BLASTP 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.


Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= 038581
         (792 letters)

Database: nr 
           23,463,169 sequences; 8,064,228,071 total letters

Searching..................................................done



>gi|255557375|ref|XP_002519718.1| Beta-glucosidase, putative [Ricinus communis]
 gi|223541135|gb|EEF42691.1| Beta-glucosidase, putative [Ricinus communis]
          Length = 802

 Score = 1169 bits (3024), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 551/772 (71%), Positives = 644/772 (83%), Gaps = 4/772 (0%)

Query: 21  STNAVDAN-GSSSPVFVCDPGRFSKLGLQMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQ 79
           + N  DAN   SS  +VCD  R+  LGL M++F FCDSSL Y +R KDLV++MTL EKVQ
Sbjct: 34  TLNHDDANPRGSSFTYVCDSSRYDNLGLDMTTFGFCDSSLSYEVRAKDLVNQMTLKEKVQ 93

Query: 80  QLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGATSFPTVILTTASFNESL 139
           QLGD A+GVPRLG+P+YEWWSEALHGVS+VGPGT FDD++PGATSFPT ILTTASFNESL
Sbjct: 94  QLGDLAYGVPRLGIPKYEWWSEALHGVSDVGPGTFFDDLVPGATSFPTTILTTASFNESL 153

Query: 140 WKKIGQAVSTEARAMYNLGRAGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYV 199
           WK IGQA S +ARAMYNLGRAGLTYWSPN+NV RDPRWGR  ETPGEDP+VVGRYAVNYV
Sbjct: 154 WKNIGQA-SAKARAMYNLGRAGLTYWSPNVNVVRDPRWGRTVETPGEDPYVVGRYAVNYV 212

Query: 200 RGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLR 259
           RGLQDVEG EN TDLN+RPLKVSSCCKHYAAYDV+ W+GV+R  FDARVTEQDM ETFLR
Sbjct: 213 RGLQDVEGTENYTDLNTRPLKVSSCCKHYAAYDVEKWQGVERLTFDARVTEQDMVETFLR 272

Query: 260 PFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDN 319
           PFEMCVKEGD SSVMCS+NRVNGIP+CADPKLLNQT+RG+WDLHGYIV+DCDSI+VMVDN
Sbjct: 273 PFEMCVKEGDVSSVMCSFNRVNGIPTCADPKLLNQTIRGDWDLHGYIVSDCDSIEVMVDN 332

Query: 320 HKFLADSKEDAVAQTLKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRL 379
           HKFL D+ EDAVAQ LKAGLDLDCG YYTNFT  +V+QGK +E  ID+SLKYLY VLMRL
Sbjct: 333 HKFLGDTNEDAVAQVLKAGLDLDCGGYYTNFTETSVKQGKAREEYIDRSLKYLYVVLMRL 392

Query: 380 GFFDGSPQYVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPH 439
           GFFDG+PQY  LGK+DIC+ EN+ELA +AAREGIVLLKN+ +TLPL+  KVK +AVVGPH
Sbjct: 393 GFFDGTPQYQKLGKKDICTKENVELAKQAAREGIVLLKNN-DTLPLSMDKVKNLAVVGPH 451

Query: 440 ANATVAMIGNYAGIPCRYMSPIAGFSGYANVTYKTGCDDVACKSNNSIFAASEAAKTADA 499
           ANAT  MIGNYAG+PCRY+SPI GFS Y+NVTY+ GC DV CK+ + +F A  AAK ADA
Sbjct: 452 ANATRVMIGNYAGVPCRYVSPIDGFSIYSNVTYEIGC-DVPCKNESLVFPAVHAAKNADA 510

Query: 500 TIILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETN 559
           TII+AGLDL++EAE LDR DL LPGYQTQLINQVA  A GPVILVIM+AGGVDI+FA  N
Sbjct: 511 TIIVAGLDLTIEAEGLDRNDLLLPGYQTQLINQVAGAANGPVILVIMAAGGVDISFARDN 570

Query: 560 TNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSL 619
             IKAILW GYPG+EGG AIADVVFGK+NPGGRLPITWY  D+V+ +P+T M LRP + L
Sbjct: 571 EKIKAILWVGYPGQEGGHAIADVVFGKYNPGGRLPITWYEADFVEQVPMTYMQLRPDEEL 630

Query: 620 GYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKT 679
           GYPG+TYKFY+G T+YPFGYGLSYT F YN+ S  ++  + LNK QHCR+L Y ++  K 
Sbjct: 631 GYPGKTYKFYDGSTVYPFGYGLSYTTFSYNITSAKRSKHIALNKFQHCRDLRYGNETFKP 690

Query: 680 RCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFV 739
            CP VL + L C+D FE +V+ +N GS DGS+VV+VYSK P  I  +YIKQVIGF+RVFV
Sbjct: 691 SCPAVLTDHLPCNDDFELEVEVENTGSRDGSEVVMVYSKTPEGIVGSYIKQVIGFKRVFV 750

Query: 740 RAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGNGGVSFPIHLNFN 791
           +AG  +++ F FN CKS  I+DY A ++LP+G HTI VG+  VS P+++N++
Sbjct: 751 QAGSVEKVNFRFNVCKSFRIIDYNAYSILPSGGHTIMVGDDIVSIPLYINYS 802


>gi|449433577|ref|XP_004134574.1| PREDICTED: probable beta-D-xylosidase 2-like [Cucumis sativus]
 gi|449530107|ref|XP_004172038.1| PREDICTED: probable beta-D-xylosidase 2-like [Cucumis sativus]
          Length = 812

 Score = 1136 bits (2939), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 538/800 (67%), Positives = 635/800 (79%), Gaps = 10/800 (1%)

Query: 2   AKVVSS-LLCFSLSIALLVFSTNAV---------DANGSSSPVFVCDPGRFSKLGLQMSS 51
           AK+ SS ++  S+     +F+ NA          D    ++  FVCDP R+ KLGL  SS
Sbjct: 13  AKMASSPIMMISVLSLFFIFTANARVFPRRSLLDDPPAVNNFTFVCDPSRYDKLGLDFSS 72

Query: 52  FLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGP 111
           F FCDSSL +  R KDL+ RMTL EK  QLG  A GV RLGLP Y WWSEALHGVSNVGP
Sbjct: 73  FGFCDSSLSFPERAKDLIDRMTLSEKAAQLGHVASGVDRLGLPPYNWWSEALHGVSNVGP 132

Query: 112 GTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNINV 171
           GT FD V+PGATSFP VI T +SFNE LWK IGQAVSTEARAMYNLGRAGLTYWSP INV
Sbjct: 133 GTQFDKVVPGATSFPNVITTASSFNEDLWKTIGQAVSTEARAMYNLGRAGLTYWSPTINV 192

Query: 172 ARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAY 231
            RDPRWGR  ETPGEDPFVVG+YA NYVRGLQDVEG EN TDLNSRPLKVSSCCKHYAAY
Sbjct: 193 IRDPRWGRTVETPGEDPFVVGKYAKNYVRGLQDVEGSENVTDLNSRPLKVSSCCKHYAAY 252

Query: 232 DVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKL 291
           DVDNW GV+RY FDARVTEQDM ETF +PFEMCVKEGD SSVMCSYNRVNGIP+CADP L
Sbjct: 253 DVDNWLGVERYSFDARVTEQDMLETFNKPFEMCVKEGDVSSVMCSYNRVNGIPTCADPVL 312

Query: 292 LNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFT 351
           L  T+RG W LHGYIV+DCDS++VMV++  +L D+ EDAVAQTLKAGLDLDCGQ Y N+T
Sbjct: 313 LKDTIRGNWGLHGYIVSDCDSVKVMVEDAHYLQDTNEDAVAQTLKAGLDLDCGQIYPNYT 372

Query: 352 GNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQDICSDENIELAAEAARE 411
            + V+QGKV   +ID +L  LY VLMRLG+FDG+  + SLGK DICSDE+IELA EAAR+
Sbjct: 373 ESTVRQGKVGMRNIDNALNNLYVVLMRLGYFDGNTGFESLGKPDICSDEHIELATEAARQ 432

Query: 412 GIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFSGYANVT 471
           G VLLKND +TLP + +  KT+AVVGPHANAT AM+GNYAG+PCR  SP+ G S YA V 
Sbjct: 433 GTVLLKNDNDTLPFDPSNYKTLAVVGPHANATSAMLGNYAGVPCRMNSPMDGLSEYAKVK 492

Query: 472 YKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAESLDREDLWLPGYQTQLIN 531
           Y+ GCD VACK++  IF A EAA+T+DAT+I  G+DLS+EAESLDR DL LPGYQTQL+ 
Sbjct: 493 YQMGCDSVACKNDTFIFGAMEAARTSDATVIFVGIDLSIEAESLDRVDLLLPGYQTQLVQ 552

Query: 532 QVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGG 591
           QVA V+KGPV+LVI+SAGG+D++FA+ N+NIKAI+WAGYPGEEGGRAIADV+FGKFNPGG
Sbjct: 553 QVATVSKGPVVLVILSAGGIDVSFAKNNSNIKAIIWAGYPGEEGGRAIADVIFGKFNPGG 612

Query: 592 RLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLL 651
           RLP+TWY  DYV  LP+TSMPLRPV SLGYPGRTYKFY+GP +YPFG+GLSYT F +NL 
Sbjct: 613 RLPLTWYENDYVYQLPMTSMPLRPVKSLGYPGRTYKFYDGPVVYPFGHGLSYTFFLHNLT 672

Query: 652 SFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSD 711
           S  ++I ++L+    CR++ YT+   K  CP VLV+DL C +  EF+++ +N G  DGS 
Sbjct: 673 SAKRSIAIDLSNRTQCRDIAYTNGTFKPECPAVLVDDLTCTEEIEFQMEVENTGERDGSQ 732

Query: 712 VVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAG 771
           V++VYS PP  I++T+IKQV+GFQRVF++AG ++ + F  NACKSL +VD+    LLPAG
Sbjct: 733 VLLVYSVPPGGISSTHIKQVVGFQRVFLKAGDSETVTFKLNACKSLGLVDFTGYNLLPAG 792

Query: 772 EHTIFVGNGGVSFPIHLNFN 791
            HTI VG+G VSFP+ L+FN
Sbjct: 793 GHTIVVGDGEVSFPVELSFN 812


>gi|225432136|ref|XP_002274651.1| PREDICTED: probable beta-D-xylosidase 5-like [Vitis vinifera]
          Length = 809

 Score = 1083 bits (2802), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 523/810 (64%), Positives = 611/810 (75%), Gaps = 23/810 (2%)

Query: 1   MAKVVSSLLCFSLSIALLVF--STNAVDANGSSSP---------------VFVCDPGRFS 43
           M K++ SL  FSLSI  + F    +A+ +     P                +VCD  RF+
Sbjct: 1   MGKLLRSLF-FSLSIVWIAFFAVCSAIKSPLKDGPAAAPMAARGPIDGNYTYVCDESRFA 59

Query: 44  KLGLQMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEAL 103
            LGL M  F +CDSS PY +R KDLV RMTL EKV Q GD A GV R+GLP+Y WWSEAL
Sbjct: 60  ALGLDMKDFHYCDSSSPYEVRAKDLVDRMTLSEKVMQTGDQASGVERIGLPKYNWWSEAL 119

Query: 104 HGVSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLT 163
           HGVSN G    FD+V+PGATSFPTVIL+ ASFN+SLWK +GQAVSTEARAMYN G AGLT
Sbjct: 120 HGVSNFGRCVFFDEVVPGATSFPTVILSAASFNQSLWKTLGQAVSTEARAMYNSGNAGLT 179

Query: 164 YWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSS 223
           +WSPNINV RDPRWGRI ETPGEDP +VG YAVNYVRGLQDV G EN TDLNSRPLKVSS
Sbjct: 180 FWSPNINVVRDPRWGRILETPGEDPHLVGLYAVNYVRGLQDVVGAENTTDLNSRPLKVSS 239

Query: 224 CCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGI 283
           CCKHYAAYD+DNWKG DR HFDARV+ QDM ETF+ PFEMCVKEGD SSVMCSYN++NGI
Sbjct: 240 CCKHYAAYDLDNWKGADRVHFDARVSVQDMAETFVLPFEMCVKEGDVSSVMCSYNKINGI 299

Query: 284 PSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDC 343
           PSCAD +LL QT+RGEWDLHGYIV+DCDS++VM  + K+L  S  D+ AQ L AG++LDC
Sbjct: 300 PSCADSRLLKQTIRGEWDLHGYIVSDCDSVEVMAVDQKWLDSSFSDSAAQALNAGMNLDC 359

Query: 344 GQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQDICSDENIE 403
           G +       AV QGK  + D+D SL+YLY +LMR+GFFDG P + SLGK DICS E+IE
Sbjct: 360 GTFNNRSLTEAVNQGKANQADLDHSLRYLYVLLMRVGFFDGIPAFASLGKDDICSAEHIE 419

Query: 404 LAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAG 463
           LA EAAR+GIVLLKND  TLPL S  VK +A+VGPHANAT AMIGNYAGIPC Y+SP+  
Sbjct: 420 LAREAARQGIVLLKNDNATLPLKS--VKNIALVGPHANATDAMIGNYAGIPCYYVSPLDA 477

Query: 464 FSGYANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAESLDREDLWLP 523
           FS    V Y+ GC DV C +   IF A EAAK ADATII AG DLS+EAE+LDR DL LP
Sbjct: 478 FSSMGEVRYEKGCADVQCLNETYIFNAMEAAKRADATIIFAGTDLSIEAEALDRVDLLLP 537

Query: 524 GYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVV 583
           GYQTQLINQVA+++ GPV+LVIMS GGVDI+FA  N  I AILWAGYPGE+GG AIADV+
Sbjct: 538 GYQTQLINQVADLSTGPVVLVIMSGGGVDISFARDNPKIAAILWAGYPGEQGGNAIADVI 597

Query: 584 FGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSY 643
            GK+NPGGRLPITWY  DYV MLP+TSM LRPVDSLGYPGRTYKF+NG T+YPFGYG+SY
Sbjct: 598 LGKYNPGGRLPITWYEADYVDMLPMTSMALRPVDSLGYPGRTYKFFNGSTVYPFGYGMSY 657

Query: 644 TQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQN 703
           T F Y+L +  +   +NL KLQ CR++ Y +D     CP VLV+DL C +  EF+V  +N
Sbjct: 658 TNFSYSLSTSQRWTNINLRKLQRCRSMVYINDTFVPDCPAVLVDDLSCKESIEFEVAVKN 717

Query: 704 VGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYA 763
           VG  DGS+VV+VYS PP  IA T+IK+V+GF+RVFV+ G  +++KF  N CKSL IVD  
Sbjct: 718 VGRMDGSEVVVVYSSPPLGIAGTHIKKVVGFERVFVKVGGTEKVKFSMNVCKSLGIVDST 777

Query: 764 ANTLLPAGEHTIFVG---NGGVSFPIHLNF 790
              LLP+G HTI VG      V+FP H+N+
Sbjct: 778 GYALLPSGSHTIKVGGDNTTSVAFPFHVNY 807


>gi|224093292|ref|XP_002309869.1| predicted protein [Populus trichocarpa]
 gi|222852772|gb|EEE90319.1| predicted protein [Populus trichocarpa]
          Length = 694

 Score = 1075 bits (2779), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 510/726 (70%), Positives = 598/726 (82%), Gaps = 36/726 (4%)

Query: 67  DLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGATSFP 126
           DLV++MTL+EKV QLG+ A+GVPRLGL +Y+WWSEALHGVSNVGPGT FDD+IPG+TSFP
Sbjct: 2   DLVNQMTLNEKVLQLGNKAYGVPRLGLAEYQWWSEALHGVSNVGPGTFFDDLIPGSTSFP 61

Query: 127 TVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNINVARDPRWGRITETPGE 186
           TVI T A+FNESLWK IGQAVSTEARAMYNLGRAGLTYWSPNINV RDPRWGR  ETPGE
Sbjct: 62  TVITTAAAFNESLWKVIGQAVSTEARAMYNLGRAGLTYWSPNINVVRDPRWGRAIETPGE 121

Query: 187 DPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDA 246
           DP++VGRYAVNYVRGLQDVEG EN TD NSRPLKVSSCCKHYAAYDVDNWKGV+RY FDA
Sbjct: 122 DPYLVGRYAVNYVRGLQDVEGSENYTDPNSRPLKVSSCCKHYAAYDVDNWKGVERYTFDA 181

Query: 247 RVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYI 306
           RV+EQDM ETFLRPFEMCVK+GD SSVMCSYNRVNGIP+CADPKLLNQT+RG+WDLHGYI
Sbjct: 182 RVSEQDMVETFLRPFEMCVKDGDVSSVMCSYNRVNGIPTCADPKLLNQTIRGDWDLHGYI 241

Query: 307 VADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFTGNAVQQGKVKETDID 366
           V+DCDS+QVMV+NHK+L              GLDLDCG YYT     AV+QGKV+E DID
Sbjct: 242 VSDCDSLQVMVENHKWL--------------GLDLDCGAYYTENVEAAVRQGKVREADID 287

Query: 367 KSLKYLYTVLMRLGFFDGSPQYVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLN 426
           KSL +LY VLMRLGFFDG PQY S GK D+CS ENIELA EAAREG VLLKN+ ++LPL+
Sbjct: 288 KSLNFLYVVLMRLGFFDGIPQYNSFGKNDVCSKENIELATEAAREGAVLLKNENDSLPLS 347

Query: 427 SAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFSGYANVTYKTGCDDVACKSNNS 486
             KVKT+AV+GPH+NAT AMIGNYAGIPC+ ++PI G S YA V Y+ GC D+ACK  + 
Sbjct: 348 IEKVKTLAVIGPHSNATSAMIGNYAGIPCQIITPIEGLSKYAKVDYQMGCSDIACKDESF 407

Query: 487 IFAASEAAKTADATIILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIM 546
           IF A E+AK ADATIILAG+DLS+EAESLDR+DL LPGYQTQLINQVA V+ GPV+LV+M
Sbjct: 408 IFPAMESAKKADATIILAGIDLSIEAESLDRDDLLLPGYQTQLINQVASVSNGPVVLVLM 467

Query: 547 SAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQML 606
           SAGGVDI+FA++N +IK+ILW GYPGEEGG AIADV+FGK+NPGGRLP+TW+  DYV ML
Sbjct: 468 SAGGVDISFAKSNGDIKSILWVGYPGEEGGNAIADVIFGKYNPGGRLPLTWHEADYVDML 527

Query: 607 PLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQH 666
           P+TSMPLRP+DSLGYPGRTYKF+NG T+YPFG+GLSYTQF Y L S  +++ + L+K Q+
Sbjct: 528 PMTSMPLRPIDSLGYPGRTYKFFNGSTVYPFGHGLSYTQFTYKLTSTIRSLDIKLDKYQY 587

Query: 667 CRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQ--NVGSTDGSDVVIVYSKPPAEIA 724
           C +L Y +D+                    FK  F+  N G+ DGS+VVIVY+KPP  I 
Sbjct: 588 CHDLGYKNDS--------------------FKPSFEVLNAGAKDGSEVVIVYAKPPEGID 627

Query: 725 ATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGNGGVSF 784
           ATYIKQVIGF+RVFV AG ++++KF FNA KSL +VD+ A ++LP+G HTI +G+  +SF
Sbjct: 628 ATYIKQVIGFKRVFVPAGGSEKVKFEFNASKSLQVVDFNAYSVLPSGGHTIMLGDDIISF 687

Query: 785 PIHLNF 790
            + + F
Sbjct: 688 SVQIRF 693


>gi|225432134|ref|XP_002274619.1| PREDICTED: probable beta-D-xylosidase 5-like [Vitis vinifera]
          Length = 805

 Score = 1062 bits (2747), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 508/809 (62%), Positives = 625/809 (77%), Gaps = 23/809 (2%)

Query: 1   MAKVVSSLLCFSLSI---ALLVFST-------------NAVDANGSSSPVFVCDPGRFSK 44
           MAK  + L  FSLSI   A L  ST              A D  G+ +  +VCD  RF+ 
Sbjct: 1   MAKSFTRLF-FSLSILAIAFLAVSTARYTPRPNSRFLSQAFDVPGNYT--YVCDASRFAA 57

Query: 45  LGLQMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALH 104
           LGL M  F++CDSSLPY +RVKDLV R+TL+EK + + D A GVPR+GLP Y+WWSEALH
Sbjct: 58  LGLDMKDFVYCDSSLPYDVRVKDLVDRITLEEKARNVIDVASGVPRIGLPPYKWWSEALH 117

Query: 105 GVSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTY 164
           GV+NVG  T FD+V+PGATSFP VIL+ ASFN+SLWK +GQ VSTEARAMYNLG AGLT+
Sbjct: 118 GVANVGSATFFDEVVPGATSFPNVILSAASFNQSLWKTLGQVVSTEARAMYNLGHAGLTF 177

Query: 165 WSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSC 224
           WSPNINVARDPRWGRI ETPGEDP  VG Y VNYVRGLQD+EG EN TDLNSRPLK++S 
Sbjct: 178 WSPNINVARDPRWGRILETPGEDPLTVGVYGVNYVRGLQDIEGTENTTDLNSRPLKIASS 237

Query: 225 CKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIP 284
           CKH+AAYD+D W  VDR HFDA+V+EQDM ETFLRPFEMCVKEGD SSVMCS+N +NGIP
Sbjct: 238 CKHFAAYDLDQWFNVDRRHFDAKVSEQDMTETFLRPFEMCVKEGDTSSVMCSFNNINGIP 297

Query: 285 SCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCG 344
            CADP+ L   +R +W+LHGYIV+DC +I  +V + KFL  + E+ VA ++KAGLDL+CG
Sbjct: 298 PCADPRFLKGVIREQWNLHGYIVSDCWAIDTIVQDQKFLDVTSEEGVALSMKAGLDLECG 357

Query: 345 QYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQDICSDENIEL 404
            YY +    AV++G+V E D+DKSL YLY VLMR+GFFDG P   SLGK+DIC+DE+IEL
Sbjct: 358 HYYNDSLATAVREGRVSEHDVDKSLSYLYVVLMRVGFFDGIPSLASLGKKDICNDEHIEL 417

Query: 405 AAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGF 464
           A EAAR+GIVLLKND  TLPL    VK +A+VGPHANATVAMIGNYAGIPC Y+SP+  F
Sbjct: 418 AREAARQGIVLLKNDNATLPLK--PVKKLALVGPHANATVAMIGNYAGIPCHYVSPLDAF 475

Query: 465 SGYANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAESLDREDLWLPG 524
           S   +VTY+ GC DV C ++  ++ A+EAAK ADATIIL G DLS+EAE  DREDL LPG
Sbjct: 476 SELGDVTYEVGCADVKCHNDTHVYKAAEAAKNADATIILVGTDLSIEAEERDREDLLLPG 535

Query: 525 YQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVF 584
           YQT+++NQV +++ GPVILV+M  G +DI+FA+ N  I AILWAG+PGE+GG AIAD+VF
Sbjct: 536 YQTEMVNQVTDLSTGPVILVVMCGGPIDISFAKNNPKIAAILWAGFPGEQGGNAIADIVF 595

Query: 585 GKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYT 644
           GK+NPGGR PITWY   YV MLP+TSM LRP++SLGYPGRTYKF+NG T+YPFGYGLSYT
Sbjct: 596 GKYNPGGRSPITWYENGYVGMLPMTSMALRPIESLGYPGRTYKFFNGSTVYPFGYGLSYT 655

Query: 645 QFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNV 704
            F Y+L + T+++ ++L +LQ CR++ Y+SD+ +  C  VLV+DL CD+ FEF+V  +NV
Sbjct: 656 NFSYSLTAPTRSVHISLTRLQQCRSMAYSSDSFQPECSAVLVDDLSCDESFEFQVAVKNV 715

Query: 705 GSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAA 764
           GS DGS+VV+VYS PP+ I  T+IKQVIGF+RVFV+ G  +++KF  N CKSL +VD + 
Sbjct: 716 GSMDGSEVVMVYSSPPSGIVGTHIKQVIGFERVFVKVGNTEKVKFSMNVCKSLGLVDSSG 775

Query: 765 NTLLPAGEHTIFVGNG--GVSFPIHLNFN 791
             LLP+G HTI  G+    VSFP  +N++
Sbjct: 776 YILLPSGSHTIMAGDNSTSVSFPFQVNYH 804


>gi|225432132|ref|XP_002274591.1| PREDICTED: beta-xylosidase/alpha-L-arabinofuranosidase 1-like
           [Vitis vinifera]
          Length = 805

 Score = 1040 bits (2690), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 480/760 (63%), Positives = 589/760 (77%), Gaps = 4/760 (0%)

Query: 35  FVCDPGRFSKLGLQMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLP 94
           +VCD  R++ LGL M SF FCD SL Y  R KDLVSRMTL EKV Q    A GV RLGLP
Sbjct: 47  YVCDESRYALLGLDMKSFAFCDKSLSYKERAKDLVSRMTLQEKVMQSVHTASGVRRLGLP 106

Query: 95  QYEWWSEALHGVSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAM 154
           +Y WWSEALHG+SN+GPG  FD+ IPGATS PTVIL+TA+FN++LWK +G+ VSTE RAM
Sbjct: 107 EYSWWSEALHGISNLGPGVFFDETIPGATSLPTVILSTAAFNQTLWKTLGRVVSTEGRAM 166

Query: 155 YNLGRAGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDL 214
           YNLG AGLT+WSPNINV RD RWGR  ET GEDPF+VG +AVNYVRGLQDVEG EN TDL
Sbjct: 167 YNLGHAGLTFWSPNINVVRDTRWGRTQETSGEDPFIVGEFAVNYVRGLQDVEGTENVTDL 226

Query: 215 NSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVM 274
           NSRPLKVSSCCKHYAAYD+D+W  VDR+ FDARV+EQDM+ETF+ PFE CV+EGD SSVM
Sbjct: 227 NSRPLKVSSCCKHYAAYDIDSWLNVDRHTFDARVSEQDMKETFVSPFERCVREGDVSSVM 286

Query: 275 CSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQT 334
           CS+N++NGIP C+DP+LL   +R EWDLHGYIV+DC  ++V+VDN  +L DSK DAVA+T
Sbjct: 287 CSFNKINGIPPCSDPRLLKGVIRDEWDLHGYIVSDCYGLEVIVDNQNYLNDSKVDAVAKT 346

Query: 335 LKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQ 394
           L+AGLDL+CG YYT+    +V  GKV + ++D++LK +Y +LMR+G+FDG P Y SLG +
Sbjct: 347 LQAGLDLECGHYYTDALNESVLTGKVSQYELDRALKNIYVLLMRVGYFDGIPAYESLGLK 406

Query: 395 DICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIP 454
           DIC+ ++IELA EAAR+GIVLLKND   LPL     K +A+VGPHANAT  MIGNYAG+P
Sbjct: 407 DICAADHIELAREAARQGIVLLKNDYEVLPLKPG--KKIALVGPHANATEVMIGNYAGLP 464

Query: 455 CRYMSPIAGFSGYANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAES 514
           C+Y+SP+  FS   NVTY TGC D +C ++     A EAAK+A+ TII  G DLS+EAE 
Sbjct: 465 CKYVSPLEAFSAIGNVTYATGCLDASCSNDTYFSEAKEAAKSAEVTIIFVGTDLSIEAEF 524

Query: 515 LDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEE 574
           +DR D  LPG QT+LI QVAEV+ GPVILV++S   +DI FA+ N  I AILW G+PGE+
Sbjct: 525 VDRVDFLLPGNQTELIKQVAEVSSGPVILVVLSGSNIDITFAKNNPRISAILWVGFPGEQ 584

Query: 575 GGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTL 634
           GG AIADVVFGK+NPGGRLP+TWY  DYV MLP++SM LRPVD LGYPGRTYKF++G T+
Sbjct: 585 GGHAIADVVFGKYNPGGRLPVTWYEADYVDMLPMSSMSLRPVDELGYPGRTYKFFDGSTV 644

Query: 635 YPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDY 694
           YPFGYG+SYT+F Y+L +   +I ++LNK Q CR + YT D     CP VL++D+ CDD 
Sbjct: 645 YPFGYGMSYTKFSYSLATSKISIDIDLNKFQKCRTVAYTEDQKVPSCPAVLLDDMSCDDT 704

Query: 695 FEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNAC 754
            EF+V   NVG  DGS+V++VYS PP+ I  T+IKQVIGFQ+VFV AG  +R+KF  NAC
Sbjct: 705 IEFEVAVTNVGMVDGSEVLMVYSIPPSGIVGTHIKQVIGFQKVFVAAGDTERVKFSMNAC 764

Query: 755 KSLNIVDYAANTLLPAGEHTIFVGN--GGVSFPIHLNFNY 792
           KSL IVD    +LLP+G HTI VG+     S+ + +N++Y
Sbjct: 765 KSLRIVDSTGYSLLPSGSHTIRVGDYSNSASYSLQVNYHY 804


>gi|297736787|emb|CBI25988.3| unnamed protein product [Vitis vinifera]
          Length = 774

 Score = 1011 bits (2613), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 493/809 (60%), Positives = 602/809 (74%), Gaps = 54/809 (6%)

Query: 1   MAKVVSSLLCFSLSI---ALLVFST-------------NAVDANGSSSPVFVCDPGRFSK 44
           MAK  + L  FSLSI   A L  ST              A D  G+ +  +VCD  RF+ 
Sbjct: 1   MAKSFTRLF-FSLSILAIAFLAVSTARYTPRPNSRFLSQAFDVPGNYT--YVCDASRFAA 57

Query: 45  LGLQMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALH 104
           LGL M  F++CDSSLPY +RVKDLV R+TL+EK + + D A GVPR+GLP Y+WWSEALH
Sbjct: 58  LGLDMKDFVYCDSSLPYDVRVKDLVDRITLEEKARNVIDVASGVPRIGLPPYKWWSEALH 117

Query: 105 GVSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTY 164
           GV+NVG  T FD+V+PGATSFP VIL+ ASFN+SLWK +GQ VSTEARAMYNLG AGLT+
Sbjct: 118 GVANVGSATFFDEVVPGATSFPNVILSAASFNQSLWKTLGQVVSTEARAMYNLGHAGLTF 177

Query: 165 WSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSC 224
           WSPNINVARDPRWGRI ETPGEDP  VG Y VNYVRGLQD+EG EN TDLNSRPLK++S 
Sbjct: 178 WSPNINVARDPRWGRILETPGEDPLTVGVYGVNYVRGLQDIEGTENTTDLNSRPLKIASS 237

Query: 225 CKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIP 284
           CKH+AAYD+D W  VDR HFDA+V+EQDM ETFLRPFEMCVKEGD SSVMCS+N +NGIP
Sbjct: 238 CKHFAAYDLDQWFNVDRRHFDAKVSEQDMTETFLRPFEMCVKEGDTSSVMCSFNNINGIP 297

Query: 285 SCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCG 344
            CADP+ L   +R +W+LHGYIV+DC +I  +V + KFL  + E+ VA ++KAGLDL+CG
Sbjct: 298 PCADPRFLKGVIREQWNLHGYIVSDCWAIDTIVQDQKFLDVTSEEGVALSMKAGLDLECG 357

Query: 345 QYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQDICSDENIEL 404
            YY +    AV++G+V E D+DKSL YLY VLMR+GFFDG P   SLGK+DIC+DE+IEL
Sbjct: 358 HYYNDSLATAVREGRVSEHDVDKSLSYLYVVLMRVGFFDGIPSLASLGKKDICNDEHIEL 417

Query: 405 AAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGF 464
           A EAAR+GIVLLKND  TLPL    VK +A+VGPHANATVAMIGNYAGIPC Y+SP+  F
Sbjct: 418 AREAARQGIVLLKNDNATLPLK--PVKKLALVGPHANATVAMIGNYAGIPCHYVSPLDAF 475

Query: 465 SGYANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAESLDREDLWLPG 524
           S   +VTY+ GC DV C ++  ++ A+EAAK ADATIIL G DLS+EAE  DREDL LPG
Sbjct: 476 SELGDVTYEVGCADVKCHNDTHVYKAAEAAKNADATIILVGTDLSIEAEERDREDLLLPG 535

Query: 525 YQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVF 584
           YQT+++NQV +++ GPVILV+M  G +DI+FA+ N  I AILWAG+PGE+GG AIAD+VF
Sbjct: 536 YQTEMVNQVTDLSTGPVILVVMCGGPIDISFAKNNPKIAAILWAGFPGEQGGNAIADIVF 595

Query: 585 GKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYT 644
           GK+NPGGR PITWY   YV MLP+TSM LRP++SLGYPGRTYKF+NG T+YPFGYGLSYT
Sbjct: 596 GKYNPGGRSPITWYENGYVGMLPMTSMALRPIESLGYPGRTYKFFNGSTVYPFGYGLSYT 655

Query: 645 QFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNV 704
            F Y+L + T+++ ++L                                 FEF+V  +NV
Sbjct: 656 NFSYSLTAPTRSVHISLTS-------------------------------FEFQVAVKNV 684

Query: 705 GSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAA 764
           GS DGS+VV+VYS PP+ I  T+IKQVIGF+RVFV+ G  +++KF  N CKSL +VD + 
Sbjct: 685 GSMDGSEVVMVYSSPPSGIVGTHIKQVIGFERVFVKVGNTEKVKFSMNVCKSLGLVDSSG 744

Query: 765 NTLLPAGEHTIFVGNG--GVSFPIHLNFN 791
             LLP+G HTI  G+    VSFP  +N++
Sbjct: 745 YILLPSGSHTIMAGDNSTSVSFPFQVNYH 773


>gi|297736788|emb|CBI25989.3| unnamed protein product [Vitis vinifera]
          Length = 746

 Score =  967 bits (2499), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 484/810 (59%), Positives = 565/810 (69%), Gaps = 86/810 (10%)

Query: 1   MAKVVSSLLCFSLSIALLVFST--NAVDANGSSSP---------------VFVCDPGRFS 43
           M K++ SL  FSLSI  + F    +A+ +     P                +VCD  RF+
Sbjct: 1   MGKLLRSLF-FSLSIVWIAFFAVCSAIKSPLKDGPAAAPMAARGPIDGNYTYVCDESRFA 59

Query: 44  KLGLQMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEAL 103
            LGL M  F +CDSS PY +R KDLV RMTL EKV Q GD A GV R+GLP+Y WWSEAL
Sbjct: 60  ALGLDMKDFHYCDSSSPYEVRAKDLVDRMTLSEKVMQTGDQASGVERIGLPKYNWWSEAL 119

Query: 104 HGVSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLT 163
           HGVSN G    FD+V+PGATSFPTVIL+ ASFN+SLWK +GQAVSTEARAMYN G AGLT
Sbjct: 120 HGVSNFGRCVFFDEVVPGATSFPTVILSAASFNQSLWKTLGQAVSTEARAMYNSGNAGLT 179

Query: 164 YWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSS 223
           +WSPNINV RDPRWGRI ETPGEDP +VG YAVNY                         
Sbjct: 180 FWSPNINVVRDPRWGRILETPGEDPHLVGLYAVNY------------------------- 214

Query: 224 CCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGI 283
              HYAAYD+DNWKG DR HFDARV+ QDM ETF+ PFEMCVKEGD SSVMCSYN++NGI
Sbjct: 215 ---HYAAYDLDNWKGADRVHFDARVSVQDMAETFVLPFEMCVKEGDVSSVMCSYNKINGI 271

Query: 284 PSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDC 343
           PSCAD +LL QT+RGEWDLHGYIV+DCDS++VM  + K+L  S  D+ AQ L AG++LDC
Sbjct: 272 PSCADSRLLKQTIRGEWDLHGYIVSDCDSVEVMAVDQKWLDSSFSDSAAQALNAGMNLDC 331

Query: 344 GQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQDICSDENIE 403
           G +       AV QGK  + D+D SL+YLY +LMR+GFFDG P + SLGK DICS E+IE
Sbjct: 332 GTFNNRSLTEAVNQGKANQADLDHSLRYLYVLLMRVGFFDGIPAFASLGKDDICSAEHIE 391

Query: 404 LAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAG 463
           LA EAAR+GIVLLKND  TLPL S  VK +A+VGPHANAT AMIGNYAGIPC Y+SP+  
Sbjct: 392 LAREAARQGIVLLKNDNATLPLKS--VKNIALVGPHANATDAMIGNYAGIPCYYVSPLDA 449

Query: 464 FSGYANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAESLDREDLWLP 523
           FS    V Y+ GC DV C +   IF A EAAK ADATII AG DLS+EAE+LDR DL LP
Sbjct: 450 FSSMGEVRYEKGCADVQCLNETYIFNAMEAAKRADATIIFAGTDLSIEAEALDRVDLLLP 509

Query: 524 GYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVV 583
           GYQTQLINQVA+++ GPV+LVIMS GGVDI+FA  N  I AILWAGYPGE+GG AIADV+
Sbjct: 510 GYQTQLINQVADLSTGPVVLVIMSGGGVDISFARDNPKIAAILWAGYPGEQGGNAIADVI 569

Query: 584 FGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSY 643
            GK+NPGGRLPITWY  DYV MLP+TSM LRPVDSLGYPGRTYKF+NG T+YPFGYG+SY
Sbjct: 570 LGKYNPGGRLPITWYEADYVDMLPMTSMALRPVDSLGYPGRTYKFFNGSTVYPFGYGMSY 629

Query: 644 TQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQN 703
           T F Y+L     T Q                                C +  EF+V  +N
Sbjct: 630 TNFSYSL----STSQ-------------------------------SCKESIEFEVAVKN 654

Query: 704 VGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYA 763
           VG  DGS+VV+VYS PP  IA T+IK+V+GF+RVFV+ G  +++KF  N CKSL IVD  
Sbjct: 655 VGRMDGSEVVVVYSSPPLGIAGTHIKKVVGFERVFVKVGGTEKVKFSMNVCKSLGIVDST 714

Query: 764 ANTLLPAGEHTIFVG---NGGVSFPIHLNF 790
              LLP+G HTI VG      V+FP H+N+
Sbjct: 715 GYALLPSGSHTIKVGGDNTTSVAFPFHVNY 744


>gi|359477633|ref|XP_003632006.1| PREDICTED: LOW QUALITY PROTEIN: beta-D-xylosidase 3-like [Vitis
           vinifera]
          Length = 781

 Score =  940 bits (2430), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 474/784 (60%), Positives = 580/784 (73%), Gaps = 16/784 (2%)

Query: 19  VFSTNAVDANGSSSPVFVCDPGRFSKLGLQMSSFLFCDSSLP-YSIRVKDLVSRMTLDEK 77
           +F +   D  G+ S   VCDP RF+ LG  M  F++C+SSLP Y +RVKDLV RMTL+EK
Sbjct: 1   MFLSEGFDVPGNYS--HVCDPARFAALGFDMKDFVYCNSSLPIYDVRVKDLVDRMTLEEK 58

Query: 78  VQQLGDFAHGVPRLGLPQYEWWSEALHGVSNV---GPGTHFDDVIPGATSFPTVILTTAS 134
              +   A GV R+GLP Y+WWSEALHGVS+V   GP T FD+ +PGATSFP VIL+ AS
Sbjct: 59  ATNVIYKAAGVERIGLPPYQWWSEALHGVSSVSINGP-TFFDETVPGATSFPNVILSAAS 117

Query: 135 FNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNINVARDPRWGRITETPGEDPFVVGRY 194
           FN+SLWK I Q VS EARA YNLG AGLT+W PN+NVARDPRWGR  ET GEDPF V  Y
Sbjct: 118 FNQSLWKTIRQVVSKEARATYNLGHAGLTFWCPNVNVARDPRWGRTQETXGEDPFTVSVY 177

Query: 195 AVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDME 254
           AV+YVRGLQDVEG EN TDLNSRPLKVSS  KH+AAYD+DNW  VDR HF+ARV+EQDM 
Sbjct: 178 AVSYVRGLQDVEGTENTTDLNSRPLKVSSSGKHFAAYDLDNWLNVDRNHFNARVSEQDMA 237

Query: 255 ETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQ 314
           ETFLRPFE CV+EGD S VMCS+N +NGIP CADP+L   T+R EW+LHGYIV+DC SI+
Sbjct: 238 ETFLRPFEACVREGDVSGVMCSFNNINGIPPCADPRLFKGTIRDEWNLHGYIVSDCWSIE 297

Query: 315 VMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYT 374
            +V++ KFL  + E+AVA  LKAGLDL+CG YY +   +AV  G+V + D+D+SL  LY 
Sbjct: 298 TIVEDQKFLDVTGEEAVALNLKAGLDLECGHYYNDSPASAVMAGRVGQHDLDQSLSNLYV 357

Query: 375 VLMRLGFFDGSPQYVSLGKQDIC-SDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTV 433
           VLMRLGFFDG P   SLGK DIC S E+IELA EAAR+GIVLLKND  TLPL S  VK +
Sbjct: 358 VLMRLGFFDGIPALASLGKDDICLSAEHIELAREAARQGIVLLKNDNATLPLKS--VKNL 415

Query: 434 AVVGPHANATVAMIGNYAGIPCRYMSPIAGFSGYANVTYKTGCDDVACKSNNSIFAASEA 493
           A+VGP+A+A  AM+GNYAG PCR +SP   FS   NVTY+ GC DV C ++  ++ A EA
Sbjct: 416 ALVGPNADAYGAMMGNYAGPPCRSVSPRDAFSAIGNVTYEMGCGDVLCHNDTYVYKAVEA 475

Query: 494 AKTADATIILAGL-DLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMS--AGG 550
           AK AD TII+ G+ D+S+  E  DR DL LPGYQT L+NQ+A+    P+ILV+     G 
Sbjct: 476 AKHADTTIIVVGITDVSIGTEDKDRVDLLLPGYQTHLVNQIAKATTAPIILVVCGHCGGP 535

Query: 551 VDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTS 610
           +DI+FA  N  I+ ILWAG+PGEEGG AIADVV+GK+NPGGRLP+TWY   YV MLP+TS
Sbjct: 536 IDISFARDNPGIEPILWAGFPGEEGGNAIADVVYGKYNPGGRLPVTWYENGYVGMLPMTS 595

Query: 611 MPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNL 670
           M LR V+SLGYPGR YKF++G T+YPFG GLSYT F Y+L + T++I  +L KLQ CR++
Sbjct: 596 MALRSVESLGYPGRKYKFFSGSTVYPFGCGLSYTNFSYSLTAPTRSIHTHLKKLQPCRSM 655

Query: 671 NYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQ 730
            Y+  +   +CP VLV+DL C++ FEF+V  + VGS DGS+VVIVYS PP+ I  T+IKQ
Sbjct: 656 AYSICSVIPQCPAVLVDDLSCNETFEFEVAVKTVGSMDGSEVVIVYSSPPSGIVGTHIKQ 715

Query: 731 VIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGNG---GVSFPIH 787
           VIGF+RVFV+ G  +++KF  N CKSL IV  + +TLLP+G   I  G      VSFP  
Sbjct: 716 VIGFERVFVKVGXVEKVKFSMNVCKSLGIVHSSGHTLLPSGSDIIKAGGDNTISVSFPFQ 775

Query: 788 LNFN 791
             ++
Sbjct: 776 AAYH 779


>gi|297736786|emb|CBI25987.3| unnamed protein product [Vitis vinifera]
          Length = 745

 Score =  939 bits (2427), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 450/760 (59%), Positives = 550/760 (72%), Gaps = 64/760 (8%)

Query: 35  FVCDPGRFSKLGLQMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLP 94
           +VCD  R++ LGL M SF FCD SL Y  R KDLVSRMTL EKV Q    A GV RLGLP
Sbjct: 47  YVCDESRYALLGLDMKSFAFCDKSLSYKERAKDLVSRMTLQEKVMQSVHTASGVRRLGLP 106

Query: 95  QYEWWSEALHGVSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAM 154
           +Y WWSEALHG+SN+GPG  FD+ IPGATS PTVIL+TA+FN++LWK +G+ VSTE RAM
Sbjct: 107 EYSWWSEALHGISNLGPGVFFDETIPGATSLPTVILSTAAFNQTLWKTLGRVVSTEGRAM 166

Query: 155 YNLGRAGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDL 214
           YNLG AGLT+WSPNINV RD RWGR  ET GEDPF+VG +AVNYVRGLQDVEG EN    
Sbjct: 167 YNLGHAGLTFWSPNINVVRDTRWGRTQETSGEDPFIVGEFAVNYVRGLQDVEGTEN---- 222

Query: 215 NSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVM 274
                 VSSCCKHYAAYD+D+W  VDR+ FDARV+EQDM+ETF+ PFE CV+EGD SSVM
Sbjct: 223 ------VSSCCKHYAAYDIDSWLNVDRHTFDARVSEQDMKETFVSPFERCVREGDVSSVM 276

Query: 275 CSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQT 334
           CS+N++NGIP C+DP+LL   +R EWDLHGYIV+DC  ++V+VDN  +L DSK DAVA+T
Sbjct: 277 CSFNKINGIPPCSDPRLLKGVIRDEWDLHGYIVSDCYGLEVIVDNQNYLNDSKVDAVAKT 336

Query: 335 LKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQ 394
           L+AGLDL+CG YYT+    +V  GKV + ++D++LK +Y +LMR+G+FDG P Y SLG +
Sbjct: 337 LQAGLDLECGHYYTDALNESVLTGKVSQYELDRALKNIYVLLMRVGYFDGIPAYESLGLK 396

Query: 395 DICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIP 454
           DIC+ ++IELA EAAR+GIVLLKND   LPL     K +A+VGPHANAT  MIGNYAG+P
Sbjct: 397 DICAADHIELAREAARQGIVLLKNDYEVLPLKPG--KKIALVGPHANATEVMIGNYAGLP 454

Query: 455 CRYMSPIAGFSGYANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAES 514
           C+Y+SP+  FS   NVTY TG                        TII  G DLS+EAE 
Sbjct: 455 CKYVSPLEAFSAIGNVTYATGF-----------------------TIIFVGTDLSIEAEF 491

Query: 515 LDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEE 574
           +DR D  LPG QT+LI QVAEV+ GPVILV++S   +DI FA+ N  I AILW G+PGE+
Sbjct: 492 VDRVDFLLPGNQTELIKQVAEVSSGPVILVVLSGSNIDITFAKNNPRISAILWVGFPGEQ 551

Query: 575 GGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTL 634
           GG AIADVVFGK+NPGGRLP+TWY  DYV MLP++SM LRPVD LGYPGRTYKF++G T+
Sbjct: 552 GGHAIADVVFGKYNPGGRLPVTWYEADYVDMLPMSSMSLRPVDELGYPGRTYKFFDGSTV 611

Query: 635 YPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDY 694
           YPFGYG+SYT+F Y+L +   +I ++LNK Q CR                          
Sbjct: 612 YPFGYGMSYTKFSYSLATSKISIDIDLNKFQKCRT------------------------- 646

Query: 695 FEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNAC 754
             F+V   NVG  DGS+V++VYS PP+ I  T+IKQVIGFQ+VFV AG  +R+KF  NAC
Sbjct: 647 --FEVAVTNVGMVDGSEVLMVYSIPPSGIVGTHIKQVIGFQKVFVAAGDTERVKFSMNAC 704

Query: 755 KSLNIVDYAANTLLPAGEHTIFVGN--GGVSFPIHLNFNY 792
           KSL IVD    +LLP+G HTI VG+     S+ + +N++Y
Sbjct: 705 KSLRIVDSTGYSLLPSGSHTIRVGDYSNSASYSLQVNYHY 744


>gi|226506870|ref|NP_001146482.1| uncharacterized protein LOC100280070 precursor [Zea mays]
 gi|219887469|gb|ACL54109.1| unknown [Zea mays]
 gi|413947917|gb|AFW80566.1| putative O-Glycosyl hydrolase superfamily protein [Zea mays]
          Length = 835

 Score =  902 bits (2331), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 436/772 (56%), Positives = 552/772 (71%), Gaps = 17/772 (2%)

Query: 36  VCDPGRFSKLGLQMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQ 95
           VCDP RF  LGL MS F +CD+SLPY+ RV+DLV R+ L+EKV+ LGD A G PR+GLP 
Sbjct: 62  VCDPARFVALGLDMSRFRYCDASLPYADRVRDLVGRLALEEKVRNLGDQAEGAPRVGLPP 121

Query: 96  YEWWSEALHGVSNVGPG-THFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAM 154
           Y+WW EALHGVS+VGPG T F DV+PGATSFP VI + A+FNESLW+ IG  VSTE RAM
Sbjct: 122 YKWWGEALHGVSDVGPGGTWFGDVVPGATSFPLVINSAAAFNESLWRAIGGVVSTEIRAM 181

Query: 155 YNLGRAGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEG--HENAT 212
           YNLG A LTYWSPNINV RDPRWGR +ETPGEDPFVVGRYAVN+VRG+QDV+   +  A 
Sbjct: 182 YNLGHAELTYWSPNINVVRDPRWGRASETPGEDPFVVGRYAVNFVRGMQDVDDRPYAAAA 241

Query: 213 DLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASS 272
           D  SRP+KVSSCCKH+AAYDVD W   DR  FDA+V E+DM ETF RPFEMC+++GDAS 
Sbjct: 242 DPFSRPIKVSSCCKHFAAYDVDAWFKADRLTFDAQVEERDMVETFERPFEMCIRDGDASC 301

Query: 273 VMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVA 332
           VMCSYNR+NGIP+CAD +LL++TVR +W LHGYIV+DCDS++VMV + K+L  +  +A A
Sbjct: 302 VMCSYNRINGIPACADARLLSETVRSQWQLHGYIVSDCDSVRVMVRDAKWLNYTGVEATA 361

Query: 333 QTLKAGLDLDCGQY-------YTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGS 385
             +KAGLDLDCG +       +T +  +AV+QGK+KE D+D +L  +YT LMRLGFFDG 
Sbjct: 362 AAMKAGLDLDCGMFWEGARDFFTTYGVDAVRQGKIKEGDVDNALSNVYTTLMRLGFFDGM 421

Query: 386 PQYVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVG--PHANAT 443
           P++ SLG  ++C+D + ELAA+AAR+G+VLLKND   LPL+  K+ +V++VG   H NAT
Sbjct: 422 PEFESLGASNVCTDGHKELAADAARQGMVLLKNDARRLPLDPNKINSVSLVGLLEHINAT 481

Query: 444 VAMIGNYAGIPCRYMSPIAGFSGYANVTYKTGCDDVACKSNNSIFAASEAAKTADATIIL 503
             M+G+Y G PCR ++P        N TY   CD  AC +   +  AS  AK ADATI++
Sbjct: 482 DVMLGDYRGKPCRIVTPYNAIRNMVNATYVHACDSGACNTAEGMGRASSTAKIADATIVI 541

Query: 504 AGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIK 563
           AGL++SVE ES DREDL LP  Q+  IN VA  +  P++LVIMSAGGVD++FA  NT I 
Sbjct: 542 AGLNMSVERESNDREDLLLPWNQSSWINAVAMASPTPIVLVIMSAGGVDVSFAHNNTKIG 601

Query: 564 AILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPG 623
           AI+WAGYPGEEGG AIADV+FGK+NPGGRLP+TW+  +YV  +P+TSM LRP  +LGYPG
Sbjct: 602 AIVWAGYPGEEGGTAIADVLFGKYNPGGRLPLTWFKNEYVNQIPMTSMALRPDAALGYPG 661

Query: 624 RTYKFYNGP-TLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTR-- 680
           RTYKFY GP  LYPFG+GLSYT F Y   +   T+ +++   +HC+ L Y   A      
Sbjct: 662 RTYKFYGGPAVLYPFGHGLSYTNFSYASGTTGATVTIHIGAWEHCKMLTYKMGAPSPSPA 721

Query: 681 CPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVR 740
           CP + V    C +   F +   N G   G  VV VY+ PP E+    +KQ++ F+RVFV 
Sbjct: 722 CPALNVASHMCSEVVSFSLRVANTGGVGGDHVVPVYTAPPPEVGDAPLKQLVAFRRVFVP 781

Query: 741 AGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGNGG--VSFPIHLNF 790
           AG    + F  N CK+  IV+  A T++P+G  T+ VG+    +SFP+ +N 
Sbjct: 782 AGAAVDVPFALNVCKTFAIVEETAYTVVPSGVSTVVVGDDALVLSFPVTINL 833


>gi|242052713|ref|XP_002455502.1| hypothetical protein SORBIDRAFT_03g012290 [Sorghum bicolor]
 gi|241927477|gb|EES00622.1| hypothetical protein SORBIDRAFT_03g012290 [Sorghum bicolor]
          Length = 825

 Score =  900 bits (2325), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 435/773 (56%), Positives = 553/773 (71%), Gaps = 17/773 (2%)

Query: 36  VCDPGRFSKLGLQMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQ 95
           VCDP RF+ LGL MS F +CD+SLPY+ RV+DLV R++L+EKV+ LGD A G PR+GLP 
Sbjct: 50  VCDPVRFAALGLDMSRFRYCDASLPYAERVRDLVGRLSLEEKVRNLGDQAEGAPRVGLPP 109

Query: 96  YEWWSEALHGVSNVGPG-THFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAM 154
           Y+WW EALHGVS+VGPG T F DV+PGATSFP VI + A+FNESLW+ IG  VSTE RAM
Sbjct: 110 YKWWGEALHGVSDVGPGGTWFGDVVPGATSFPLVINSAAAFNESLWRAIGGVVSTEIRAM 169

Query: 155 YNLGRAGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDV---EGHENA 211
           YNLG A LTYWSPNINV RDPRWGR +ETPGEDPFVVGRYAVN+VRG+QDV    G    
Sbjct: 170 YNLGHAELTYWSPNINVVRDPRWGRASETPGEDPFVVGRYAVNFVRGMQDVVIAAGAAAT 229

Query: 212 TDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDAS 271
            D  SRP+KVSSCCKH+AAYDVD W   DR  FDA+V E+DM ETF RPFEMC+++GDAS
Sbjct: 230 ADPFSRPIKVSSCCKHFAAYDVDAWFKADRLTFDAQVEERDMVETFERPFEMCIRDGDAS 289

Query: 272 SVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAV 331
            VMCSYNR+NGIP+CAD +LL++TVR +W LHGYIV+DCDS++VMV + K+L  +  +A 
Sbjct: 290 CVMCSYNRINGIPACADARLLSETVRSQWQLHGYIVSDCDSVRVMVRDAKWLNYTGVEAT 349

Query: 332 AQTLKAGLDLDCGQY-------YTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDG 384
           A  +KAGLDLDCG +       +T +  +AV+QGK+KE D+D +L  +YT LMRLGFFDG
Sbjct: 350 AAAMKAGLDLDCGMFWEGARDFFTTYGVDAVRQGKIKEADVDNALGNVYTTLMRLGFFDG 409

Query: 385 SPQYVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVG--PHANA 442
            P++ SLG  D+C+ ++ ELAA+AAR+G+VLLKND   LPL+ +K+ +V++VG   H NA
Sbjct: 410 MPEFESLGADDVCTRDHKELAADAARQGMVLLKNDARRLPLDPSKINSVSLVGLLEHINA 469

Query: 443 TVAMIGNYAGIPCRYMSPIAGFSGYANVTYKTGCDDVACKSNNSIFAASEAAKTADATII 502
           T  M+G+Y G PCR ++P        N TY   CD  AC +   +  AS  AK ADATI+
Sbjct: 470 TDVMLGDYRGKPCRIVTPYDAIRQVVNATYVHACDSGACSTAEGMGRASRTAKIADATIV 529

Query: 503 LAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNI 562
           +AGL++SVE ES DREDL LP  Q+  IN VAE +  P++LVIMSAGGVD++FA+ NT I
Sbjct: 530 IAGLNMSVERESNDREDLLLPWNQSSWINAVAEASTTPIVLVIMSAGGVDVSFAQNNTKI 589

Query: 563 KAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYP 622
            AI+WAGYPGEEGG AIADV+FGK+NPGGRLP+TW+  +YV  +P+TSM LRP  + GYP
Sbjct: 590 GAIVWAGYPGEEGGTAIADVLFGKYNPGGRLPLTWFKNEYVNQIPMTSMALRPDAAHGYP 649

Query: 623 GRTYKFYNGP-TLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKT-- 679
           GRTYKFY GP  LYPFG+GLSYT F Y   +   T+ + +   +HC+ L Y S  + +  
Sbjct: 650 GRTYKFYGGPAVLYPFGHGLSYTSFTYASGTTGATVTIPIGAWEHCKMLTYKSGKAPSPS 709

Query: 680 -RCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVF 738
             CP + V   RCD+   F +   N G   G  VV VY+ PP E+     KQ++ F+RVF
Sbjct: 710 PACPALNVASHRCDEVVSFSLRVANTGGVGGDHVVPVYTAPPPEVGDAPRKQLVEFRRVF 769

Query: 739 VRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGNGGVSFPIHLNFN 791
           V AG    + F  N CK+  IV+  A T++P+G  T+ VG+  ++    +  N
Sbjct: 770 VPAGAAVDVPFALNVCKTFAIVEETAYTVVPSGVSTVIVGDDALALSFAVTIN 822


>gi|326523729|dbj|BAJ93035.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 810

 Score =  878 bits (2269), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 424/763 (55%), Positives = 548/763 (71%), Gaps = 21/763 (2%)

Query: 36  VCDPGRFSKLGLQMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQ 95
           VCDP RF+ LGL+M+ F +CD+SLPY+ RV+DLV R+TL+EKV+ LGD A G  R+GLP 
Sbjct: 45  VCDPARFAALGLEMAGFRYCDASLPYADRVRDLVGRLTLEEKVRNLGDRAEGAARVGLPP 104

Query: 96  YEWWSEALHGVSNVGPG-THFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAM 154
           Y WW EALHGVS+ GPG T F DV+PGATSFP VI + A+FNE+LW  IG AVSTE RAM
Sbjct: 105 YLWWGEALHGVSDTGPGGTRFGDVVPGATSFPLVINSAAAFNETLWGAIGGAVSTEIRAM 164

Query: 155 YNLGRAGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHE--NAT 212
           YNLG A LTYWSPNINV RDPRWGR +ETPGEDPFVVGRYAV++VR +QD++G       
Sbjct: 165 YNLGHAELTYWSPNINVVRDPRWGRASETPGEDPFVVGRYAVSFVRAMQDIDGAGPGAGA 224

Query: 213 DLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASS 272
           D  +RP+KVSSCCKHYAAYDVD W   DR  FDA+V E+DM ETF RPFEMCV++GDAS 
Sbjct: 225 DPFARPIKVSSCCKHYAAYDVDAWLTADRLTFDAQVEERDMIETFERPFEMCVRDGDASC 284

Query: 273 VMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVA 332
           VMCSYNR+NG+P+CA+ +LL++TVRGEW LHGYIV+DCDS++VMV + K+L  +  +A A
Sbjct: 285 VMCSYNRINGVPACANARLLSETVRGEWQLHGYIVSDCDSVRVMVRDAKWLGYNGVEATA 344

Query: 333 QTLKAGLDLDCGQY-------YTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGS 385
             +KAGLDLDCG +       +T F  +AV+QGK++E+++D +L+ LY  LMRLGFFDG 
Sbjct: 345 AAMKAGLDLDCGMFWEGAQDFFTAFGLDAVRQGKLRESEVDNALRNLYLTLMRLGFFDGI 404

Query: 386 PQYVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVG--PHANAT 443
           P+  SLG  D+C++E+ ELAA+AAR+G+VL+KND   LPL+++KV ++++VG   H NAT
Sbjct: 405 PELESLGANDVCTEEHKELAADAARQGMVLIKNDHGRLPLDTSKVNSLSLVGLLQHINAT 464

Query: 444 VAMIGNYAGIPCRYMSPIAGFSGYANVTYKTGCDDVACKSNNSIFAASEAAKTADATIIL 503
             M+G+Y G PCR ++P        + T    CD  AC +       +   KT DATI++
Sbjct: 465 DVMLGDYRGKPCRVVTPYDAIRKVVSATSMQVCDHGACST-------AANGKTVDATIVI 517

Query: 504 AGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIK 563
           AGL++SVE E  DREDL LP  QT  IN VAE +  P+ILVI+SAGGVD++FA+ N  I 
Sbjct: 518 AGLNMSVEKEGNDREDLLLPWNQTNWINAVAEASPYPIILVIISAGGVDVSFAQNNPKIG 577

Query: 564 AILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPG 623
           AI+WAGYPGEEGG AIADV+FGK+NPGGRLP+TWY  +Y+  +P+TSM LRPV   GYPG
Sbjct: 578 AIVWAGYPGEEGGTAIADVLFGKYNPGGRLPLTWYKSEYISKIPMTSMALRPVADKGYPG 637

Query: 624 RTYKFYNGP-TLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYT-SDASKTRC 681
           RTYKFY GP  LYPFG+GLSY+ F Y   +   ++ V +   + C+ L       +   C
Sbjct: 638 RTYKFYGGPEVLYPFGHGLSYSNFSYASDTTGASVTVRVGAWESCKQLTRKPGTTAPLAC 697

Query: 682 PGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRA 741
           P V V    C +   F +   N GS DG+ VV+VY+ PPAE+    +KQ++ F+RVFV A
Sbjct: 698 PAVNVAGHGCKEEVSFSLTVANRGSRDGAHVVMVYTVPPAEVDDAPLKQLVAFRRVFVPA 757

Query: 742 GRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGNGGVSF 784
           G   ++ F  N CK+  IV+  A T++P+G  T+ VG+  +SF
Sbjct: 758 GAAVQVPFTLNVCKAFAIVEETAYTVVPSGVSTVLVGDDALSF 800


>gi|357128056|ref|XP_003565692.1| PREDICTED: beta-D-xylosidase 3-like [Brachypodium distachyon]
          Length = 821

 Score =  865 bits (2234), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 427/777 (54%), Positives = 547/777 (70%), Gaps = 24/777 (3%)

Query: 36  VCDPGRFSKLGLQMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVP-RLGLP 94
           VCDP RF+ LGL M+ F +CD+SLPY+ RV+DLV R+TL+EKV  LGD A G   R+GLP
Sbjct: 45  VCDPARFASLGLDMAGFRYCDASLPYAERVRDLVGRLTLEEKVANLGDQAKGAEQRVGLP 104

Query: 95  QYEWWSEALHGVSNVGPG-THFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARA 153
           +Y WW EALHGVS+  PG T F DV+PGATSFP V+ + A+FNE+LW+ IG A STE RA
Sbjct: 105 RYMWWGEALHGVSDTNPGGTRFGDVVPGATSFPLVLNSAAAFNETLWRAIGGATSTEIRA 164

Query: 154 MYNLGRAGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATD 213
           MYNLG A LTYWSPNINV RDPRWGR +ETPGEDPF+VGR+AV++VR +QD++   NA  
Sbjct: 165 MYNLGHAELTYWSPNINVVRDPRWGRASETPGEDPFLVGRFAVSFVRAMQDIDDGANAGA 224

Query: 214 LNSRP----LKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGD 269
             + P    LKVSSCCKHYAAYDVD W G DR  FDA V E+DM ETF RPFEMCV++GD
Sbjct: 225 GAADPFARRLKVSSCCKHYAAYDVDKWFGADRLSFDANVQERDMVETFERPFEMCVRDGD 284

Query: 270 ASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKED 329
           AS VMCSYNR+NG+P+CA+ +LL  TVR +W LHGYIV+DCDS++VMV + K+L      
Sbjct: 285 ASCVMCSYNRINGVPACANGRLLTGTVRRDWQLHGYIVSDCDSVRVMVRDAKWLGYDGVQ 344

Query: 330 AVAQTLKAGLDLDCGQY-------YTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFF 382
           A A  +KAGLDLDCG +       +T +   AV+QGK+KE ++D++L +LY  LMRLGFF
Sbjct: 345 ATAAAMKAGLDLDCGMFWEGAKDFFTAYGLQAVRQGKLKEAEVDEALGHLYLTLMRLGFF 404

Query: 383 DGSPQYVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVG--PHA 440
           DGSP++ SLG  D+C++E+ E+AAEAAR+G+VLLKND + LPL++ KV ++A+VG   H 
Sbjct: 405 DGSPEFQSLGASDVCTEEHKEMAAEAARQGMVLLKNDHDRLPLDANKVNSLALVGLLQHI 464

Query: 441 NATVAMIGNYAGIPCRYMSPIAGFSGYANVTYKTGCDDVACKSNNSIFAASEAAKTADAT 500
           NAT  M+G+Y G PCR ++P        + T    CD  AC    +   A+ AAKT DAT
Sbjct: 465 NATDVMLGDYRGKPCRVVTPYEAIRKVVSGTSMQACDKGAC--GTTALGAAIAAKTVDAT 522

Query: 501 IILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNT 560
           I++ GL++SVE E  DREDL LP  QTQ IN VAE ++ P+ LVI+SAGGVDI+FA+ N 
Sbjct: 523 IVITGLNMSVEREGNDREDLLLPWDQTQWINAVAEASRDPITLVIISAGGVDISFAQNNP 582

Query: 561 NIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLG 620
            I AILWAGYPGEEGG  IADV+FGK+NPGGRLP+TWY  +Y+  LP+TSM LRPV   G
Sbjct: 583 KIGAILWAGYPGEEGGTGIADVLFGKYNPGGRLPLTWYKNEYIGKLPMTSMALRPVADKG 642

Query: 621 YPGRTYKFYNGP-TLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKL--QHCRNLNYT--SD 675
           YPGRTYKFY+GP  LYPFG+GLSYT F Y+  +   ++ V +       C+NL Y   + 
Sbjct: 643 YPGRTYKFYSGPDVLYPFGHGLSYTNFTYDSYTTGASVTVKIGTAWEDSCKNLTYKPGTT 702

Query: 676 ASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQ 735
           AS   CP + V    C +   F +   N G   GS VV VY+ PPAE+    +KQ++ F+
Sbjct: 703 ASTAPCPAINVAGHGCQEEVSFTLKVSNTGGIGGSHVVPVYTAPPAEVDDAPLKQLVAFR 762

Query: 736 RVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGNGGV--SFPIHLNF 790
           R+FV AG    + F  + CK+  IV+  A T++PAG   + VG+  +  SFP+ ++ 
Sbjct: 763 RMFVPAGDAVEVPFTLSVCKAFAIVEGTAYTVVPAGVSRVLVGDESLSFSFPVKIDL 819


>gi|14164501|dbj|BAB55751.1| putative alpha-L-arabinofuranosidase/beta-D- xylosidase isoenzyme
           ARA-I [Oryza sativa Japonica Group]
          Length = 818

 Score =  860 bits (2222), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 432/776 (55%), Positives = 545/776 (70%), Gaps = 26/776 (3%)

Query: 36  VCDPGRFSKLGLQMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQ 95
           VCDP RF+  GL M+ F +CD+SLPY+ RV+DLV RMTL+EKV  LGD A G PR+GLP+
Sbjct: 46  VCDPARFAAAGLDMAGFPYCDASLPYADRVRDLVGRMTLEEKVANLGDRAGGAPRVGLPR 105

Query: 96  YEWWSEALHGVSNVGPG-THFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAM 154
           Y WW EALHGVS+VGPG T F D +PGATSFP VI + ASFNE+LW+ IG  VSTE RAM
Sbjct: 106 YLWWGEALHGVSDVGPGGTWFGDAVPGATSFPLVINSAASFNETLWRAIGGVVSTEIRAM 165

Query: 155 YNLGRAGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDL 214
           YNLG A LTYWSPNINV RDPRWGR +ETPGEDPFVVGRYAVN+VRG+QD++G   A   
Sbjct: 166 YNLGHAELTYWSPNINVVRDPRWGRASETPGEDPFVVGRYAVNFVRGMQDIDGATTAASA 225

Query: 215 N------SRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEG 268
                  SRP+KVSSCCKHYAAYDVD W G DR  FDARV E+DM ETF RPFEMC+++G
Sbjct: 226 AAATDAFSRPIKVSSCCKHYAAYDVDAWNGTDRLTFDARVQERDMVETFERPFEMCIRDG 285

Query: 269 DASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKE 328
           DAS VMCSYNR+NG+P+CAD +LL +TVR +W LHGYIV+DCDS++VMV + K+L  +  
Sbjct: 286 DASCVMCSYNRINGVPACADARLLTETVRRDWQLHGYIVSDCDSVRVMVRDAKWLGYTGV 345

Query: 329 DAVAQTLKAGLDLDCGQ-------YYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGF 381
           +A A  +KAGLDLDCG        ++T +  +AV+QGK+KE+ +D +L  LY  LMRLGF
Sbjct: 346 EATAAAMKAGLDLDCGMFWEGVHDFFTTYGVDAVRQGKLKESAVDNALTNLYLTLMRLGF 405

Query: 382 FDGSPQYVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVG--PH 439
           FDG P+  SLG  D+C++E+ ELAA+AAR+G+VLLKND   LPL+  KV +VA+ G   H
Sbjct: 406 FDGIPELESLGAADVCTEEHKELAADAARQGMVLLKNDAALLPLSPEKVNSVALFGQLQH 465

Query: 440 ANATVAMIGNYAGIPCRYMSPIAGFSGYANVTYKTGCDDVACKSNNSIFAASEAAKTADA 499
            NAT  M+G+Y G PCR ++P  G     + T    CD  +C +      A+ AAKT DA
Sbjct: 466 INATDVMLGDYRGKPCRVVTPYDGVRKVVSSTSVHACDKGSCDT------AAAAAKTVDA 519

Query: 500 TIILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETN 559
           TI++AGL++SVE ES DREDL LP  Q   IN VAE +  P++LVIMSAGGVD++FA+ N
Sbjct: 520 TIVVAGLNMSVERESNDREDLLLPWSQASWINAVAEASPSPIVLVIMSAGGVDVSFAQDN 579

Query: 560 TNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSL 619
             I A++WAGYPGEEGG AIADV+FGK+NPGGRLP+TWY  +YV  +P+TSM LRP    
Sbjct: 580 PKIGAVVWAGYPGEEGGTAIADVLFGKYNPGGRLPLTWYKNEYVSKIPMTSMALRPDAEH 639

Query: 620 GYPGRTYKFYNGP-TLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSD-AS 677
           GYPGRTYKFY G   LYPFG+GLSYT F Y   +    + V +   ++C+ L Y +  +S
Sbjct: 640 GYPGRTYKFYGGADVLYPFGHGLSYTNFTYASATAAAPVTVKVGAWEYCKQLTYKAGVSS 699

Query: 678 KTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRV 737
              CP V V    C +   F V   N G  DG+ VV +Y+ PPAE+     KQ++ F+RV
Sbjct: 700 PPACPAVNVASHACQEEVSFAVTVANTGGRDGTHVVPMYTAPPAEVDGAPRKQLVAFRRV 759

Query: 738 FVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGNGG--VSFPIHLNFN 791
            V AG    + F  N CK+  IV+  A T++P+G   + VG+    +SFP+ ++  
Sbjct: 760 RVAAGAAVEVAFALNVCKAFAIVEETAYTVVPSGVSRVLVGDDALSLSFPVQIDLQ 815


>gi|357153280|ref|XP_003576399.1| PREDICTED: probable beta-D-xylosidase 2-like [Brachypodium
           distachyon]
          Length = 807

 Score =  853 bits (2205), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 423/814 (51%), Positives = 552/814 (67%), Gaps = 60/814 (7%)

Query: 13  LSIALLVFSTNAVDANGSSSPVF---VCDPGRFSKLGLQMSSFLFCDSSLPYSIRVKDLV 69
           LS A  V  ++  D  GS+       VCD  RF+  GL MS + +CD+ LPY  RV+DL+
Sbjct: 18  LSTARAVLPSSNDDDGGSAKTAAYTKVCDASRFAAAGLDMSRYRYCDAKLPYGDRVRDLI 77

Query: 70  SRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTHFDDV----------- 118
             MT++EKV  LGD+A G PR+GLP Y+WWSEALHG+S+ GP T FDD+           
Sbjct: 78  GWMTVEEKVSNLGDWAAGAPRVGLPPYKWWSEALHGLSSTGPTTKFDDLKKPRLHSGRAA 137

Query: 119 IPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNINVARDPRWG 178
           +   T F  VI + ASFNESLW+ IGQA+STEARAMYNLG+ GLTYWSPNINV RDPRWG
Sbjct: 138 VFNGTVFANVINSAASFNESLWRSIGQAISTEARAMYNLGKGGLTYWSPNINVVRDPRWG 197

Query: 179 RITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLN----SRPLKVSSCCKHYAAYDVD 234
           R  ETPGEDPFVVGRYAVN+VRG+QDV+  + A   N    SRPLK S+CCKHYAAYDVD
Sbjct: 198 RALETPGEDPFVVGRYAVNFVRGMQDVD--DAAAGFNGDPLSRPLKTSACCKHYAAYDVD 255

Query: 235 NWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQ 294
           +W G  R+ FDARVTE+DM ETF RPFEMCV++GDAS+VMCSYNRVNGIP+CAD +LL  
Sbjct: 256 DWYGHTRFKFDARVTERDMVETFQRPFEMCVRDGDASAVMCSYNRVNGIPACADARLLAG 315

Query: 295 TVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQ--------- 345
           T+R +W LHGYIV+DCD+++VM DN  +L  +  +A A +LKAGLDLDCG+         
Sbjct: 316 TLRRDWGLHGYIVSDCDAVRVMTDNATWLGYTPAEASAASLKAGLDLDCGESWIVQKGKP 375

Query: 346 ---YYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQDICSDENI 402
              + + +   AV+QGK++E+DID +L  LYT LMRLG+FDG P+Y SL ++DICS+ + 
Sbjct: 376 VMDFLSTYGMAAVRQGKMRESDIDNALVNLYTTLMRLGYFDGMPRYESLDEKDICSEAHR 435

Query: 403 ELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANA-TVAMIGNYAGIPCRYMSPI 461
            LA + AR+ +VLLKN    LPL+++K+ +VAV GPHA A    M G+Y G PCRY++P 
Sbjct: 436 SLALDGARQSMVLLKNLDGLLPLDASKLASVAVRGPHAEAPEKVMDGDYTGPPCRYITPR 495

Query: 462 AGFSGYANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAESLDREDLW 521
            G S   N++ + G                      D TI + G+++ +E E  DREDL 
Sbjct: 496 EGISKDVNISQQGG----------------------DVTIYMGGINMHIEREGNDREDLL 533

Query: 522 LPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIAD 581
           LP  QT+ I +VA  +  P++LVI+S GG+D++FA+++  I AILWAGYPG EGG AIAD
Sbjct: 534 LPKNQTEEILRVAAASPSPIVLVILSGGGIDVSFAQSHPKIGAILWAGYPGGEGGHAIAD 593

Query: 582 VVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGP-TLYPFGYG 640
           V+FG++NPGGRLP+TW+   Y+  LP+TSM LRP    GYPGRTYKFY+GP  LYPFGYG
Sbjct: 594 VIFGRYNPGGRLPLTWFKNKYIHQLPMTSMALRPRPEHGYPGRTYKFYDGPDVLYPFGYG 653

Query: 641 LSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVD 700
           LSYT+F+Y LL+    + +   + +HCR L+Y + +    CP V V    C +   F V 
Sbjct: 654 LSYTKFRYELLNKETAVTLAPGR-RHCRQLSYKTGSVGPDCPAVDVASHACAETVSFNVS 712

Query: 701 FQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIV 760
             N G  DG++ V+VY+ PPAE+A   IKQV  F+RV V+AG  + + F  N CK+  IV
Sbjct: 713 VVNAGKADGANAVLVYTAPPAELAGAPIKQVAAFRRVAVKAGAAETVVFTLNVCKAFGIV 772

Query: 761 DYAANTLLPAGEHTIFVGNG---GVSFPIHLNFN 791
           +  A T++P+G  T+ V NG    VSFP+ ++F+
Sbjct: 773 EKTAYTVVPSGVSTVIVENGDSSAVSFPVQISFS 806


>gi|242093144|ref|XP_002437062.1| hypothetical protein SORBIDRAFT_10g020500 [Sorghum bicolor]
 gi|241915285|gb|EER88429.1| hypothetical protein SORBIDRAFT_10g020500 [Sorghum bicolor]
          Length = 809

 Score =  838 bits (2165), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 414/789 (52%), Positives = 533/789 (67%), Gaps = 58/789 (7%)

Query: 36  VCDPGRFSKLGLQMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQ 95
           VCD  RF+++GL MS+F +CD+SLPY+ RV+DL+  MT++EKV  LGD +HG PR+GLP 
Sbjct: 45  VCDADRFAEMGLNMSAFPYCDASLPYADRVRDLIGWMTVEEKVGNLGDVSHGAPRVGLPP 104

Query: 96  YEWWSEALHGVSNVGPGTHFDDV--IPG----------ATSFPTVILTTASFNESLWKKI 143
           Y+WWSEALHGVS+ GP   FDD+   PG          AT F  VI + ASFNE+LWK I
Sbjct: 105 YKWWSEALHGVSSTGPTMLFDDLHSKPGNHSGRATVNNATVFANVINSAASFNETLWKSI 164

Query: 144 GQAVSTEARAMYNLGRAGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQ 203
           GQAVSTEARAMYNLG+ GLTYWSPNINV RDPRWGR  ETPGEDPFV GRYAVN+VRG+Q
Sbjct: 165 GQAVSTEARAMYNLGKGGLTYWSPNINVVRDPRWGRALETPGEDPFVAGRYAVNFVRGMQ 224

Query: 204 DVEGHENA-TDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFE 262
           D+ GH+    D ++RP+K S+CCKHYAAYDVD+W    R+ FDARV+E+DM ETFLRPFE
Sbjct: 225 DIPGHDGGGDDPSTRPIKTSACCKHYAAYDVDDWHNHTRFTFDARVSERDMAETFLRPFE 284

Query: 263 MCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKF 322
           MCV++GDAS VMCSYNRVNGIP+CAD +LL+ T+RG+W LHGYIV+DCD+++VM DN  +
Sbjct: 285 MCVRDGDASGVMCSYNRVNGIPACADARLLSGTIRGDWQLHGYIVSDCDAVRVMTDNATW 344

Query: 323 LADSKEDAVAQTLKAGLDLDCGQYYTNFTGN------------AVQQGKVKETDIDKSLK 370
           L  +  ++ A +++AGLDLDC + +    G             AV QGK++E+DID +L+
Sbjct: 345 LHFTGAESSAASIRAGLDLDCAESWIEEKGRPLRDFLSEYGKAAVAQGKMRESDIDSALR 404

Query: 371 YLYTVLMRLGFFDGSPQYVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKV 430
             Y  LMRLG+FD  P+Y SL + DIC+DE+  LA + AR+G+VLLKND   LPL+  K+
Sbjct: 405 NQYMTLMRLGYFDNIPRYASLNETDICTDEHKSLAHDGARQGMVLLKNDDGLLPLDPEKI 464

Query: 431 KTVAVVGPHANATVAMI-GNYAGIPCRYMSPIAGFSGYANVTYKTGCDDVACKSNNSIFA 489
             VAV GPHA A   ++ G+Y G PCRY++P  G S    ++++                
Sbjct: 465 LAVAVHGPHARAPEKIMDGDYTGPPCRYVTPRQGISKDVKISHR---------------- 508

Query: 490 ASEAAKTADATIILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAG 549
                  A+ TI L G++L +E E  DREDL LP  QT+ I   A+ +  P+ILVI+S G
Sbjct: 509 -------ANTTIYLGGINLHIEREGNDREDLLLPKNQTEEILHFAKASPNPIILVILSGG 561

Query: 550 GVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLT 609
           G+DI+FA  +  I AILWAGYPG EGG AIADV+FG++NPGGRLP+TW+   Y+Q +P+T
Sbjct: 562 GIDISFAHKHPKIGAILWAGYPGGEGGNAIADVIFGRYNPGGRLPLTWFKNKYIQQIPMT 621

Query: 610 SMPLRPVDSLGYPGRTYKFYNGP-TLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKL-QHC 667
           SM  RPV   GYPGRTYKFY+GP  LYPFGYGLSYT+F Y   + T    V L     HC
Sbjct: 622 SMEFRPVPEKGYPGRTYKFYDGPEVLYPFGYGLSYTKFLYE--TSTNGTAVTLPATGGHC 679

Query: 668 RNLNYT-SDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAAT 726
           + L+Y  S A+   C  V V    C +   F +   N G   G+ VV+VY+ PP E+A  
Sbjct: 680 KGLSYKPSVATTPACQAVDVAGHACTETVSFNISVTNAGGRGGAHVVLVYTAPPPEVAQA 739

Query: 727 YIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGNG----GV 782
            IKQV  F+RVFV A     + F  N CK+  IV+  A T++P+G   + V NG     V
Sbjct: 740 PIKQVAAFRRVFVPARSTATVPFTLNVCKAFGIVERTAYTVVPSGVSKVLVQNGDSSSSV 799

Query: 783 SFPIHLNFN 791
           SFP+ ++F+
Sbjct: 800 SFPVKIDFS 808


>gi|413954831|gb|AFW87480.1| putative O-Glycosyl hydrolase superfamily protein [Zea mays]
          Length = 814

 Score =  836 bits (2160), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 414/788 (52%), Positives = 535/788 (67%), Gaps = 58/788 (7%)

Query: 36  VCDPGRFSKLGLQMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQ 95
           VCD  RF+++GL MS+F +CD+SLPY+ RV+DL+  MT++EKV  LGD +HG PR+GLP 
Sbjct: 52  VCDAERFAEMGLNMSAFPYCDASLPYADRVRDLIGWMTVEEKVGNLGDISHGAPRVGLPP 111

Query: 96  YEWWSEALHGVSNVGPGTHFDDV--IPG----------ATSFPTVILTTASFNESLWKKI 143
           Y+WWSEALHGVS+ GP   FDD+   PG          AT F  VI + ASFNE+LW  I
Sbjct: 112 YKWWSEALHGVSSTGPTMLFDDLHSKPGNHSGRATVNNATVFANVINSAASFNETLWNSI 171

Query: 144 GQAVSTEARAMYNLGRAGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQ 203
           GQAVSTEARAMYNLG+ GLTYWSPNINV RDPRWGR  ETPGEDP+V GRYAVN+VRG+Q
Sbjct: 172 GQAVSTEARAMYNLGKGGLTYWSPNINVVRDPRWGRALETPGEDPYVAGRYAVNFVRGMQ 231

Query: 204 DVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEM 263
           D+ GH +  D ++RP+K S+CCKH+AAYDVDNW    R+ +DARV+E+DM ETFLRPFEM
Sbjct: 232 DIPGHYSG-DPSARPIKTSACCKHHAAYDVDNWHNQTRFTYDARVSERDMAETFLRPFEM 290

Query: 264 CVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFL 323
           CV+EGD SSVMCSYNRVNG+P+CAD +LL+ TVRGEW L+GYIV+DCD+++VM DN  +L
Sbjct: 291 CVREGDVSSVMCSYNRVNGVPACADARLLSGTVRGEWHLNGYIVSDCDAVRVMTDNATWL 350

Query: 324 ADSKEDAVAQTLKAGLDLDCGQ------------YYTNFTGNAVQQGKVKETDIDKSLKY 371
             +  ++ A +L+AG+DLDC +            Y + +   AV QGK++E+DID +L  
Sbjct: 351 NFTAAESSAVSLRAGMDLDCAESWIEEEGRPLRDYLSEYGMAAVAQGKMRESDIDNALTN 410

Query: 372 LYTVLMRLGFFDGSPQYVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVK 431
           LY  LMRLG+FD  P+Y SL + D+C+DE+  LA + AR+GIVLLKND   LPL+  K  
Sbjct: 411 LYMTLMRLGYFDNIPRYASLNETDVCTDEHKSLALDGARQGIVLLKNDHGLLPLDPKKTL 470

Query: 432 TVAVVGPHANATVAMI-GNYAGIPCRYMSPIAGFSGYANVTYKTGCDDVACKSNNSIFAA 490
            VAV GPHA A   ++ G+Y G PCRY++P  G S    +++K                 
Sbjct: 471 AVAVHGPHARAPEKIMDGDYTGPPCRYVTPRQGISRDVKISHK----------------- 513

Query: 491 SEAAKTADATIILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGG 550
                 A  TI L G++L +E E  DREDL LP  QT+ I   A+ +  P+ILVI+S GG
Sbjct: 514 ------AKMTIYLGGINLYIEREGNDREDLLLPKNQTEEILHFAQASPTPIILVILSGGG 567

Query: 551 VDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTS 610
           +DI+FA+ +  I AILWAGYPG EGG AIADV+FG++NPGGRLP+TW+   Y++ +P+TS
Sbjct: 568 IDISFAQKHPKIGAILWAGYPGGEGGNAIADVIFGRYNPGGRLPLTWFKNKYIEQIPMTS 627

Query: 611 MPLRPVDSLGYPGRTYKFYNGP-TLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKL-QHCR 668
           M  RPV   GYPGRTYKFY+GP  LYPFGYGLSYT+F+Y   + T  + V+L     HC+
Sbjct: 628 MEFRPVPEKGYPGRTYKFYDGPEVLYPFGYGLSYTKFQYE--TSTDGVSVSLPAPGGHCK 685

Query: 669 NLNYT-SDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATY 727
            L+Y  S A+   C  V V D  C +   F V   N G   G+ VV+VY+ PP E+A   
Sbjct: 686 GLSYKPSVATVPACQAVNVADHACTETVSFNVSVTNAGGRGGAHVVLVYTAPPPEVAEAP 745

Query: 728 IKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGNG----GVS 783
           IKQV  F+RVFV A     + F  N CK+  IV+  A T++P+G   + V NG     VS
Sbjct: 746 IKQVAAFRRVFVAARSTATVPFALNVCKAFGIVERTAYTVVPSGVSKVLVENGDSSSSVS 805

Query: 784 FPIHLNFN 791
           FP+ ++ +
Sbjct: 806 FPVKIDLS 813


>gi|125535311|gb|EAY81859.1| hypothetical protein OsI_37025 [Oryza sativa Indica Group]
          Length = 816

 Score =  832 bits (2150), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 420/789 (53%), Positives = 532/789 (67%), Gaps = 57/789 (7%)

Query: 36  VCDPGRFSKLGLQMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQ 95
           VCD  RF+ LGL M+ F +CD+SLPY+ RV+DL+ RMT++EKV  LGD+  G  R+GLP 
Sbjct: 51  VCDATRFAGLGLNMTEFRYCDASLPYADRVRDLIGRMTVEEKVGALGDWTDGAARIGLPA 110

Query: 96  YEWWSEALHGVSNVGPGTHFDDV-----------IPGATSFPTVILTTASFNESLWKKIG 144
           Y WWSEALHG+S+ GP T FDD+           +  AT F  VI + ASFNE+LWK IG
Sbjct: 111 YRWWSEALHGLSSTGPTTKFDDLATPHLHSGVSAVYNATVFANVINSAASFNETLWKSIG 170

Query: 145 QAVSTEARAMYNLGRAGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQD 204
           QAVSTEARAMYN+G+ GLTYWSPNINV RDPRWGR  ETPGEDP+VVGRYAVN+VRG+QD
Sbjct: 171 QAVSTEARAMYNMGKGGLTYWSPNINVVRDPRWGRALETPGEDPYVVGRYAVNFVRGMQD 230

Query: 205 VEGHENAT---DLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPF 261
           + GHE      D N+RPLK S+CCKHYAAYD+D+W    R+ FDARV E+DM ETF RPF
Sbjct: 231 IPGHEAVAAGGDPNTRPLKTSACCKHYAAYDLDDWHNHTRFEFDARVDERDMVETFQRPF 290

Query: 262 EMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHK 321
           EMCV++GD SSVMCSYNRVNGIP+CAD +LL+QT+R +W LHGYIV+DCD+++VM DN  
Sbjct: 291 EMCVRDGDVSSVMCSYNRVNGIPACADARLLSQTIRRDWGLHGYIVSDCDAVRVMTDNAT 350

Query: 322 FLADSKEDAVAQTLKAGLDLDCGQ-------------YYTNFTGNAVQQGKVKETDIDKS 368
           +L  +  +A A  LKAGLDLDCG+             + T +   AV +GK++E+DID +
Sbjct: 351 WLGYTGAEASAAALKAGLDLDCGESWKNDTEGHPLMDFLTTYGMEAVNKGKMRESDIDNA 410

Query: 369 LKYLYTVLMRLGFFDGSPQYVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSA 428
           L   Y  LMRLG+FD   QY SLG+QDIC+D++  LA + AR+GIVLLKND   LPL++ 
Sbjct: 411 LTNQYMTLMRLGYFDDITQYSSLGRQDICTDQHKTLALDGARQGIVLLKNDNKLLPLDAN 470

Query: 429 KVKTVAVVGPHANATVAMI-GNYAGIPCRYMSPIAGFSGYANVTYKTGCDDVACKSNNSI 487
           KV  V V GPH  A   ++ G+Y G PCRY++P  G S Y   +++              
Sbjct: 471 KVGFVNVRGPHVQAPEKIMDGDYTGPPCRYVTPRQGVSKYVRFSHR-------------- 516

Query: 488 FAASEAAKTADATIILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMS 547
                    A+ TI   GL+L++E E  DRED+ LP  QT+ I +VA+ +  P+ILVI+S
Sbjct: 517 ---------ANTTIYFGGLNLNIEREGNDREDILLPKNQTEEIIRVAKASPNPIILVILS 567

Query: 548 AGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLP 607
            GG+D++FA+ N  I AILWAGYPG EGG AIADV+FGK NP GRLP+TW+   Y+  LP
Sbjct: 568 GGGIDVSFAQNNPKIGAILWAGYPGGEGGNAIADVIFGKHNPSGRLPLTWFKNKYIYQLP 627

Query: 608 LTSMPLRPVDSLGYPGRTYKFYNGP-TLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQH 666
           +TSM LRPV   GYPGRTYKFYNGP  LYPFGYGLSYT+F Y + +    + V +    H
Sbjct: 628 MTSMDLRPVAKHGYPGRTYKFYNGPDVLYPFGYGLSYTKFLYEMGTNGTALTVPVAG-GH 686

Query: 667 CRNLNYTSDASKT--RCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIA 724
           C+ L+Y S  S     CP + VN   C +   F V   N G T GS  VIV+SKPPAE+ 
Sbjct: 687 CKKLSYKSGVSSAAPACPAINVNGHACTETVSFNVSVTNGGDTGGSHPVIVFSKPPAEVD 746

Query: 725 ATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGN--GGV 782
              IKQV+ F+ VFV A     + F  N CK+  IV+  A T++P+G  T+ V N    V
Sbjct: 747 DAPIKQVVAFRSVFVPAWSTVSVSFELNVCKAFGIVEKTAYTVVPSGVSTVLVENVDSSV 806

Query: 783 SFPIHLNFN 791
           SFP+ ++F+
Sbjct: 807 SFPVKISFS 815


>gi|115486735|ref|NP_001068511.1| Os11g0696400 [Oryza sativa Japonica Group]
 gi|77552754|gb|ABA95551.1| Glycosyl hydrolase family 3 C terminal domain containing protein
           [Oryza sativa Japonica Group]
 gi|113645733|dbj|BAF28874.1| Os11g0696400 [Oryza sativa Japonica Group]
          Length = 816

 Score =  830 bits (2145), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 420/787 (53%), Positives = 531/787 (67%), Gaps = 56/787 (7%)

Query: 36  VCDPGRFSKLGLQMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQ 95
           VCD  RF+ LGL M+ F +CD+SLPY+ RV+DL+ RMT++EKV  LGD+  G  R+GLP 
Sbjct: 52  VCDATRFAGLGLNMTEFRYCDASLPYADRVRDLIGRMTVEEKVGALGDWTDGAARIGLPA 111

Query: 96  YEWWSEALHGVSNVGPGTHFDDV-----------IPGATSFPTVILTTASFNESLWKKIG 144
           Y WWSEALHG+S+ GP T FDD+           +  AT F  VI + ASFNE+LWK IG
Sbjct: 112 YRWWSEALHGLSSTGPTTKFDDLATPHLHSGVSAVYNATVFANVINSAASFNETLWKSIG 171

Query: 145 QAVSTEARAMYNLGRAGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQD 204
           QAVSTEARAMYN+G+ GLTYWSPNINV RDPRWGR  ETPGEDP+VVGRYAVN+VRG+QD
Sbjct: 172 QAVSTEARAMYNMGKGGLTYWSPNINVVRDPRWGRALETPGEDPYVVGRYAVNFVRGMQD 231

Query: 205 VEGHENAT---DLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPF 261
           + GHE      D N+RPLK S+CCKHYAAYD+D+W    R+ FDARV E+DM ETF RPF
Sbjct: 232 IPGHEAVAAGGDPNTRPLKTSACCKHYAAYDLDDWHNHTRFEFDARVDERDMVETFQRPF 291

Query: 262 EMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHK 321
           EMCV++GD SSVMCSYNRVNGIP+CAD +LL+QT+R +W LHGYIV+DCD+++VM DN  
Sbjct: 292 EMCVRDGDVSSVMCSYNRVNGIPACADARLLSQTIRRDWGLHGYIVSDCDAVRVMTDNAT 351

Query: 322 FLADSKEDAVAQTLKAGLDLDCGQYYTNFTG-------------NAVQQGKVKETDIDKS 368
           +L  +  +A A  LKAGLDLDCG+ + N T               AV +GK++E+DID +
Sbjct: 352 WLGYTGAEASAAALKAGLDLDCGESWKNDTDGHPLMDFLTTYGMEAVNKGKMRESDIDNA 411

Query: 369 LKYLYTVLMRLGFFDGSPQYVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSA 428
           L   Y  LMRLG+FD   QY SLG+QDIC+D++  LA + AR+GIVLLKND   LPL++ 
Sbjct: 412 LTNQYMTLMRLGYFDDIAQYSSLGRQDICTDQHKTLALDGARQGIVLLKNDNKLLPLDAN 471

Query: 429 KVKTVAVVGPHANATVAMI-GNYAGIPCRYMSPIAGFSGYANVTYKTGCDDVACKSNNSI 487
           KV  V V GPH  A   ++ G+Y G PCRY++P  G S Y   +++              
Sbjct: 472 KVGFVNVRGPHVQAPEKIMDGDYTGPPCRYVTPRQGVSKYVRFSHR-------------- 517

Query: 488 FAASEAAKTADATIILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMS 547
                    A+ TI   GL+L++E E  DRED+ LP  QT+ I +VA+ +  P+ILVI+S
Sbjct: 518 ---------ANTTIYFGGLNLNIEREGNDREDILLPKNQTEEIIRVAKASPNPIILVILS 568

Query: 548 AGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLP 607
            GG+D++FA+ N  I AILWAGYPG EGG AIADV+FGK NP GRLP+TW+   Y+  LP
Sbjct: 569 GGGIDVSFAQNNPKIGAILWAGYPGGEGGNAIADVIFGKHNPSGRLPLTWFKNKYIYQLP 628

Query: 608 LTSMPLRPVDSLGYPGRTYKFYNGP-TLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQH 666
           +TSM LRPV   GYPGRTYKFY+GP  LYPFGYGLSYT+F Y + +    + V +    H
Sbjct: 629 MTSMDLRPVAKHGYPGRTYKFYDGPDVLYPFGYGLSYTKFLYEMGTNGTALIVPVAG-GH 687

Query: 667 CRNLNYTSDASKT-RCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAA 725
           C+ L+Y S  S    CP + VN   C +   F V   N G T GS  VIV+SKPPAE+  
Sbjct: 688 CKKLSYKSGVSTAPACPAINVNGHVCTETVSFNVSVTNGGDTGGSHPVIVFSKPPAEVDD 747

Query: 726 TYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGN--GGVS 783
             +KQV+ F+ VFV A     + F  N CK+  IV+  A T++P+G  TI V N    VS
Sbjct: 748 APMKQVVAFKSVFVPAWSTVSVSFELNVCKAFGIVEKTAYTVVPSGVSTILVENVDSSVS 807

Query: 784 FPIHLNF 790
           FP+ ++F
Sbjct: 808 FPVKIDF 814


>gi|297843058|ref|XP_002889410.1| hypothetical protein ARALYDRAFT_470222 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297335252|gb|EFH65669.1| hypothetical protein ARALYDRAFT_470222 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 763

 Score =  800 bits (2065), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 398/775 (51%), Positives = 528/775 (68%), Gaps = 33/775 (4%)

Query: 9   LCFSLSIALLVFSTNAVDANGSSSPVFVCDPGRFSKLGLQMSSFLFCDSSLPYSIRVKDL 68
           + F  +I   + S+++V     S   F CD    +   L+     FC  S+P + RVKDL
Sbjct: 1   MAFLAAILFFLISSSSVCVQ--SRETFACDIKDAATATLR-----FCQLSVPITERVKDL 53

Query: 69  VSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGATSFPTV 128
           + R+TL EKV  LG+ A  +PRLG+  YEWWSEALHGVSNVGPGT F  V P ATSFP V
Sbjct: 54  IGRLTLVEKVSLLGNTAAAIPRLGIKGYEWWSEALHGVSNVGPGTKFGGVYPAATSFPQV 113

Query: 129 ILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNINVARDPRWGRITETPGEDP 188
           I T ASFN SLW+ IG+ VS EARAMYN G  GLTYWSPN+N+ RDPRWGR  ETPGEDP
Sbjct: 114 ITTVASFNASLWESIGRVVSNEARAMYNGGVGGLTYWSPNVNILRDPRWGRGQETPGEDP 173

Query: 189 FVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARV 248
            V G+YA +YVRGLQ   G++ +       LKV++CCKH+ AYD+DNW GVDR+HF+A+V
Sbjct: 174 VVAGKYAASYVRGLQ---GNDRSR------LKVAACCKHFTAYDLDNWNGVDRFHFNAKV 224

Query: 249 TEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVA 308
           ++QD+E+TF  PF MCVKEG+ +S+MCSYN VNG+P+CADP LL +T+R EW L+GYIV+
Sbjct: 225 SKQDIEDTFDVPFRMCVKEGNVASIMCSYNEVNGVPTCADPNLLKKTIRNEWGLNGYIVS 284

Query: 309 DCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKS 368
           DCDS+ V+ D   +   + E+A A ++KAGLDLDCG +    T +AV++  ++E+D+D +
Sbjct: 285 DCDSVGVLYDTQHYTG-TPEEAAADSIKAGLDLDCGPFLGAHTIDAVKKNLLRESDVDNA 343

Query: 369 LKYLYTVLMRLGFFDG---SPQYVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPL 425
           L    TV MRLG FDG   +  Y  LG   +C+  +  LA EAA++GIVLLKN  ++LPL
Sbjct: 344 LINTLTVQMRLGMFDGDIAAQPYGHLGPAHVCTPVHKGLALEAAQQGIVLLKNHGSSLPL 403

Query: 426 NSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFSGYANVTYKTGCDDVACKSNN 485
           +S + +TVAV+GP+++ATVAMIGNYAGI C Y SP+ G +GYA   ++ GC DV C  + 
Sbjct: 404 SSQRHRTVAVIGPNSDATVAMIGNYAGIACGYTSPVQGITGYARTVHQKGCVDVHCMDDR 463

Query: 486 SIFAASEAAKTADATIILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVI 545
              AA EAA+ ADAT+++ GLD S+EAE  DR  L LPG Q +LI++VA+ AKGPVILV+
Sbjct: 464 LFDAAVEAARGADATVLVMGLDQSIEAEFKDRNSLLLPGKQQELISRVAKAAKGPVILVL 523

Query: 546 MSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQM 605
           MS G +DI+FAE +  I AI+WAGYPG+EGG AIAD++FG  NPGG+LP+TWY  DY+  
Sbjct: 524 MSGGPIDISFAEKDRKIPAIVWAGYPGQEGGTAIADILFGSANPGGKLPMTWYPQDYLTN 583

Query: 606 LPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQ 665
           LP+T M +RP+ S   PGRTY+FY+GP +YPFG+GLSYT+F +++    K I + +    
Sbjct: 584 LPMTEMSMRPIHSKRIPGRTYRFYDGPVVYPFGHGLSYTRFTHSIADAPKVIPIAV---- 639

Query: 666 HCRNLNYTSDASKTRCPGVLVNDLRCDDY-FEFKVDFQNVGSTDGSDVVIVYSKPPAEIA 724
             R  N T      R     V   RC+       VD  NVGS DG+  ++V+S PP    
Sbjct: 640 --RGRNGTVSGKSIR-----VTHARCNRLSLGVHVDVTNVGSRDGTHTMLVFSAPPGGEW 692

Query: 725 ATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGN 779
           A   KQ++ F+RV V  G  KR++   + CK L++VD A N  +P G+H I +G+
Sbjct: 693 APK-KQLVAFERVHVAVGEKKRVQVNIHVCKYLSVVDRAGNRRIPIGDHGIHIGD 746


>gi|18378991|ref|NP_563659.1| beta-glucosidase [Arabidopsis thaliana]
 gi|75250279|sp|Q94KD8.1|BXL2_ARATH RecName: Full=Probable beta-D-xylosidase 2; Short=AtBXL2; Flags:
           Precursor
 gi|14194121|gb|AAK56255.1|AF367266_1 At1g02640/T14P4_11 [Arabidopsis thaliana]
 gi|23506063|gb|AAN28891.1| At1g02640/T14P4_11 [Arabidopsis thaliana]
 gi|332189332|gb|AEE27453.1| beta-glucosidase [Arabidopsis thaliana]
          Length = 768

 Score =  798 bits (2061), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 395/775 (50%), Positives = 527/775 (68%), Gaps = 33/775 (4%)

Query: 9   LCFSLSIALLVFSTNAVDANGSSSPVFVCDPGRFSKLGLQMSSFLFCDSSLPYSIRVKDL 68
           + F   I   + S+++V  +  S   F CD    +   L+     FC  S+P   RV+DL
Sbjct: 6   MAFLAVILFFLISSSSVCVH--SRETFACDTKDAATATLR-----FCQLSVPIPERVRDL 58

Query: 69  VSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGATSFPTV 128
           + R+TL EKV  LG+ A  +PRLG+  YEWWSEALHGVSNVGPGT F  V P ATSFP V
Sbjct: 59  IGRLTLAEKVSLLGNTAAAIPRLGIKGYEWWSEALHGVSNVGPGTKFGGVYPAATSFPQV 118

Query: 129 ILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNINVARDPRWGRITETPGEDP 188
           I T ASFN SLW+ IG+ VS EARAMYN G  GLTYWSPN+N+ RDPRWGR  ETPGEDP
Sbjct: 119 ITTVASFNASLWESIGRVVSNEARAMYNGGVGGLTYWSPNVNILRDPRWGRGQETPGEDP 178

Query: 189 FVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARV 248
            V G+YA +YVRGLQ   G++ +       LKV++CCKH+ AYD+DNW GVDR+HF+A+V
Sbjct: 179 VVAGKYAASYVRGLQ---GNDRSR------LKVAACCKHFTAYDLDNWNGVDRFHFNAKV 229

Query: 249 TEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVA 308
           ++QD+E+TF  PF MCVKEG+ +S+MCSYN+VNG+P+CADP LL +T+R +W L+GYIV+
Sbjct: 230 SKQDIEDTFDVPFRMCVKEGNVASIMCSYNQVNGVPTCADPNLLKKTIRNQWGLNGYIVS 289

Query: 309 DCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKS 368
           DCDS+ V+ D   +   + E+A A ++KAGLDLDCG +    T +AV++  ++E+D+D +
Sbjct: 290 DCDSVGVLYDTQHYTG-TPEEAAADSIKAGLDLDCGPFLGAHTIDAVKKNLLRESDVDNA 348

Query: 369 LKYLYTVLMRLGFFDG---SPQYVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPL 425
           L    TV MRLG FDG   +  Y  LG   +C+  +  LA EAA++GIVLLKN  ++LPL
Sbjct: 349 LINTLTVQMRLGMFDGDIAAQPYGHLGPAHVCTPVHKGLALEAAQQGIVLLKNHGSSLPL 408

Query: 426 NSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFSGYANVTYKTGCDDVACKSNN 485
           +S + +TVAV+GP+++ATV MIGNYAG+ C Y SP+ G +GYA   ++ GC DV C  + 
Sbjct: 409 SSQRHRTVAVIGPNSDATVTMIGNYAGVACGYTSPVQGITGYARTIHQKGCVDVHCMDDR 468

Query: 486 SIFAASEAAKTADATIILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVI 545
              AA EAA+ ADAT+++ GLD S+EAE  DR  L LPG Q +L+++VA+ AKGPVILV+
Sbjct: 469 LFDAAVEAARGADATVLVMGLDQSIEAEFKDRNSLLLPGKQQELVSRVAKAAKGPVILVL 528

Query: 546 MSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQM 605
           MS G +DI+FAE +  I AI+WAGYPG+EGG AIAD++FG  NPGG+LP+TWY  DY+  
Sbjct: 529 MSGGPIDISFAEKDRKIPAIVWAGYPGQEGGTAIADILFGSANPGGKLPMTWYPQDYLTN 588

Query: 606 LPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQ 665
           LP+T M +RPV S   PGRTY+FY+GP +YPFG+GLSYT+F +N+    K I + +    
Sbjct: 589 LPMTEMSMRPVHSKRIPGRTYRFYDGPVVYPFGHGLSYTRFTHNIADAPKVIPIAV---- 644

Query: 666 HCRNLNYTSDASKTRCPGVLVNDLRCDDY-FEFKVDFQNVGSTDGSDVVIVYSKPPAEIA 724
             R  N T      R     V   RCD       V+  NVGS DG+  ++V+S PP    
Sbjct: 645 --RGRNGTVSGKSIR-----VTHARCDRLSLGVHVEVTNVGSRDGTHTMLVFSAPPGGEW 697

Query: 725 ATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGN 779
           A   KQ++ F+RV V  G  KR++   + CK L++VD A N  +P G+H I +G+
Sbjct: 698 APK-KQLVAFERVHVAVGEKKRVQVNIHVCKYLSVVDRAGNRRIPIGDHGIHIGD 751


>gi|356503923|ref|XP_003520749.1| PREDICTED: probable beta-D-xylosidase 2-like [Glycine max]
          Length = 775

 Score =  798 bits (2060), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 397/784 (50%), Positives = 517/784 (65%), Gaps = 31/784 (3%)

Query: 1   MAKVVSSLLCFSLSIALLVFSTNAVDANGSSSPVFVCDPGRFSKLGLQMSSFLFCDSSLP 60
           M+   S LL       LL    +  +A       F CDP   +       +  FC +SL 
Sbjct: 1   MSSTFSPLLNLIAVFLLLFLVRHTCEARDP----FACDPKNGA-----TENMPFCKASLA 51

Query: 61  YSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTHFDDVIP 120
              RVKDLV R+TL EKV+ L + A  VPRLG+  YEWWSEALHGVSNVGPG  F+   P
Sbjct: 52  IPERVKDLVGRLTLQEKVRLLVNNAAAVPRLGMKGYEWWSEALHGVSNVGPGVKFNAQFP 111

Query: 121 GATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNINVARDPRWGRI 180
           GATSFP VI T ASFN SLW+ IGQ VS EARAMYN G AGLTYWSPN+N+ RDPRWGR 
Sbjct: 112 GATSFPQVITTAASFNASLWEAIGQVVSDEARAMYNGGTAGLTYWSPNVNIFRDPRWGRG 171

Query: 181 TETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVD 240
            ETPGEDP + G YA +YVRGLQ  +G+          LKV++CCKH+ AYD+DNW G+D
Sbjct: 172 QETPGEDPVLAGTYAASYVRGLQGTDGNR---------LKVAACCKHFTAYDLDNWNGMD 222

Query: 241 RYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEW 300
           R+HF+A+V++QD+EETF  PF MCV EG  +SVMCSYN+VNG+P+CADP LL +TVRG W
Sbjct: 223 RFHFNAQVSKQDIEETFDVPFRMCVSEGKVASVMCSYNQVNGVPTCADPNLLKKTVRGLW 282

Query: 301 DLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFTGNAVQQGKV 360
            L GYIV+DCDS+ V  DN  +   + E+A A  +KAGLDLDCG +    T NAV++G +
Sbjct: 283 QLDGYIVSDCDSVGVFYDNQHY-TPTPEEAAADAIKAGLDLDCGPFLAVHTQNAVEKGLL 341

Query: 361 KETDIDKSLKYLYTVLMRLGFFDGSPQ---YVSLGKQDICSDENIELAAEAAREGIVLLK 417
            E D++ +L    TV MRLG FDG P    Y  LG +D+C   + ELA EAAR+GIVLLK
Sbjct: 342 SEADVNGALVNTLTVQMRLGMFDGEPSAHAYGKLGPKDVCKPAHQELALEAARQGIVLLK 401

Query: 418 NDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFSGYANVTYKTGCD 477
           N    LPL+  +  TVAV+GP++ ATV MIGNYAG+ C Y +P+ G   YA   ++ GC+
Sbjct: 402 NTGPVLPLSPQRHHTVAVIGPNSKATVTMIGNYAGVACGYTNPLQGIGRYAKTIHQLGCE 461

Query: 478 DVACKSNNSIFAASEAAKTADATIILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVA 537
           +VACK++    +A  AA+ ADAT+++ GLD S+EAE++DR  L LPG Q  L+++VA  +
Sbjct: 462 NVACKNDKLFGSAINAARQADATVLVMGLDQSIEAETVDRTGLLLPGRQQDLVSKVAAAS 521

Query: 538 KGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITW 597
           KGP ILVIMS G VDI FA+ N  I  ILWAGYPG+ GG AIAD++FG  NPGG+LP+TW
Sbjct: 522 KGPTILVIMSGGSVDITFAKNNPRIVGILWAGYPGQAGGAAIADILFGTTNPGGKLPVTW 581

Query: 598 YNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTI 657
           Y  +Y+  LP+T+M +R   S GYPGRTY+FYNGP +YPFG+GL+YT F + L S    +
Sbjct: 582 YPQEYLTKLPMTNMAMRGSKSAGYPGRTYRFYNGPVVYPFGHGLTYTHFVHTLASAPTVV 641

Query: 658 QVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDY-FEFKVDFQNVGSTDGSDVVIVY 716
            V LN  +     N ++ A       + V   RCD      +VD +NVGS DG+  ++V+
Sbjct: 642 SVPLNGHRRANVTNISNRA-------IRVTHARCDKLSISLEVDIKNVGSRDGTHTLLVF 694

Query: 717 SKPPAEIAATYI-KQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTI 775
           S PPA      + KQ++ F+++ V A   +R+    + CK L++VD +    +P GEH+ 
Sbjct: 695 SAPPAGFGHWALEKQLVAFEKIHVPAKGLQRVGVNIHVCKLLSVVDKSGIRRIPLGEHSF 754

Query: 776 FVGN 779
            +G+
Sbjct: 755 NIGD 758


>gi|357445735|ref|XP_003593145.1| Beta-xylosidase/alpha-L-arabinofuranosidase [Medicago truncatula]
 gi|355482193|gb|AES63396.1| Beta-xylosidase/alpha-L-arabinofuranosidase [Medicago truncatula]
          Length = 775

 Score =  797 bits (2059), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 394/781 (50%), Positives = 533/781 (68%), Gaps = 28/781 (3%)

Query: 3   KVVSSLLCFSLSIALLVFSTNAVDANGSSSPVFVCDPGRFSKLGLQMSSFLFCDSSLPYS 62
           KV S  LCFS+    ++ + N V   G +S VF CD  + + +    SS+ FCD SL   
Sbjct: 10  KVSSVFLCFSIFYVAVLLNCNHV--YGQTSTVFACDVAKNTNV----SSYGFCDKSLSVE 63

Query: 63  IRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGA 122
            RV DLV R+TL EK+  LG+ A  V RLG+P+YEWWSEALHGVSN+GPGTHF  ++PGA
Sbjct: 64  DRVSDLVKRLTLQEKIGNLGNSAVEVSRLGIPKYEWWSEALHGVSNIGPGTHFSSLVPGA 123

Query: 123 TSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNINVARDPRWGRITE 182
           TSFP  ILT ASFN SL++ IG  VS EARAMYN+G AGLTYWSPNIN+ RDPRWGR  E
Sbjct: 124 TSFPMPILTAASFNTSLFQAIGSVVSNEARAMYNVGLAGLTYWSPNINIFRDPRWGRGQE 183

Query: 183 TPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRY 242
           TPGEDP +  +YA  YV+GLQ  +      D +S  LKV++CCKHY AYDVDNWKGV RY
Sbjct: 184 TPGEDPLLSSKYAAGYVKGLQQTD------DGDSDKLKVAACCKHYTAYDVDNWKGVQRY 237

Query: 243 HFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDL 302
            FDA V++QD+++TF  PF+ CV +G+ +SVMCSYN+VNG P+CADP LL   +RG+W L
Sbjct: 238 TFDAVVSQQDLDDTFQPPFKSCVIDGNVASVMCSYNKVNGKPTCADPDLLKGVIRGKWKL 297

Query: 303 HGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFTGNAVQQGKVKE 362
           +GYIV+DCDS++V+  +  +   + E+A A+T+ +GLDLDCG Y   +TG AV+QG V E
Sbjct: 298 NGYIVSDCDSVEVLFKDQHY-TKTPEEAAAKTILSGLDLDCGSYLGQYTGGAVKQGLVDE 356

Query: 363 TDIDKSLKYLYTVLMRLGFFDGSPQ---YVSLGKQDICSDENIELAAEAAREGIVLLKND 419
             I+ ++   +  LMRLGFFDG P    Y +LG +D+C+ EN ELA EAAR+GIVLLKN 
Sbjct: 357 ASINNAVSNNFATLMRLGFFDGDPSKQPYGNLGPKDVCTPENQELAREAARQGIVLLKNS 416

Query: 420 QNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFSGYANVTYKTGCDDV 479
             +LPL+S  +K++AV+GP+ANAT  MIGNY GIPC+Y SP+ G + +   +Y  GC DV
Sbjct: 417 PGSLPLSSKAIKSLAVIGPNANATRVMIGNYEGIPCKYTSPLQGLTAFVPTSYAPGCPDV 476

Query: 480 ACKSNNSIFAASEAAKTADATIILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKG 539
            C +N  I  A++ A +ADATII+ G +L++EAESLDR ++ LPG Q QL+N+VA V+KG
Sbjct: 477 QC-ANAQIDDAAKIAASADATIIVVGANLAIEAESLDRVNILLPGQQQQLVNEVANVSKG 535

Query: 540 PVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYN 599
           PVILVIMS GG+D++FA+TN  I +ILW GYPGE GG AIADV+FG +NP GRLP+TWY 
Sbjct: 536 PVILVIMSGGGMDVSFAKTNDKITSILWVGYPGEAGGAAIADVIFGSYNPSGRLPMTWYP 595

Query: 600 GDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQV 659
             YV+ +P+T+M +R   + GYPGRTY+FY G T++ FG G+S+   ++ ++   + + V
Sbjct: 596 QSYVEKIPMTNMNMRSDPATGYPGRTYRFYKGETVFSFGDGMSFGTVEHKIVKAPQLVSV 655

Query: 660 NLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDY-FEFKVDFQNVGSTDGSDVVIVYSK 718
            L +   CR+L          C  + V D  C +  F+  +  +N+G    S  V+++  
Sbjct: 656 PLAEDHECRSL---------ECKSLDVADEHCQNLAFDIHLSVKNMGKMSSSHSVLLFFT 706

Query: 719 PPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVG 778
           PP  +     K ++GF++V +       ++F  + C  L++VD   N  +P G+H + VG
Sbjct: 707 PP-NVHNAPQKHLLGFEKVQLAGKSEGMVRFKVDVCNDLSVVDELGNRKVPLGDHMLHVG 765

Query: 779 N 779
           N
Sbjct: 766 N 766


>gi|9972374|gb|AAG10624.1|AC022521_2 Similar to xylosidase [Arabidopsis thaliana]
          Length = 763

 Score =  796 bits (2057), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 395/775 (50%), Positives = 527/775 (68%), Gaps = 33/775 (4%)

Query: 9   LCFSLSIALLVFSTNAVDANGSSSPVFVCDPGRFSKLGLQMSSFLFCDSSLPYSIRVKDL 68
           + F   I   + S+++V  +  S   F CD    +   L+     FC  S+P   RV+DL
Sbjct: 1   MAFLAVILFFLISSSSVCVH--SRETFACDTKDAATATLR-----FCQLSVPIPERVRDL 53

Query: 69  VSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGATSFPTV 128
           + R+TL EKV  LG+ A  +PRLG+  YEWWSEALHGVSNVGPGT F  V P ATSFP V
Sbjct: 54  IGRLTLAEKVSLLGNTAAAIPRLGIKGYEWWSEALHGVSNVGPGTKFGGVYPAATSFPQV 113

Query: 129 ILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNINVARDPRWGRITETPGEDP 188
           I T ASFN SLW+ IG+ VS EARAMYN G  GLTYWSPN+N+ RDPRWGR  ETPGEDP
Sbjct: 114 ITTVASFNASLWESIGRVVSNEARAMYNGGVGGLTYWSPNVNILRDPRWGRGQETPGEDP 173

Query: 189 FVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARV 248
            V G+YA +YVRGLQ   G++ +       LKV++CCKH+ AYD+DNW GVDR+HF+A+V
Sbjct: 174 VVAGKYAASYVRGLQ---GNDRSR------LKVAACCKHFTAYDLDNWNGVDRFHFNAKV 224

Query: 249 TEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVA 308
           ++QD+E+TF  PF MCVKEG+ +S+MCSYN+VNG+P+CADP LL +T+R +W L+GYIV+
Sbjct: 225 SKQDIEDTFDVPFRMCVKEGNVASIMCSYNQVNGVPTCADPNLLKKTIRNQWGLNGYIVS 284

Query: 309 DCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKS 368
           DCDS+ V+ D   +   + E+A A ++KAGLDLDCG +    T +AV++  ++E+D+D +
Sbjct: 285 DCDSVGVLYDTQHYTG-TPEEAAADSIKAGLDLDCGPFLGAHTIDAVKKNLLRESDVDNA 343

Query: 369 LKYLYTVLMRLGFFDG---SPQYVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPL 425
           L    TV MRLG FDG   +  Y  LG   +C+  +  LA EAA++GIVLLKN  ++LPL
Sbjct: 344 LINTLTVQMRLGMFDGDIAAQPYGHLGPAHVCTPVHKGLALEAAQQGIVLLKNHGSSLPL 403

Query: 426 NSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFSGYANVTYKTGCDDVACKSNN 485
           +S + +TVAV+GP+++ATV MIGNYAG+ C Y SP+ G +GYA   ++ GC DV C  + 
Sbjct: 404 SSQRHRTVAVIGPNSDATVTMIGNYAGVACGYTSPVQGITGYARTIHQKGCVDVHCMDDR 463

Query: 486 SIFAASEAAKTADATIILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVI 545
              AA EAA+ ADAT+++ GLD S+EAE  DR  L LPG Q +L+++VA+ AKGPVILV+
Sbjct: 464 LFDAAVEAARGADATVLVMGLDQSIEAEFKDRNSLLLPGKQQELVSRVAKAAKGPVILVL 523

Query: 546 MSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQM 605
           MS G +DI+FAE +  I AI+WAGYPG+EGG AIAD++FG  NPGG+LP+TWY  DY+  
Sbjct: 524 MSGGPIDISFAEKDRKIPAIVWAGYPGQEGGTAIADILFGSANPGGKLPMTWYPQDYLTN 583

Query: 606 LPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQ 665
           LP+T M +RPV S   PGRTY+FY+GP +YPFG+GLSYT+F +N+    K I + +    
Sbjct: 584 LPMTEMSMRPVHSKRIPGRTYRFYDGPVVYPFGHGLSYTRFTHNIADAPKVIPIAV---- 639

Query: 666 HCRNLNYTSDASKTRCPGVLVNDLRCDDY-FEFKVDFQNVGSTDGSDVVIVYSKPPAEIA 724
             R  N T      R     V   RCD       V+  NVGS DG+  ++V+S PP    
Sbjct: 640 --RGRNGTVSGKSIR-----VTHARCDRLSLGVHVEVTNVGSRDGTHTMLVFSAPPGGEW 692

Query: 725 ATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGN 779
           A   KQ++ F+RV V  G  KR++   + CK L++VD A N  +P G+H I +G+
Sbjct: 693 APK-KQLVAFERVHVAVGEKKRVQVNIHVCKYLSVVDRAGNRRIPIGDHGIHIGD 746


>gi|9294427|dbj|BAB02547.1| beta-1,4-xylosidase [Arabidopsis thaliana]
          Length = 876

 Score =  796 bits (2055), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 399/753 (52%), Positives = 515/753 (68%), Gaps = 32/753 (4%)

Query: 50  SSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNV 109
           + + FC+ SL Y  R KDLVSR++L EKVQQL + A GVPRLG+P YEWWSEALHGVS+V
Sbjct: 37  AKYGFCNVSLSYEARAKDLVSRLSLKEKVQQLVNKATGVPRLGVPPYEWWSEALHGVSDV 96

Query: 110 GPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNI 169
           GPG HF+  +PGATSFP  ILT ASFN SLW K+G+ VSTEARAM+N+G AGLTYWSPN+
Sbjct: 97  GPGVHFNGTVPGATSFPATILTAASFNTSLWLKMGEVVSTEARAMHNVGLAGLTYWSPNV 156

Query: 170 NVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYA 229
           NV RDPRWGR  ETPGEDP VV +YAVNYV+GLQDV  H+      SR LKVSSCCKHY 
Sbjct: 157 NVFRDPRWGRGQETPGEDPLVVSKYAVNYVKGLQDV--HDAG---KSRRLKVSSCCKHYT 211

Query: 230 AYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADP 289
           AYD+DNWKG+DR+HFDA+VT+QD+E+T+  PF+ CV+EGD SSVMCSYNRVNGIP+CADP
Sbjct: 212 AYDLDNWKGIDRFHFDAKVTKQDLEDTYQTPFKSCVEEGDVSSVMCSYNRVNGIPTCADP 271

Query: 290 KLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTN 349
            LL   +RG+W L GYIV+DCDSIQV  ++  +   ++EDAVA  LKAGL+++CG +   
Sbjct: 272 NLLRGVIRGQWRLDGYIVSDCDSIQVYFNDIHY-TKTREDAVALALKAGLNMNCGDFLGK 330

Query: 350 FTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ---YVSLGKQDICSDENIELAA 406
           +T NAV+  K+  +D+D++L Y Y VLMRLGFFDG P+   + +LG  D+CS ++  LA 
Sbjct: 331 YTENAVKLKKLNGSDVDEALIYNYIVLMRLGFFDGDPKSLPFGNLGPSDVCSKDHQMLAL 390

Query: 407 EAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFSG 466
           EAA++GIVLL+N +  LPL    VK +AV+GP+ANAT  MI NYAG+PC+Y SPI G   
Sbjct: 391 EAAKQGIVLLEN-RGDLPLPKTTVKKLAVIGPNANATKVMISNYAGVPCKYTSPIQGLQK 449

Query: 467 YA--NVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAESLDREDLWLPG 524
           Y    + Y+ GC DV C     I AA +A   AD T+++ GLD +VEAE LDR +L LPG
Sbjct: 450 YVPEKIVYEPGCKDVKCGDQTLISAAVKAVSEADVTVLVVGLDQTVEAEGLDRVNLTLPG 509

Query: 525 YQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVF 584
           YQ +L+  VA  AK  V+LVIMSAG +DI+FA+  + I+A+LW GYPGE GG AIA V+F
Sbjct: 510 YQEKLVRDVANAAKKTVVLVIMSAGPIDISFAKNLSTIRAVLWVGYPGEAGGDAIAQVIF 569

Query: 585 GKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYT 644
           G +NP GRLP TWY  ++   + +T M +RP  + G+PGR+Y+FY G  +Y FGYGLSY+
Sbjct: 570 GDYNPSGRLPETWYPQEFADKVAMTDMNMRPNSTSGFPGRSYRFYTGKPIYKFGYGLSYS 629

Query: 645 QFKYNLLSFTKTIQVNLNKLQHCRNLNYTS--DASKTRCPGVLVNDLRCDDYFEFKVDFQ 702
            F   +LS    I +  N +    NLN T+  D S   C     +DL+        +  +
Sbjct: 630 SFSTFVLSAPSIIHIKTNPIM---NLNKTTSVDISTVNC-----HDLK----IRIVIGVK 677

Query: 703 NVGSTDGSDVVIVYSKPPAEIAATY-----IKQVIGFQRVFVRAGRNKRIKFVFNACKSL 757
           N G   GS VV+V+ KPP    +       + Q++GF+RV V     ++    F+ CK+L
Sbjct: 678 NHGLRSGSHVVLVFWKPPKCSKSLVGGGVPLTQLVGFERVEVGRSMTEKFTVDFDVCKAL 737

Query: 758 NIVDYAANTLLPAGEHTIFVG-NGGVSFPIHLN 789
           ++VD      L  G H + +G N       HLN
Sbjct: 738 SLVDTHGKRKLVTGHHKLVIGSNSDQQIYHHLN 770


>gi|15230897|ref|NP_188596.1| putative beta-D-xylosidase 5 [Arabidopsis thaliana]
 gi|259585724|sp|Q9LJN4.2|BXL5_ARATH RecName: Full=Probable beta-D-xylosidase 5; Short=AtBXL5; Flags:
           Precursor
 gi|332642747|gb|AEE76268.1| putative beta-D-xylosidase 5 [Arabidopsis thaliana]
          Length = 781

 Score =  795 bits (2052), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 399/753 (52%), Positives = 515/753 (68%), Gaps = 32/753 (4%)

Query: 50  SSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNV 109
           + + FC+ SL Y  R KDLVSR++L EKVQQL + A GVPRLG+P YEWWSEALHGVS+V
Sbjct: 37  AKYGFCNVSLSYEARAKDLVSRLSLKEKVQQLVNKATGVPRLGVPPYEWWSEALHGVSDV 96

Query: 110 GPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNI 169
           GPG HF+  +PGATSFP  ILT ASFN SLW K+G+ VSTEARAM+N+G AGLTYWSPN+
Sbjct: 97  GPGVHFNGTVPGATSFPATILTAASFNTSLWLKMGEVVSTEARAMHNVGLAGLTYWSPNV 156

Query: 170 NVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYA 229
           NV RDPRWGR  ETPGEDP VV +YAVNYV+GLQDV  H+      SR LKVSSCCKHY 
Sbjct: 157 NVFRDPRWGRGQETPGEDPLVVSKYAVNYVKGLQDV--HDAG---KSRRLKVSSCCKHYT 211

Query: 230 AYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADP 289
           AYD+DNWKG+DR+HFDA+VT+QD+E+T+  PF+ CV+EGD SSVMCSYNRVNGIP+CADP
Sbjct: 212 AYDLDNWKGIDRFHFDAKVTKQDLEDTYQTPFKSCVEEGDVSSVMCSYNRVNGIPTCADP 271

Query: 290 KLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTN 349
            LL   +RG+W L GYIV+DCDSIQV  ++  +   ++EDAVA  LKAGL+++CG +   
Sbjct: 272 NLLRGVIRGQWRLDGYIVSDCDSIQVYFNDIHY-TKTREDAVALALKAGLNMNCGDFLGK 330

Query: 350 FTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ---YVSLGKQDICSDENIELAA 406
           +T NAV+  K+  +D+D++L Y Y VLMRLGFFDG P+   + +LG  D+CS ++  LA 
Sbjct: 331 YTENAVKLKKLNGSDVDEALIYNYIVLMRLGFFDGDPKSLPFGNLGPSDVCSKDHQMLAL 390

Query: 407 EAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFSG 466
           EAA++GIVLL+N +  LPL    VK +AV+GP+ANAT  MI NYAG+PC+Y SPI G   
Sbjct: 391 EAAKQGIVLLEN-RGDLPLPKTTVKKLAVIGPNANATKVMISNYAGVPCKYTSPIQGLQK 449

Query: 467 YA--NVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAESLDREDLWLPG 524
           Y    + Y+ GC DV C     I AA +A   AD T+++ GLD +VEAE LDR +L LPG
Sbjct: 450 YVPEKIVYEPGCKDVKCGDQTLISAAVKAVSEADVTVLVVGLDQTVEAEGLDRVNLTLPG 509

Query: 525 YQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVF 584
           YQ +L+  VA  AK  V+LVIMSAG +DI+FA+  + I+A+LW GYPGE GG AIA V+F
Sbjct: 510 YQEKLVRDVANAAKKTVVLVIMSAGPIDISFAKNLSTIRAVLWVGYPGEAGGDAIAQVIF 569

Query: 585 GKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYT 644
           G +NP GRLP TWY  ++   + +T M +RP  + G+PGR+Y+FY G  +Y FGYGLSY+
Sbjct: 570 GDYNPSGRLPETWYPQEFADKVAMTDMNMRPNSTSGFPGRSYRFYTGKPIYKFGYGLSYS 629

Query: 645 QFKYNLLSFTKTIQVNLNKLQHCRNLNYTS--DASKTRCPGVLVNDLRCDDYFEFKVDFQ 702
            F   +LS    I +  N +    NLN T+  D S   C     +DL+        +  +
Sbjct: 630 SFSTFVLSAPSIIHIKTNPIM---NLNKTTSVDISTVNC-----HDLK----IRIVIGVK 677

Query: 703 NVGSTDGSDVVIVYSKPPAEIAATY-----IKQVIGFQRVFVRAGRNKRIKFVFNACKSL 757
           N G   GS VV+V+ KPP    +       + Q++GF+RV V     ++    F+ CK+L
Sbjct: 678 NHGLRSGSHVVLVFWKPPKCSKSLVGGGVPLTQLVGFERVEVGRSMTEKFTVDFDVCKAL 737

Query: 758 NIVDYAANTLLPAGEHTIFVG-NGGVSFPIHLN 789
           ++VD      L  G H + +G N       HLN
Sbjct: 738 SLVDTHGKRKLVTGHHKLVIGSNSDQQIYHHLN 770


>gi|292630922|sp|A5JTQ2.1|XYL1_MEDVA RecName: Full=Beta-xylosidase/alpha-L-arabinofuranosidase 1;
           AltName: Full=Xylan
           1,4-beta-xylosidase/Alpha-N-arabinofuranosidase 1;
           Short=MsXyl1; Includes: RecName: Full=Beta-xylosidase;
           AltName: Full=1,4-beta-D-xylan xylohydrolase; AltName:
           Full=Xylan 1,4-beta-xylosidase; Includes: RecName:
           Full=Alpha-N-arabinofuranosidase; AltName:
           Full=Alpha-L-arabinofuranosidase; Short=Arabinosidase;
           Flags: Precursor
 gi|146762261|gb|ABQ45227.1| beta-xylosidase/alpha-L-arabinosidase [Medicago sativa subsp. x
           varia]
          Length = 774

 Score =  795 bits (2052), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 394/790 (49%), Positives = 536/790 (67%), Gaps = 28/790 (3%)

Query: 3   KVVSSLLCFSLSIALLVFSTNAVDANGSSSPVFVCDPGRFSKLGLQMSSFLFCDSSLPYS 62
           KV S  LCFS+    ++ + N V   G +S VF CD  + +     +SS+ FCD+SL   
Sbjct: 9   KVSSVFLCFSIFYVTVLLNCNHV--YGQTSTVFACDVAKNT----NVSSYGFCDNSLSVE 62

Query: 63  IRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGA 122
            RV DLV R+TL EK+  LG+ A  V RLG+P+YEWWSEALHGVSN+GPGTHF  ++PGA
Sbjct: 63  DRVSDLVKRLTLQEKIGNLGNSAVEVSRLGIPKYEWWSEALHGVSNIGPGTHFSSLVPGA 122

Query: 123 TSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNINVARDPRWGRITE 182
           T+FP  ILT ASFN SL++ IG  VS EARAMYN+G AGLTYWSPNIN+ RDPRWGR  E
Sbjct: 123 TNFPMPILTAASFNTSLFQAIGSVVSNEARAMYNVGLAGLTYWSPNINIFRDPRWGRGQE 182

Query: 183 TPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRY 242
           TPGEDP +  +YA  YV+GLQ  +      D +S  LKV++CCKHY AYDVDNWKGV RY
Sbjct: 183 TPGEDPLLSSKYAAGYVKGLQQTD------DGDSDKLKVAACCKHYTAYDVDNWKGVQRY 236

Query: 243 HFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDL 302
            FDA V++QD+++TF  PF+ CV +G+ +SVMCSYN+VNG P+CADP LL   +RG+W L
Sbjct: 237 TFDAVVSQQDLDDTFQPPFKSCVIDGNVASVMCSYNKVNGKPTCADPDLLKGVIRGKWKL 296

Query: 303 HGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFTGNAVQQGKVKE 362
           +GYIV+DCDS++V+  +  +   + E+A A+T+ +GLDLDCG Y   +TG AV+QG V E
Sbjct: 297 NGYIVSDCDSVEVLYKDQHY-TKTPEEAAAKTILSGLDLDCGSYLGQYTGGAVKQGLVDE 355

Query: 363 TDIDKSLKYLYTVLMRLGFFDGSPQ---YVSLGKQDICSDENIELAAEAAREGIVLLKND 419
             I  ++   +  LMRLGFFDG P    Y +LG +D+C+ EN ELA EAAR+GIVLLKN 
Sbjct: 356 ASITNAVSNNFATLMRLGFFDGDPSKQPYGNLGPKDVCTPENQELAREAARQGIVLLKNS 415

Query: 420 QNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFSGYANVTYKTGCDDV 479
             +LPL+S  +K++AV+GP+ANAT  MIGNY GIPC+Y SP+ G + +   +Y  GC DV
Sbjct: 416 PRSLPLSSKAIKSLAVIGPNANATRVMIGNYEGIPCKYTSPLQGLTAFVPTSYAPGCPDV 475

Query: 480 ACKSNNSIFAASEAAKTADATIILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKG 539
            C +N  I  A++ A +ADATII+ G +L++EAESLDR ++ LPG Q QL+N+VA V+KG
Sbjct: 476 QC-ANAQIDDAAKIAASADATIIVVGANLAIEAESLDRVNILLPGQQQQLVNEVANVSKG 534

Query: 540 PVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYN 599
           PVILVIMS GG+D++FA+TN  I +ILW GYPGE GG AIADV+FG +NP GRLP+TWY 
Sbjct: 535 PVILVIMSGGGMDVSFAKTNDKITSILWVGYPGEAGGAAIADVIFGSYNPSGRLPMTWYP 594

Query: 600 GDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQV 659
             YV+ +P+T+M +R   + GYPGRTY+FY G T++ FG G+S+   ++ ++   + + V
Sbjct: 595 QSYVEKVPMTNMNMRADPATGYPGRTYRFYKGETVFSFGDGMSFGTVEHKIVKAPQLVSV 654

Query: 660 NLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDY-FEFKVDFQNVGSTDGSDVVIVYSK 718
            L +   CR+L          C  + V D  C +  F+  +  +N+G    S  V+++  
Sbjct: 655 PLAEDHECRSL---------ECKSLDVADKHCQNLAFDIHLSVKNMGKMSSSHSVLLFFT 705

Query: 719 PPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVG 778
           PP  +     K ++GF++V +       ++F  + C  L++VD   N  +P G+H + VG
Sbjct: 706 PP-NVHNAPQKHLLGFEKVQLAGKSEGMVRFKVDVCNDLSVVDELGNRKVPLGDHMLHVG 764

Query: 779 NGGVSFPIHL 788
           N   S  + +
Sbjct: 765 NLKHSLSVRI 774


>gi|224111912|ref|XP_002316021.1| predicted protein [Populus trichocarpa]
 gi|222865061|gb|EEF02192.1| predicted protein [Populus trichocarpa]
          Length = 768

 Score =  794 bits (2050), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 390/760 (51%), Positives = 517/760 (68%), Gaps = 28/760 (3%)

Query: 35  FVCDPGRFSKLGLQMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLP 94
           F CDP    KLGL   S  FC  +LP  +RV+DL+ R+TL EK++ L + A  VPRLG+ 
Sbjct: 28  FACDP----KLGL-TRSLKFCRVNLPIHVRVRDLIGRLTLQEKIRLLVNNAAAVPRLGIQ 82

Query: 95  QYEWWSEALHGVSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAM 154
            YEWWSEALHGVSNVGPGT F    PGAT+FP VI T ASFNESLW++IG+ VS EARAM
Sbjct: 83  GYEWWSEALHGVSNVGPGTKFGGAFPGATAFPQVITTAASFNESLWEEIGRVVSDEARAM 142

Query: 155 YNLGRAGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDL 214
           YN G AGLTYWSPN+NV RDPRWGR  ETPGEDP V G+YA +YVRGLQ   G       
Sbjct: 143 YNGGMAGLTYWSPNVNVFRDPRWGRGQETPGEDPVVAGKYAASYVRGLQGNNGLR----- 197

Query: 215 NSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVM 274
               LKV++CCKHY AYD+DNW GVDRYHF+ARV++QD+E+T+  PF+ CV  G  +SVM
Sbjct: 198 ----LKVAACCKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKSCVVAGKVASVM 253

Query: 275 CSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQT 334
           CSYN+VNG P+CADP LL  T+RGEW L+GYIV+DCDS+ V+ D   + A + E+A A T
Sbjct: 254 CSYNQVNGKPTCADPYLLKNTIRGEWGLNGYIVSDCDSVGVLFDTQHYTA-TPEEAAAST 312

Query: 335 LKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ---YVSL 391
           ++AGLDLDCG +    T NAV+ G +KE D++ +L    TV MRLG FDG P    + +L
Sbjct: 313 IRAGLDLDCGPFLAIHTENAVKGGLLKEEDVNMALANTITVQMRLGMFDGEPSAQPFGNL 372

Query: 392 GKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYA 451
           G +D+C+  + +LA +AAR+GIVLL+N   TLPL S  ++TVAV+GP+++ TV MIGNYA
Sbjct: 373 GPRDVCTPAHQQLALQAARQGIVLLQNRGRTLPL-SRTLQTVAVIGPNSDVTVTMIGNYA 431

Query: 452 GIPCRYMSPIAGFSGYANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVE 511
           G+ C Y +P+ G   YA   +  GC+DV C  N    AA  AA+ ADATI++ GLD S+E
Sbjct: 432 GVACGYTTPLQGIRRYAKTVHHPGCNDVFCNGNQQFNAAEVAARHADATILVMGLDQSIE 491

Query: 512 AESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYP 571
           AE  DR+ L LPGYQ +L++ VA  ++GP ILV+MS G +D++FA+ +  I AILW GYP
Sbjct: 492 AEFRDRKGLLLPGYQQELVSIVARASRGPTILVLMSGGPIDVSFAKNDPRIGAILWVGYP 551

Query: 572 GEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNG 631
           G+ GG AIADV+FG  NPGG+LP+TWY  +Y+  +P+T+M +R   S GYPGRTY+FY G
Sbjct: 552 GQAGGAAIADVLFGTANPGGKLPMTWYPHNYLAKVPMTNMGMRADPSRGYPGRTYRFYKG 611

Query: 632 PTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRC 691
           P ++PFG+G+SYT F ++L+   + + V L  L   RN    S+A       + V+   C
Sbjct: 612 PVVFPFGHGMSYTTFAHSLVQAPREVSVPLASLHVSRNTTGASNA-------IRVSHANC 664

Query: 692 DDY-FEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFV 750
           +       +D +N G  DG+  ++V+S PP    +T  KQ+IGF++V +  G  KR+K  
Sbjct: 665 EALALGVHIDVKNTGDMDGTHTLLVFSSPPGGKWSTQ-KQLIGFEKVHLVTGSQKRVKID 723

Query: 751 FNACKSLNIVDYAANTLLPAGEHTIFVGNGGVSFPIHLNF 790
            + CK L++VD      +P GEH +++G+   S  +  N 
Sbjct: 724 IHVCKHLSVVDRFGIRRIPIGEHDLYIGDLKHSISLQANL 763


>gi|357442285|ref|XP_003591420.1| Beta xylosidase [Medicago truncatula]
 gi|355480468|gb|AES61671.1| Beta xylosidase [Medicago truncatula]
          Length = 765

 Score =  793 bits (2049), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 387/776 (49%), Positives = 518/776 (66%), Gaps = 36/776 (4%)

Query: 9   LCFSLSIALLVFSTNAVDANGSSSPVFVCDPGRFSKLGLQMSSFLFCDSSLPYSIRVKDL 68
           +  ++   LL+ S+ A D        F CDP   S      ++F FC +SLP   RV DL
Sbjct: 4   ILITIVFLLLLMSSEARDP-------FACDPKNTS-----TNNFPFCKASLPIPTRVNDL 51

Query: 69  VSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGATSFPTV 128
           + R+TL EKV  L + A  VPR+G+  YEWWSEALHGVSNVGPGT F    P ATSFP V
Sbjct: 52  IGRLTLQEKVSMLVNNAAAVPRVGIKGYEWWSEALHGVSNVGPGTKFAGQFPAATSFPQV 111

Query: 129 ILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNINVARDPRWGRITETPGEDP 188
           I T ASFN SLW+ IG+  S EARAMYN G AGLTYWSPN+N+ RDPRWGR  ETPGEDP
Sbjct: 112 ITTVASFNASLWEAIGRVASDEARAMYNGGTAGLTYWSPNVNIFRDPRWGRGQETPGEDP 171

Query: 189 FVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARV 248
            + G+YA +YVRGLQ  +         S  LKV++ CKH+ AYD+DNW GVDR+HF+A+V
Sbjct: 172 ILAGKYAASYVRGLQGTD---------SSRLKVAASCKHFTAYDLDNWNGVDRFHFNAKV 222

Query: 249 TEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVA 308
           ++QDME+TF  PF MCVKEG+ +SVMCSYN+VNG+P+CADP LL +T+RG+W L GYIV+
Sbjct: 223 SKQDMEDTFNVPFRMCVKEGNVASVMCSYNQVNGVPTCADPNLLKRTIRGQWHLDGYIVS 282

Query: 309 DCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKS 368
           DCDS+ V   N  + + + E+A A  +KAGLDLDCG +    T NAV++G + ETD++ +
Sbjct: 283 DCDSVGVFYTNQHYTS-TPEEAAADAIKAGLDLDCGPFLAQHTQNAVKKGLLTETDVNGA 341

Query: 369 LKYLYTVLMRLGFFDGSPQ---YVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPL 425
           L    TV MRLG FDG P    Y +LG  D+C+  + ELA +AAR+GIVLLKN   +LPL
Sbjct: 342 LANTLTVQMRLGMFDGEPSAQPYGNLGPTDVCTPTHQELALDAARQGIVLLKNTGPSLPL 401

Query: 426 NSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFSGYANVTYKTGCDDVACKSNN 485
           ++   +TVAV+GP++NATV MIGNYAGI C Y SP+ G   YA   ++ GC +VAC  + 
Sbjct: 402 STKNHQTVAVIGPNSNATVTMIGNYAGIACGYTSPLQGIGKYARTIHEPGCANVACNDDK 461

Query: 486 SIFAASEAAKTADATIILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVI 545
              +A  AA+ ADAT+++ GLD S+EAE +DR  L LPG+Q  L+++VA  ++GP ILV+
Sbjct: 462 QFGSALNAARQADATVLVMGLDQSIEAEMVDRTGLLLPGHQQDLVSKVAAASRGPTILVL 521

Query: 546 MSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQM 605
           MS G +DI FA+ +  I  ILWAGYPG+ GG AIAD++FG  NPG +LP+TWY   Y++ 
Sbjct: 522 MSGGPIDITFAKNDPRIMGILWAGYPGQAGGAAIADILFGTTNPGAKLPMTWYPQGYLKN 581

Query: 606 LPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQ 665
           L +T+M +RP  S GYPGRTY+FYNGP +YPFGYGLSYT F + L S  K + V ++  +
Sbjct: 582 LAMTNMAMRPSSSTGYPGRTYRFYNGPVVYPFGYGLSYTNFVHTLASAPKVVSVPVDGHR 641

Query: 666 HCRNLNYTSDASKTRCPGVLVNDLRCDDY-FEFKVDFQNVGSTDGSDVVIVYSKPPAEIA 724
              + N  +         + V   RC        +D +NVGS DG++ ++V+S PP    
Sbjct: 642 RGNSSNKAA---------IRVTHARCGKLSIRLDIDVKNVGSKDGTNTLLVFSVPPTGNG 692

Query: 725 A-TYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGN 779
                KQ++ F++V+V A   +R++   + CK L++VD +    +P G H+I +G+
Sbjct: 693 HWAPQKQLVAFEKVYVPAKAQQRVRINIHVCKLLSVVDKSGTRRIPMGAHSIHIGD 748


>gi|356501877|ref|XP_003519750.1| PREDICTED: probable beta-D-xylosidase 2-like [Glycine max]
          Length = 772

 Score =  791 bits (2042), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 385/756 (50%), Positives = 510/756 (67%), Gaps = 27/756 (3%)

Query: 29  GSSSPVFVCDPGRFSKLGLQMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGV 88
           G +   F CDP   +   L      FC +SL    RVKDL+ R+TL EKV  L + A  V
Sbjct: 22  GEARDPFACDPKNTATKNLP-----FCKASLATGARVKDLIGRLTLQEKVNLLVNNAAAV 76

Query: 89  PRLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVS 148
           PRLG+  YEWWSEALHGVSNVGPGT F    P ATSFP VI T ASFN SLW+ IG+  S
Sbjct: 77  PRLGIKGYEWWSEALHGVSNVGPGTKFGGQFPAATSFPQVITTAASFNASLWEAIGRVAS 136

Query: 149 TEARAMYNLGRAGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGH 208
            EARAMYN G AGLTYWSPN+N+ RDPRWGR  ETPGEDP + G+YA +YVRGLQ  +G+
Sbjct: 137 DEARAMYNGGTAGLTYWSPNVNIFRDPRWGRGQETPGEDPILAGKYAASYVRGLQGTDGN 196

Query: 209 ENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEG 268
                     LKV++ CKH+ AYD+DNW GVDR+HF+A+V++QD+E+TF  PF MCVKEG
Sbjct: 197 R---------LKVAASCKHFTAYDLDNWNGVDRFHFNAQVSKQDIEDTFNVPFRMCVKEG 247

Query: 269 DASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKE 328
             +SVMCSYN+VNG+P+CADP LL +TVRG+W L+GYIV+DCDS+ V  ++  + + + E
Sbjct: 248 KVASVMCSYNQVNGVPTCADPILLKRTVRGQWGLNGYIVSDCDSVGVFYNSQHYTS-TPE 306

Query: 329 DAVAQTLKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ- 387
           +A A  +KAGLDLDCG +    T NAV++G + E D++ +L    TV MRLG +DG P  
Sbjct: 307 EAAADAIKAGLDLDCGPFLGQHTQNAVKKGLISEADVNGALLNTLTVQMRLGMYDGEPSS 366

Query: 388 --YVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVA 445
             Y +LG +D+C+  + ELA EAAR+GIVLLKN   +LPL++ + +TVAV+GP++N T  
Sbjct: 367 HPYNNLGPRDVCTQSHQELALEAARQGIVLLKNKGPSLPLSTRRGRTVAVIGPNSNVTFT 426

Query: 446 MIGNYAGIPCRYMSPIAGFSGYANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAG 505
           MIGNYAGI C Y SP+ G   Y    Y+ GC +VAC  +     A  AA+ ADAT+++ G
Sbjct: 427 MIGNYAGIACGYTSPLQGIGTYTKTIYEHGCANVACTDDKQFGRAINAAQQADATVLVMG 486

Query: 506 LDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAI 565
           LD S+EAE++DR  L LPG+Q  L+++VA  +KGP ILVIMS G VDI FA+ +  I+ I
Sbjct: 487 LDQSIEAETVDRASLLLPGHQQDLVSKVAAASKGPTILVIMSGGPVDITFAKNDPRIQGI 546

Query: 566 LWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRT 625
           LWAGYPG+ GG AIAD++FG  NPGG+LP+TWY   Y++ LP+T+M +R   S GYPGRT
Sbjct: 547 LWAGYPGQAGGAAIADILFGTSNPGGKLPMTWYPQGYIKNLPMTNMAMRASRSKGYPGRT 606

Query: 626 YKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVL 685
           Y+FYNGP +YPFGYGLSYT F + L S  K + + ++  +H  + N  + A K       
Sbjct: 607 YRFYNGPVVYPFGYGLSYTHFVHTLTSAPKLVSIPVDGHRHGNSSNIANKAIK------- 659

Query: 686 VNDLRCDDY-FEFKVDFQNVGSTDGSDVVIVYSKPPAEIAA-TYIKQVIGFQRVFVRAGR 743
           V   RC        VD +NVGS DG   ++V+S PPA        KQ++ F++V + A  
Sbjct: 660 VTHARCGKLSINLHVDVKNVGSKDGIHTLLVFSAPPAGNGHWAPHKQLVAFEKVHIPAKA 719

Query: 744 NKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGN 779
            +R++   + CK L++VD +    +P G H++ +G+
Sbjct: 720 QQRVRVKIHVCKLLSVVDRSGTRRIPMGLHSLHIGD 755


>gi|255548487|ref|XP_002515300.1| Beta-glucosidase, putative [Ricinus communis]
 gi|223545780|gb|EEF47284.1| Beta-glucosidase, putative [Ricinus communis]
          Length = 768

 Score =  790 bits (2040), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 385/749 (51%), Positives = 511/749 (68%), Gaps = 29/749 (3%)

Query: 35  FVCDPGRFSKLGLQMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLP 94
           F CD    SK G    +  FC   LP   RVKDL+ R+TL EKV  L + A  V RLG+ 
Sbjct: 28  FACD----SKDG-TTKNLPFCQVKLPIQDRVKDLIGRLTLAEKVGLLVNNAGAVSRLGIK 82

Query: 95  QYEWWSEALHGVSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAM 154
            YEWWSEALHGVSNVGPGT F    PGATSFP VI T ASFN +LW+ IG+ VS EARAM
Sbjct: 83  GYEWWSEALHGVSNVGPGTKFGGSFPGATSFPQVITTAASFNSTLWEAIGRVVSDEARAM 142

Query: 155 YNLGRAGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDL 214
           YN G AGLTYWSPN+N+ RDPRWGR  ETPGEDP +VG+YA +YV+GLQ  +G       
Sbjct: 143 YNGGAAGLTYWSPNVNILRDPRWGRGQETPGEDPLLVGKYAASYVKGLQGNDGER----- 197

Query: 215 NSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVM 274
               LKV++CCKH+ AYD+DNW GVDR+HF+A+V++QDM++TF  PF MCVKEG  +SVM
Sbjct: 198 ----LKVAACCKHFTAYDLDNWNGVDRFHFNAKVSKQDMKDTFDVPFRMCVKEGKVASVM 253

Query: 275 CSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQT 334
           CSYN+VNGIP+CADP LL +TVR +W L+GYIV+DCDS+ V  D   + + + E+A A  
Sbjct: 254 CSYNQVNGIPTCADPNLLRKTVRTQWGLNGYIVSDCDSVGVFYDKQHYTS-TPEEAAADA 312

Query: 335 LKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ---YVSL 391
           +KAGLDLDCG +    T +AV++G + E D++ +L    TV MRLG FDG P    Y +L
Sbjct: 313 IKAGLDLDCGPFLAVHTQDAVKRGLISEADVNGALFNTLTVQMRLGMFDGEPSAQPYGNL 372

Query: 392 GKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYA 451
           G +D+C+  + ELA EA R+GIVLLKN   +LPL+  + +TVA++GP++N TV MIGNYA
Sbjct: 373 GPKDVCTPAHQELALEAGRQGIVLLKNHGPSLPLSPRRHRTVAIIGPNSNVTVTMIGNYA 432

Query: 452 GIPCRYMSPIAGFSGYANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVE 511
           G+ C+Y +P+ G   YA   ++ GC DV C ++     A +AA+ ADAT+++ GLD S+E
Sbjct: 433 GVACQYTTPLQGIGSYAKTIHQQGCADVGCVTDQLFSGAIDAARQADATVLVMGLDQSIE 492

Query: 512 AESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYP 571
           AE  DR  L LPG Q +L+++VA  +KGP ILV+MS G +D++FA+ +  I AILWAGYP
Sbjct: 493 AEFRDRTGLLLPGRQQELVSKVAMASKGPTILVLMSGGPIDVSFAKKDPKIAAILWAGYP 552

Query: 572 GEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNG 631
           G+ GG AIADV+FG  NPGG+LP+TWY  +Y+  LP+T M +R   S GYPGRTY+FY G
Sbjct: 553 GQAGGAAIADVLFGTINPGGKLPMTWYPQEYITNLPMTEMAMRSSQSKGYPGRTYRFYQG 612

Query: 632 PTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRC 691
             +YPFG+G+SYT F +N+ S    + V L+   H  N + +  A       + V   +C
Sbjct: 613 KVVYPFGHGMSYTHFVHNIASAPTMVSVPLDG--HRGNTSISGKA-------IRVTHTKC 663

Query: 692 DDY-FEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFV 750
           +      +VD +NVGS DG+  ++VYS PPA   + + KQ++ F+RV V AG  +R+   
Sbjct: 664 NKLSLGIQVDVKNVGSKDGTHTLLVYSAPPAGRWSPH-KQLVAFERVHVSAGTQERVGIS 722

Query: 751 FNACKSLNIVDYAANTLLPAGEHTIFVGN 779
            + CK L++VD +    +P GEH+I +GN
Sbjct: 723 IHVCKLLSVVDRSGIRRIPIGEHSIHIGN 751


>gi|356556038|ref|XP_003546334.1| PREDICTED: beta-D-xylosidase 1-like [Glycine max]
          Length = 775

 Score =  788 bits (2036), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 382/750 (50%), Positives = 519/750 (69%), Gaps = 27/750 (3%)

Query: 35  FVCDPGRFSKLGLQMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLP 94
           F CDP    + GL    F FC++ +P  +RV+DL++R+TL EK++ + + A  VPRLG+ 
Sbjct: 37  FACDP----RNGL-TRGFKFCNTHVPIHVRVQDLIARLTLPEKIRLVVNNAIAVPRLGIQ 91

Query: 95  QYEWWSEALHGVSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAM 154
            YEWWSEALHGVSNVGPGT F    PGAT FP VI T ASFN+SLW++IG+ VS EARAM
Sbjct: 92  GYEWWSEALHGVSNVGPGTKFGGAFPGATMFPQVISTAASFNQSLWQEIGRVVSDEARAM 151

Query: 155 YNLGRAGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDL 214
           YN G+AGLTYWSPN+N+ RDPRWGR  ETPGEDP +  +YA +YV+GLQ         D 
Sbjct: 152 YNGGQAGLTYWSPNVNIFRDPRWGRGQETPGEDPTLAAKYAASYVKGLQG--------DS 203

Query: 215 NSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVM 274
               LKV++CCKHY AYD+DNW GVDR+HF+A+V++QD+E+T+  PF+ CV EG  +SVM
Sbjct: 204 AGNHLKVAACCKHYTAYDLDNWNGVDRFHFNAKVSKQDLEDTYDVPFKACVLEGQVASVM 263

Query: 275 CSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQT 334
           CSYN+VNG P+CADP LL  T+RG+W L+GYIV+DCDS+ V  DN  +   + E+A A+ 
Sbjct: 264 CSYNQVNGKPTCADPDLLRNTIRGQWRLNGYIVSDCDSVGVFFDNQHY-TKTPEEAAAEA 322

Query: 335 LKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ---YVSL 391
           +KAGLDLDCG +    T +A+++G + E D++ +L  L +V MRLG FDG P    Y +L
Sbjct: 323 IKAGLDLDCGPFLAIHTDSAIRKGLISENDLNLALANLISVQMRLGMFDGEPSTQPYGNL 382

Query: 392 GKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYA 451
           G +D+C+  + +LA EAARE IVLL+N  N+LPL+ ++++T+ VVGP+A+ATV MIGNYA
Sbjct: 383 GPRDVCTSAHQQLALEAARESIVLLQNKGNSLPLSPSRLRTIGVVGPNADATVTMIGNYA 442

Query: 452 GIPCRYMSPIAGFSGYANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVE 511
           G+ C Y +P+ G + Y    ++ GC  VAC+ N    AA   A+ ADA +++ GLD +VE
Sbjct: 443 GVACGYTTPLQGIARYVKTAHQVGCRGVACRGNELFGAAETIARQADAIVLVMGLDQTVE 502

Query: 512 AESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYP 571
           AE+ DR  L LPG Q +L+ +VA  AKGPVIL+IMS G VDI+FA+ +  I AILW GYP
Sbjct: 503 AETRDRVGLLLPGLQQELVTRVARAAKGPVILLIMSGGPVDISFAKNDPKISAILWVGYP 562

Query: 572 GEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNG 631
           G+ GG AIADV+FG  NPGGRLP+TWY   Y+  +P+T+M +RP  + GYPGRTY+FY G
Sbjct: 563 GQAGGTAIADVIFGTTNPGGRLPMTWYPQGYLAKVPMTNMDMRPNPTTGYPGRTYRFYKG 622

Query: 632 PTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRC 691
           P ++PFG+GLSY++F ++L    K + V +  LQ   N   +S A K       V+   C
Sbjct: 623 PVVFPFGHGLSYSRFSHSLALAPKQVSVPIMSLQALTNSTLSSKAVK-------VSHANC 675

Query: 692 DDYF--EFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKF 749
           DD    EF VD +N GS DG+  ++++S+PP     + IKQ++GF +  V AG  +R+K 
Sbjct: 676 DDSLEMEFHVDVKNEGSMDGTHTLLIFSQPP-HGKWSQIKQLVGFHKTHVLAGSKQRVKV 734

Query: 750 VFNACKSLNIVDYAANTLLPAGEHTIFVGN 779
             + CK L++VD      +P GEH + +G+
Sbjct: 735 GVHVCKHLSVVDQFGVRRIPTGEHELHIGD 764


>gi|356534827|ref|XP_003535953.1| PREDICTED: probable beta-D-xylosidase 2-like [Glycine max]
          Length = 771

 Score =  788 bits (2036), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 386/756 (51%), Positives = 513/756 (67%), Gaps = 27/756 (3%)

Query: 29  GSSSPVFVCDPGRFSKLGLQMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGV 88
           G +   F CDP   +   L      FC + L    RVKDL+ R+TL EKV  L + A  V
Sbjct: 21  GEARDPFACDPKNTATKNLP-----FCKAWLATGARVKDLIGRLTLQEKVNLLVNNAAAV 75

Query: 89  PRLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVS 148
           PRLG+  YEWWSEALHGVSNVGPGT F    P ATSFP VI T ASFN SLW+ IG+  S
Sbjct: 76  PRLGIKGYEWWSEALHGVSNVGPGTKFGGQFPAATSFPQVITTAASFNASLWEAIGRVAS 135

Query: 149 TEARAMYNLGRAGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGH 208
            EARAMYN G AGLTYWSPN+N+ RDPRWGR  ETPGEDP + G+YA +YVRGLQ+ +G+
Sbjct: 136 DEARAMYNGGTAGLTYWSPNVNIFRDPRWGRGQETPGEDPILAGKYAASYVRGLQETDGN 195

Query: 209 ENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEG 268
                     LKV++ CKH+ AYD+DNW GVDR+HF+A+V++QD+E+TF  PF MCVKEG
Sbjct: 196 R---------LKVAASCKHFTAYDLDNWNGVDRFHFNAQVSKQDIEDTFNVPFRMCVKEG 246

Query: 269 DASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKE 328
             +SVMCSYN+VNG+P+CADP LL +TVRG+W L+GYIV+DCDS+ V  ++  + + + E
Sbjct: 247 KVASVMCSYNQVNGVPTCADPILLKRTVRGQWGLNGYIVSDCDSVGVFYNSQHYTS-TPE 305

Query: 329 DAVAQTLKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ- 387
           +A A  +KAGLDLDCG +    T NAV++G + ETD++ +L    TV MRLG +DG P  
Sbjct: 306 EAAADAIKAGLDLDCGPFLGQHTQNAVKKGLISETDVNGALLNTLTVQMRLGMYDGEPSS 365

Query: 388 --YVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVA 445
             Y  LG +D+C+  + ELA EAAR+GIVLLKN   +LPL++ +  TVAV+GP++N TV 
Sbjct: 366 HPYGKLGPRDVCTPSHQELALEAARQGIVLLKNKGPSLPLSTRRHPTVAVIGPNSNVTVT 425

Query: 446 MIGNYAGIPCRYMSPIAGFSGYANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAG 505
           MIGNYAGI C Y SP+ G   Y    ++ GC +VAC ++     A   A+ ADAT+++ G
Sbjct: 426 MIGNYAGIACGYTSPLEGIGRYTKTIHELGCANVACTNDKQFGRAINVAQQADATVLVMG 485

Query: 506 LDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAI 565
           LD S+EAE++DR  L LPG Q  L+++VA  +KGP ILVIMS G VDI FA+ N  I+AI
Sbjct: 486 LDQSIEAETVDRAGLLLPGRQQDLVSKVAAASKGPTILVIMSGGPVDITFAKNNPRIQAI 545

Query: 566 LWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRT 625
           LWAGYPG+ GG AIAD++FG  NPGG+LP+TWY   Y++ LP+T+M +R   S GYPGRT
Sbjct: 546 LWAGYPGQAGGAAIADILFGTSNPGGKLPMTWYPQGYIKNLPMTNMAMRASRSKGYPGRT 605

Query: 626 YKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVL 685
           Y+FYNGP +YPFGYGLSYT F + L S  K + + ++  +H    N +S A+K     + 
Sbjct: 606 YRFYNGPVVYPFGYGLSYTHFVHTLASAPKLVSIPVDGHRHG---NSSSIANKA----IK 658

Query: 686 VNDLRCDDY-FEFKVDFQNVGSTDGSDVVIVYSKPPAEIAA-TYIKQVIGFQRVFVRAGR 743
           V   RC       +VD +NVGS DG+  ++V+S PPA        KQ++ FQ++ + +  
Sbjct: 659 VTHARCGKLSISLQVDVKNVGSKDGTHTLLVFSAPPAGNGHWAPHKQLVAFQKLHIPSKA 718

Query: 744 NKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGN 779
            +R+    + CK L++VD +    +P G H++ +G+
Sbjct: 719 QQRVNVNIHVCKLLSVVDRSGTRRVPMGLHSLHIGD 754


>gi|225437531|ref|XP_002270249.1| PREDICTED: probable beta-D-xylosidase 2 [Vitis vinifera]
 gi|297743965|emb|CBI36935.3| unnamed protein product [Vitis vinifera]
          Length = 768

 Score =  788 bits (2036), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 384/769 (49%), Positives = 517/769 (67%), Gaps = 28/769 (3%)

Query: 14  SIALLVFSTNAVDANGSSSPVFVCDPGRFSKLGLQMSSFLFCDSSLPYSIRVKDLVSRMT 73
           S +LL+F       +G +   F CDP   +  G     F FC  S+    RVKDL+ R+T
Sbjct: 8   SSSLLIFLVVLAVVSGEARDPFACDPKDGANAG-----FPFCRKSIGIGERVKDLIGRLT 62

Query: 74  LDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGATSFPTVILTTA 133
           L+EKV+ L + A GVPRLG+  YEWWSEALHGVSNVGPGT F    PGATSFP VI T A
Sbjct: 63  LEEKVRLLVNNAAGVPRLGIKGYEWWSEALHGVSNVGPGTKFSGDFPGATSFPQVITTAA 122

Query: 134 SFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNINVARDPRWGRITETPGEDPFVVGR 193
           SFN SLW+ IGQ VS EARAMYN G AGLT+WSPN+N+ RDPRWGR  ETPGEDP + G+
Sbjct: 123 SFNSSLWEAIGQVVSDEARAMYNGGAAGLTFWSPNVNIFRDPRWGRGQETPGEDPVLAGK 182

Query: 194 YAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDM 253
           YA  YVRGLQ      NA D     LKV++CCKH+ AYD+DNW GVDR+HFDARV++Q+M
Sbjct: 183 YAARYVRGLQG-----NAGDR----LKVAACCKHFTAYDLDNWNGVDRFHFDARVSKQEM 233

Query: 254 EETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSI 313
           E+TF  PF  CV EG  +SVMCSYN+VNG+P+CADP LL  TVR +W L+GY+V+DCDS+
Sbjct: 234 EDTFDVPFRSCVVEGKVASVMCSYNQVNGVPTCADPNLLRNTVRKQWHLNGYVVSDCDSV 293

Query: 314 QVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLY 373
            V  DN  +  ++ E+A A  +KAGLDLDCG +    T +A+++G V E D+D +L    
Sbjct: 294 GVFYDNQHY-TNTPEEAAADAIKAGLDLDCGPFLAVHTQDAIKKGLVSEADVDSALVNTV 352

Query: 374 TVLMRLGFFDGSPQ---YVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKV 430
           TV MRLG FDG P    +  LG +D+CS  + ELA EAAR+GIVLLKN  ++LPL++   
Sbjct: 353 TVQMRLGMFDGEPSAQPFGDLGPKDVCSPAHQELAIEAARQGIVLLKNHGHSLPLSTRSH 412

Query: 431 KTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFSGYANVTYKTGCDDVACKSNNSIFAA 490
           +++AV+GP+++A V MIGNYAGIPC Y +P+ G   Y+   ++ GC DVAC  +     A
Sbjct: 413 RSIAVIGPNSDANVTMIGNYAGIPCEYTTPLQGIGRYSRTIHQKGCADVACSEDQLFAGA 472

Query: 491 SEAAKTADATIILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGG 550
            +AA  ADAT+++ GLD S+EAE+ DR DL LPG Q +L+++VA  ++GP +LV+MS G 
Sbjct: 473 IDAASQADATVLVMGLDQSIEAEAKDRADLLLPGRQQELVSKVAMASRGPTVLVLMSGGP 532

Query: 551 VDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTS 610
           VD++FA+ +  I AI+WAGYPG+ GG AIAD++FG  NPGG+LP+TWY  +Y+  +P+T+
Sbjct: 533 VDVSFAKKDPRIAAIVWAGYPGQAGGAAIADILFGVANPGGKLPMTWYPQEYLSKVPMTT 592

Query: 611 MPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNL 670
           M +R + S  YPGRTY+FY GP +Y FG+GLSYT F + +      + + L+      + 
Sbjct: 593 MAMRAIPSKAYPGRTYRFYKGPVVYRFGHGLSYTNFVHTIAQAPTAVAIPLHG-----HH 647

Query: 671 NYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQ 730
           N T      R      N L         +D +NVG+ DGS  ++V+SKPPA   A + KQ
Sbjct: 648 NTTVSGKAIRVTHAKCNRLS----IALHLDVKNVGNKDGSHTLLVFSKPPAGHWAPH-KQ 702

Query: 731 VIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGN 779
           ++ F++V V A   +R++   + CK L++VD +    +P G+H + +G+
Sbjct: 703 LVAFEKVHVAARTQQRVQINIHVCKYLSVVDRSGIRRIPMGQHGLHIGD 751


>gi|371917282|dbj|BAL44717.1| SlArf/Xyl2 [Solanum lycopersicum]
          Length = 774

 Score =  787 bits (2033), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 394/778 (50%), Positives = 525/778 (67%), Gaps = 29/778 (3%)

Query: 14  SIALLVFSTNAVDANGSSSPVFVCDPGRFSKLGLQMSSFLFCDSSLPYSIRVKDLVSRMT 73
           S+ + +F   ++ A   + P F CD     +      +F FC ++LP   RV+DL+ R+T
Sbjct: 13  SLFIFIFLFVSIQA---ARPPFACD-----QKNRAFRNFPFCQTNLPIGDRVRDLIGRLT 64

Query: 74  LDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGATSFPTVILTTA 133
           L EKV+ LG+ A  VPRLG+  YEWWSEALHGVSNVGPGT F    PGATSFP VI T A
Sbjct: 65  LQEKVKLLGNNAAAVPRLGIKGYEWWSEALHGVSNVGPGTKFGGEFPGATSFPQVITTAA 124

Query: 134 SFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNINVARDPRWGRITETPGEDPFVVGR 193
           SFN SLW++IG+ VS EARAMYN    GLTYWSPN+N+ RDPRWGR  ETPGEDP V   
Sbjct: 125 SFNASLWEEIGRVVSDEARAMYNGEMGGLTYWSPNVNIFRDPRWGRGQETPGEDPVVAAL 184

Query: 194 YAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDM 253
           YA  YVRGLQ   G+E+   L     KV++CCKHY AYD+DNW GVDR+HF+A+VT+QD+
Sbjct: 185 YAERYVRGLQ---GNEDGDSL-----KVAACCKHYTAYDLDNWGGVDRFHFNAKVTKQDI 236

Query: 254 EETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSI 313
           E+TF  PF  CVK+G  +S+MCSYN+VNGIP+CADP+LL +T+RG W L+GYIV+DCDS+
Sbjct: 237 EDTFDVPFRSCVKQGKVASIMCSYNQVNGIPTCADPQLLRKTIRGGWGLNGYIVSDCDSV 296

Query: 314 QVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLY 373
            V  D   + + + E+A A  +KAGLDLDCG + +  T NAV  G +KE  ID +L    
Sbjct: 297 GVFYDTQHYTS-TPEEAAAAAIKAGLDLDCGPFLSQHTENAVHIGILKEAAIDTNLANTV 355

Query: 374 TVLMRLGFFDGSP---QYVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKV 430
            V MRLG FDG P   QY  LG +D+CS  + ELA EAAR+GIVLLKN    LPL+  + 
Sbjct: 356 AVQMRLGMFDGEPSAQQYGHLGPRDVCSPAHQELAVEAARQGIVLLKNHGPALPLSPRRH 415

Query: 431 KTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFSGYANVTYKTGCDDVACKSNNSIFAA 490
           +TVAV+GP+++ TV MIGNYAG+ C Y SP+ G S YA   ++ GC DVAC  +     A
Sbjct: 416 RTVAVIGPNSDVTVTMIGNYAGVACGYTSPLQGISKYAKTIHEKGCGDVACSDDKLFAGA 475

Query: 491 SEAAKTADATIILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGG 550
             AA+ ADAT+++ GLD S+EAE  DR  L LPG+Q +LI++V++ ++GPV+LV+MS G 
Sbjct: 476 VNAARQADATVLVMGLDQSIEAEFRDRTGLLLPGFQQELISEVSKASRGPVVLVLMSGGP 535

Query: 551 VDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTS 610
           VD+ FA  +  I AI+WAGYPG+ GG AIADV+FG  NPGG+LP+TWY  +Y+  LP+T+
Sbjct: 536 VDVTFANNDPRIGAIVWAGYPGQGGGAAIADVLFGAHNPGGKLPMTWYPQEYLNNLPMTT 595

Query: 611 MPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNL 670
           M +R   + GYPGRTY+FY GP +YPFG+GLSYT+F   +    KT+ + ++  +H  N 
Sbjct: 596 MDMRSNLAKGYPGRTYRFYKGPLVYPFGHGLSYTKFITTIFEAPKTLAIPIDG-RHTYNS 654

Query: 671 NYTSDASKTRCPGVLVNDLRCDDY-FEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIK 729
           +  S+ S      + V   +C     +  VD +NVG  DGS  ++V+SKPP +I   + K
Sbjct: 655 STISNKS------IRVTHAKCSKISVQIHVDVKNVGPKDGSHTLLVFSKPPVDIWVPH-K 707

Query: 730 QVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGNGGVSFPIH 787
           Q++ FQ+V+V A   +R+    + CK L++VD A    +P GEH+I +G+   S  + 
Sbjct: 708 QLVAFQKVYVPARSKQRVAINIHVCKYLSVVDRAGVRRIPIGEHSIHIGDAKHSLSLQ 765


>gi|449469042|ref|XP_004152230.1| PREDICTED: probable beta-D-xylosidase 2-like [Cucumis sativus]
          Length = 769

 Score =  786 bits (2030), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 393/771 (50%), Positives = 518/771 (67%), Gaps = 32/771 (4%)

Query: 13  LSIALLVFSTNAVDANGSSSPVFVCDPGRFSKLGLQMSSFLFCDSSLPYSIRVKDLVSRM 72
           LSI +L+   +A+    S +P F CDP          + + FC  SL    RVKDL+ R+
Sbjct: 9   LSIFILL---SAIHGRASRAP-FACDPNNSVT-----TDYPFCRRSLVVGERVKDLIGRL 59

Query: 73  TLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGATSFPTVILTT 132
           TL+EKV+ L   A GVPRLG+  Y+WWSEALHGVSNVGPGT F    P ATSFP VI T 
Sbjct: 60  TLEEKVKLLVSNAGGVPRLGIKAYQWWSEALHGVSNVGPGTRFGGEFPAATSFPQVISTA 119

Query: 133 ASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNINVARDPRWGRITETPGEDPFVVG 192
           ASFN SLW+ IG+ VS EARAMYN G  GLTYWSPN+N+ RDPRWGR  ETPGEDP + G
Sbjct: 120 ASFNASLWEAIGRVVSDEARAMYNGGVGGLTYWSPNVNIFRDPRWGRGQETPGEDPILAG 179

Query: 193 RYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQD 252
            YAVNYVRGLQ  EG+          LKV++CCKH+ AYD+DNW GVDR+HF+A+V++QD
Sbjct: 180 TYAVNYVRGLQGTEGNR---------LKVAACCKHFTAYDLDNWNGVDRFHFNAQVSKQD 230

Query: 253 MEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDS 312
           +E+TF  PF MCVK G  SSVMCSYN+VNG+P+CADP LL  T+R +W L GYIV+DCDS
Sbjct: 231 IEDTFEVPFRMCVKGGKVSSVMCSYNQVNGVPTCADPNLLTNTLRSQWHLDGYIVSDCDS 290

Query: 313 IQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYL 372
           + V  ++  + + + E+A A  +KAGLDLDCG +    T NAV++G + E+ I+ +L   
Sbjct: 291 VGVFYNSQHYTS-TPEEAAAMAIKAGLDLDCGSFLETHTENAVKRGLLNESHINGALSNT 349

Query: 373 YTVLMRLGFFDG---SPQYVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAK 429
            +V MRLG FDG   +  Y  LG + +CSD N +LA +AAR+GIVLL+N + +LPL++ +
Sbjct: 350 LSVQMRLGMFDGDLKTQPYAHLGAKHVCSDHNRQLAVDAARQGIVLLENRRGSLPLSTNR 409

Query: 430 VKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFSGYANVTYKTGCDDVACKSNNSIFA 489
            + VAVVGP++NAT+ MIGNYAGI C Y++P+ G S Y    ++ GC  VAC+SN     
Sbjct: 410 HRIVAVVGPNSNATLTMIGNYAGIACEYITPLQGISKYTRTIHQEGCRGVACRSNKFFGG 469

Query: 490 ASEAAKTADATIILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAG 549
           A EAA+ ADA +++ GLD S+EAE  DR  L LPG Q  L+ +VA VAKGPVILV+MS G
Sbjct: 470 AIEAARVADAVVLVMGLDQSIEAEFRDRAGLLLPGLQPDLVLKVASVAKGPVILVLMSGG 529

Query: 550 GVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLT 609
            +D++FA+ +  I  I+W GYPG+ GG AIADV+FG+ NPGG+LP+TWY  DYV  LP+T
Sbjct: 530 PIDVSFAKDHPKISGIIWGGYPGQAGGLAIADVLFGQTNPGGKLPMTWYPQDYVSKLPMT 589

Query: 610 SMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRN 669
           +M LRP  S  YPGRTY+FY GP +YPFG+GLSYT F + +LS   T+ V +   +H  N
Sbjct: 590 TMSLRPGTS--YPGRTYRFYKGPVVYPFGHGLSYTAFTHKILSAPTTLTVPVTGHRHPHN 647

Query: 670 LNYTSDASKTRCPGVLVNDLRCDDY-FEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYI 728
                  S+     V V   +CD      KV  +N+G+ DG+  ++VYS PP  +     
Sbjct: 648 ------GSEFWGKAVRVTHAKCDRLSLVIKVAVRNIGARDGAHTLLVYSIPPMGVWVPQ- 700

Query: 729 KQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGN 779
           KQ++ F++V + A   K ++   + CK L++VD      +P GEH I +G+
Sbjct: 701 KQLVAFEKVHIDAQALKEVQINIHVCKLLSVVDKYGIRRVPMGEHGIDIGD 751


>gi|356572781|ref|XP_003554544.1| PREDICTED: probable beta-D-xylosidase 2-like [Glycine max]
          Length = 771

 Score =  786 bits (2029), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 386/752 (51%), Positives = 503/752 (66%), Gaps = 31/752 (4%)

Query: 35  FVCDP--GRFSKLGLQMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLG 92
           F CDP  G   K+        FC  SL  + RVKDL+ R+TL+EKV+ L + A  VPRLG
Sbjct: 27  FACDPKNGGTKKMA-------FCKVSLAIAERVKDLIGRLTLEEKVRLLVNNAAAVPRLG 79

Query: 93  LPQYEWWSEALHGVSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEAR 152
           +  YEWWSEALHGVSN+GP   F+   P ATSFP VI T ASFN SLW+ IGQ VS EAR
Sbjct: 80  MKGYEWWSEALHGVSNLGPAVKFNAQFPAATSFPQVITTAASFNASLWEAIGQVVSDEAR 139

Query: 153 AMYNLGRAGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENAT 212
           AMYN G AGLTYWSPN+N+ RDPRWGR  ETPGEDP + G YA  YVRGLQ    +    
Sbjct: 140 AMYNGGTAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLAGTYAATYVRGLQGTHANR--- 196

Query: 213 DLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASS 272
                 LKV++CCKH+ AYD+DNW G+DR+HF+A+V++QD+E+TF  PF+MCV EG  +S
Sbjct: 197 ------LKVAACCKHFTAYDLDNWNGMDRFHFNAQVSKQDIEDTFDVPFKMCVSEGKVAS 250

Query: 273 VMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVA 332
           VMCSYN+VNG+P+CADP LL +TVRG W L GYIV+DCDS+ V  DN  +   + E+A A
Sbjct: 251 VMCSYNQVNGVPTCADPNLLKKTVRGLWQLDGYIVSDCDSVGVFYDNQHY-TPTPEEAAA 309

Query: 333 QTLKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ---YV 389
             +KAGLDLDCG +    T NAV++G + E D++ +L    TV MRLG FDG P    Y 
Sbjct: 310 DAIKAGLDLDCGPFLAVHTQNAVKKGLLSEADVNGALVNTLTVQMRLGMFDGEPTAHPYG 369

Query: 390 SLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGN 449
            LG +D+C   + ELA EAAR+GIVLLKN    LPL+S   +TVAV+GP++ AT+ MIGN
Sbjct: 370 HLGPKDVCKPAHQELALEAARQGIVLLKNTGPVLPLSSQLHRTVAVIGPNSKATITMIGN 429

Query: 450 YAGIPCRYMSPIAGFSGYANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLS 509
           YAG+ C Y +P+ G   YA   ++ GC +VACK++     A  AA+ ADAT+++ GLD S
Sbjct: 430 YAGVACGYTNPLQGIGRYARTVHQLGCQNVACKNDKLFGPAINAARQADATVLVMGLDQS 489

Query: 510 VEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAG 569
           +EAE++DR  L LPG Q  L+++VA  +KGP ILV+MS G VDI FA+ N  I  ILWAG
Sbjct: 490 IEAETVDRTGLLLPGRQPDLVSKVAAASKGPTILVLMSGGPVDITFAKNNPRIVGILWAG 549

Query: 570 YPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFY 629
           YPG+ GG AIAD++FG  NPGG+LP+TWY  +Y+  LP+T+M +R   S GYPGRTY+FY
Sbjct: 550 YPGQAGGAAIADILFGTANPGGKLPVTWYPEEYLTKLPMTNMAMRATKSAGYPGRTYRFY 609

Query: 630 NGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDL 689
           NGP +YPFG+GL+YT F + L S    + V LN  +     N ++ A       + V   
Sbjct: 610 NGPVVYPFGHGLTYTHFVHTLASAPTVVSVPLNGHRRANVTNISNRA-------IRVTHA 662

Query: 690 RCDDY-FEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYI-KQVIGFQRVFVRAGRNKRI 747
           RCD      +VD +NVGS DG+  ++V+S PPA      + KQ++ F++V V A    R+
Sbjct: 663 RCDKLSITLQVDIKNVGSRDGTHTLLVFSAPPAGFGHWALEKQLVAFEKVHVPAKGQHRV 722

Query: 748 KFVFNACKSLNIVDYAANTLLPAGEHTIFVGN 779
               + CK L++VD +    +P GEH+  +G+
Sbjct: 723 GVNIHVCKLLSVVDRSGIRRIPLGEHSFNIGD 754


>gi|449484229|ref|XP_004156823.1| PREDICTED: LOW QUALITY PROTEIN: probable beta-D-xylosidase 2-like
           [Cucumis sativus]
          Length = 769

 Score =  786 bits (2029), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 393/771 (50%), Positives = 518/771 (67%), Gaps = 32/771 (4%)

Query: 13  LSIALLVFSTNAVDANGSSSPVFVCDPGRFSKLGLQMSSFLFCDSSLPYSIRVKDLVSRM 72
           LSI +L+   +A+    S +P F CDP          + + FC  SL    RVKDL+ R+
Sbjct: 9   LSIFILL---SAIHGRASRAP-FACDPNNSVT-----TDYPFCRRSLVVEERVKDLIGRL 59

Query: 73  TLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGATSFPTVILTT 132
           TL+EKV+ L   A GVPRLG+  Y+WWSEALHGVSNVGPGT F    P ATSFP VI T 
Sbjct: 60  TLEEKVKLLVSNAGGVPRLGIKAYQWWSEALHGVSNVGPGTRFGGEFPAATSFPQVISTA 119

Query: 133 ASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNINVARDPRWGRITETPGEDPFVVG 192
           ASFN SLW+ IG+ VS EARAMYN G  GLTYWSPN+N+ RDPRWGR  ETPGEDP + G
Sbjct: 120 ASFNASLWEAIGRVVSDEARAMYNGGVGGLTYWSPNVNIFRDPRWGRGQETPGEDPILAG 179

Query: 193 RYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQD 252
            YAVNYVRGLQ  EG+          LKV++CCKH+ AYD+DNW GVDR+HF+A+V++QD
Sbjct: 180 TYAVNYVRGLQGTEGNR---------LKVAACCKHFTAYDLDNWNGVDRFHFNAQVSKQD 230

Query: 253 MEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDS 312
           +E+TF  PF MCVK G  SSVMCSYN+VNG+P+CADP LL  T+R +W L GYIV+DCDS
Sbjct: 231 IEDTFEVPFRMCVKGGKVSSVMCSYNQVNGVPTCADPNLLTNTLRSQWHLDGYIVSDCDS 290

Query: 313 IQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYL 372
           + V  ++  + + + E+A A  +KAGLDLDCG +    T NAV++G + E+ I+ +L   
Sbjct: 291 VGVFYNSQHYTS-TPEEAAAMAIKAGLDLDCGSFLETHTENAVKRGLLNESHINGALSNT 349

Query: 373 YTVLMRLGFFDG---SPQYVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAK 429
            +V MRLG FDG   +  Y  LG + +CSD N +LA +AAR+GIVLL+N + +LPL++ +
Sbjct: 350 LSVQMRLGMFDGDLKTQPYAHLGAKHVCSDHNRQLAVDAARQGIVLLENRRGSLPLSTNR 409

Query: 430 VKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFSGYANVTYKTGCDDVACKSNNSIFA 489
            + VAVVGP++NAT+ MIGNYAGI C Y++P+ G S Y    ++ GC  VAC+SN     
Sbjct: 410 HRIVAVVGPNSNATLTMIGNYAGIACEYITPLQGISKYTRTIHQEGCRGVACRSNKFFGG 469

Query: 490 ASEAAKTADATIILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAG 549
           A EAA+ ADA +++ GLD S+EAE  DR  L LPG Q  L+ +VA VAKGPVILV+MS G
Sbjct: 470 AIEAARVADAVVLVMGLDQSIEAEFRDRAGLLLPGLQPDLVLKVASVAKGPVILVLMSGG 529

Query: 550 GVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLT 609
            +D++FA+ +  I  I+W GYPG+ GG AIADV+FG+ NPGG+LP+TWY  DYV  LP+T
Sbjct: 530 PIDVSFAKDHPKISGIIWGGYPGQAGGLAIADVLFGQTNPGGKLPMTWYPQDYVSKLPMT 589

Query: 610 SMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRN 669
           +M LRP  S  YPGRTY+FY GP +YPFG+GLSYT F + +LS   T+ V +   +H  N
Sbjct: 590 TMSLRPGTS--YPGRTYRFYKGPVVYPFGHGLSYTAFTHKILSAPTTLTVPVTGHRHPHN 647

Query: 670 LNYTSDASKTRCPGVLVNDLRCDDY-FEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYI 728
                  S+     V V   +CD      KV  +N+G+ DG+  ++VYS PP  +     
Sbjct: 648 ------GSEFWGKAVRVTHAKCDRLSLVIKVAVRNIGARDGAHTLLVYSIPPMGVWVPQ- 700

Query: 729 KQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGN 779
           KQ++ F++V + A   K ++   + CK L++VD      +P GEH I +G+
Sbjct: 701 KQLVAFEKVHIDAQALKEVQINIHVCKLLSVVDKYGIRRVPMGEHGIDIGD 751


>gi|357444469|ref|XP_003592512.1| Xylosidase [Medicago truncatula]
 gi|355481560|gb|AES62763.1| Xylosidase [Medicago truncatula]
          Length = 781

 Score =  785 bits (2028), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 399/755 (52%), Positives = 520/755 (68%), Gaps = 22/755 (2%)

Query: 31  SSPVFVCDPGRFSKLGLQMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPR 90
           +S    CD G       + S+F FC++SL Y  R KDLVSR+TL EK QQL + + G+ R
Sbjct: 20  TSQKHACDKG-----SPKTSNFPFCNTSLSYETRAKDLVSRLTLQEKAQQLVNPSTGISR 74

Query: 91  LGLPQYEWWSEALHGVSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTE 150
           LG+P YEWWSEALHGVSNVGPGT FD  +PGATSFP VIL+ ASFNE+LW  +GQ VS E
Sbjct: 75  LGVPAYEWWSEALHGVSNVGPGTRFDSRVPGATSFPAVILSAASFNETLWYTMGQVVSNE 134

Query: 151 ARAMYNLGRAGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHEN 210
           ARAMYN+  AGLT+WSPN+NV RDPRWGR  ETPGEDP VV RYAVNYVRGLQ+V    +
Sbjct: 135 ARAMYNVDLAGLTFWSPNVNVFRDPRWGRGQETPGEDPLVVSRYAVNYVRGLQEVGDEAS 194

Query: 211 ATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDA 270
           A       LKVSSCCKHY AYDVDNWKGVDR+HFDA+VT+QD+E+T+  PF+ CV EG  
Sbjct: 195 A---KGDRLKVSSCCKHYTAYDVDNWKGVDRFHFDAKVTKQDLEDTYQPPFKSCVLEGHV 251

Query: 271 SSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDA 330
           SSVMCSYNRVNGIP+CADP LL   +RG+W L GYIV+DCDS++V  ++  +   + EDA
Sbjct: 252 SSVMCSYNRVNGIPTCADPDLLQGVIRGQWGLDGYIVSDCDSVEVYYNSIHY-TKTPEDA 310

Query: 331 VAQTLKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDG--SPQY 388
           VA  LKAGL+++CG +   +T NAV   KV  + +D++L Y Y VLMRLGFF+   S  +
Sbjct: 311 VALALKAGLNMNCGDFLKKYTANAVNLKKVDVSIVDQALVYNYIVLMRLGFFENPKSLPF 370

Query: 389 VSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIG 448
            +LG  D+C+ EN +LA EAA++GIVLL+N++  LPL+  K+K +AV+GP+ANAT  MI 
Sbjct: 371 ANLGPSDVCTKENQQLALEAAKQGIVLLENNKGALPLSKTKIKNLAVIGPNANATTVMIS 430

Query: 449 NYAGIPCRYMSPIAGFSGY-ANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLD 507
           NYAGIPCRY SP+ G   Y ++VTY  GC DV C + N   AA +AA +ADA +++ GLD
Sbjct: 431 NYAGIPCRYSSPLQGLQKYISSVTYARGCSDVKCSNQNLFAAAVKAAASADAVVLVVGLD 490

Query: 508 LSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILW 567
            S+EAE LDR +L LPG+Q +L+  VA   KG +ILVIM+AG +DI+F ++ +NI  ILW
Sbjct: 491 QSIEAEGLDRVNLTLPGFQEKLVKDVAAATKGTLILVIMAAGPIDISFTKSVSNIGGILW 550

Query: 568 AGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYK 627
            GYPG++GG AIA V+FG +NPGGR P TWY   YV  +P+T M +R   S  +PGRTY+
Sbjct: 551 VGYPGQDGGNAIAQVIFGDYNPGGRSPFTWYPQSYVDQVPMTDMNMRANSSRNFPGRTYR 610

Query: 628 FYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLN-KLQHCRNLNYTSDASKTRCPGVLV 686
           FYNG +LY FGYGLSY+ F  ++ S   TI +  N  +    N  +  D        + +
Sbjct: 611 FYNGKSLYEFGYGLSYSTFSTHIASAPSTIMLQKNTSISKPLNNIFLDDQV------IDI 664

Query: 687 NDLRCDDY-FEFKVDFQNVGSTDGSDVVIVYSKPPAE--IAATYIKQVIGFQRVFVRAGR 743
           + + C +  F   +  +N G  DGS VV+V+ +PP+   ++   +KQ+IGF+R  V+ G+
Sbjct: 665 STISCFNLTFSLVIGVKNNGPFDGSHVVLVFLEPPSSEAVSGVPLKQLIGFERAQVKVGK 724

Query: 744 NKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVG 778
            + +    + CK L+ VD      L  G+H I VG
Sbjct: 725 TEFVTVKIDICKMLSNVDSDGKRKLVIGQHNILVG 759


>gi|356525896|ref|XP_003531557.1| PREDICTED: beta-xylosidase/alpha-L-arabinofuranosidase 1-like
           [Glycine max]
          Length = 776

 Score =  784 bits (2024), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 381/779 (48%), Positives = 524/779 (67%), Gaps = 26/779 (3%)

Query: 5   VSSLLCFSLSIALLVFSTNAVDANGSSSPVFVCDPGRFSKLGLQMSSFLFCDSSLPYSIR 64
           V   LCF   + +     N    +G +S VF CD  +   L    + + FCD SL    R
Sbjct: 11  VPVFLCFFSFMFVATVLLNCDRVSGQTSSVFACDVAKNPAL----AGYGFCDKSLSLEDR 66

Query: 65  VKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGATS 124
           V DLV R+TL EK+  L + A  V RLG+P+YEWWSEALHGVSNVGPGTHF  ++PGATS
Sbjct: 67  VADLVKRLTLQEKIGSLVNSATSVSRLGIPKYEWWSEALHGVSNVGPGTHFSSLVPGATS 126

Query: 125 FPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNINVARDPRWGRITETP 184
           FP  ILT ASFN SL++ IG+ VSTEARAMYN+G AGLTYWSPNIN+ RDPRWGR  ETP
Sbjct: 127 FPMPILTAASFNASLFEAIGRVVSTEARAMYNVGLAGLTYWSPNINIFRDPRWGRGQETP 186

Query: 185 GEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHF 244
           GEDP +  +YA  YV+GLQ  +      D +S  LKV++CCKHY AYD+DNWKG+ RY F
Sbjct: 187 GEDPLLSSKYATGYVKGLQQTD------DGDSNKLKVAACCKHYTAYDLDNWKGIQRYTF 240

Query: 245 DARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHG 304
           +A VT+QDM++TF  PF+ CV +G+ +SVMCSYN+VNG P+CADP LL   +RGEW L+G
Sbjct: 241 NAVVTQQDMDDTFQPPFKSCVIDGNVASVMCSYNQVNGKPTCADPDLLKGVIRGEWKLNG 300

Query: 305 YIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFTGNAVQQGKVKETD 364
           YIV+DCDS++V+  +  +   + E+A A+T+ AGLDL+CG Y   +T  AV+QG + E  
Sbjct: 301 YIVSDCDSVEVLFKDQHY-TKTPEEAAAETILAGLDLNCGNYLGQYTEGAVKQGLLDEAS 359

Query: 365 IDKSLKYLYTVLMRLGFFDGSPQ---YVSLGKQDICSDENIELAAEAAREGIVLLKNDQN 421
           I+ ++   +  LMRLGFFDG P    Y +LG  D+C+ EN ELA EAAR+GIVLLKN   
Sbjct: 360 INNAVSNNFATLMRLGFFDGDPSKQTYGNLGPNDVCTSENRELAREAARQGIVLLKNSLG 419

Query: 422 TLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFSGYANVTYKTGCDDVAC 481
           +LPLN+  +K++AV+GP+ANAT  MIGNY GIPC Y+SP+   +     +Y  GC +V C
Sbjct: 420 SLPLNAKAIKSLAVIGPNANATRVMIGNYEGIPCNYISPLQALTALVPTSYAAGCPNVQC 479

Query: 482 KSNNSIFAASEAAKTADATIILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPV 541
            +N  +  A++ A +ADAT+I+ G  L++EAESLDR ++ LPG Q  L+++VA  +KGPV
Sbjct: 480 -ANAELDDATQIAASADATVIVVGASLAIEAESLDRINILLPGQQQLLVSEVANASKGPV 538

Query: 542 ILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGD 601
           ILVIMS GG+D++FA++N  I +ILW GYPGE GG AIADV+FG +NP GRLP+TWY   
Sbjct: 539 ILVIMSGGGMDVSFAKSNDKITSILWVGYPGEAGGAAIADVIFGFYNPSGRLPMTWYPQS 598

Query: 602 YVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNL 661
           YV  +P+T+M +R   + GYPGRTY+FY G T++ FG G+S++  ++ ++   + + V L
Sbjct: 599 YVNKVPMTNMNMRADPATGYPGRTYRFYKGETVFSFGDGISFSNIEHKIVKAPQLVSVPL 658

Query: 662 NKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDY-FEFKVDFQNVGSTDGSDVVIVYSKPP 720
            +   CR+         + C  + V D  C +  F+  +  +N+G    S VV+++  PP
Sbjct: 659 AEDHECRS---------SECMSLDVADEHCQNLAFDIHLGVKNMGKMSSSHVVLLFFTPP 709

Query: 721 AEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGN 779
            ++     K ++GF++V +      +++F  + CK L++VD   N  +P G+H + VGN
Sbjct: 710 -DVHNAPQKHLLGFEKVHLPGKSEAQVRFKVDICKDLSVVDELGNRKVPLGQHLLHVGN 767


>gi|356558612|ref|XP_003547598.1| PREDICTED: beta-xylosidase/alpha-L-arabinofuranosidase 1-like
           [Glycine max]
          Length = 776

 Score =  784 bits (2024), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 384/781 (49%), Positives = 530/781 (67%), Gaps = 30/781 (3%)

Query: 5   VSSLLCF-SLS-IALLVFSTNAVDANGSSSPVFVCDPGRFSKLGLQMSSFLFCDSSLPYS 62
           V   LCF S + +A ++ + N V  +G +S VF CD  +   L    + + FCD SL   
Sbjct: 11  VPVFLCFFSFTFVASVLLNCNRV--SGQTSAVFACDVAKNPAL----AGYGFCDKSLSVE 64

Query: 63  IRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGA 122
            RV DLV R+TL EK+  L + A  V RLG+P+YEWWSEALHGVSNVGPGTHF  ++PGA
Sbjct: 65  DRVADLVKRLTLQEKIGSLVNSATSVSRLGIPKYEWWSEALHGVSNVGPGTHFSSLVPGA 124

Query: 123 TSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNINVARDPRWGRITE 182
           TSFP  ILT ASFN SL++ IG+ VSTEARAMYN+G AGLTYWSPNIN+ RDPRWGR  E
Sbjct: 125 TSFPMPILTAASFNASLFEAIGRVVSTEARAMYNVGLAGLTYWSPNINIFRDPRWGRGQE 184

Query: 183 TPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRY 242
           TPGEDP +  +YA  YV+GLQ  +      D +S  LKV++CCKHY AYD+DNWKG+ RY
Sbjct: 185 TPGEDPLLSSKYATGYVKGLQQTD------DGDSNKLKVAACCKHYTAYDLDNWKGIQRY 238

Query: 243 HFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDL 302
            F+A VT+QDM++TF  PF+ CV +G+ +SVMCSYN+VNG P+CADP LL   +RGEW L
Sbjct: 239 TFNAVVTQQDMDDTFQPPFKSCVIDGNVASVMCSYNQVNGKPTCADPDLLKGIIRGEWKL 298

Query: 303 HGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFTGNAVQQGKVKE 362
           +GYIV+DCDS++V+  +  +   + E+A AQT+ AGLDL+CG Y   +T  AV+QG + E
Sbjct: 299 NGYIVSDCDSVEVLFKDQHY-TKTPEEAAAQTILAGLDLNCGNYLGQYTEGAVKQGLLDE 357

Query: 363 TDIDKSLKYLYTVLMRLGFFDGSPQ---YVSLGKQDICSDENIELAAEAAREGIVLLKND 419
             I+ ++   +  LMRLGFFDG P    Y +LG +D+C+ EN ELA EAAR+GIVLLKN 
Sbjct: 358 ASINNAVSNNFATLMRLGFFDGDPSKQPYGNLGPKDVCTSENRELAREAARQGIVLLKNS 417

Query: 420 QNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFSGYANVTYKTGCDDV 479
             +LPLN+  +K++AV+GP+ANAT  MIGNY GIPC Y+SP+   +     +Y  GC +V
Sbjct: 418 PGSLPLNAKTIKSLAVIGPNANATRVMIGNYEGIPCNYISPLQTLTALVPTSYAAGCPNV 477

Query: 480 ACKSNNSIFAASEAAKTADATIILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKG 539
            C +N  +  A++ A +ADAT+I+ G  L++EAESLDR ++ LPG Q  L+++VA  +KG
Sbjct: 478 QC-ANAELDDATQIAASADATVIIVGASLAIEAESLDRINILLPGQQQLLVSEVANASKG 536

Query: 540 PVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYN 599
           PVILVIMS GG+D++FA++N  I +ILW GYPGE GG AIADV+FG +NP GRLP+TWY 
Sbjct: 537 PVILVIMSGGGMDVSFAKSNDKITSILWVGYPGEAGGAAIADVIFGFYNPSGRLPMTWYP 596

Query: 600 GDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQV 659
             YV  +P+T+M +R   + GYPGRTY+FY G T++ FG G+S++  ++ ++   + + V
Sbjct: 597 QAYVNKVPMTNMNMRADPATGYPGRTYRFYKGETVFSFGDGISFSSIEHKIVKAPQLVSV 656

Query: 660 NLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDY-FEFKVDFQNVGSTDGSDVVIVYSK 718
            L +   CR+         + C  + + D  C +  F+  +  +N G    S VV+++  
Sbjct: 657 PLAEDHECRS---------SECMSLDIADEHCQNLAFDIHLGVKNTGKMSTSHVVLLFFT 707

Query: 719 PPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVG 778
           PP ++     K ++GF++V +      +++F  + CK L++VD   N  +P G+H + VG
Sbjct: 708 PP-DVHNAPQKHLLGFEKVHLPGKSEAQVRFKVDVCKDLSVVDELGNRKVPLGQHLLHVG 766

Query: 779 N 779
           N
Sbjct: 767 N 767


>gi|356529243|ref|XP_003533205.1| PREDICTED: beta-D-xylosidase 1-like [Glycine max]
          Length = 774

 Score =  782 bits (2020), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 380/750 (50%), Positives = 516/750 (68%), Gaps = 27/750 (3%)

Query: 35  FVCDPGRFSKLGLQMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLP 94
           F CDP    + GL    F FC++ +P  +RV+DL++R+TL EK++ + + A  VPRLG+ 
Sbjct: 36  FACDP----RNGL-TRGFKFCNTHVPIHVRVQDLIARLTLPEKIRLVVNNAIAVPRLGIQ 90

Query: 95  QYEWWSEALHGVSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAM 154
            YEWWSEALHGVSNVGPGT F    PGAT FP VI T ASFN+SLW++IG+ VS EARAM
Sbjct: 91  GYEWWSEALHGVSNVGPGTKFGGAFPGATMFPQVISTAASFNQSLWQEIGRVVSDEARAM 150

Query: 155 YNLGRAGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDL 214
           YN G+AGLTYWSPN+N+ RDPRWGR  ETPGEDP +  +YA +YV+GLQ         D 
Sbjct: 151 YNGGQAGLTYWSPNVNIFRDPRWGRGQETPGEDPTLAAKYAASYVKGLQG--------DG 202

Query: 215 NSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVM 274
               LKV++CCKHY AYD+DNW GVDR+HF+A+V++QD+E+T+  PF+ CV EG  +SVM
Sbjct: 203 AGNRLKVAACCKHYTAYDLDNWNGVDRFHFNAKVSKQDLEDTYDVPFKACVLEGQVASVM 262

Query: 275 CSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQT 334
           CSYN+VNG P+CADP LL  T+RG+W L+GYIV+DCDS+ V  DN  +   + E+A A+ 
Sbjct: 263 CSYNQVNGKPTCADPDLLRNTIRGQWGLNGYIVSDCDSVGVFFDNQHY-TRTPEEAAAEA 321

Query: 335 LKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ---YVSL 391
           +KAGLDLDCG +    T +A+++G + E D++ +L  L TV MRLG FDG P    + +L
Sbjct: 322 IKAGLDLDCGPFLAIHTDSAIRKGLISENDLNLALANLITVQMRLGMFDGEPSTQPFGNL 381

Query: 392 GKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYA 451
           G +D+C+  + +LA EAARE IVLL+N  N+LPL+ ++++ V V+GP+ +ATV MIGNYA
Sbjct: 382 GPRDVCTPAHQQLALEAARESIVLLQNKGNSLPLSPSRLRIVGVIGPNTDATVTMIGNYA 441

Query: 452 GIPCRYMSPIAGFSGYANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVE 511
           G+ C Y +P+ G + Y    ++ GC  VAC+ N    AA   A+  DAT+++ GLD ++E
Sbjct: 442 GVACGYTTPLQGIARYVKTAHQVGCRGVACRGNELFGAAEIIARQVDATVLVMGLDQTIE 501

Query: 512 AESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYP 571
           AE+ DR  L LPG Q +L+ +VA  AKGPVILVIMS G VD++FA+ N  I AILW GYP
Sbjct: 502 AETRDRVGLLLPGLQQELVTRVARAAKGPVILVIMSGGPVDVSFAKNNPKISAILWVGYP 561

Query: 572 GEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNG 631
           G+ GG AIADV+FG  NPGGRLP+TWY   Y+  +P+T+M +RP  + GYPGRTY+FY G
Sbjct: 562 GQAGGTAIADVIFGATNPGGRLPMTWYPQGYLAKVPMTNMDMRPNPATGYPGRTYRFYKG 621

Query: 632 PTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRC 691
           P ++PFG+GLSY++F  +L    K + V +  LQ   N   +S A K       V+   C
Sbjct: 622 PVVFPFGHGLSYSRFSQSLALAPKQVSVQILSLQALTNSTLSSKAVK-------VSHANC 674

Query: 692 DDYF--EFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKF 749
           DD    EF VD +N GS DG+  ++++SKPP     + IKQ++ F +  V AG  +R+K 
Sbjct: 675 DDSLETEFHVDVKNEGSMDGTHTLLIFSKPPPG-KWSQIKQLVTFHKTHVPAGSKQRLKV 733

Query: 750 VFNACKSLNIVDYAANTLLPAGEHTIFVGN 779
             ++CK L++VD      +P GEH + +G+
Sbjct: 734 NVHSCKHLSVVDQFGVRRIPTGEHELHIGD 763


>gi|255556320|ref|XP_002519194.1| Periplasmic beta-glucosidase precursor, putative [Ricinus communis]
 gi|223541509|gb|EEF43058.1| Periplasmic beta-glucosidase precursor, putative [Ricinus communis]
          Length = 782

 Score =  781 bits (2018), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 384/759 (50%), Positives = 520/759 (68%), Gaps = 26/759 (3%)

Query: 35  FVCDPGRFSKLGLQMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLP 94
           F CDP       L+     FC ++LP  +RV+DL+SR+TL EK++ L + A  VPRLG+ 
Sbjct: 42  FACDPRNGVTRNLK-----FCRANLPIHVRVRDLISRLTLQEKIRLLVNNAAAVPRLGIQ 96

Query: 95  QYEWWSEALHGVSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAM 154
            YEWWSEALHGVSNVGPG  F    PGATSFP VI T ASFN+SLW++IG+ VS EARAM
Sbjct: 97  GYEWWSEALHGVSNVGPGVKFGGAFPGATSFPQVITTAASFNQSLWEQIGRVVSDEARAM 156

Query: 155 YNLGRAGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDL 214
           YN G AGLTYWSPN+NV RDPRWGR  ETPGEDP + G+YA +YVRGLQ   G +     
Sbjct: 157 YNGGLAGLTYWSPNVNVFRDPRWGRGQETPGEDPVLAGKYAASYVRGLQSSTGLK----- 211

Query: 215 NSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVM 274
               LKV++CCKHY AYD+DNW GVDRYHF+ARV++QD+E+T+  PF+ CV EG  +SVM
Sbjct: 212 ----LKVAACCKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYDVPFKACVVEGKVASVM 267

Query: 275 CSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQT 334
           CSYN+VNG P+CADP LL  T+RG+W L+GYIV+DCDS+ V+ DN  + + + E+A A T
Sbjct: 268 CSYNQVNGKPTCADPILLKNTIRGQWGLNGYIVSDCDSVGVLYDNQHYTS-TPEEAAAAT 326

Query: 335 LKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ---YVSL 391
           +KAGLDLDCG +    T NAV++G + E D++ +L    TV MRLG FDG P    Y +L
Sbjct: 327 IKAGLDLDCGPFLAIHTENAVKKGLLVEEDVNLALANTITVQMRLGMFDGEPSAHPYGNL 386

Query: 392 GKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYA 451
           G +D+C+  + ELA EAAR+GIVLL+N    LPL+S++  T+AV+GP+++ TV MIGNYA
Sbjct: 387 GPRDVCTPAHQELALEAARQGIVLLENRGQALPLSSSRHHTIAVIGPNSDVTVTMIGNYA 446

Query: 452 GIPCRYMSPIAGFSGYANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVE 511
           GI C+Y SP+ G S YA   ++ GC DVAC SN    AA  AA+ ADAT+++ GLD S+E
Sbjct: 447 GIACKYTSPLQGISRYAKTLHQNGCGDVACHSNQQFGAAEAAARQADATVLVMGLDQSIE 506

Query: 512 AESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYP 571
           AE  DR  L LPG+Q +L+++VA  ++GP ILV+MS G +D++FA+ +  + AILWAGYP
Sbjct: 507 AEFRDRVGLLLPGHQQELVSRVARASRGPTILVLMSGGPIDVSFAKNDPRVGAILWAGYP 566

Query: 572 GEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNG 631
           G+ GG AIADV+FG  NPGG+LP+TWY   Y+  +P+T+M +RP  + GYPGRTY+FY G
Sbjct: 567 GQAGGAAIADVLFGTTNPGGKLPMTWYPQGYLAKVPMTNMGMRPDPATGYPGRTYRFYKG 626

Query: 632 PTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRC 691
             ++PFG+G+SYT F ++L    K + + +  L     LN T  +   R     V+ + C
Sbjct: 627 NVVFPFGHGMSYTSFSHSLTQAPKEVSLPITNLY---ALNTTISSKAIR-----VSHINC 678

Query: 692 DDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVF 751
                  ++ +N G+ DG+  ++V+S PP+    +  KQ+IGF++V + AG   ++K   
Sbjct: 679 QTSLGIDINVKNTGTMDGTHTLLVFSSPPSGEKESSNKQLIGFEKVDLVAGSQIQVKIDI 738

Query: 752 NACKSLNIVDYAANTLLPAGEHTIFVGNGGVSFPIHLNF 790
           + CK L+ VD      +P G+H I++G+   S  +  N 
Sbjct: 739 HVCKHLSAVDRFGIRRIPIGDHHIYIGDLKHSISLQANM 777


>gi|292630923|sp|A5JTQ3.1|XYL2_MEDVA RecName: Full=Beta-xylosidase/alpha-L-arabinofuranosidase 2;
           AltName: Full=Xylan
           1,4-beta-xylosidase/Alpha-N-arabinofuranosidase 2;
           Short=MsXyl2; Includes: RecName: Full=Beta-xylosidase;
           AltName: Full=1,4-beta-D-xylan xylohydrolase; AltName:
           Full=Xylan 1,4-beta-xylosidase; Includes: RecName:
           Full=Alpha-N-arabinofuranosidase; AltName:
           Full=Alpha-L-arabinofuranosidase; Short=Arabinosidase;
           Flags: Precursor
 gi|146762263|gb|ABQ45228.1| beta-xylosidase/alpha-L-arabinosidase [Medicago sativa subsp. x
           varia]
          Length = 774

 Score =  780 bits (2014), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 385/779 (49%), Positives = 523/779 (67%), Gaps = 28/779 (3%)

Query: 5   VSSLLCFSLSIALLVFSTNAVDANGSSSPVFVCDPGRFSKLGLQMSSFLFCDSSLPYSIR 64
           VS  LCF +  A L+ S   V +   +S VF CD  +   L    +++ FC+  L    R
Sbjct: 11  VSVFLCFFVLFATLLLSGGRVSSQ--TSAVFACDVAKNPAL----ANYGFCNKKLSVDAR 64

Query: 65  VKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGATS 124
           VKDLV R+TL EKV  L + A  V RLG+P+YEWWSEALHGVSN+GPGTHF +VIPGATS
Sbjct: 65  VKDLVRRLTLQEKVGNLVNSAVDVSRLGIPKYEWWSEALHGVSNIGPGTHFSNVIPGATS 124

Query: 125 FPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNINVARDPRWGRITETP 184
           FP  IL  ASFN SL++ IG+ VSTEARAM+N+G AGLTYWSPNIN+ RDPRWGR  ETP
Sbjct: 125 FPMPILIAASFNASLFQTIGKVVSTEARAMHNVGLAGLTYWSPNINIFRDPRWGRGQETP 184

Query: 185 GEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHF 244
           GEDP +  +YA  YV+GLQ  +      D +S  LKV++CCKHY AYDVD+WKGV RY F
Sbjct: 185 GEDPLLASKYAAGYVKGLQQTD------DGDSNKLKVAACCKHYTAYDVDDWKGVQRYTF 238

Query: 245 DARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHG 304
           +A VT+QD+++T+  PF+ CV +G+ +SVMCSYN+VNG P+CADP LL   +RG+W L+G
Sbjct: 239 NAVVTQQDLDDTYQPPFKSCVIDGNVASVMCSYNQVNGKPTCADPDLLKGVIRGKWKLNG 298

Query: 305 YIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFTGNAVQQGKVKETD 364
           YIV+DCDS+ V+  N  +   + E+A A+++ AGLDL+CG +   +T  AV+QG + E  
Sbjct: 299 YIVSDCDSVDVLFKNQHY-TKTPEEAAAKSILAGLDLNCGSFLGRYTEGAVKQGLIGEAS 357

Query: 365 IDKSLKYLYTVLMRLGFFDGSPQ---YVSLGKQDICSDENIELAAEAAREGIVLLKNDQN 421
           I+ ++   +  LMRLGFFDG P    Y +LG +D+C+  N ELA EAAR+GIVLLKN   
Sbjct: 358 INNAVYNNFATLMRLGFFDGDPSKQPYGNLGPKDVCTSANQELAREAARQGIVLLKNCAG 417

Query: 422 TLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFSGYANVTYKTGCDDVAC 481
           +LPLN+  +K++AV+GP+ANAT AMIGNY GIPC+Y SP+ G +     ++  GC DV C
Sbjct: 418 SLPLNAKAIKSLAVIGPNANATRAMIGNYEGIPCKYTSPLQGLTALVPTSFAAGCPDVQC 477

Query: 482 KSNNSIFAASEAAKTADATIILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPV 541
            +N ++  A + A +ADAT+I+ G +L++EAES DR ++ LPG Q QL+ +VA VAKGPV
Sbjct: 478 -TNAALDDAKKIAASADATVIVVGANLAIEAESHDRINILLPGQQQQLVTEVANVAKGPV 536

Query: 542 ILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGD 601
           IL IMS GG+D++FA+TN  I +ILW GYPGE GG AIADV+FG  NP GRLP+TWY   
Sbjct: 537 ILAIMSGGGMDVSFAKTNKKITSILWVGYPGEAGGAAIADVIFGYHNPSGRLPMTWYPQS 596

Query: 602 YVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNL 661
           YV  +P+T+M +RP  + GYPGRTY+FY G T++ FG G+SY+ F++ L+   + + V L
Sbjct: 597 YVDKVPMTNMNMRPDPATGYPGRTYRFYKGETVFSFGDGISYSTFEHKLVKAPQLVSVPL 656

Query: 662 NKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDY-FEFKVDFQNVGSTDGSDVVIVYSKPP 720
            +   CR+         ++C  + V    C +  F+  +  +N G    S  V ++S PP
Sbjct: 657 AEDHVCRS---------SKCKSLDVVGEHCQNLAFDIHLRIKNKGKMSSSQTVFLFSTPP 707

Query: 721 AEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGN 779
           A   A   K ++ F++V +       + F  + CK L +VD   N  +  G+H + VG+
Sbjct: 708 AVHNAPQ-KHLLAFEKVLLTGKSEALVSFKVDVCKDLGLVDELGNRKVALGKHMLHVGD 765


>gi|357511337|ref|XP_003625957.1| Beta-xylosidase [Medicago truncatula]
 gi|355500972|gb|AES82175.1| Beta-xylosidase [Medicago truncatula]
          Length = 771

 Score =  778 bits (2010), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 387/785 (49%), Positives = 522/785 (66%), Gaps = 37/785 (4%)

Query: 1   MAKVVSSLLCFSLSIALLVFSTNAVDANGSSSPVFVCDPGRFSKLGLQMSSFLFCDSSLP 60
           M+   S     +L I LL  S +A D+       F CD    +   L      FC+  L 
Sbjct: 1   MSSTFSLSPLITLFILLLQSSCDARDS-------FACDAKDAATKNLP-----FCNVKLA 48

Query: 61  YSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTHFDDVIP 120
              RVKDL+ R+T+ EKV  L + A  VPR+G+  YEWWSEALHGVSNVGPGT F  V P
Sbjct: 49  IPERVKDLIGRLTMQEKVNLLVNNAPAVPRVGMKSYEWWSEALHGVSNVGPGTRFGGVFP 108

Query: 121 GATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNINVARDPRWGRI 180
            ATSFP VI T ASFN SLW+ IG+ VS EARAMYN G AGLTYWSPN+N+ RDPRWGR 
Sbjct: 109 AATSFPQVITTAASFNASLWEAIGRVVSDEARAMYNGGAAGLTYWSPNVNIFRDPRWGRG 168

Query: 181 TETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVD 240
            ETPGEDP + GRYA +YV+GLQ  +G++         LKV++CCKH+ AYDVDNW GVD
Sbjct: 169 QETPGEDPVLAGRYAASYVKGLQGTDGNK---------LKVAACCKHFTAYDVDNWNGVD 219

Query: 241 RYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEW 300
           R+HF+A V++QD+E+TF  PF MCVKEG  +SVMCSYN+VNG+P+CADP LL +TVRG W
Sbjct: 220 RFHFNALVSKQDIEDTFDVPFRMCVKEGKVASVMCSYNQVNGVPTCADPNLLKKTVRGVW 279

Query: 301 DLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFTGNAVQQGKV 360
            L GYIV+DCDS+ V+ ++  + + + E+A A  +KAGLDLDCG +    T +AV++G +
Sbjct: 280 GLDGYIVSDCDSVGVLYNSQHYTS-TPEEAAADAIKAGLDLDCGPFLGVHTQDAVKKGLL 338

Query: 361 KETDIDKSLKYLYTVLMRLGFFDGSPQ---YVSLGKQDICSDENIELAAEAAREGIVLLK 417
            E D++ +L     V MRLG FDG P    Y  LG +D+C   + ELA EAAR+GIVLLK
Sbjct: 339 TEADVNNALVNTLKVQMRLGMFDGEPSAQAYGRLGPKDVCKPAHQELALEAARQGIVLLK 398

Query: 418 NDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFSGYANVTYKTGCD 477
           N   TLPL+  + +TVAV+GP+++ TV MIGNYAGI C Y SP+ G   YA   ++ GC 
Sbjct: 399 NTGPTLPLSPQRHRTVAVIGPNSDVTVTMIGNYAGIACGYTSPLQGIGRYAKTIHQQGCS 458

Query: 478 DVACKSNNSIFAASEAAKTADATIILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVA 537
           +VAC+ +     A +AA+ ADATI++ GLD S+EAE++DR  L LPG+Q  L+++VA  +
Sbjct: 459 NVACRDDKQFGPALDAARHADATILVIGLDQSIEAETVDRTSLLLPGHQQDLVSKVAAAS 518

Query: 538 KGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITW 597
           KGP ILV+MS G VDI FA+ +  +  ILWAGYPG+ GG AIAD++FG  +PGG+LP+TW
Sbjct: 519 KGPTILVLMSGGPVDITFAKNDPKVAGILWAGYPGQAGGAAIADILFGTASPGGKLPVTW 578

Query: 598 YNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTI 657
           Y  +Y++ L +T+M +RP   +GYPGRTY+FY GP +YPFG+GL+YT F + L S    +
Sbjct: 579 YPQEYLKNLAMTNMAMRP-SKIGYPGRTYRFYKGPVVYPFGHGLTYTHFVHELSSAPTVV 637

Query: 658 QVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDY-FEFKVDFQNVGSTDGSDVVIVY 716
            V ++  +H  N N ++ A       + V   RC        VD +NVGS DG+  ++V+
Sbjct: 638 SVPVHGHRHGNNTNISNKA-------IRVTHARCGKLSIALHVDVKNVGSRDGTHTLLVF 690

Query: 717 SKPPAEIAATYI--KQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHT 774
           S PP      ++  K ++ F++V V A   +R++   + CK L++VD +    +P GEH+
Sbjct: 691 SAPP-NGGNHWVPQKSLVAFEKVHVPAKTKQRVRVNIHVCKLLSVVDKSGIRRIPMGEHS 749

Query: 775 IFVGN 779
           + +G+
Sbjct: 750 LHIGD 754


>gi|297834874|ref|XP_002885319.1| beta-1,4-xylosidase [Arabidopsis lyrata subsp. lyrata]
 gi|297331159|gb|EFH61578.1| beta-1,4-xylosidase [Arabidopsis lyrata subsp. lyrata]
          Length = 865

 Score =  775 bits (2001), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 401/799 (50%), Positives = 520/799 (65%), Gaps = 56/799 (7%)

Query: 5   VSSLLCFSLSIALLVFSTNAVDANGSSSPVFVCDPGRFSKLGLQMSSFLFCDSSLPYSIR 64
           V   +  SL IA LV S      N      F CD     +     + + FC+ SL Y  R
Sbjct: 3   VGRFVGVSLLIAALVSSLCESQKN------FACD-----RNDPATAKYGFCNVSLSYEAR 51

Query: 65  VKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGATS 124
            KDLVSR++L EKVQQL + A GV RLG+P YEWWSEALHGVS+VGPG  F+  +PGATS
Sbjct: 52  AKDLVSRLSLKEKVQQLVNKATGVSRLGVPPYEWWSEALHGVSDVGPGVRFNGTVPGATS 111

Query: 125 FPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNINVARDPRWGRITETP 184
           FP  ILT ASFN SLW K+G+ VSTEARAM+N+G AGLTYWSPN+N+ RDPRWGR  ETP
Sbjct: 112 FPATILTAASFNTSLWLKMGEVVSTEARAMHNVGLAGLTYWSPNVNIFRDPRWGRGQETP 171

Query: 185 GEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHF 244
           GEDP VV +YAVNYV+GLQDV+         SR LKVSSCCKHY AYD+DNWKG+DR+HF
Sbjct: 172 GEDPLVVSKYAVNYVKGLQDVQDAG-----KSRRLKVSSCCKHYTAYDLDNWKGIDRFHF 226

Query: 245 DARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHG 304
           DA+VT+QD+E+T+  PF+ CV+EGD SSVMCSYNRVNGIP+CADP LL   +RG+W L G
Sbjct: 227 DAKVTKQDLEDTYQPPFKSCVEEGDVSSVMCSYNRVNGIPTCADPNLLRGVIRGQWRLDG 286

Query: 305 YIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFTGNAVQQGKVKETD 364
           YIV+DCDSIQV  D+  +             K  L+++CG +   +T NAV+  K+  ++
Sbjct: 287 YIVSDCDSIQVYFDDIHY------------TKTRLNMNCGDFLGKYTENAVKLKKLNGSE 334

Query: 365 IDKSLKYLYTVLMRLGFFDGSPQ---YVSLGKQDICSDENIELAAEAAREGIVLLKNDQN 421
           +D++L Y Y VLMRLGFFDG P+   +  LG  D+CS ++  LA EAA++GIVLL+N + 
Sbjct: 335 VDEALIYNYIVLMRLGFFDGDPKSLPFGQLGPSDVCSKDHQMLALEAAKQGIVLLEN-RG 393

Query: 422 TLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFSGYA--NVTYKTGCDDV 479
            LPL+   VK +AV+GP+ANAT  MI NYAG+PC+Y SP+ G   Y    V Y+ GC DV
Sbjct: 394 DLPLSKTAVKKIAVIGPNANATKVMISNYAGVPCKYTSPLQGLQKYVPEKVVYEPGCKDV 453

Query: 480 ACKSNNSIFAASEAAKTADATIILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKG 539
            C     I AA +A   AD T+++ GLD +VEAE LDR +L LPGYQ +L+  VA  AK 
Sbjct: 454 NCGEQTLISAAVKAVSEADVTVLVVGLDQTVEAEGLDRVNLTLPGYQEKLVRDVANAAKK 513

Query: 540 PVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYN 599
            V+LVIMSAG +DI+FA+  + I A+LW GYPGE GG AIA V+FG +NP GRLP TWY+
Sbjct: 514 TVVLVIMSAGPIDISFAKNLSTISAVLWVGYPGEAGGDAIAQVIFGDYNPSGRLPETWYS 573

Query: 600 GDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQV 659
            ++   + +T M +RP  + G+PGR+Y+FY G  +Y FGYGLSY+ F   +LS    I +
Sbjct: 574 QEFADKVAMTDMNMRPNSTSGFPGRSYRFYTGKPIYKFGYGLSYSAFSTFVLSAPSIIHI 633

Query: 660 NLNKLQHCRNLNYTS--DASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYS 717
             N +    NLN T+  D S   C     +DL+        +  +N G   GS VV+V+ 
Sbjct: 634 KTNPIL---NLNKTTSIDISTVNC-----HDLK----IRIVIGVKNRGQRSGSHVVLVFW 681

Query: 718 KPPAEIAATYI------KQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAG 771
           KPP + + T +       Q++GF+RV V     +++   F+ CK+L++VD      L  G
Sbjct: 682 KPP-KCSKTLVGAGVPQTQLVGFERVEVGRSMTEKVTVEFDVCKALSLVDTHGKRKLVTG 740

Query: 772 EHTIFVG-NGGVSFPIHLN 789
            HT+ +G N       HLN
Sbjct: 741 HHTLVIGSNSDQQIYHHLN 759


>gi|356524862|ref|XP_003531047.1| PREDICTED: beta-xylosidase/alpha-L-arabinofuranosidase 2-like
           [Glycine max]
          Length = 765

 Score =  775 bits (2000), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 381/749 (50%), Positives = 509/749 (67%), Gaps = 26/749 (3%)

Query: 35  FVCDPGRFSKLGLQMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLP 94
           F CD G+   +    + + FCD SL    RVKDLV R+TL EK+  L + A  V RLG+P
Sbjct: 30  FACDVGKSPAV----AGYGFCDKSLGVEARVKDLVGRLTLQEKIGNLVNSAVDVSRLGIP 85

Query: 95  QYEWWSEALHGVSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAM 154
           +YEWWSEALHGVSNVGPGT F +VIPGATSFP  ILT ASFN SL++ IG+ VSTEARAM
Sbjct: 86  KYEWWSEALHGVSNVGPGTRFSNVIPGATSFPMPILTAASFNTSLFEVIGRVVSTEARAM 145

Query: 155 YNLGRAGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDL 214
           YN+G AGLTYWSPNIN+ RDPRWGR  ETPGEDP +  +YA  YV+GLQ  +G +     
Sbjct: 146 YNVGLAGLTYWSPNINIFRDPRWGRGLETPGEDPVLTSKYAAGYVKGLQQTDGGD----- 200

Query: 215 NSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVM 274
               LKV++CCKHY AYDVDNWKG+ RY F+A VT+QDME+TF  PF+ CV +G+ +SVM
Sbjct: 201 -PNKLKVAACCKHYTAYDVDNWKGIQRYTFNAVVTKQDMEDTFQPPFKSCVIDGNVASVM 259

Query: 275 CSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQT 334
           CSYN+VNG P+CADP LL   VRGEW L+GYIV+DCDS++V+  +  +   + E+A A +
Sbjct: 260 CSYNKVNGKPTCADPDLLKGVVRGEWKLNGYIVSDCDSVEVLYKDQHY-TKTPEEAAAIS 318

Query: 335 LKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ---YVSL 391
           + AGLDL+CG++   +T  AV+QG + E  I+ ++   +  LMRLGFFDG P+   Y +L
Sbjct: 319 ILAGLDLNCGRFLGQYTEGAVKQGLIDEASINNAVTNNFATLMRLGFFDGDPRKQPYGNL 378

Query: 392 GKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYA 451
           G +D+C+ EN ELA EAAR+GIVLLKN   +LPLN+  +K++AV+GP+ANAT  MIGNY 
Sbjct: 379 GPKDVCTQENQELAREAARQGIVLLKNSPASLPLNAKAIKSLAVIGPNANATRVMIGNYE 438

Query: 452 GIPCRYMSPIAGFSGYANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVE 511
           GIPC+Y+SP+ G + +A  +Y  GC DV C  N  +  A + A +ADAT+I+ G  L++E
Sbjct: 439 GIPCKYISPLQGLTAFAPTSYAAGCLDVRC-PNPVLDDAKKIAASADATVIVVGASLAIE 497

Query: 512 AESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYP 571
           AESLDR ++ LPG Q  L+++VA  +KGPVILVIMS GG+D++FA+ N  I +ILW GYP
Sbjct: 498 AESLDRVNILLPGQQQLLVSEVANASKGPVILVIMSGGGMDVSFAKNNNKITSILWVGYP 557

Query: 572 GEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNG 631
           GE GG AIADV+FG  NP GRLP+TWY   YV  +P+T+M +RP  + GYPGRTY+FY G
Sbjct: 558 GEAGGAAIADVIFGFHNPSGRLPMTWYPQSYVDKVPMTNMNMRPDPATGYPGRTYRFYKG 617

Query: 632 PTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRC 691
            T++ FG GLSY+   + L+   + + V L +   CR+         + C  + V    C
Sbjct: 618 ETVFAFGDGLSYSSIVHKLVKAPQLVSVQLAEDHVCRS---------SECKSIDVVGEHC 668

Query: 692 DDY-FEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFV 750
            +  F+  +  +N G    +  V ++S PPA   A   K ++GF++V +       + F 
Sbjct: 669 QNLVFDIHLRIKNKGKMSSAHTVFLFSTPPAVHNAPQ-KHLLGFEKVHLIGKSEALVSFK 727

Query: 751 FNACKSLNIVDYAANTLLPAGEHTIFVGN 779
            + CK L+IVD   N  +  G+H + VG+
Sbjct: 728 VDVCKDLSIVDELGNRKVALGQHLLHVGD 756


>gi|359485890|ref|XP_002264183.2| PREDICTED: beta-xylosidase/alpha-L-arabinofuranosidase 2-like
           [Vitis vinifera]
          Length = 774

 Score =  774 bits (1998), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 393/782 (50%), Positives = 526/782 (67%), Gaps = 28/782 (3%)

Query: 2   AKVVSSLLCFSLSIALLVFSTNAVDANGSSSPVFVCDPGRFSKLGLQMSSFLFCDSSLPY 61
           A  V+  LCF    +  + S   V A   SSPVF CD      LG     F FC++SL  
Sbjct: 8   APKVTVFLCFLSCFSHFLSSPKWVLAQ--SSPVFACDVENNPTLG----QFGFCNTSLET 61

Query: 62  SIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTHFDDVIPG 121
           + RV DLV R+TL+EK+  L + A  V RLG+P+YEWWSEALHGVS VGPGTHF+ V+PG
Sbjct: 62  AARVADLVKRLTLEEKIGFLVNSAASVSRLGIPKYEWWSEALHGVSYVGPGTHFNSVVPG 121

Query: 122 ATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNINVARDPRWGRIT 181
           ATSFP VILT ASFN SL++ IG+AVSTEARAMYN+G AGLT+WSPN+N+ RDPRWGR  
Sbjct: 122 ATSFPQVILTAASFNASLFEAIGKAVSTEARAMYNVGLAGLTFWSPNVNIFRDPRWGRGQ 181

Query: 182 ETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDR 241
           ETPGEDP +  +YA  YVRGLQ  +      D +   LKV++CCKHY AYD+DNWKGVDR
Sbjct: 182 ETPGEDPLLSSKYASGYVRGLQQSD------DGSPDRLKVAACCKHYTAYDLDNWKGVDR 235

Query: 242 YHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWD 301
           +HF+A VT+QDM++TF  PF+ CV +G+ +SVMCSYN+VNG P+CADP LL+  VRGEW 
Sbjct: 236 FHFNAVVTKQDMDDTFQPPFKSCVIDGNVASVMCSYNQVNGKPACADPDLLSGIVRGEWK 295

Query: 302 LHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFTGNAVQQGKVK 361
           L+GYIV+DCDS+ V  ++  +   + E+A A+ + AGLDL+CG +    T  AV+ G V 
Sbjct: 296 LNGYIVSDCDSVDVFYNSQHY-TKTPEEAAAKAILAGLDLNCGSFLGQHTEAAVKGGLVD 354

Query: 362 ETDIDKSLKYLYTVLMRLGFFDGSPQ---YVSLGKQDICSDENIELAAEAAREGIVLLKN 418
           E+ +DK++   +  LMRLGFFDG+P    Y  LG +D+C+ E+ ELA EAAR+GIVLLKN
Sbjct: 355 ESAVDKAVSNNFATLMRLGFFDGNPSKAIYGKLGPKDVCTSEHQELAREAARQGIVLLKN 414

Query: 419 DQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFSGYANVTYKTGCDD 478
            + +LPL+   +KT+AV+GP+AN T  MIGNY G PC+Y +P+ G +     TY  GC +
Sbjct: 415 SKGSLPLSPTAIKTLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGLTALVATTYLPGCSN 474

Query: 479 VACKSNNSIFAASEAAKTADATIILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAK 538
           VAC +   I  A + A  ADAT+++ G+D S+EAE  DR ++ LPG Q  LI +VA+ +K
Sbjct: 475 VACGTAQ-IDEAKKIAAAADATVLIVGIDQSIEAEGRDRVNIQLPGQQPLLITEVAKASK 533

Query: 539 GPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWY 598
           G VILV+MS GG DI+FA+ +  I +ILW GYPGE GG AIADV+FG +NP GRLP+TWY
Sbjct: 534 GNVILVVMSGGGFDISFAKNDDKITSILWVGYPGEAGGAAIADVIFGFYNPSGRLPMTWY 593

Query: 599 NGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQ 658
              YV  +P+T+M +RP  + GYPGRTY+FY G T+Y FG GLSYTQF ++L+   K++ 
Sbjct: 594 PQSYVDKVPMTNMNMRPDPASGYPGRTYRFYTGETIYTFGDGLSYTQFNHHLVQAPKSVS 653

Query: 659 VNLNKLQHCRNLNYTS-DASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYS 717
           + + +   C +    S DA +  C  ++         F+  +   N G+  GS  V ++S
Sbjct: 654 IPIEEGHSCHSSKCKSVDAVQESCQNLV---------FDIHLRVNNAGNISGSHTVFLFS 704

Query: 718 KPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFV 777
            PP+ +  +  K ++GF++VFV A     ++F  + CK L+IVD      +  G H + V
Sbjct: 705 SPPS-VHNSPQKHLLGFEKVFVTAKAKALVRFKVDVCKDLSIVDELGTRKVALGLHVLHV 763

Query: 778 GN 779
           GN
Sbjct: 764 GN 765


>gi|297797477|ref|XP_002866623.1| beta-xylosidase 4 [Arabidopsis lyrata subsp. lyrata]
 gi|297312458|gb|EFH42882.1| beta-xylosidase 4 [Arabidopsis lyrata subsp. lyrata]
          Length = 784

 Score =  773 bits (1995), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 391/792 (49%), Positives = 522/792 (65%), Gaps = 29/792 (3%)

Query: 1   MAKVVSSLLCFSLSIALLVFSTNAVDANGSSSPVFVCDPGRFSKLGLQMSSFLFCDSSLP 60
           ++ V  + LCF L    L        +N  SSPVF CD      L    +++ FC++ L 
Sbjct: 18  VSSVFLTFLCFFLYFLDL--------SNAQSSPVFACDVAANPSL----AAYGFCNTVLK 65

Query: 61  YSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTHFDDVIP 120
              RV DLV+R+TL EK+  L   A+GV RLG+P YEWWSEALHGVS +GPGTHF   +P
Sbjct: 66  IEYRVADLVARLTLQEKIGFLVSKANGVTRLGIPTYEWWSEALHGVSYIGPGTHFSSQVP 125

Query: 121 GATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNINVARDPRWGRI 180
           GATSFP VILT ASFN SL++ IG+ VSTEARAMYN+G AGLTYWSPN+N+ RDPRWGR 
Sbjct: 126 GATSFPQVILTAASFNVSLFQAIGKVVSTEARAMYNVGLAGLTYWSPNVNIFRDPRWGRG 185

Query: 181 TETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVD 240
            ETPGEDP +  +YA  YV+GLQ+ +G +      S  LKV++CCKHY AYDVDNWKGV+
Sbjct: 186 QETPGEDPLLASKYASGYVKGLQETDGGD------SNRLKVAACCKHYTAYDVDNWKGVE 239

Query: 241 RYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEW 300
           RY F+A VT+QDM++T+  PF+ CV +G+ +SVMCSYN+VNG P+CADP LL+  +RGEW
Sbjct: 240 RYSFNAVVTQQDMDDTYQPPFKSCVVDGNVASVMCSYNQVNGKPTCADPDLLSGVIRGEW 299

Query: 301 DLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFTGNAVQQGKV 360
            L+GYIV+DCDS+ V+  N  +     E A A ++ AGLDL+CG +    T  AV+ G V
Sbjct: 300 KLNGYIVSDCDSVDVLYKNQHYTKTPAE-AAAISILAGLDLNCGSFLGQHTEEAVKSGLV 358

Query: 361 KETDIDKSLKYLYTVLMRLGFFDGSPQ---YVSLGKQDICSDENIELAAEAAREGIVLLK 417
            E  IDK++   +  LMRLGFFDG+P+   Y  LG  D+C+  N ELAA+AAR+GIVLLK
Sbjct: 359 NEAAIDKAISNNFLTLMRLGFFDGNPKNQIYGGLGPTDVCTSANQELAADAARQGIVLLK 418

Query: 418 NDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFSGYANVTYKTGCD 477
           N    LPL+   +KT+AV+GP+AN T  MIGNY G PC+Y +P+ G +G  + TY  GC 
Sbjct: 419 N-TGFLPLSPKSIKTLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGLAGAVSTTYLPGCS 477

Query: 478 DVACKSNNSIFAASEAAKTADATIILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVA 537
           +VAC   + +  A++ A TAD T++L G D S+EAES DR DL LPG Q +L+ QVA+ A
Sbjct: 478 NVACAVAD-VAGATKLAATADVTVLLIGADQSIEAESRDRVDLNLPGQQQELVIQVAKAA 536

Query: 538 KGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITW 597
           KGPV+LVIMS GG DI FA+ +  I  ILW GYPGE GG AIAD++FG++NP GRLP+TW
Sbjct: 537 KGPVLLVIMSGGGFDITFAKNDPKIAGILWVGYPGEAGGIAIADIIFGRYNPSGRLPMTW 596

Query: 598 YNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTI 657
           Y   YV+ +P+T M +RP  S GYPGRTY+FY G T+Y FG GLSYT+F ++L+     +
Sbjct: 597 YPQSYVEKVPMTIMNMRPDKSKGYPGRTYRFYTGETVYAFGDGLSYTKFSHSLVKAPSLV 656

Query: 658 QVNLNKLQHCRNLNYTS-DASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVY 716
            ++L +   CR+    S DA    C   +         FE ++  +N G  +G   V ++
Sbjct: 657 SLSLEENHVCRSSECQSLDAIGPHCENAVSGG---GSAFEVQIKVRNGGDREGIHTVFLF 713

Query: 717 SKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIF 776
           + PPA I  +  K ++GF+++ +       ++F    CK L++VD      +  G+H + 
Sbjct: 714 TTPPA-IHGSPRKHLLGFEKIRLGKMEEAVVRFKVEVCKDLSVVDEIGKRKIGLGKHLLH 772

Query: 777 VGNGGVSFPIHL 788
           VG+   S  I +
Sbjct: 773 VGDLKHSLSIRI 784


>gi|74355968|dbj|BAE44362.1| alpha-L-arabinofuranosidase [Raphanus sativus]
          Length = 780

 Score =  771 bits (1992), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 385/789 (48%), Positives = 524/789 (66%), Gaps = 37/789 (4%)

Query: 11  FSLSIALLVFSTNAVDANGSSSPVFVCDPGRFSKLGLQMSSFLFCDSSLPYSIRVKDLVS 70
           FSLS+  L    ++   N  S+PVF CD      L    +++ FC++++    RV DLV+
Sbjct: 18  FSLSLIFLCLLDSS---NAQSTPVFACDVAGNPSL----AAYGFCNTAIKIEYRVADLVA 70

Query: 71  RMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGATSFPTVIL 130
           R+TL EK+  L    HGV RLG+P YEWWSEALHGVS VGPGT F   +PGATSFP VIL
Sbjct: 71  RLTLQEKIGVLTSKLHGVARLGIPTYEWWSEALHGVSYVGPGTRFSGQVPGATSFPQVIL 130

Query: 131 TTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNINVARDPRWGRITETPGEDPFV 190
           T ASFN SL++ IG+ VSTEARAMYN+G AGLTYWSPN+N+ RDPRWGR  ETPGEDP +
Sbjct: 131 TAASFNVSLFQAIGKVVSTEARAMYNVGLAGLTYWSPNVNIFRDPRWGRGQETPGEDPLL 190

Query: 191 VGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTE 250
             +YA  YV+GLQ+ +    ++D N   LKV++CCKHY AYDVDNWKGV+RY F+A V +
Sbjct: 191 SSKYASGYVKGLQETD----SSDANR--LKVAACCKHYTAYDVDNWKGVERYSFNAVVNQ 244

Query: 251 QDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADC 310
           QD+++T+  PF+ CV +G+ +SVMCSYN+VNG P+CADP LL+  +RGEW L+GYIV+DC
Sbjct: 245 QDLDDTYQPPFKSCVVDGNVASVMCSYNKVNGKPTCADPDLLSGVIRGEWKLNGYIVSDC 304

Query: 311 DSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLK 370
           DS+ V+  N  +   + E+A A ++ AGLDL+CG +  + T  AV+ G VKE  IDK++ 
Sbjct: 305 DSVDVLYKNQHY-TKTPEEAAAISINAGLDLNCGYFLGDHTEAAVKAGLVKEAAIDKAIT 363

Query: 371 YLYTVLMRLGFFDGSPQ---YVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNS 427
             +  LMRLGFFDG P+   Y  LG +D+C+  N ELAAEAAR+GIVLLKN    LPL+ 
Sbjct: 364 NNFLTLMRLGFFDGDPKKQIYGGLGPKDVCTPANQELAAEAARQGIVLLKN-TGALPLSP 422

Query: 428 AKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFSGYANVTYKTGCDDVACKSNNSI 487
             +KT+AV+GP+AN T  MIGNY G PC+Y +P+ G +G  + TY  GC +VAC   + +
Sbjct: 423 KTIKTLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGLAGTVHTTYLPGCSNVACAVAD-V 481

Query: 488 FAASEAAKTADATIILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMS 547
             +++ A  +DAT+++ G D S+EAES DR DL LPG Q +L+ QVA+ AKGPV LVIMS
Sbjct: 482 AGSTKLAAASDATVLVIGADQSIEAESRDRVDLNLPGQQQELVTQVAKAAKGPVFLVIMS 541

Query: 548 AGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLP 607
            GG DI FA+ +  I  ILW GYPGE GG A ADV+FG++NP GRLP+TWY   YV+ +P
Sbjct: 542 GGGFDITFAKNDAKIAGILWVGYPGEAGGIATADVIFGRYNPSGRLPMTWYPQSYVEKVP 601

Query: 608 LTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHC 667
           +T+M +RP  S GYPGRTY+FY G T+Y FG GLSYT+F ++L+   + + ++L +   C
Sbjct: 602 MTNMNMRPDKSNGYPGRTYRFYTGETVYAFGDGLSYTKFSHSLVKAPRLVSLSLEENHVC 661

Query: 668 RNLNYTSDASKTRCPGVLVNDLRCDD--------YFEFKVDFQNVGSTDGSDVVIVYSKP 719
           R+         + C  +      CD+         FE  +  QN G  +G   V +++ P
Sbjct: 662 RS---------SECQSLNAIGPHCDNAVSGTGGKAFEVHIKVQNGGDREGIHTVFLFTTP 712

Query: 720 PAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGN 779
           PA +  +  K ++GF+++ +       +KF  + CK L++VD      +  G+H + VG+
Sbjct: 713 PA-VHGSPRKHLLGFEKIRLGKMEEAVVKFKVDVCKDLSVVDEVGKRKIGLGQHLLHVGD 771

Query: 780 GGVSFPIHL 788
              S  I +
Sbjct: 772 VKHSLSIRI 780


>gi|356574315|ref|XP_003555294.1| PREDICTED: LOW QUALITY PROTEIN: probable beta-D-xylosidase 5-like
           [Glycine max]
          Length = 901

 Score =  771 bits (1991), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 389/744 (52%), Positives = 514/744 (69%), Gaps = 17/744 (2%)

Query: 48  QMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVS 107
           + S+F FCD+SL Y  R KDLVSR+TL EK QQL + + G+ RLG+P YEWWSEALHGVS
Sbjct: 30  KTSNFPFCDTSLSYEDRAKDLVSRLTLQEKTQQLVNPSAGISRLGVPAYEWWSEALHGVS 89

Query: 108 NVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSP 167
           N+GPGT FD  +PGATSFP VIL+ ASFN SLW+K+GQ VSTEARAMYN+  AGLT+WSP
Sbjct: 90  NLGPGTRFDKKVPGATSFPAVILSAASFNASLWQKMGQVVSTEARAMYNVDLAGLTFWSP 149

Query: 168 NINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKH 227
           N+NV RDPRWGR  ETPGEDP VV RYAV Y+RGLQ+VE   +A    +  LKVSSCCKH
Sbjct: 150 NVNVFRDPRWGRGQETPGEDPLVVSRYAVMYLRGLQEVEDEASA---KADRLKVSSCCKH 206

Query: 228 YAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCA 287
           Y AYD+DNWKG+DR+HFDA+VT+QD+E+++  PF+ CV EG  SSVMCSYNRVNGIP+CA
Sbjct: 207 YTAYDLDNWKGIDRFHFDAKVTKQDLEDSYQPPFKSCVVEGHVSSVMCSYNRVNGIPTCA 266

Query: 288 DPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYY 347
           DP LL   +RG+W L GYIV+DCDS++V  +   + A + EDAVA  LKAGL+++CG + 
Sbjct: 267 DPDLLKGIIRGQWGLDGYIVSDCDSVEVYYNAIHYTA-TPEDAVALALKAGLNMNCGDFL 325

Query: 348 TNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFD--GSPQYVSLGKQDICSDENIELA 405
             +T NAV   KV    +D++L Y Y VLMRLGFFD   S  + +LG  D+C+ +N +LA
Sbjct: 326 KKYTANAVNLKKVDVATVDQALVYNYIVLMRLGFFDDPKSLPFANLGPSDVCTKDNQQLA 385

Query: 406 AEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFS 465
            +AA++GIVLL+N+   LPL+   +K +AV+GP+ANAT  MI NYAGIPCRY SP+ G  
Sbjct: 386 LDAAKQGIVLLENNNGALPLSQTNIKKLAVIGPNANATTVMISNYAGIPCRYTSPLQGLQ 445

Query: 466 GY-ANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAESLDREDLWLPG 524
            Y ++V Y  GC +V C + + I AA +AA +ADA +++ GLD S+EAE LDRE+L LPG
Sbjct: 446 KYISSVNYAPGCSNVKCDNQSLIAAAVKAAASADAVVLVVGLDQSIEAEGLDRENLTLPG 505

Query: 525 YQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVF 584
           +Q + +  VA   KG VILVIM+AG +DI+  ++ +NI  ILW GYPG+ GG AIA V+F
Sbjct: 506 FQEKFVKDVAGATKGKVILVIMAAGPIDISSTKSVSNIGGILWVGYPGQAGGDAIAQVIF 565

Query: 585 GKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYT 644
           G +NPGGR P TWY   YV  +P+T M +R   S  +PGRTY+FYNG +LY FG+GLSY+
Sbjct: 566 GDYNPGGRSPFTWYPQSYVDQVPMTDMNMRANKSRNFPGRTYRFYNGNSLYEFGHGLSYS 625

Query: 645 QFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCP------GVLVNDLRCDDY-FEF 697
            F   + S   +I +    +    N+  +S+ S T+         + ++ + C D  F  
Sbjct: 626 TFSMYVASAPSSIMIENTSISEPHNM-LSSNNSGTQVESLSDGQAIDISTINCQDLTFLL 684

Query: 698 KVDFQNVGSTDGSDVVIVYSKPPAE--IAATYIKQVIGFQRVFVRAGRNKRIKFVFNACK 755
            +  +N G  +GS VV+V+ +P     +    IKQ+IGF+RV V  G  + +    + C+
Sbjct: 685 VIGVKNNGPLNGSHVVLVFWEPATSEFVIGAPIKQLIGFERVQVVVGVTEFVTVKIDICQ 744

Query: 756 SLNIVDYAANTLLPAGEHTIFVGN 779
            ++ VD      L  G+HTI VG+
Sbjct: 745 LISNVDSDGKRKLVIGQHTILVGS 768


>gi|224054312|ref|XP_002298197.1| predicted protein [Populus trichocarpa]
 gi|222845455|gb|EEE83002.1| predicted protein [Populus trichocarpa]
          Length = 741

 Score =  770 bits (1989), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 389/754 (51%), Positives = 508/754 (67%), Gaps = 28/754 (3%)

Query: 32  SPVFVCDPGRFSKLGLQMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRL 91
           SPVF CD      L    +SF FC++SL  S RV DLV R+TL EK+  L + A  V RL
Sbjct: 1   SPVFACDVVSNPSL----ASFGFCNTSLGVSDRVVDLVKRLTLQEKILFLVNSAGSVSRL 56

Query: 92  GLPQYEWWSEALHGVSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEA 151
           G+P+YEWWSEALHGVS VGPGTHF  V+PGATSFP VILT ASFN SL+  IG+ VSTEA
Sbjct: 57  GIPKYEWWSEALHGVSYVGPGTHFSSVVPGATSFPQVILTAASFNTSLFVAIGKVVSTEA 116

Query: 152 RAMYNLGRAGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENA 211
           RAMYN+G AGLT+WSPNIN+ RDPRWGR  ETPGEDP +  +Y   YV+GLQ  +     
Sbjct: 117 RAMYNVGLAGLTFWSPNINIFRDPRWGRGQETPGEDPLLSSKYGSGYVKGLQQRD----- 171

Query: 212 TDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDAS 271
            D N   LKV++CCKHY AYD+DNWKGVDRYHF+A VT+QDM++TF  PF+ CV +G+ +
Sbjct: 172 -DGNPDGLKVAACCKHYTAYDLDNWKGVDRYHFNAVVTKQDMDDTFQPPFKSCVVDGNVA 230

Query: 272 SVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAV 331
           SVMCSYN+VNGIP+CADP LL+  +RGEW L+GYIV DCDSI V  ++  +   + E+A 
Sbjct: 231 SVMCSYNKVNGIPTCADPDLLSGVIRGEWKLNGYIVTDCDSIDVFYNSQHY-TKTPEEAA 289

Query: 332 AQTLKAG--LDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ-- 387
           A+ + AG  LDL+CG +    T  AV  G V E+ ID+++   +  LMRLGFFDG P   
Sbjct: 290 AKAILAGIRLDLNCGSFLGKHTEAAVTAGLVNESAIDRAVSNNFATLMRLGFFDGDPSKQ 349

Query: 388 -YVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAM 446
            Y  LG +D+C+ EN ELA EAAR+GIVLLKN   +LPL+   +K +AV+GP+AN T  M
Sbjct: 350 LYGKLGPKDVCTAENQELAREAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTM 409

Query: 447 IGNYAGIPCRYMSPIAGFSGYANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGL 506
           IGNY G PC+Y +P+ G +     TY  GC +VAC S   +  A + A  ADAT+++ G 
Sbjct: 410 IGNYEGTPCKYTTPLQGLAALVATTYLPGCSNVAC-STAQVDDAKKIAAAADATVLVMGA 468

Query: 507 DLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAIL 566
           DLS+EAES DR D+ LPG Q  LI  VA  + GPVILVIMS GG+D++FA+TN  I +IL
Sbjct: 469 DLSIEAESRDRVDILLPGQQQLLITAVANASTGPVILVIMSGGGMDVSFAKTNDKITSIL 528

Query: 567 WAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTY 626
           W GYPGE GG AIAD++FG +NP GRLP+TWY   YV  +P+T+M +RP  S GYPGRTY
Sbjct: 529 WVGYPGEAGGAAIADIIFGSYNPSGRLPMTWYPQSYVDKVPMTNMNMRPDPSNGYPGRTY 588

Query: 627 KFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLV 686
           +FY G T+Y FG GLSY++F + L      + V L +   C    Y+S+     C  V  
Sbjct: 589 RFYTGETVYSFGDGLSYSEFSHELTQAPGLVSVPLEENHVC----YSSE-----CKSVAA 639

Query: 687 NDLRCDDY-FEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNK 745
            +  C +  F+  +  +N G+T GS  V ++S PP+ +  +  K ++GF++VF+ A  + 
Sbjct: 640 AEQTCQNLTFDVHLRIKNTGTTSGSHTVFLFSTPPS-VHNSPQKHLVGFEKVFLHAQTDS 698

Query: 746 RIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGN 779
            + F  + CK L++VD   +  +  GEH + +G+
Sbjct: 699 HVGFKVDVCKDLSVVDELGSKKVALGEHVLHIGS 732


>gi|255545293|ref|XP_002513707.1| Beta-glucosidase, putative [Ricinus communis]
 gi|223547158|gb|EEF48654.1| Beta-glucosidase, putative [Ricinus communis]
          Length = 777

 Score =  769 bits (1986), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 381/753 (50%), Positives = 507/753 (67%), Gaps = 25/753 (3%)

Query: 31  SSPVFVCDPGRFSKLGLQMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPR 90
           SSPVF CD     K    ++SF FC+ SL  S RV DLV+R+TL EK+  L + A  V R
Sbjct: 37  SSPVFACD----VKSNPSLASFGFCNVSLGISDRVTDLVNRLTLQEKIGFLVNSAGSVSR 92

Query: 91  LGLPQYEWWSEALHGVSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTE 150
           LG+P+YEWWSEALHGVS VGPGTHF +++PGATSFP VILT ASFN SL++ IG+ VSTE
Sbjct: 93  LGIPKYEWWSEALHGVSYVGPGTHFSNIVPGATSFPQVILTAASFNASLFEAIGKVVSTE 152

Query: 151 ARAMYNLGRAGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHEN 210
           ARAMYN+G AGLT+WSPNIN+ RDPRWGR  ETPGEDP +  +Y   YVRGLQ  +  + 
Sbjct: 153 ARAMYNVGLAGLTFWSPNINIFRDPRWGRGQETPGEDPLLSSKYGSCYVRGLQQTDNGD- 211

Query: 211 ATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDA 270
                S  LKV++CCKHY AYD+DNWKG DRYHF+A VT+QD+++TF  PF+ CV +G+ 
Sbjct: 212 -----SERLKVAACCKHYTAYDLDNWKGTDRYHFNAVVTKQDLDDTFQPPFKSCVIDGNV 266

Query: 271 SSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDA 330
           +SVMCSYN+VNG P+CADP LL   +RGEW L+GYIV+DCDS+ V+ ++  +   + E+A
Sbjct: 267 ASVMCSYNQVNGKPTCADPDLLAGIIRGEWKLNGYIVSDCDSVDVIYNSQHY-TKTPEEA 325

Query: 331 VAQTLKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ--- 387
            A T+ AGLDL+CG +    T  AV  G +  + +DK++   +  LMRLGFFDG P    
Sbjct: 326 AAITILAGLDLNCGSFLGKHTEAAVNAGLLNVSAVDKAVSNNFATLMRLGFFDGDPSKQL 385

Query: 388 YVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMI 447
           Y  LG +D+C+  N ELA EAAR+GIVLLKN   +LPL+   +KT+AV+GP+AN T  MI
Sbjct: 386 YGKLGPKDVCTAVNQELAREAARQGIVLLKNSPGSLPLSPTAIKTLAVIGPNANVTKTMI 445

Query: 448 GNYAGIPCRYMSPIAGFSGYANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLD 507
           GNY G PC+Y +P+ G +     TY  GC +VAC +   +  A + A +ADAT+++ G D
Sbjct: 446 GNYEGTPCKYTTPLQGLTASVATTYLAGCSNVACAAAQ-VDDAKKLAASADATVLVMGAD 504

Query: 508 LSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILW 567
            S+EAES DR D+ LPG Q  LI QVA V+KGPVILVIMS GG+D++FA+TN  I +ILW
Sbjct: 505 QSIEAESRDRVDVLLPGQQQLLITQVANVSKGPVILVIMSGGGMDVSFAKTNDKITSILW 564

Query: 568 AGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYK 627
            GYPGE GG AIADV+FG +NP GRLP+TWY   YV  +P+T+M +RP  S GYPGRTY+
Sbjct: 565 VGYPGEAGGAAIADVIFGYYNPSGRLPMTWYPQAYVDKVPMTNMNMRPDPSSGYPGRTYR 624

Query: 628 FYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVN 687
           FY G T+Y FG GLSY+++K+ L+   + + + L     CR        S ++C  V   
Sbjct: 625 FYTGETVYSFGDGLSYSEYKHQLVQAPQLVSIPLEDDHVCR--------SSSKCISVDAG 676

Query: 688 DLRCDDY-FEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKR 746
           +  C    F   +  +N+G   G+  V ++  PP+ +  +  K ++ F++V + A     
Sbjct: 677 EQNCQGLAFNIDLKVRNIGKVRGTHTVFLFFTPPS-VHNSPQKHLVDFEKVSLDAKTYGM 735

Query: 747 IKFVFNACKSLNIVDYAANTLLPAGEHTIFVGN 779
           + F  + CK L++VD   +  +  G H + VGN
Sbjct: 736 VSFKVDVCKHLSVVDEFGSRKVALGGHVLHVGN 768


>gi|15237736|ref|NP_201262.1| beta-D-xylosidase 4 [Arabidopsis thaliana]
 gi|75262663|sp|Q9FLG1.1|BXL4_ARATH RecName: Full=Beta-D-xylosidase 4; Short=AtBXL4; Flags: Precursor
 gi|10178060|dbj|BAB11424.1| beta-xylosidase [Arabidopsis thaliana]
 gi|332010539|gb|AED97922.1| beta-D-xylosidase 4 [Arabidopsis thaliana]
          Length = 784

 Score =  767 bits (1981), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 388/785 (49%), Positives = 516/785 (65%), Gaps = 29/785 (3%)

Query: 8   LLCFSLSIALLVFSTNAVDANGSSSPVFVCDPGRFSKLGLQMSSFLFCDSSLPYSIRVKD 67
            LCF L    L FS      N  SSPVF CD      L    +++ FC++ L    RV D
Sbjct: 25  FLCFFLY--FLNFS------NAQSSPVFACDVAANPSL----AAYGFCNTVLKIEYRVAD 72

Query: 68  LVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGATSFPT 127
           LV+R+TL EK+  L   A+GV RLG+P YEWWSEALHGVS +GPGTHF   +PGATSFP 
Sbjct: 73  LVARLTLQEKIGFLVSKANGVTRLGIPTYEWWSEALHGVSYIGPGTHFSSQVPGATSFPQ 132

Query: 128 VILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNINVARDPRWGRITETPGED 187
           VILT ASFN SL++ IG+ VSTEARAMYN+G AGLTYWSPN+N+ RDPRWGR  ETPGED
Sbjct: 133 VILTAASFNVSLFQAIGKVVSTEARAMYNVGLAGLTYWSPNVNIFRDPRWGRGQETPGED 192

Query: 188 PFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDAR 247
           P +  +YA  YV+GLQ+ +G +      S  LKV++CCKHY AYDVDNWKGV+RY F+A 
Sbjct: 193 PLLASKYASGYVKGLQETDGGD------SNRLKVAACCKHYTAYDVDNWKGVERYSFNAV 246

Query: 248 VTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIV 307
           VT+QDM++T+  PF+ CV +G+ +SVMCSYN+VNG P+CADP LL+  +RGEW L+GYIV
Sbjct: 247 VTQQDMDDTYQPPFKSCVVDGNVASVMCSYNQVNGKPTCADPDLLSGVIRGEWKLNGYIV 306

Query: 308 ADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDK 367
           +DCDS+ V+  N  +     E A A ++ AGLDL+CG +    T  AV+ G V E  IDK
Sbjct: 307 SDCDSVDVLYKNQHYTKTPAE-AAAISILAGLDLNCGSFLGQHTEEAVKSGLVNEAAIDK 365

Query: 368 SLKYLYTVLMRLGFFDGSPQ---YVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLP 424
           ++   +  LMRLGFFDG+P+   Y  LG  D+C+  N ELAA+AAR+GIVLLKN    LP
Sbjct: 366 AISNNFLTLMRLGFFDGNPKNQIYGGLGPTDVCTSANQELAADAARQGIVLLKN-TGCLP 424

Query: 425 LNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFSGYANVTYKTGCDDVACKSN 484
           L+   +KT+AV+GP+AN T  MIGNY G PC+Y +P+ G +G  + TY  GC +VAC   
Sbjct: 425 LSPKSIKTLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGLAGTVSTTYLPGCSNVACAVA 484

Query: 485 NSIFAASEAAKTADATIILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILV 544
           + +  A++ A TAD ++++ G D S+EAES DR DL LPG Q +L+ QVA+ AKGPV+LV
Sbjct: 485 D-VAGATKLAATADVSVLVIGADQSIEAESRDRVDLHLPGQQQELVIQVAKAAKGPVLLV 543

Query: 545 IMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQ 604
           IMS GG DI FA+ +  I  ILW GYPGE GG AIAD++FG++NP G+LP+TWY   YV+
Sbjct: 544 IMSGGGFDITFAKNDPKIAGILWVGYPGEAGGIAIADIIFGRYNPSGKLPMTWYPQSYVE 603

Query: 605 MLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKL 664
            +P+T M +RP  + GYPGRTY+FY G T+Y FG GLSYT+F + L+     + + L + 
Sbjct: 604 KVPMTIMNMRPDKASGYPGRTYRFYTGETVYAFGDGLSYTKFSHTLVKAPSLVSLGLEEN 663

Query: 665 QHCRNLNYTS-DASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEI 723
             CR+    S DA    C   +         FE  +  +N G  +G   V +++ PPA I
Sbjct: 664 HVCRSSECQSLDAIGPHCENAVSGG---GSAFEVHIKVRNGGDREGIHTVFLFTTPPA-I 719

Query: 724 AATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGNGGVS 783
             +  K ++GF+++ +       ++F    CK L++VD      +  G+H + VG+   S
Sbjct: 720 HGSPRKHLVGFEKIRLGKREEAVVRFKVEICKDLSVVDEIGKRKIGLGKHLLHVGDLKHS 779

Query: 784 FPIHL 788
             I +
Sbjct: 780 LSIRI 784


>gi|226531269|ref|NP_001145980.1| uncharacterized protein LOC100279508 precursor [Zea mays]
 gi|219885199|gb|ACL52974.1| unknown [Zea mays]
 gi|413920228|gb|AFW60160.1| putative O-Glycosyl hydrolase superfamily protein [Zea mays]
          Length = 794

 Score =  767 bits (1980), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 389/763 (50%), Positives = 499/763 (65%), Gaps = 27/763 (3%)

Query: 35  FVCDPGRFSKLGLQMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLP 94
           F C PG  +      +S  FC  SLP   R +DLVSR+T  EKV+ L + A GVPRLG+ 
Sbjct: 27  FACAPGGPA------ASLPFCRQSLPLRARARDLVSRLTRAEKVRLLVNNAAGVPRLGVA 80

Query: 95  QYEWWSEALHGVSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAM 154
            YEWWSEALHGVS+ GPG  F    PGAT+FP VI T AS N +LW+ +G+AVS EARAM
Sbjct: 81  GYEWWSEALHGVSDTGPGVRFGGAFPGATAFPQVIGTAASLNATLWELVGRAVSDEARAM 140

Query: 155 YNLGRAGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDL 214
           YN GRAGLT+WSPN+N+ RDPRWGR  ETPGEDP V  RYA  YVRGLQ      N    
Sbjct: 141 YNGGRAGLTFWSPNVNIFRDPRWGRGQETPGEDPAVSARYAAAYVRGLQQPYAAPNGGHR 200

Query: 215 NSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVM 274
           N   LK+++CCKH+ AYD+D W G DR+HF+A V  QD+E+TF  PF  CV++G A+SVM
Sbjct: 201 NR--LKLAACCKHFTAYDLDKWGGTDRFHFNAVVAAQDLEDTFNVPFRACVEDGRAASVM 258

Query: 275 CSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQT 334
           CSYN+VNG+P+CAD   L  T+RG W L GYIV+DCDS+ V   +  +   + EDA A T
Sbjct: 259 CSYNQVNGVPTCADAAFLRGTIRGRWGLDGYIVSDCDSVDVFFRDQHY-TRTPEDAAAAT 317

Query: 335 LKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ---YVSL 391
           L+AGLDLDCG +   + G+AV  GKV + D+D +L    TV MRLG FDG P    +  L
Sbjct: 318 LRAGLDLDCGPFLALYAGSAVAAGKVADADVDAALLNTVTVQMRLGMFDGDPAAGPFGRL 377

Query: 392 GKQDICSDENIELAAEAAREGIVLLKN------DQNTLPLNSAKVKTVAVVGPHANATVA 445
           G  D+C+ E+ +LA +AAR+G+VLLKN      +++ LPL  A  + VAVVGPHA+ATVA
Sbjct: 378 GPADVCTREHQDLALDAARQGVVLLKNRRGARHNRDVLPLRPAAHRVVAVVGPHADATVA 437

Query: 446 MIGNYAGIPCRYMSPIAGFSGYA-NVTYKTGCDDVACKSNNSIFAASEAAKTADATIILA 504
           MIGNYAG PCRY +P+ G + YA  V ++ GC DVAC+ N  I AA EAA+ ADAT+++A
Sbjct: 438 MIGNYAGKPCRYTTPLQGVAAYAARVAHQAGCTDVACRGNQPIAAAVEAARQADATVVVA 497

Query: 505 GLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKA 564
           GLD  VEAE LDR  L LPG Q +LI+ VA+ +KGPVILV+MS G +DIAFA+ +  I  
Sbjct: 498 GLDQRVEAEGLDRTTLLLPGRQAELISAVAKASKGPVILVLMSGGPIDIAFAQNDPRIDG 557

Query: 565 ILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGR 624
           ILW GYPG+ GG+AIADV+FG  NPG +LP+TWY+ DY+Q +P+T+M +R   + GYPGR
Sbjct: 558 ILWVGYPGQAGGQAIADVIFGHHNPGAKLPVTWYHQDYLQKVPMTNMAMRANPARGYPGR 617

Query: 625 TYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCP-- 682
           TY+FY GPT+YPFG+GLSYTQF + L      + V L+   H      +   +    P  
Sbjct: 618 TYRFYTGPTIYPFGHGLSYTQFTHTLAHAPTQLTVRLSGSGHSAASAASLLNATLARPVR 677

Query: 683 GVLVNDLRCDDY-FEFKVDFQNVGSTDGSDVVIVYSKPP-----AEIAATYIKQVIGFQR 736
            V V   RC+       VD  NVG  DG+  V+VY   P     A  A    +Q++ F++
Sbjct: 678 AVRVAHARCEGLTVPVHVDVSNVGDRDGAHAVLVYHAAPSPSHAAPGADAPARQLVAFEK 737

Query: 737 VFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGN 779
           V V AG   R++     C  L++ D      +P GEH + +G 
Sbjct: 738 VHVPAGGVARVEMRIGVCDRLSVADRNGVRRVPVGEHRLMIGE 780


>gi|357130854|ref|XP_003567059.1| PREDICTED: probable beta-D-xylosidase 2-like [Brachypodium
           distachyon]
          Length = 779

 Score =  767 bits (1980), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 389/760 (51%), Positives = 500/760 (65%), Gaps = 29/760 (3%)

Query: 31  SSPVFVCDPGRFSKLGLQMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPR 90
           + P F C PG  S      +   FC  +LP   R +DLV+R+T  EKV+ L + A GVPR
Sbjct: 23  TRPPFACAPGGPS------TRLPFCRQALPPRARARDLVARLTRAEKVRLLVNNAAGVPR 76

Query: 91  LGLPQYEWWSEALHGVSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTE 150
           LG+  YEWWSEALHGVS+ GPG  F    PGAT+FP VI T ASFN SLW+ IG+AVS E
Sbjct: 77  LGVEGYEWWSEALHGVSDTGPGVRFGGAFPGATAFPQVIGTAASFNASLWELIGRAVSDE 136

Query: 151 ARAMYNLGRAGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHEN 210
            RA+YN  +AGLT+WSPN+N+ RDPRWGR  ETPGEDP V GRYA  YVRGLQ       
Sbjct: 137 GRAIYNGRQAGLTFWSPNVNIFRDPRWGRGQETPGEDPAVSGRYAAAYVRGLQQQHAGR- 195

Query: 211 ATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDA 270
                   LK ++CCKH+ AYD+D W G DR+HF+A VT QD+E+TF  PF  CV EG A
Sbjct: 196 --------LKTAACCKHFTAYDLDRWSGADRFHFNAIVTPQDLEDTFNAPFRACVVEGRA 247

Query: 271 SSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDA 330
           ++VMCSYN+VNG+P+CAD   L  T+RG+W L GYIV+DCDS+ V      +   ++EDA
Sbjct: 248 AAVMCSYNQVNGVPTCADQGFLRGTIRGKWKLDGYIVSDCDSVDVFYREQHY-TRTREDA 306

Query: 331 VAQTLKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDG---SPQ 387
           VA TL+AGLDLDCG +   +T  AV QGKVKE DID ++    TV MRLG FDG   +  
Sbjct: 307 VAATLRAGLDLDCGPFLAQYTEAAVAQGKVKEADIDAAVVNTVTVQMRLGMFDGDVAAQP 366

Query: 388 YVSLGKQDICSDENIELAAEAAREGIVLLKN---DQNTLPLNSAKVK-TVAVVGPHANAT 443
           +  LG Q +C+  + ELA EAA + IVLLKN   +   LPL+S   + TVAVVGPH+ AT
Sbjct: 367 FGHLGPQHVCTPAHRELALEAACQSIVLLKNGGGNNMRLPLSSHHRRGTVAVVGPHSEAT 426

Query: 444 VAMIGNYAGIPCRYMSPIAGFSGYANVT-YKTGCDDVACK-SNNSIFAASEAAKTADATI 501
           VAMIGNYAG PC Y +P+ G   YA  T ++ GC DVAC+ S   I AA +AA+ ADAT+
Sbjct: 427 VAMIGNYAGKPCAYTTPLQGVGRYARATVHQAGCTDVACQGSGQPIDAAVDAARHADATV 486

Query: 502 ILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTN 561
           ++ GLD SVEAE LDR  L LPG Q +L++ VA  +KGPVILV+MS G VDIAFA+ + N
Sbjct: 487 VVVGLDQSVEAEGLDRTTLLLPGRQAELVSAVARASKGPVILVLMSGGPVDIAFAQNDRN 546

Query: 562 IKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGY 621
           + AILWAGYPG+ GG+AIADV+FG  NPGG+LP+TWY  DY++  P+T+M +R   + GY
Sbjct: 547 VAAILWAGYPGQAGGQAIADVIFGHHNPGGKLPVTWYPEDYLRKAPMTNMAMRADPARGY 606

Query: 622 PGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRC 681
           PGRTY+FY GPT++PFG+GLSYT+F + L        + + +    R     +  + +  
Sbjct: 607 PGRTYRFYAGPTIHPFGHGLSYTKFAHTLAH--APAHLTVRRAAGHRTTAAINTTTASHL 664

Query: 682 PGVLVNDLRCDDY-FEFKVDFQNVGSTDGSDVVIVYSKPP-AEIAATYIKQVIGFQRVFV 739
             V V   +C+       VD +NVGS DG+  V VY+ PP A I    ++Q++ F++V V
Sbjct: 665 NDVRVAHAQCEGLSVSVHVDVKNVGSRDGAHTVFVYASPPIAAIHGAPVRQLVAFEKVHV 724

Query: 740 RAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGN 779
            AG   R+K   + C SL+I D      +P GEH + +G 
Sbjct: 725 AAGAVARVKMGVDVCGSLSIADQEGVRRIPIGEHRLMIGE 764


>gi|147844622|emb|CAN82161.1| hypothetical protein VITISV_035506 [Vitis vinifera]
          Length = 925

 Score =  766 bits (1978), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 382/741 (51%), Positives = 506/741 (68%), Gaps = 16/741 (2%)

Query: 50  SSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNV 109
           S F FC++SLPY  R  DLVSR+TL EK +QL + A G+ RLG+P YEWWSEALHGVSN 
Sbjct: 37  SQFPFCNTSLPYQDRASDLVSRLTLQEKAKQLINSATGISRLGVPDYEWWSEALHGVSNS 96

Query: 110 GPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNI 169
           G G HF D IP  T FP VIL+ ASFNESLW  +GQ VSTE RAMYN+G+AGLTYWSPN+
Sbjct: 97  GIGVHFHDPIPAVTIFPAVILSAASFNESLWYTMGQVVSTEGRAMYNVGQAGLTYWSPNV 156

Query: 170 NVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYA 229
           N+ RDPRWGR  ETPGEDP VV RYAVNYVRGLQ+V G E   +  +  LKVSSCCKHY 
Sbjct: 157 NIFRDPRWGRGQETPGEDPLVVSRYAVNYVRGLQEV-GKEG--NFAADRLKVSSCCKHYT 213

Query: 230 AYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADP 289
           AYDVD WKGVDR+HFDA+VT QD+E+T+  PF+ CV+EG  SSVMCSYNRVNG+P+CA+P
Sbjct: 214 AYDVDKWKGVDRFHFDAKVTLQDLEDTYQPPFKXCVEEGHVSSVMCSYNRVNGVPTCANP 273

Query: 290 KLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTN 349
           +LL   +R +W L GYIV+DCDSI V  +   +  ++ EDAVA  LKAGL+L+CG Y  +
Sbjct: 274 ELLKGVIRDQWGLDGYIVSDCDSIMVYHERMNY-TETPEDAVALALKAGLNLNCGSYLGD 332

Query: 350 FTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGK---QDICSDENIELAA 406
           +T NAV  GKVKE+ +B++L Y Y VLMRLGFFDG P  +  GK    D+C+ ++  LA 
Sbjct: 333 YTKNAVNLGKVKESIVBQALIYNYIVLMRLGFFDGDPTMLPFGKMGPSDVCTVDHQLLAL 392

Query: 407 EAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFSG 466
           +AA++GIVLL N+   LPL+    KT+AV+GP+A+AT  M+ NYAG+PCRY SP+ G   
Sbjct: 393 DAAKQGIVLLHNN-GALPLSPNTTKTLAVIGPNADATNTMLSNYAGVPCRYTSPLQGLQK 451

Query: 467 YAN-VTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAESLDREDLWLPGY 525
           Y + V+Y+ GC +V+C     I  A+  A  ADAT+++ GLDL +EAE LDR +L LPG+
Sbjct: 452 YVSAVSYEKGCANVSCSEETLIEGAASIASMADATVVVVGLDLFIEAEDLDRVNLTLPGF 511

Query: 526 QTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFG 585
           Q +L+ + A+ A G VILV+MSAG VDI+F +  + I  ILW GYPG+ GG AI+ V+FG
Sbjct: 512 QEKLVMEAAKAANGTVILVVMSAGPVDISFVKNVSKIGGILWVGYPGQAGGDAISQVIFG 571

Query: 586 KFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQ 645
            +NPGGR P TWY  +YV  +P+T M +RP  +  +PGRTY+FY G +LY FG+GLSY+ 
Sbjct: 572 DYNPGGRSPFTWYPQEYVDQVPMTDMNMRPNATXNFPGRTYRFYTGKSLYQFGHGLSYST 631

Query: 646 FKYNLLSFTKTIQVNLNKLQHCRNL---NY-TSDASKTRCPGVLVNDLRCDDY--FEFKV 699
           F   + S   T+ V+L       N+   NY T     T    + ++ + C +    +  +
Sbjct: 632 FYKFIKSAPXTVLVHLLPQMDMPNIFSSNYPTMPNPNTNGQAIDISAIDCRNLSNIDIVI 691

Query: 700 DFQNVGSTDGSDVVIVYSKPP-AEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLN 758
             +N G  DG+ VV+ + KPP + +      +++GF+RV V+ G+ + +    + C  ++
Sbjct: 692 GVKNAGEIDGTHVVLAFWKPPRSGVRGAPGVELVGFERVEVKRGKTEMVGMRLDVCGKIS 751

Query: 759 IVDYAANTLLPAGEHTIFVGN 779
            VD      L  G HT+ VG+
Sbjct: 752 NVDEEGKRKLVMGMHTLVVGS 772


>gi|359481045|ref|XP_002268626.2| PREDICTED: beta-xylosidase/alpha-L-arabinofuranosidase 2-like
           [Vitis vinifera]
 gi|296089342|emb|CBI39114.3| unnamed protein product [Vitis vinifera]
          Length = 774

 Score =  766 bits (1977), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 387/782 (49%), Positives = 524/782 (67%), Gaps = 28/782 (3%)

Query: 2   AKVVSSLLCFSLSIALLVFSTNAVDANGSSSPVFVCDPGRFSKLGLQMSSFLFCDSSLPY 61
           A  V+  LCF    +  + S   V   G SSPVF CD      LG     F FC++SL  
Sbjct: 8   APKVTVFLCFLSCFSHFLSSPKWV--LGQSSPVFACDVENNPTLG----QFGFCNTSLET 61

Query: 62  SIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTHFDDVIPG 121
           + RV DLV R+TL+EK+  L + A  V RLG+P+YEWWSEALHGVS VGPGTHF+ ++PG
Sbjct: 62  AARVADLVKRLTLEEKIGFLVNSAASVSRLGIPKYEWWSEALHGVSYVGPGTHFNSIVPG 121

Query: 122 ATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNINVARDPRWGRIT 181
           ATSFP VILT ASFN SL++ IG+ VSTEARAMYN+G AGLT+WSPN+N+ RDPRWGR  
Sbjct: 122 ATSFPQVILTAASFNASLFEAIGKVVSTEARAMYNVGLAGLTFWSPNVNIFRDPRWGRGQ 181

Query: 182 ETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDR 241
           ETPGEDP +  +YA  YVRGLQ  +G + + D     LKV++CCKHY AYD+DNWKGVDR
Sbjct: 182 ETPGEDPLLSSKYASAYVRGLQ--QGDDGSPDR----LKVAACCKHYTAYDLDNWKGVDR 235

Query: 242 YHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWD 301
            HF+A VT+QDM++TF  PF+ CV +G+ +SVMCS+N+VNG P+CADP LL+  VRGEW 
Sbjct: 236 LHFNAVVTKQDMDDTFQPPFKSCVIDGNVASVMCSFNQVNGKPTCADPDLLSGIVRGEWK 295

Query: 302 LHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFTGNAVQQGKVK 361
           L+GYIV+DCDS+ V  ++  +   + E+A A+ + AGLDL+CG +    T  AV+ G V 
Sbjct: 296 LNGYIVSDCDSVDVFYNSQHY-TKTPEEAAAKAILAGLDLNCGSFLGQHTEAAVKGGLVD 354

Query: 362 ETDIDKSLKYLYTVLMRLGFFDGSPQ---YVSLGKQDICSDENIELAAEAAREGIVLLKN 418
           E+ +DK++   +  LMRLGFFDG+P    Y  LG +D+C+ E+ E+A EAAR+GIVLLKN
Sbjct: 355 ESAVDKAVSNNFATLMRLGFFDGNPSKAIYGKLGPKDVCTSEHQEMAREAARQGIVLLKN 414

Query: 419 DQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFSGYANVTYKTGCDD 478
            + +LPL+   +KT+A++GP+AN T  MIGNY G PC+Y +P+ G +     TY  GC +
Sbjct: 415 SKGSLPLSPTAIKTLAIIGPNANVTKTMIGNYEGTPCKYTTPLQGLTALVATTYLPGCSN 474

Query: 479 VACKSNNSIFAASEAAKTADATIILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAK 538
           VAC +   I  A + A  ADAT+++ G+D S+EAE  DR  + LPG Q  LI +VA+ +K
Sbjct: 475 VACGTAQ-IDEAKKIAAAADATVLIVGIDQSIEAEGRDRVSIQLPGQQPLLITEVAKASK 533

Query: 539 GPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWY 598
           G VILV+MS GG DI+FA+ +  I +ILW GYPGE GG AIADV+FG +NP GRLP+TWY
Sbjct: 534 GNVILVVMSGGGFDISFAKNDDKIASILWVGYPGEAGGAAIADVIFGFYNPSGRLPMTWY 593

Query: 599 NGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQ 658
              YV  +P+T+M +RP  + GYPGRTY+FY G T+Y FG GLSYTQF ++L+   K++ 
Sbjct: 594 PQSYVDKVPMTNMNMRPDPASGYPGRTYRFYTGETIYTFGDGLSYTQFNHHLVQAPKSVS 653

Query: 659 VNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDY-FEFKVDFQNVGSTDGSDVVIVYS 717
           + + +   C +         ++C  V      C +  F+  +   N G+  GS  V ++S
Sbjct: 654 IPIEEGHSCHS---------SKCKSVDAVQESCQNLAFDIHLRVNNAGNISGSHTVFLFS 704

Query: 718 KPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFV 777
            PP+ +  +  K ++GF++VFV A     ++F  + CK L+IVD      +  G H + V
Sbjct: 705 SPPS-VHNSPQKHLLGFEKVFVTAKAEALVRFKVDVCKDLSIVDELGTQKVALGLHVLHV 763

Query: 778 GN 779
           G+
Sbjct: 764 GS 765


>gi|225428983|ref|XP_002264114.1| PREDICTED: probable beta-D-xylosidase 5-like [Vitis vinifera]
          Length = 818

 Score =  765 bits (1975), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 382/741 (51%), Positives = 506/741 (68%), Gaps = 16/741 (2%)

Query: 50  SSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNV 109
           S F FC++SLPY  R  DLVSR+TL EK +QL + A G+ RLG+P YEWWSEALHGVSN 
Sbjct: 61  SQFPFCNTSLPYQDRASDLVSRLTLQEKAKQLINSATGISRLGVPDYEWWSEALHGVSNS 120

Query: 110 GPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNI 169
           G G HF D IP  T FP VIL+ ASFNESLW  +GQ VSTE RAMYN+G+AGLTYWSPN+
Sbjct: 121 GIGVHFHDPIPAVTIFPAVILSAASFNESLWYTMGQVVSTEGRAMYNVGQAGLTYWSPNV 180

Query: 170 NVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYA 229
           N+ RDPRWGR  ETPGEDP VV RYAVNYVRGLQ+V G E   +  +  LKVSSCCKHY 
Sbjct: 181 NIFRDPRWGRGQETPGEDPLVVSRYAVNYVRGLQEV-GKEG--NFAADRLKVSSCCKHYT 237

Query: 230 AYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADP 289
           AYDVD WKGVDR+HFDA+VT QD+E+T+  PF+ CV+EG  SSVMCSYNRVNG+P+CA+P
Sbjct: 238 AYDVDKWKGVDRFHFDAKVTLQDLEDTYQPPFKSCVEEGHVSSVMCSYNRVNGVPTCANP 297

Query: 290 KLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTN 349
           +LL   +R +W L GYIV+DCDSI V  +   +  ++ EDAVA  LKAGL+L+CG Y  +
Sbjct: 298 ELLKGVIRDQWGLDGYIVSDCDSIMVYHERMNY-TETPEDAVALALKAGLNLNCGSYLGD 356

Query: 350 FTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGK---QDICSDENIELAA 406
           +T NAV  GKVKE+ ++++L Y Y VLMRLGFFDG P  +  GK    D+C+ ++  LA 
Sbjct: 357 YTKNAVNLGKVKESIVNQALIYNYIVLMRLGFFDGDPTMLPFGKMGPSDVCTVDHQLLAL 416

Query: 407 EAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFSG 466
           +AA++GIVLL N+   LPL+    KT+AV+GP+A+AT  M+ NYAG+PCRY SP+ G   
Sbjct: 417 DAAKQGIVLLHNN-GALPLSPNTTKTLAVIGPNADATNTMLSNYAGVPCRYTSPLQGLQK 475

Query: 467 YAN-VTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAESLDREDLWLPGY 525
           Y + V+Y+ GC +V+C     I  A+  A  ADAT+++ GLDL +EAE LDR +L LPG+
Sbjct: 476 YVSAVSYEKGCANVSCSEETLIEGAASIASMADATVVVVGLDLFIEAEDLDRVNLTLPGF 535

Query: 526 QTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFG 585
           Q +L+ + A+ A G VILV+MSAG VDI+F +  + I  ILW GYPG+ GG AI+ V+FG
Sbjct: 536 QEKLVMEAAKAANGTVILVVMSAGPVDISFVKNVSKIGGILWVGYPGQAGGDAISQVIFG 595

Query: 586 KFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQ 645
            +NPGGR P TWY  +YV  +P+T M +RP  +  +PGRTY+FY G +LY FG+GLSY+ 
Sbjct: 596 DYNPGGRSPFTWYPQEYVDQVPMTDMNMRPNATSNFPGRTYRFYTGKSLYQFGHGLSYST 655

Query: 646 FKYNLLSFTKTIQVNLNKLQHCRNL---NY-TSDASKTRCPGVLVNDLRCDDY--FEFKV 699
           F   + S   T+ V+L       N+   NY T     T    + ++ + C +    +  +
Sbjct: 656 FYKFIKSAPTTVLVHLLPQMDMPNIFSSNYPTMPNPNTNGQAIDISAIDCRNLSNIDIVI 715

Query: 700 DFQNVGSTDGSDVVIVYSKPP-AEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLN 758
             +N G  DG+ VV+ + KPP + +      +++GF+RV V+ G+ + +    + C  ++
Sbjct: 716 GVKNAGEIDGTHVVLAFWKPPRSGVRGAPGVELVGFERVEVKRGKTEMVGMRLDVCGKIS 775

Query: 759 IVDYAANTLLPAGEHTIFVGN 779
            VD      L  G HT+ VG+
Sbjct: 776 NVDEEGKRKLVMGMHTLVVGS 796


>gi|350534908|ref|NP_001233910.1| beta-D-xylosidase 1 precursor [Solanum lycopersicum]
 gi|37359706|dbj|BAC98298.1| LEXYL1 [Solanum lycopersicum]
          Length = 770

 Score =  764 bits (1973), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 377/773 (48%), Positives = 511/773 (66%), Gaps = 27/773 (3%)

Query: 11  FSLSIALLVFSTNAVDANGSSSPVFVCDPGRFSKLGLQMSSFLFCDSSLPYSIRVKDLVS 70
           FS+ I  ++ S+        +SPVF CD      LG    +  FCD+SL    RV DLV+
Sbjct: 12  FSI-IGFILLSSLLKQVLAQNSPVFACDVTSNPALG----NLTFCDASLAVENRVNDLVN 66

Query: 71  RMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGATSFPTVIL 130
           R+TL EK+  L   A GV RLG+P+YEWWSEALHGV+  GPG HF  ++PGATSFP VIL
Sbjct: 67  RLTLGEKIGFLVSGAGGVSRLGIPKYEWWSEALHGVAYTGPGVHFTSLVPGATSFPQVIL 126

Query: 131 TTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNINVARDPRWGRITETPGEDPFV 190
           T ASFN +L++ IG+ VSTEARAMYN+G AGLTYWSPN+N+ RDPRWGR  ETPGEDP +
Sbjct: 127 TAASFNVTLFQTIGKVVSTEARAMYNVGLAGLTYWSPNVNIFRDPRWGRGQETPGEDPTL 186

Query: 191 VGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTE 250
             +Y V YV GLQ  +      D ++  LKV++CCKHY AYDVDNWKG++RY F+A V +
Sbjct: 187 TSKYGVAYVEGLQQTD------DGSTNKLKVAACCKHYTAYDVDNWKGIERYSFNAVVRQ 240

Query: 251 QDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADC 310
           QD+++TF  PF  CV EG  +SVMCSYN+VNG P+C DP LL   VRGEW L+GYIV DC
Sbjct: 241 QDLDDTFQPPFRSCVLEGAVASVMCSYNQVNGKPTCGDPNLLAGIVRGEWKLNGYIVTDC 300

Query: 311 DSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLK 370
           DS+QV+  +  +   + E+A A  L +G+DL+CG + + +T  AV Q  V E+ ID+++ 
Sbjct: 301 DSLQVIFKSQNY-TKTPEEAAALGLNSGVDLNCGSWLSTYTQGAVNQKLVNESVIDRAIS 359

Query: 371 YLYTVLMRLGFFDGSPQ---YVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNS 427
             +  LMRLGFFDG+P+   Y +LG +D+C+ EN ELA EAAR+GIVLLKN   +LPL  
Sbjct: 360 NNFATLMRLGFFDGNPKSRIYGNLGPKDVCTPENQELAREAARQGIVLLKNTAGSLPLTP 419

Query: 428 AKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFSGYANVTYKTGCDDVACKSNNSI 487
             +K++AV+GP+AN T  MIGNY GIPC+Y +P+ G +      YK GC DV+C +   I
Sbjct: 420 TAIKSLAVIGPNANVTKTMIGNYEGIPCKYTTPLQGLTASVATIYKPGCADVSCNTAQ-I 478

Query: 488 FAASEAAKTADATIILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMS 547
             A + A TADA +++ G D S+E ESLDR  + LPG Q+ L+ +VA+VAKGPVILVIMS
Sbjct: 479 DDAKQIATTADAVVLVMGSDQSIEKESLDRTSITLPGQQSILVAEVAKVAKGPVILVIMS 538

Query: 548 AGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLP 607
            GG+D+ FA  N  I +ILW G+PGE GG A+ADV+FG +NP GRLP+TWY   Y  ++P
Sbjct: 539 GGGMDVQFAVDNPKITSILWVGFPGEAGGAALADVIFGYYNPSGRLPMTWYPQSYADVVP 598

Query: 608 LTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHC 667
           +T M +RP  +  YPGRTY+FY GPT++ FG+GLSY+QFK++L    + + + L +   C
Sbjct: 599 MTDMNMRPNPATNYPGRTYRFYTGPTVFTFGHGLSYSQFKHHLDKAPQFVSLPLGEKHTC 658

Query: 668 RNLNYTSDASKTRCPGVLVNDLRCDDY-FEFKVDFQNVGSTDGSDVVIVYSKPPAEIAAT 726
           R          ++C  V      C +  F+  +  +NVG   GS ++ +++ PP+   A 
Sbjct: 659 R---------LSKCKTVDAVGQSCSNMGFDIHLRVKNVGKISGSHIIFLFTSPPSVHNAP 709

Query: 727 YIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGN 779
             K ++GF++V +       +KF  N CK L++ D   N  +  G H + +G+
Sbjct: 710 K-KHLLGFEKVHLTPQGEGVVKFNVNVCKHLSVHDELGNRKVALGPHVLHIGD 761


>gi|255573163|ref|XP_002527511.1| Beta-glucosidase, putative [Ricinus communis]
 gi|223533151|gb|EEF34909.1| Beta-glucosidase, putative [Ricinus communis]
          Length = 810

 Score =  760 bits (1963), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 382/760 (50%), Positives = 510/760 (67%), Gaps = 21/760 (2%)

Query: 31  SSPVFVCDPGRFSKLGLQMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPR 90
           +S  F CD     K   Q + + FC++SL Y  R KDL+SR+TL EKVQQ+ + A G+PR
Sbjct: 21  ASQNFACD-----KNSPQTNDYSFCNTSLSYQDRAKDLISRLTLQEKVQQVVNHAAGIPR 75

Query: 91  LGLPQYEWWSEALHGVSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTE 150
           LG+P YEWWSEALHGVSNVG G  F+  +PGATSFP +IL+ ASFNE+LW K+GQ VSTE
Sbjct: 76  LGIPAYEWWSEALHGVSNVGFGVRFNGTVPGATSFPAMILSAASFNETLWLKMGQVVSTE 135

Query: 151 ARAMYNLGRAGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHEN 210
           AR M+++G AGLTYWSPN+NV RDPRWGR  ETPGEDP VV RYAVNYVRGLQ+V    N
Sbjct: 136 ARTMHSVGLAGLTYWSPNVNVFRDPRWGRGQETPGEDPLVVSRYAVNYVRGLQEVGDEGN 195

Query: 211 ATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDA 270
           +T   +  LKVSSCCKHY AYD+D WKGVDR+HFDA+VT+QD+E+T+  PF  CV+E   
Sbjct: 196 ST---ADKLKVSSCCKHYTAYDLDKWKGVDRFHFDAKVTKQDLEDTYQPPFRSCVEEAHV 252

Query: 271 SSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDA 330
           SSVMCSYNRVNGIP+CADP LL   +RGEW+L GYIV+DCDSI+V  D+  + A + EDA
Sbjct: 253 SSVMCSYNRVNGIPTCADPDLLKGIIRGEWNLDGYIVSDCDSIEVYYDSINYTA-TPEDA 311

Query: 331 VAQTLKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ--- 387
           VA  LKAGL+++CG++   +T +AV+  KV+E+ +D++L Y + VLMRLGFFDG P+   
Sbjct: 312 VALALKAGLNMNCGEFLGKYTVDAVKLNKVEESVVDQALIYNFIVLMRLGFFDGDPKSLL 371

Query: 388 YVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMI 447
           + +LG  D+CSD + +LA +AAR+GIVLL N +  LPL+    + +AV+GP+AN T  MI
Sbjct: 372 FGNLGPSDVCSDGHQKLALDAARQGIVLLYN-KGALPLSKNNTRNLAVIGPNANVTTTMI 430

Query: 448 GNYAGIPCRYMSPIAGFSGY-ANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGL 506
            NYAGIPC+Y +P+ G   Y + VTY  GC  V+C  +  I AA++AA  ADA ++L GL
Sbjct: 431 SNYAGIPCKYTTPLQGLQKYVSTVTYAAGCKSVSCSDDTLIDAATQAAAAADAVVLLVGL 490

Query: 507 DLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAIL 566
           D S+E E LDRE+L LPG+Q +L+  V     G V+LV+MS+  +D++FA   + IK IL
Sbjct: 491 DQSIEREGLDRENLTLPGFQEKLVVDVVNATNGTVVLVVMSSSPIDVSFAVNKSKIKGIL 550

Query: 567 WAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTY 626
           W GYPG+ GG A+A V+FG +NP GR P TWY  +Y   +P+T M +R   +  +PGRTY
Sbjct: 551 WVGYPGQAGGDAVAQVMFGDYNPAGRSPFTWYPQEYAHQVPMTDMNMRANSTANFPGRTY 610

Query: 627 KFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQH----CRNLNYTSDASKTRCP 682
           +FY G TLY FG+GLSY+ F   ++S   T+ +  N            N T +       
Sbjct: 611 RFYAGNTLYKFGHGLSYSTFSNFIISGPSTLLLKTNSDLKPDIILSTHNSTEEHPFINSQ 670

Query: 683 GVLVNDLRC-DDYFEFKVDFQNVGSTDGSDVVIVYSKPP--AEIAATYIKQVIGFQRVFV 739
            + +  L C +      +  +N G   G  VV+V+ KPP  +E+      Q++GF RV V
Sbjct: 671 AMDITTLNCTNSLLSLILGVRNNGPVSGDHVVLVFWKPPNSSEVTGAANVQLVGFSRVEV 730

Query: 740 RAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGN 779
             G+ + +    + CK L++VD      L  G+H   +G+
Sbjct: 731 NRGKTQNVTLEIDVCKRLSLVDSEGKRKLVTGQHIFTIGS 770


>gi|449438167|ref|XP_004136861.1| PREDICTED: beta-xylosidase/alpha-L-arabinofuranosidase 2-like
           [Cucumis sativus]
          Length = 782

 Score =  760 bits (1962), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 381/755 (50%), Positives = 511/755 (67%), Gaps = 26/755 (3%)

Query: 28  NGSSSPVFVCDPGRFSKLGLQMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHG 87
           +  S   F CD    ++    +S F FCDSSL +  RV+DLV R+TL EK+  L + A  
Sbjct: 40  SAQSPTAFACD----AETNPSVSGFAFCDSSLGFEARVEDLVKRLTLQEKIGFLINNARN 95

Query: 88  VPRLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAV 147
           V RLG+P+YEWWSEALHGVS VGPGT F +V+PGATSFP VILT ASFN SL++ IG+ V
Sbjct: 96  VTRLGIPKYEWWSEALHGVSYVGPGTKFSNVVPGATSFPQVILTAASFNASLFEAIGKVV 155

Query: 148 STEARAMYNLGRAGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEG 207
           STEARAMYN+G AGLTYWSPN+N+ RDPRWGR  ETPGEDP +  +YA  YVRGLQ  + 
Sbjct: 156 STEARAMYNVGLAGLTYWSPNVNIFRDPRWGRGQETPGEDPLLSSKYAAGYVRGLQQRD- 214

Query: 208 HENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKE 267
                D +   LKV++CCKHY AYD+DNWKG DRYHF+A V+ QD+E+TF  PF+ CV +
Sbjct: 215 -----DGDPDRLKVAACCKHYTAYDLDNWKGTDRYHFNAVVSPQDLEDTFQPPFKSCVID 269

Query: 268 GDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSK 327
           G+ +SVMCSYN+VNG P+CADP LL   +RG+W L+GYIV+DCDS+ V+ ++  +   S 
Sbjct: 270 GNVASVMCSYNQVNGKPTCADPDLLAGVIRGQWKLNGYIVSDCDSVDVLYNSQHY-TKSP 328

Query: 328 EDAVAQTLKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ 387
           E+A A+T+ AGLDLDCG +    T  AV  G V E  I K++      LMRLGFFDG+P 
Sbjct: 329 EEAAAKTILAGLDLDCGDFLGKHTEAAVTGGLVNEAAISKAVFNNLLTLMRLGFFDGNPS 388

Query: 388 ---YVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATV 444
              Y  LG +D+C+ E+ ELA EAAR+GIVLLKN   +LPL+S+ +K++AV+GP+AN T 
Sbjct: 389 KQLYGKLGPKDVCTPEHQELAREAARQGIVLLKNSPKSLPLSSSAIKSLAVIGPNANVTK 448

Query: 445 AMIGNYAGIPCRYMSPIAGFSGYANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILA 504
            MIGNY G PC+Y +P+ G S   + +++ GC +VAC S   +  A + A +ADAT+++ 
Sbjct: 449 TMIGNYEGTPCKYTTPLQGLSAVVSTSFQPGCANVACTSAQ-LDEAKKIAASADATVLVV 507

Query: 505 GLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKA 564
           G D S+EAES DR DL LPG Q  LI +VA+ +KGPVILVIM+ GG+DI FA+ +  I +
Sbjct: 508 GSDQSIEAESRDRVDLNLPGQQALLITEVAKASKGPVILVIMTGGGMDITFAKKDDKITS 567

Query: 565 ILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGR 624
           ILW G+PGE GG AIADV+FG FNP GRLP+TWY   YV+ +P+T M +RP  S G+PGR
Sbjct: 568 ILWVGFPGEAGGAAIADVIFGSFNPSGRLPMTWYPQSYVEKVPMTDMRMRPSASNGFPGR 627

Query: 625 TYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGV 684
           TY+FY G T+Y FG GLSY+ FK++L+   K + + L +   C +         ++C  +
Sbjct: 628 TYRFYTGETIYSFGDGLSYSDFKHHLVKAPKLVSIPLEEGHICHS---------SKCHSL 678

Query: 685 LVNDLRCDDY-FEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGR 743
            V    C +  F+  +  +NVG   GS  V +YS PP+ +  +  K ++GF++V +  G 
Sbjct: 679 EVVQESCQNLGFDVHLRVKNVGQRSGSHTVFLYSTPPS-VHNSPQKHLLGFEKVSLGRGG 737

Query: 744 NKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVG 778
              ++F  + CK L++ D   +  +  G H + VG
Sbjct: 738 ETVVRFKVDVCKDLSVADEVGSRKVALGLHILHVG 772


>gi|297745522|emb|CBI40687.3| unnamed protein product [Vitis vinifera]
          Length = 751

 Score =  758 bits (1958), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 390/781 (49%), Positives = 517/781 (66%), Gaps = 49/781 (6%)

Query: 2   AKVVSSLLCFSLSIALLVFSTNAVDANGSSSPVFVCDPGRFSKLGLQMSSFLFCDSSLPY 61
           A  V+  LCF    +  + S   V A   SSPVF CD      LG     F FC++SL  
Sbjct: 8   APKVTVFLCFLSCFSHFLSSPKWVLAQ--SSPVFACDVENNPTLG----QFGFCNTSLET 61

Query: 62  SIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTHFDDVIPG 121
           + RV DLV R+TL+EK+  L + A  V RLG+P+YEWWSEALHGVS VGPGTHF+ V+PG
Sbjct: 62  AARVADLVKRLTLEEKIGFLVNSAASVSRLGIPKYEWWSEALHGVSYVGPGTHFNSVVPG 121

Query: 122 ATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNINVARDPRWGRIT 181
           ATSFP VILT ASFN SL++ IG+AVSTEARAMYN+G AGLT+WSPN+N+ RDPRWGR  
Sbjct: 122 ATSFPQVILTAASFNASLFEAIGKAVSTEARAMYNVGLAGLTFWSPNVNIFRDPRWGRGQ 181

Query: 182 ETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDR 241
           ETPGEDP +  +YA  YVRGLQ  +      D +   LKV++CCKHY AYD+DNWKGVDR
Sbjct: 182 ETPGEDPLLSSKYASGYVRGLQQSD------DGSPDRLKVAACCKHYTAYDLDNWKGVDR 235

Query: 242 YHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWD 301
           +HF+A VT+QDM++TF  PF+ CV +G+ +SVMCSYN+VNG P+CADP LL+  VRGEW 
Sbjct: 236 FHFNAVVTKQDMDDTFQPPFKSCVIDGNVASVMCSYNQVNGKPACADPDLLSGIVRGEWK 295

Query: 302 LHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFTGNAVQQGKVK 361
           L+GYIV+DCDS+ V  ++  +   + E+A A+ + AGLDL+CG +    T  AV+ G V 
Sbjct: 296 LNGYIVSDCDSVDVFYNSQHY-TKTPEEAAAKAILAGLDLNCGSFLGQHTEAAVKGGLVD 354

Query: 362 ETDIDKSLKYLYTVLMRLGFFDGSPQ---YVSLGKQDICSDENIELAAEAAREGIVLLKN 418
           E+ +DK++   +  LMRLGFFDG+P    Y  LG +D+C+ E+ ELA EAAR+GIVLLKN
Sbjct: 355 ESAVDKAVSNNFATLMRLGFFDGNPSKAIYGKLGPKDVCTSEHQELAREAARQGIVLLKN 414

Query: 419 DQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFSGYANVTYKTGCDD 478
            + +LPL+   +KT+AV+GP+AN T  MIGNY G PC+Y +P+ G +     TY  GC +
Sbjct: 415 SKGSLPLSPTAIKTLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGLTALVATTYLPGCSN 474

Query: 479 VACKSNNSIFAASEAAKTADATIILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAK 538
           VAC +   I  A + A  ADAT+++ G+D S+EAE  DR ++ LPG Q  LI +VA+ +K
Sbjct: 475 VACGTAQ-IDEAKKIAAAADATVLIVGIDQSIEAEGRDRVNIQLPGQQPLLITEVAKASK 533

Query: 539 GPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWY 598
           G VILV+MS GG DI+FA+ +  I +ILW GYPGE GG AIADV+FG +NP GRLP+TWY
Sbjct: 534 GNVILVVMSGGGFDISFAKNDDKITSILWVGYPGEAGGAAIADVIFGFYNPSGRLPMTWY 593

Query: 599 NGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQ 658
              YV  +P+T+M +RP  + GYPGRTY+FY G T+Y FG GLSYTQF ++L        
Sbjct: 594 PQSYVDKVPMTNMNMRPDPASGYPGRTYRFYTGETIYTFGDGLSYTQFNHHL-------- 645

Query: 659 VNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSK 718
                         + DA +  C  ++         F+  +   N G+  GS  V ++S 
Sbjct: 646 --------------SVDAVQESCQNLV---------FDIHLRVNNAGNISGSHTVFLFSS 682

Query: 719 PPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVG 778
           PP+ +  +  K ++GF++VFV A     ++F  + CK L+IVD      +  G H + VG
Sbjct: 683 PPS-VHNSPQKHLLGFEKVFVTAKAKALVRFKVDVCKDLSIVDELGTRKVALGLHVLHVG 741

Query: 779 N 779
           N
Sbjct: 742 N 742


>gi|449479116|ref|XP_004155509.1| PREDICTED: beta-xylosidase/alpha-L-arabinofuranosidase 2-like
           [Cucumis sativus]
          Length = 809

 Score =  758 bits (1958), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 381/756 (50%), Positives = 511/756 (67%), Gaps = 26/756 (3%)

Query: 28  NGSSSPVFVCDPGRFSKLGLQMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHG 87
           +  S   F CD    ++    +S F FCDSSL +  RV+DLV R+TL EK+  L + A  
Sbjct: 67  SAQSPTAFACD----AETNPSVSGFAFCDSSLGFEARVEDLVKRLTLQEKIGFLINNARN 122

Query: 88  VPRLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAV 147
           V RLG+P+YEWWSEALHGVS VGPGT F +V+PGATSFP VILT ASFN SL++ IG+ V
Sbjct: 123 VTRLGIPKYEWWSEALHGVSYVGPGTKFSNVVPGATSFPQVILTAASFNASLFEAIGKVV 182

Query: 148 STEARAMYNLGRAGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEG 207
           STEARAMYN+G AGLTYWSPN+N+ RDPRWGR  ETPGEDP +  +YA  YVRGLQ  + 
Sbjct: 183 STEARAMYNVGLAGLTYWSPNVNIFRDPRWGRGQETPGEDPLLSSKYAAGYVRGLQQRD- 241

Query: 208 HENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKE 267
                D +   LKV++CCKHY AYD+DNWKG DRYHF+A V+ QD+E+TF  PF+ CV +
Sbjct: 242 -----DGDPDRLKVAACCKHYTAYDLDNWKGTDRYHFNAVVSPQDLEDTFQPPFKSCVID 296

Query: 268 GDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSK 327
           G+ +SVMCSYN+VNG P+CADP LL   +RG+W L+GYIV+DCDS+ V+ ++  +   S 
Sbjct: 297 GNVASVMCSYNQVNGKPTCADPDLLAGVIRGQWKLNGYIVSDCDSVDVLYNSQHY-TKSP 355

Query: 328 EDAVAQTLKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ 387
           E+A A+T+ AGLDLDCG +    T  AV  G V E  I K++      LMRLGFFDG+P 
Sbjct: 356 EEAAAKTILAGLDLDCGDFLGKHTEAAVTGGLVNEAAISKAVFNNLLTLMRLGFFDGNPS 415

Query: 388 ---YVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATV 444
              Y  LG +D+C+ E+ ELA EAAR+GIVLLKN   +LPL+S+ +K++AV+GP+AN T 
Sbjct: 416 KQLYGKLGPKDVCTPEHQELAREAARQGIVLLKNSPKSLPLSSSAIKSLAVIGPNANVTK 475

Query: 445 AMIGNYAGIPCRYMSPIAGFSGYANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILA 504
            MIGNY G PC+Y +P+ G S   + +++ GC +VAC S   +  A + A +ADAT+++ 
Sbjct: 476 TMIGNYEGTPCKYTTPLQGLSAVVSTSFQPGCANVACTSAQ-LDEAKKIAASADATVLVV 534

Query: 505 GLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKA 564
           G D S+EAES DR DL LPG Q  LI +VA+ +KGPVILVIM+ GG+DI FA+ +  I +
Sbjct: 535 GSDQSIEAESRDRVDLNLPGQQALLITEVAKASKGPVILVIMTGGGMDITFAKKDDKITS 594

Query: 565 ILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGR 624
           ILW G+PGE GG AIADV+FG FNP GRLP+TWY   YV+ +P+T M +RP  S G+PGR
Sbjct: 595 ILWVGFPGEAGGAAIADVIFGSFNPSGRLPMTWYPQSYVEKVPMTDMRMRPSASNGFPGR 654

Query: 625 TYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGV 684
           TY+FY G T+Y FG GLSY+ FK++L+   K + + L +   C +         ++C  +
Sbjct: 655 TYRFYTGETIYSFGDGLSYSDFKHHLVKAPKLVSIPLEEGHICHS---------SKCHSL 705

Query: 685 LVNDLRCDDY-FEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGR 743
            V    C +  F+  +  +NVG   GS  V +YS PP+ +  +  K ++GF++V +  G 
Sbjct: 706 EVVQESCQNLGFDVHLRVKNVGQRSGSHTVFLYSTPPS-VHNSPQKHLLGFEKVSLGRGG 764

Query: 744 NKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGN 779
              ++F  + CK L++ D   +  +  G H + VG 
Sbjct: 765 ETVVRFKVDVCKDLSVADEVGSRKVALGLHILHVGT 800


>gi|224099193|ref|XP_002311398.1| predicted protein [Populus trichocarpa]
 gi|222851218|gb|EEE88765.1| predicted protein [Populus trichocarpa]
          Length = 755

 Score =  758 bits (1957), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 386/760 (50%), Positives = 512/760 (67%), Gaps = 28/760 (3%)

Query: 35  FVCDPGRFSKLGLQMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLP 94
           F CD    +K GL   S  FC  ++P  +RV+DL+ R+TL EK++ L + A  VPRLG+ 
Sbjct: 20  FACD----AKNGL-TRSLKFCRVNMPLHVRVRDLIGRLTLQEKIRLLVNNAAAVPRLGIQ 74

Query: 95  QYEWWSEALHGVSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAM 154
            YEWWSEALHGVSNVGPGT F    PGATSFP VI T ASFN+SLW++IG+ VS EARAM
Sbjct: 75  GYEWWSEALHGVSNVGPGTKFGGAFPGATSFPQVITTAASFNKSLWEEIGRVVSDEARAM 134

Query: 155 YNLGRAGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDL 214
           +N G AGLTYWSPN+NV RDPRWGR  ETPGEDP V G+YA +YVRGLQ   G       
Sbjct: 135 FNGGMAGLTYWSPNVNVFRDPRWGRGQETPGEDPVVAGKYAASYVRGLQGNSGFR----- 189

Query: 215 NSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVM 274
               LKV++CCKHY AYD+DNW GVDRYHF+ARV++QD+E+T+  PF+ CV EG  +SVM
Sbjct: 190 ----LKVAACCKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYDVPFKSCVVEGKVASVM 245

Query: 275 CSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQT 334
           CSYN+VNG P+CADP LL  T+RGEW L+GYIV+DCDS+ V+ +N  + A  +E A A T
Sbjct: 246 CSYNQVNGKPTCADPNLLKNTIRGEWRLNGYIVSDCDSVGVLYENQHYTATPEE-AAAAT 304

Query: 335 LKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ---YVSL 391
           +KAGLDLDCG +    T NAV+ G + E D++ +L    TV MRLG FDG P    +  L
Sbjct: 305 IKAGLDLDCGPFLAIHTENAVKGGLLNEEDVNMALANTITVQMRLGLFDGEPSAQPFGKL 364

Query: 392 GKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYA 451
           G +D+C+  + +LA  AA++GIVLL+N   TLPL+   + TVAV+GP A+ TV MIGNYA
Sbjct: 365 GPRDVCTPAHQQLALHAAQQGIVLLQNSGRTLPLSRPNL-TVAVIGPIADVTVTMIGNYA 423

Query: 452 GIPCRYMSPIAGFSGYANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVE 511
           G+ C Y +P+ G S YA   +++GC DVAC  N     A  AA  ADAT+++ GLD S+E
Sbjct: 424 GVACGYTTPLQGISRYAKTIHQSGCIDVACNGNQQFGMAEAAASQADATVLVMGLDQSIE 483

Query: 512 AESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYP 571
           AE  DR+DL LPGYQ +LI++VA  ++GP ILV+MS G +D++FA+ +  I AILWAGYP
Sbjct: 484 AEFRDRKDLLLPGYQQELISRVARASRGPTILVLMSGGPIDVSFAKNDPRIGAILWAGYP 543

Query: 572 GEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNG 631
           G+ GG AIADV+FG  NPGG+LP+TWY  DY+  +P+T+M +R   S GYPGRTY+FY G
Sbjct: 544 GQAGGAAIADVLFGTTNPGGKLPMTWYPQDYLAKVPMTNMGMRADPSRGYPGRTYRFYKG 603

Query: 632 PTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRC 691
           P ++PFG+G+SYT F ++L+   + + V    L   +N     ++       + V+   C
Sbjct: 604 PVVFPFGHGMSYTTFAHSLVQAPQEVAVPFTSLYALQNTTAARNS-------IRVSHANC 656

Query: 692 DDY-FEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFV 750
           +       +D +N G  DG   ++V+S PP E   +  K++IGF++V + AG  KR+K  
Sbjct: 657 EPLVLGVHIDVKNTGDMDGIQTLLVFSSPP-EGKWSANKKLIGFEKVHIVAGSKKRVKID 715

Query: 751 FNACKSLNIVDYAANTLLPAGEHTIFVGNGGVSFPIHLNF 790
              CK L++VD      LP G+H + +G+   S  +  N 
Sbjct: 716 IPVCKHLSVVDRFGIRRLPIGKHDLHIGDLKHSISLQANL 755


>gi|357449039|ref|XP_003594795.1| Beta xylosidase [Medicago truncatula]
 gi|355483843|gb|AES65046.1| Beta xylosidase [Medicago truncatula]
          Length = 762

 Score =  758 bits (1957), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 377/772 (48%), Positives = 511/772 (66%), Gaps = 29/772 (3%)

Query: 10  CFSLSIALLVFSTNAVDANGSSSPVFVCDPGRFSKLGLQMSSFLFCDSSLPYSIRVKDLV 69
           CF   I  ++  +  V  +    P F CDP    K GL   S+ FC++ +P   RV+DL+
Sbjct: 3   CFKNLITFMLLISILVTLSEGRVP-FACDP----KNGL-TRSYKFCNTRVPIHARVQDLI 56

Query: 70  SRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGATSFPTVI 129
            R+ L EK++ + + A  VPRLG+  YEWWSEALHGVSNVGPGT F      ATSFP VI
Sbjct: 57  GRLALPEKIRLVVNNAIAVPRLGIQGYEWWSEALHGVSNVGPGTKFGGAFSAATSFPQVI 116

Query: 130 LTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNINVARDPRWGRITETPGEDPF 189
            T ASFN+SLW +IG+ VS EARAMYN G AGLT+WSPN+N+ RDPRWGR  ETPGEDP 
Sbjct: 117 TTAASFNQSLWLEIGRIVSDEARAMYNGGAAGLTFWSPNVNIFRDPRWGRGQETPGEDPT 176

Query: 190 VVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVT 249
           V G+YA +YV+GLQ   G  N        LKV++CCKHY AYD+DNW GVDR+HF+A+V+
Sbjct: 177 VAGKYAASYVQGLQG-NGAGNR-------LKVAACCKHYTAYDLDNWNGVDRFHFNAKVS 228

Query: 250 EQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVAD 309
           +QD+ +T+  PF+ CV++G  +SVMCSYN+VNG P+CADP+LL  T+RGEW L+GYIV+D
Sbjct: 229 KQDLADTYDVPFKACVRDGKVASVMCSYNQVNGKPTCADPELLRNTIRGEWGLNGYIVSD 288

Query: 310 CDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSL 369
           CDS+ V+ DN  +   + E A A  +KAGLDLDCG +    T  A++QG + E D++ +L
Sbjct: 289 CDSVGVLYDNQHY-TRTPEQAAAAAIKAGLDLDCGPFLALHTDGAIKQGLISENDLNLAL 347

Query: 370 KYLYTVLMRLGFFDGSPQ-YVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSA 428
             L TV MRLG FDG  Q Y +LG +D+C   + ++A EAAR+GIVLL+N  N LPL+  
Sbjct: 348 ANLITVQMRLGMFDGDAQPYGNLGTRDVCLPSHNDVALEAARQGIVLLQNKGNALPLSPT 407

Query: 429 KVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFSGYANVTYKTGCDDVACKSNNSIF 488
           + +TV V+GP+++ TV MIGNYAGI C Y +P+ G + Y    ++ GC DV C  N    
Sbjct: 408 RYRTVGVIGPNSDVTVTMIGNYAGIACGYTTPLQGIARYVKTIHQAGCKDVGCGGNQLFG 467

Query: 489 AASEAAKTADATIILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSA 548
            + + A+ ADAT+++ GLD S+EAE  DR  L LPG+Q +L+++VA  A+GPVILV+MS 
Sbjct: 468 LSEQVARQADATVLVMGLDQSIEAEFRDRTGLLLPGHQQELVSRVARAARGPVILVLMSG 527

Query: 549 GGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPL 608
           G +D+ FA+ +  I AILW GYPG+ GG AIADV+FG+ NP GRLP TWY  DYV+ +P+
Sbjct: 528 GPIDVTFAKNDPKISAILWVGYPGQSGGTAIADVIFGRTNPSGRLPNTWYPQDYVRKVPM 587

Query: 609 TSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCR 668
           T+M +R   + GYPGRTY+FY GP ++PFG+GLSY++F ++L    K + V         
Sbjct: 588 TNMDMRANPATGYPGRTYRFYKGPVVFPFGHGLSYSRFTHSLALAPKQVSVQFTTPLTQA 647

Query: 669 NLNYTSDASKTRCPGVLVNDLRCDDY-FEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATY 727
             N ++ A K       V+   CD+    F VD +N GS DG+  ++VYSK P       
Sbjct: 648 FTNSSNKAMK-------VSHANCDELEVGFHVDVKNEGSMDGAHTLLVYSKAP-----NG 695

Query: 728 IKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGN 779
           +KQ++ F + +V AG   R+K   + C  L+ VD      +P GEH + +G+
Sbjct: 696 VKQLVNFHKTYVPAGSKTRVKVGVHVCNHLSAVDEFGVRRIPMGEHELQIGD 747


>gi|298364130|gb|ADI79208.1| alpha-L-arabinofuranosidase/beta-D-xylosidase [Malus x domestica]
          Length = 774

 Score =  757 bits (1955), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 371/753 (49%), Positives = 500/753 (66%), Gaps = 26/753 (3%)

Query: 31  SSPVFVCDPGRFSKLGLQMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPR 90
           + P F CDP       L+     FC   +P  +RV+DL+ R+TL EK+  L + A  VPR
Sbjct: 28  ARPPFACDPRNPITRTLK-----FCRVRVPIHVRVQDLIGRLTLQEKIGLLVNNAIAVPR 82

Query: 91  LGLPQYEWWSEALHGVSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTE 150
           LG+  YEWWSEALHGVSNVGPGT F   + GATSFP VI T ASFNESLW++IG+ VS E
Sbjct: 83  LGIQGYEWWSEALHGVSNVGPGTKFGTFL-GATSFPQVITTAASFNESLWEEIGRVVSDE 141

Query: 151 ARAMYNLGRAGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHEN 210
           ARAMYN G AGLT+WSPN+N+ RDPRWGR  ETPGEDP +  +Y   YV+GLQ       
Sbjct: 142 ARAMYNGGAAGLTFWSPNVNIFRDPRWGRGQETPGEDPILAAKYGARYVKGLQG------ 195

Query: 211 ATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDA 270
             D     LKV++CCKHY AYD+DNW GVDR+HF+ARV++QD+E+T+  PF  CV +G+ 
Sbjct: 196 --DGAGNRLKVAACCKHYTAYDLDNWNGVDRFHFNARVSKQDLEDTYNVPFRACVVDGNV 253

Query: 271 SSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDA 330
           +SVMCSYN+VNG P+CADP+LL  T+RG+W L+GYIV+DCDS+ V  DN  +   + E+A
Sbjct: 254 ASVMCSYNQVNGKPTCADPELLKGTIRGQWKLNGYIVSDCDSVGVYYDNQHY-TKTPEEA 312

Query: 331 VAQTLKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSP---Q 387
            A  +KAGLDLDCG +    T  AV+ G+V E DI+ +L    TV MRLG FDG P   +
Sbjct: 313 AAYAIKAGLDLDCGPFLGIHTEAAVRFGQVNEIDINYALANTITVQMRLGMFDGEPSAQR 372

Query: 388 YVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMI 447
           Y +LG  D+C   + ELA EAAR+GIVLL+N  N+LPL++ + +TVAV+GP+++ T  MI
Sbjct: 373 YGNLGLADVCKPSSNELALEAARQGIVLLENRGNSLPLSTMRHRTVAVIGPNSDVTETMI 432

Query: 448 GNYAGIPCRYMSPIAGFSGYANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLD 507
           GNYAGI C Y +P+ G + Y    ++ GC DV C  N  I AA  AA+ ADAT+++ GLD
Sbjct: 433 GNYAGIACGYTTPLQGIARYTRTIHQAGCTDVHCNGNQLIGAAEVAARQADATVLVIGLD 492

Query: 508 LSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILW 567
            S+EAE  DR DL LPG+Q +L+++VA  ++GP ILVIMS G +D+ FA+ +  I AI+W
Sbjct: 493 QSIEAEFRDRTDLLLPGHQQELVSRVARASRGPTILVIMSGGPIDVTFAKNDPRIGAIIW 552

Query: 568 AGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYK 627
            GYPG+ GG AIADV+FG  NP G+LP+TWY  +YV  LP+T M +R   + GYPGRTY+
Sbjct: 553 VGYPGQAGGTAIADVLFGTTNPSGKLPMTWYPQNYVANLPMTDMAMRADPARGYPGRTYR 612

Query: 628 FYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVN 687
           FY GP ++PFG GLSYT+F ++L      + V    L   +N     +        + V+
Sbjct: 613 FYKGPVVFPFGLGLSYTRFSHSLAQGPTLVSVPFTSLVASKNTTMLGNHD------IRVS 666

Query: 688 DLRCDDY-FEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKR 746
              CD    +  +D +N G+ DG+  ++V++ PP    A   KQ++GF +V + AG  +R
Sbjct: 667 HTNCDSLSLDVHIDIKNSGTMDGTHTLLVFATPPTGKWAPN-KQLVGFHKVHIVAGSERR 725

Query: 747 IKFVFNACKSLNIVDYAANTLLPAGEHTIFVGN 779
           ++     CK L++VD      +P G+H + +G+
Sbjct: 726 VRVGVQVCKHLSVVDELGIRRIPLGQHKLEIGD 758


>gi|183579871|dbj|BAG28345.1| arabinofuranosidase [Citrus unshiu]
          Length = 769

 Score =  756 bits (1952), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 372/733 (50%), Positives = 498/733 (67%), Gaps = 31/733 (4%)

Query: 35  FVCDPGRFSKLGLQMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLP 94
           F CDP    + GL   S  FC +S+P  +RV+DL+ R+TL EK++ L + A  VPRLG+ 
Sbjct: 28  FACDP----RNGL-TRSLRFCRTSVPIHVRVQDLIGRLTLQEKIRLLVNNAAAVPRLGIQ 82

Query: 95  QYEWWSEALHGVSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAM 154
            YEWWSEALHGVSNVGPGT F    PGATSFP VI T A+FNESLW++IG+ VS EARAM
Sbjct: 83  GYEWWSEALHGVSNVGPGTKFGGAFPGATSFPQVITTAAAFNESLWEEIGRVVSDEARAM 142

Query: 155 YNLGRAGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDL 214
           YN G AGLTYWSPN+N+ RDPRWGR  ETPGEDP + G+YA +YVR LQ   G       
Sbjct: 143 YNGGMAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLAGKYAASYVRRLQGNTGSR----- 197

Query: 215 NSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVM 274
               LKV++CCKHY AYD+DNW GVDRYHF+ARV++QD+E+T+  PF+ CV EG  +SVM
Sbjct: 198 ----LKVAACCKHYTAYDLDNWNGVDRYHFNARVSKQDLEDTYNVPFKACVVEGKVASVM 253

Query: 275 CSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQT 334
           CSYN+VNG P+CADP +L  T+RG+W L GYIV+DCDS+ V+ +   +   + E+A A  
Sbjct: 254 CSYNQVNGKPTCADPDILKNTIRGQWRLDGYIVSDCDSVGVLYNTQHY-TRTPEEAAADA 312

Query: 335 LKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ---YVSL 391
           +KAGLDLDCG +    T  AV+ G ++E D++ +  Y  TV MRLG FDG P    + +L
Sbjct: 313 IKAGLDLDCGPFLAIHTEGAVRGGLLREEDVNLASAYTITVQMRLGMFDGEPSAQPFGNL 372

Query: 392 GKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYA 451
           G +D+C+  + +LA +AA +GIVLLKN   TLPL++ +  TVAV+GP+++ TV MIGNYA
Sbjct: 373 GPRDVCTPAHQQLALQAAHQGIVLLKNSARTLPLSTLRHHTVAVIGPNSDVTVTMIGNYA 432

Query: 452 GIPCRYMSPIAGFSGYANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVE 511
           G+ C Y +P+ G S YA   ++ GC  VAC  N  I AA  AA+ ADAT+++ GLD S+E
Sbjct: 433 GVACGYTTPLQGISRYAKTIHQAGCLGVACNGNQLIGAAEVAARQADATVLVMGLDQSIE 492

Query: 512 AESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYP 571
           AE +DR  L LPG Q +L+++VA+ ++GPV+LV+M  G VD++FA+ +  I AILW GYP
Sbjct: 493 AEFIDRAGLLLPGRQQELVSRVAKASRGPVVLVLMCGGPVDVSFAKNDPRIGAILWVGYP 552

Query: 572 GEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNG 631
           G+ GG AIADV+FG+ NPGG+LP+TWY  DYV  LP+T M +R     GYPGRTY+FY G
Sbjct: 553 GQAGGAAIADVLFGRANPGGKLPMTWYPQDYVARLPMTDMRMRA--GRGYPGRTYRFYKG 610

Query: 632 PTLYPFGYGLSYTQFKYNLLSFTKTIQVNL-NKLQHCRNLNYTSDASKTRCPGVLVNDLR 690
           P ++PFG+G+SYT F + L        V +   L   +N   +S+A       + V    
Sbjct: 611 PVVFPFGHGMSYTTFAHTLSKAPNQFSVPIATSLYAFKNTTISSNA-------IRVAHTN 663

Query: 691 CDDYFE--FKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIK 748
           C+D       VD +N G   G+  ++V++KPPA   +   KQ+IGF++V V AG  + ++
Sbjct: 664 CNDAMSLGLHVDVKNTGDMAGTHTLLVFAKPPAGNWSPN-KQLIGFKKVHVTAGALQSVR 722

Query: 749 FVFNACKSLNIVD 761
              + CK L++VD
Sbjct: 723 LDIHVCKHLSVVD 735


>gi|15239867|ref|NP_199747.1| beta-xylosidase 1 [Arabidopsis thaliana]
 gi|75262458|sp|Q9FGY1.1|BXL1_ARATH RecName: Full=Beta-D-xylosidase 1; Short=AtBXL1; AltName:
           Full=Alpha-L-arabinofuranosidase; Flags: Precursor
 gi|9759419|dbj|BAB09906.1| xylosidase [Arabidopsis thaliana]
 gi|21539545|gb|AAM53325.1| xylosidase [Arabidopsis thaliana]
 gi|332008419|gb|AED95802.1| beta-xylosidase 1 [Arabidopsis thaliana]
          Length = 774

 Score =  756 bits (1952), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 383/779 (49%), Positives = 522/779 (67%), Gaps = 28/779 (3%)

Query: 7   SLLCFSLSIALLVFSTNAVDANGSSSPVFVCDPGRFSKLGLQMSSFLFCDSSLPYSIRVK 66
           +LL  +  + +LVF    V ++ S  P+F CDP      GL   +  FC +++P  +RV+
Sbjct: 7   ALLIGNKVVVILVFLLCLVHSSESLRPLFACDPAN----GL-TRTLRFCRANVPIHVRVQ 61

Query: 67  DLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGATSFP 126
           DL+ R+TL EK++ L + A  VPRLG+  YEWWSEALHG+S+VGPG  F    PGATSFP
Sbjct: 62  DLLGRLTLQEKIRNLVNNAAAVPRLGIGGYEWWSEALHGISDVGPGAKFGGAFPGATSFP 121

Query: 127 TVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNINVARDPRWGRITETPGE 186
            VI T ASFN+SLW++IG+ VS EARAMYN G AGLTYWSPN+N+ RDPRWGR  ETPGE
Sbjct: 122 QVITTAASFNQSLWEEIGRVVSDEARAMYNGGVAGLTYWSPNVNILRDPRWGRGQETPGE 181

Query: 187 DPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDA 246
           DP V  +YA +YVRGLQ        T   +R LKV++CCKHY AYD+DNW GVDR+HF+A
Sbjct: 182 DPIVAAKYAASYVRGLQ-------GTAAGNR-LKVAACCKHYTAYDLDNWNGVDRFHFNA 233

Query: 247 RVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYI 306
           +VT+QD+E+T+  PF+ CV EG  +SVMCSYN+VNG P+CAD  LL  T+RG+W L+GYI
Sbjct: 234 KVTQQDLEDTYNVPFKSCVYEGKVASVMCSYNQVNGKPTCADENLLKNTIRGQWRLNGYI 293

Query: 307 VADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFTGNAVQQGKVKETDID 366
           V+DCDS+ V   N +    + E+A A+++KAGLDLDCG +   FT  AV++G + E DI+
Sbjct: 294 VSDCDSVDVFF-NQQHYTSTPEEAAARSIKAGLDLDCGPFLAIFTEGAVKKGLLTENDIN 352

Query: 367 KSLKYLYTVLMRLGFFDGS-PQYVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPL 425
            +L    TV MRLG FDG+   Y +LG +D+C+  +  LA EAA +GIVLLKN   +LPL
Sbjct: 353 LALANTLTVQMRLGMFDGNLGPYANLGPRDVCTPAHKHLALEAAHQGIVLLKNSARSLPL 412

Query: 426 NSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFSGYANVTYKTGCDDVACKSNN 485
           +  + +TVAV+GP+++ T  MIGNYAG  C Y SP+ G S YA   ++ GC  VACK N 
Sbjct: 413 SPRRHRTVAVIGPNSDVTETMIGNYAGKACAYTSPLQGISRYARTLHQAGCAGVACKGNQ 472

Query: 486 SIFAASEAAKTADATIILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVI 545
              AA  AA+ ADAT+++ GLD S+EAE+ DR  L LPGYQ  L+ +VA+ ++GPVILV+
Sbjct: 473 GFGAAEAAAREADATVLVMGLDQSIEAETRDRTGLLLPGYQQDLVTRVAQASRGPVILVL 532

Query: 546 MSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQM 605
           MS G +D+ FA+ +  + AI+WAGYPG+ GG AIA+++FG  NPGG+LP+TWY  DYV  
Sbjct: 533 MSGGPIDVTFAKNDPRVAAIIWAGYPGQAGGAAIANIIFGAANPGGKLPMTWYPQDYVAK 592

Query: 606 LPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLL-SFTKTIQVNLNKL 664
           +P+T M +R   S  YPGRTY+FY GP ++PFG+GLSYT F ++L  S    + V+L+  
Sbjct: 593 VPMTVMAMRA--SGNYPGRTYRFYKGPVVFPFGFGLSYTTFTHSLAKSPLAQLSVSLS-- 648

Query: 665 QHCRNLNYTSDASKTRCPGVLVNDLRCDDY--FEFKVDFQNVGSTDGSDVVIVYSKPPAE 722
               NLN  +    +    + V+   C+ +      V+  N G  DG+  V V+++PP  
Sbjct: 649 ----NLNSANTILNSSSHSIKVSHTNCNSFPKMPLHVEVSNTGEFDGTHTVFVFAEPPIN 704

Query: 723 -IAATYI-KQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGN 779
            I    + KQ+I F++V V AG  + ++   +ACK L +VD      +P GEH + +G+
Sbjct: 705 GIKGLGVNKQLIAFEKVHVMAGAKQTVQVDVDACKHLGVVDEYGKRRIPMGEHKLHIGD 763


>gi|86553064|gb|AAS17751.2| beta xylosidase [Fragaria x ananassa]
          Length = 772

 Score =  755 bits (1949), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 383/777 (49%), Positives = 518/777 (66%), Gaps = 34/777 (4%)

Query: 7   SLLCFSLSIALLVFSTNAVDANGSSSPVFVCDPGRFSKLGLQMSSFLFCDSSLPYSIRVK 66
           SL+   L ++ L+F  N V A     P F CDP      G     F FC + +P  +RV+
Sbjct: 10  SLIALVLCVSALLF--NLVHAR----PPFACDPRNPLTRG-----FKFCRTRVPVHVRVQ 58

Query: 67  DLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGATSFP 126
           DL+ R+TL EK++ L + A  VPRLG+  YEWWSEALHGVSNVGPGT F    PGATSFP
Sbjct: 59  DLIGRLTLQEKIRLLVNNAIAVPRLGIQGYEWWSEALHGVSNVGPGTKFGGAFPGATSFP 118

Query: 127 TVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNINVARDPRWGRITETPGE 186
            VI T ASFN+SLW++IGQ VS EARAMYN G+AGLTYWSPN+N+ RDPRWGR  ETPGE
Sbjct: 119 QVITTAASFNQSLWQEIGQVVSDEARAMYNGGQAGLTYWSPNVNIFRDPRWGRGQETPGE 178

Query: 187 DPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDA 246
           DP +  +YA +YV+GLQ         D     LKV++CCKHY AYD+DNW GVDR+HF+A
Sbjct: 179 DPVLSAKYAASYVKGLQG--------DGAGNRLKVAACCKHYTAYDLDNWNGVDRFHFNA 230

Query: 247 RVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYI 306
           RV++QD+ +T+  PF  CV EG  +SVMCSYN+VNG P+CADP LL  T+RGEW L+GYI
Sbjct: 231 RVSKQDLADTYDVPFRGCVLEGKVASVMCSYNQVNGKPTCADPDLLKNTIRGEWKLNGYI 290

Query: 307 VADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFTGNAVQQGKVKETDID 366
           V+DCDS+ V  D   +   + E+A A+ +KAGLDLDCG +    T  A++ G + E D+D
Sbjct: 291 VSDCDSVGVFYDQQHY-TRTPEEAAAEAIKAGLDLDCGPFLAIHTEGAIKAGLLPEIDVD 349

Query: 367 KSLKYLYTVLMRLGFFDGSP---QYVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTL 423
            +L    TV MRLG FDG P   QY +LG +D+C+  + ELA EA+R+GIVLL+N+ +TL
Sbjct: 350 YALANTLTVQMRLGMFDGEPSAQQYGNLGPRDVCTPAHQELALEASRQGIVLLQNNGHTL 409

Query: 424 PLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFSGYANVTYKTGCDDVACKS 483
           PL++ + +TVAVVGP+++ T  MIGNYAG+ C Y +P+ G   Y    ++ GC +VAC +
Sbjct: 410 PLSTVRHRTVAVVGPNSDVTETMIGNYAGVACGYTTPLQGIGRYTKTIHQQGCTNVACTT 469

Query: 484 NNSIFAASEAAKTADATIILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVIL 543
           N    AA  AA+ ADAT+++ GLD S+EAE  DR DL +PG+Q +L+++VA  ++GP +L
Sbjct: 470 NQLFGAAEAAARQADATVLVMGLDQSIEAEFRDRTDLVMPGHQQELVSRVARASRGPTVL 529

Query: 544 VIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYV 603
           V+MS G +D++FA+ +  I AI+W GYPG+ GG A+ADV+FG  NP G+LP+TWY  DYV
Sbjct: 530 VLMSGGPIDVSFAKNDPKIGAIIWVGYPGQAGGTAMADVLFGTTNPSGKLPMTWYPQDYV 589

Query: 604 QMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNK 663
             +P+T+M +R     GYPGRTY+FY GP ++PFG GLSYT F ++L     ++ V L  
Sbjct: 590 SKVPMTNMAMRA--GRGYPGRTYRFYKGPVVFPFGLGLSYTTFAHSLAQVPTSVSVPLTS 647

Query: 664 LQHCRNLNYTSDASKTRCPGVLVNDLRCDDY-FEFKVDFQNVGSTDGSDVVIVYSKPPAE 722
           L    N    S A       V V+   C+       V  +N G+ DG+  ++V+S PP+ 
Sbjct: 648 LSATTNSTMLSSA-------VRVSHTNCNPLSLALHVVVKNTGARDGTHTLLVFSSPPSG 700

Query: 723 IAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGN 779
             A   KQ++GF +V + AG +KR+K   + CK L++VD      +P GEH + +G+
Sbjct: 701 KWAAN-KQLVGFHKVHIVAGSHKRVKVDVHVCKHLSVVDQFGIRRIPIGEHKLQIGD 756


>gi|371917280|dbj|BAL44716.1| SlArf/Xyl1 [Solanum lycopersicum]
          Length = 771

 Score =  754 bits (1948), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 382/771 (49%), Positives = 511/771 (66%), Gaps = 24/771 (3%)

Query: 11  FSLSIALLVFSTNAVDANGSSSPVFVCDPGRFSKLGLQMSSFLFCDSSLPYSIRVKDLVS 70
           F L I +L F+ +     G S   F CDP       L+     FC +SLP  +RV+DL++
Sbjct: 6   FILIIFVLAFAYS-----GESRQPFACDPANAGIRNLR-----FCKTSLPIHVRVQDLIA 55

Query: 71  RMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGATSFPTVIL 130
           R+TL EK++ L + A  V RLG+  YEWWSEALHGVSN G G  F    PGATSFP VI 
Sbjct: 56  RLTLQEKIRLLVNNAAPVQRLGISGYEWWSEALHGVSNTGYGVKFGGAFPGATSFPQVIT 115

Query: 131 TTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNINVARDPRWGRITETPGEDPFV 190
           T ASFN SLW++IG+ VS E RAMYN G AGLT+WSPN+N+ RDPRWGR  ETPGEDP +
Sbjct: 116 TAASFNASLWEEIGRVVSEEGRAMYNGGAAGLTFWSPNVNIFRDPRWGRGQETPGEDPHL 175

Query: 191 VGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTE 250
           V +Y V+YV+GLQ   G  N        LKV++CCKHY AYD+D+W G DRYHF+A+V+ 
Sbjct: 176 VAQYGVSYVKGLQGGGGRGNTR------LKVAACCKHYTAYDLDDWNGYDRYHFNAKVSM 229

Query: 251 QDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADC 310
           QD+E+T+  PF+ CV EG+ +SVMCSYN++NG PSCADP LL  T+R +W L+GYIV+DC
Sbjct: 230 QDLEDTYNAPFKACVVEGNVASVMCSYNQINGKPSCADPTLLRDTIRNQWHLNGYIVSDC 289

Query: 311 DSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLK 370
           DS+ V+ +   +     EDA A T+KAGLDLDCG +    T  AV  GKV + +I+ +L 
Sbjct: 290 DSVGVLFEKQHY-TRYPEDAAAITIKAGLDLDCGPFLAIHTDKAVHTGKVSQVEINNALA 348

Query: 371 YLYTVLMRLGFFDG-SPQYVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAK 429
              TV MRLG FDG +  Y +LG +D+CS  + +LA +AAREGIVLLKN    LPL++ +
Sbjct: 349 NTITVQMRLGMFDGPNGPYANLGPKDVCSPAHQQLALQAAREGIVLLKNIGQALPLSTKR 408

Query: 430 VKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFSGYANVTYKTGCDDVACKSNNSIFA 489
            +TVAV+GP+++AT+AMIGNYAG+PC Y+SP+ G S YA   ++ GC  VAC  N +   
Sbjct: 409 HRTVAVIGPNSDATLAMIGNYAGVPCGYISPLQGISRYARTIHQQGCMGVACPGNQNFGL 468

Query: 490 ASEAAKTADATIILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAG 549
           A  AA+ ADAT+++ GLD S+EAE+ DR  L LPG+Q  LI++VA  +KGPV+LV+MS G
Sbjct: 469 AEVAARHADATVLVMGLDQSIEAEAKDRVTLLLPGHQQDLISRVAMASKGPVVLVLMSGG 528

Query: 550 GVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLT 609
            +D+ FA+ +  + +I+W GYPG+ GG AIADV+FG  NPGG+LP+TWY  DYV  + + 
Sbjct: 529 PIDVTFAKNDPRVSSIVWVGYPGQAGGAAIADVLFGATNPGGKLPMTWYPQDYVAKVSMA 588

Query: 610 SMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQV-NLNKLQHCR 668
           +M +R   S GYPGRTY+FY GPT++PFG G+SYT F  +L+S   T+ V  L+      
Sbjct: 589 NMDMRANPSKGYPGRTYRFYKGPTVFPFGAGISYTTFSQHLVSAPITVSVPTLHSHDLVS 648

Query: 669 NLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYI 728
           N   T   +K     +  N    D   +  +D +N G  DG+  V+++S PP     T  
Sbjct: 649 NNTTTLMKAKATVRTIHTNCESLD--IDMHIDVKNTGDMDGTHAVLIFSTPPDP---TET 703

Query: 729 KQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGN 779
           KQ++ F++V V AG  +R+K   NACK L++ D      +  GEH I VG+
Sbjct: 704 KQLVAFEKVHVVAGAKQRVKINMNACKHLSVADEYGVRRIYMGEHKIHVGD 754


>gi|408354266|gb|AFU54452.1| alpha-L-arabinofuranosidase/beta-D-xylosidase [Prunus salicina]
          Length = 775

 Score =  754 bits (1948), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 373/755 (49%), Positives = 506/755 (67%), Gaps = 29/755 (3%)

Query: 31  SSPVFVCDPGRFSKLGLQMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPR 90
           + P F CDP      GL+     FC  ++P  +RV+DL+ R+TL EK++ L + A  VPR
Sbjct: 28  ARPPFACDPHNPITRGLK-----FCRVTVPIHVRVQDLIGRLTLQEKIRLLVNNAIAVPR 82

Query: 91  LGLPQYEWWSEALHGVSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTE 150
           LG+  YEWWSEALHGVSNVGPGT F    PGATSFP VI T ASFNESLW++IG+ V  E
Sbjct: 83  LGIQGYEWWSEALHGVSNVGPGTKFGGAFPGATSFPQVITTAASFNESLWQEIGRVVPDE 142

Query: 151 ARAMYNLGRAGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHEN 210
           ARAMYN G AGLTYWSPN+N+ RDPRWGR  ETPGEDP +  +YA  YV+GLQ       
Sbjct: 143 ARAMYNGGMAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLASKYAARYVKGLQG------ 196

Query: 211 ATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDA 270
             D     LKV++CCKHY AYD+DNW GV+R+HF+ARV++QD+ +T+  PF+ CV EG  
Sbjct: 197 --DGAGNRLKVAACCKHYTAYDLDNWNGVNRFHFNARVSKQDLADTYNVPFKACVVEGHV 254

Query: 271 SSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDA 330
           +SVMCSYN+VNG P+CADP LL  T+RG+W L+GYIV+DCDS+ V+ +   +   + E+A
Sbjct: 255 ASVMCSYNQVNGKPTCADPDLLKGTIRGQWRLNGYIVSDCDSVGVLYEEQHY-TRTPEEA 313

Query: 331 VAQTLKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSP---Q 387
            A  +KAGLDLDCG +    T  AV++G V + +I+ +L    TV MRLG FDG P   Q
Sbjct: 314 AADAIKAGLDLDCGPFLAIHTEAAVRRGLVSQLEINWALANTMTVQMRLGMFDGEPSAHQ 373

Query: 388 YVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMI 447
           Y +LG +D+C+  + +LA EAAR+GIVLL+N   +LPL+  + +TVAV+GP+++ TV MI
Sbjct: 374 YGNLGPRDVCTPAHQQLALEAARQGIVLLENRGRSLPLSIRRHRTVAVIGPNSDVTVTMI 433

Query: 448 GNYAGIPCRYMSPIAGFSGYANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLD 507
           GNYAG+ C Y +P+ G   Y    ++ GC DV C  N    AA  AA+ ADAT+++ GLD
Sbjct: 434 GNYAGVACGYTTPLQGIGRYTRTIHQAGCTDVHCNGNQLFGAAEAAARQADATVLVMGLD 493

Query: 508 LSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILW 567
            S+EAE +DR  L LPG+Q +L+++VA  ++GP ILV+MS G +D+ FA+ +  I AI+W
Sbjct: 494 QSIEAEFVDRVGLLLPGHQQELVSRVARASRGPTILVLMSGGPIDVTFAKNDPRISAIIW 553

Query: 568 AGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYK 627
            GYPG+ GG AIADV+FG  NPGG+LP+TWY  +YV  LP+T M +R   + GYPGRTY+
Sbjct: 554 VGYPGQAGGTAIADVLFGTTNPGGKLPMTWYPQNYVTHLPMTDMAMRADPARGYPGRTYR 613

Query: 628 FYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVN 687
           FY GP ++PFG GLSYT F +NL     ++ V L  L+   N    S A       V V+
Sbjct: 614 FYRGPVVFPFGLGLSYTTFAHNLAHGPTSVSVPLTSLKATANSTMLSKA-------VRVS 666

Query: 688 DLRCDDY--FEFKVDFQNVGSTDGSDVVIVYSKPP-AEIAATYIKQVIGFQRVFVRAGRN 744
              C+     +  VD +N GS DG+  ++V++ PP  + AA+  KQ++GF ++ + AG  
Sbjct: 667 HADCNALSPLDVHVDVKNTGSMDGTHTLLVFTSPPDGKWAAS--KQLVGFHKIHIAAGSE 724

Query: 745 KRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGN 779
            R++   + CK L++VD      +P GEH + +G+
Sbjct: 725 TRVRIAVHVCKHLSVVDRFGIRRIPLGEHKLQIGD 759


>gi|408354264|gb|AFU54451.1| alpha-L-arabinofuranosidase/beta-D-xylosidase [Prunus salicina]
          Length = 775

 Score =  754 bits (1947), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 373/755 (49%), Positives = 506/755 (67%), Gaps = 29/755 (3%)

Query: 31  SSPVFVCDPGRFSKLGLQMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPR 90
           + P F CDP      GL+     FC  ++P  +RV+DL+ R+TL EK++ L + A  VPR
Sbjct: 28  ARPPFACDPHNPITRGLK-----FCRVTVPIHVRVQDLIGRLTLQEKIRLLVNNAIAVPR 82

Query: 91  LGLPQYEWWSEALHGVSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTE 150
           LG+  YEWWSEALHGVSNVGPGT F    PGATSFP VI T ASFNESLW++IG+ V  E
Sbjct: 83  LGIQGYEWWSEALHGVSNVGPGTKFGGAFPGATSFPQVITTAASFNESLWQEIGRGVPDE 142

Query: 151 ARAMYNLGRAGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHEN 210
           ARAMYN G AGLTYWSPN+N+ RDPRWGR  ETPGEDP +  +YA  YV+GLQ       
Sbjct: 143 ARAMYNGGMAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLASKYAARYVKGLQG------ 196

Query: 211 ATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDA 270
             D     LKV++CCKHY AYD+DNW GV+R+HF+ARV++QD+ +T+  PF+ CV EG  
Sbjct: 197 --DGAGNRLKVAACCKHYTAYDLDNWNGVNRFHFNARVSKQDLADTYNVPFKACVVEGHV 254

Query: 271 SSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDA 330
           +SVMCSYN+VNG P+CADP LL  T+RG+W L+GYIV+DCDS+ V+ +   +   + E+A
Sbjct: 255 ASVMCSYNQVNGKPTCADPDLLKGTIRGQWRLNGYIVSDCDSVGVLYEEQHY-TRTPEEA 313

Query: 331 VAQTLKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSP---Q 387
            A  +KAGLDLDCG +    T  AV++G V + +I+ +L    TV MRLG FDG P   Q
Sbjct: 314 AADAIKAGLDLDCGPFLAIHTEAAVRRGLVSQLEINWALANTMTVQMRLGMFDGEPSAHQ 373

Query: 388 YVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMI 447
           Y +LG +D+C+  + +LA EAAR+GIVLL+N   +LPL+  + +TVAV+GP+++ TV MI
Sbjct: 374 YGNLGPRDVCTPAHQQLALEAARQGIVLLENRGRSLPLSIRRHRTVAVIGPNSDVTVTMI 433

Query: 448 GNYAGIPCRYMSPIAGFSGYANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLD 507
           GNYAG+ C Y +P+ G   Y    ++ GC DV C  N    AA  AA+ ADAT+++ GLD
Sbjct: 434 GNYAGVACGYTTPLQGIGRYTRTIHQAGCTDVHCNGNQLFGAAEAAARQADATVLVMGLD 493

Query: 508 LSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILW 567
            S+EAE +DR  L LPG+Q +L+++VA  ++GP ILV+MS G +D+ FA+ +  I AI+W
Sbjct: 494 QSIEAEFVDRVGLLLPGHQQELVSRVARASRGPTILVLMSGGPIDVTFAKNDPRISAIIW 553

Query: 568 AGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYK 627
            GYPG+ GG AIADV+FG  NPGG+LP+TWY  +YV  LP+T M +R   + GYPGRTY+
Sbjct: 554 VGYPGQAGGTAIADVLFGTTNPGGKLPMTWYPQNYVTHLPMTDMAMRADPARGYPGRTYR 613

Query: 628 FYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVN 687
           FY GP ++PFG GLSYT F +NL     ++ V L  L+   N    S A       V V+
Sbjct: 614 FYRGPVVFPFGLGLSYTTFAHNLAHGPTSVSVPLTSLKATANSTMLSKA-------VRVS 666

Query: 688 DLRCDDY--FEFKVDFQNVGSTDGSDVVIVYSKPP-AEIAATYIKQVIGFQRVFVRAGRN 744
              C+     +  VD +N GS DG+  ++V++ PP  + AA+  KQ++GF ++ + AG  
Sbjct: 667 HADCNALSPLDVHVDVKNTGSMDGTHTLLVFTSPPDGKWAAS--KQLVGFHKIHIAAGSE 724

Query: 745 KRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGN 779
            R++   + CK L++VD      +P GEH + +G+
Sbjct: 725 TRVRIAVHVCKHLSVVDRFGIRRIPLGEHKLQIGD 759


>gi|157041199|dbj|BAF79669.1| beta-D-xylosidase [Pyrus pyrifolia]
          Length = 774

 Score =  754 bits (1946), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 375/753 (49%), Positives = 503/753 (66%), Gaps = 26/753 (3%)

Query: 31  SSPVFVCDPGRFSKLGLQMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPR 90
           + P F CDP       L+     FC   +P  +RV+DL+ R+TL EK+  L + A  VPR
Sbjct: 28  ARPPFACDPRNPITRTLK-----FCRVRVPIHVRVQDLIGRLTLQEKIGLLVNNAIAVPR 82

Query: 91  LGLPQYEWWSEALHGVSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTE 150
           LG+  YEWWSEALHGVSNVGPGT F   + GATSFP VI T ASFNESLW++IG+ VS E
Sbjct: 83  LGIQGYEWWSEALHGVSNVGPGTKFGTFL-GATSFPQVITTAASFNESLWEEIGRVVSDE 141

Query: 151 ARAMYNLGRAGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHEN 210
           ARAMYN G AGLT+WSPN+N+ RDPRWGR  ETPGEDP +  +Y   YV+GLQ       
Sbjct: 142 ARAMYNGGAAGLTFWSPNVNIFRDPRWGRGQETPGEDPVLAAKYGARYVKGLQG------ 195

Query: 211 ATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDA 270
             D     LKV++CCKHY AYD+DNW GVDR+HF+ARV++QD+E+T+  PF+ CV +G+ 
Sbjct: 196 --DGAGNRLKVAACCKHYTAYDLDNWNGVDRFHFNARVSKQDLEDTYNVPFKACVVDGNV 253

Query: 271 SSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDA 330
           +SVMCSYN+VNG P+CADP LL  T+RG+W L+GYIV+DCDS+ V  DN  +   + E A
Sbjct: 254 ASVMCSYNQVNGKPTCADPDLLKGTIRGQWKLNGYIVSDCDSVGVYYDNQHY-TKTPEAA 312

Query: 331 VAQTLKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSP---Q 387
            A  +KAGLDLDCG +    T  A++ G+V E DI+ +L    TV MRLG FDG P   +
Sbjct: 313 AAYAIKAGLDLDCGPFLGIHTEAAIRTGQVNEIDINYALANTITVQMRLGMFDGEPSTQR 372

Query: 388 YVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMI 447
           Y +LG  D+C   + ELA EAAR+GIVLL+N  N+LPL++ + +TVAV+GP+++ T  MI
Sbjct: 373 YGNLGLADVCKPSSNELALEAARQGIVLLENRGNSLPLSTIRHRTVAVIGPNSDVTETMI 432

Query: 448 GNYAGIPCRYMSPIAGFSGYANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLD 507
           GNYAGI C Y +P+ G + Y    ++ GC DV C  N  I AA  AA+ ADAT+++ GLD
Sbjct: 433 GNYAGIACGYTTPLQGIARYTRTIHQAGCTDVHCNGNQLIGAAEVAARQADATVLVIGLD 492

Query: 508 LSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILW 567
            S+EAE  DR  L LPG+Q +L+++VA  ++GP ILVIMS G +D+ FA+ +  I AI+W
Sbjct: 493 QSIEAEFRDRTGLLLPGHQQELVSRVARASRGPTILVIMSGGPIDVTFAKNDPRIGAIIW 552

Query: 568 AGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYK 627
            GYPG+ GG AIADV+FG  NP G+LP+TWY  +YV  LP+T M +R   + GYPGRTY+
Sbjct: 553 VGYPGQAGGTAIADVLFGTTNPSGKLPMTWYPQNYVANLPMTDMAMRADPARGYPGRTYR 612

Query: 628 FYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVN 687
           FY GP ++PFG GLSYT+F ++L      + V L  L   +N    S+       GV V+
Sbjct: 613 FYKGPVVFPFGMGLSYTRFSHSLAQGPTLVSVPLTSLVAAKNTTMLSNH------GVRVS 666

Query: 688 DLRCDDY-FEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKR 746
              CD    +F +D +N G+ DG+  ++V++  PA   A   KQ++GF +V + AG  +R
Sbjct: 667 HTNCDSLSLDFHIDIKNTGTMDGTHTLLVFATQPAGKWAPN-KQLVGFHKVHIVAGSERR 725

Query: 747 IKFVFNACKSLNIVDYAANTLLPAGEHTIFVGN 779
           ++   + CK L+IVD      +P G+H + +G+
Sbjct: 726 VRVGVHVCKHLSIVDKLGIRRIPLGQHKLEIGD 758


>gi|224070626|ref|XP_002303181.1| predicted protein [Populus trichocarpa]
 gi|222840613|gb|EEE78160.1| predicted protein [Populus trichocarpa]
          Length = 773

 Score =  754 bits (1946), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 390/773 (50%), Positives = 513/773 (66%), Gaps = 28/773 (3%)

Query: 11  FSLSIALLVFSTNAVDANGSSSPVFVCDPGRFSKLGLQMSSFLFCDSSLPYSIRVKDLVS 70
           F L    LVF +  V A   SSPVF CD      L    +S  FC++S+  + RV DLV 
Sbjct: 16  FLLFCMFLVFLSTHVSAQ--SSPVFACDVVSNPSL----ASLGFCNTSIGINDRVVDLVK 69

Query: 71  RMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGATSFPTVIL 130
           R+TL EK+  L + A  V RLG+P+YEWWSEALHGVS VGPGTHF D + GATSFP VIL
Sbjct: 70  RLTLQEKIVFLVNSAGNVSRLGIPKYEWWSEALHGVSYVGPGTHFSDDVAGATSFPQVIL 129

Query: 131 TTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNINVARDPRWGRITETPGEDPFV 190
           T ASFN SL++ IG+ VSTEARAMYN+G AGLT+WSPNIN+ RDPRWGR  ETPGEDP +
Sbjct: 130 TAASFNTSLFEAIGKVVSTEARAMYNVGLAGLTFWSPNINIFRDPRWGRGQETPGEDPLL 189

Query: 191 VGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTE 250
             +Y   YV+GLQ  +      D +   LKV++CCKHY AYD+DNWKG DRYHF+A VT+
Sbjct: 190 SSKYGSCYVKGLQQRD------DGDPDKLKVAACCKHYTAYDLDNWKGSDRYHFNAVVTK 243

Query: 251 QDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADC 310
           QDM++TF  PF+ CV +G+ +SVMCSYN+VNG P+CADP LL+  +RGEW+L+GYIV DC
Sbjct: 244 QDMDDTFQPPFKSCVIDGNVASVMCSYNQVNGKPTCADPDLLSGVIRGEWNLNGYIVTDC 303

Query: 311 DSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLK 370
           DS+ V   +  +    +E A A  L AG+DL+CG +    T  AV+ G V E  ID ++ 
Sbjct: 304 DSLDVFYKSQNYTKTPEEAAAAAIL-AGVDLNCGSFLGQHTEAAVKGGLVNEHAIDIAVS 362

Query: 371 YLYTVLMRLGFFDGSPQ---YVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNS 427
             +  LMRLGFFDG P    Y  LG +D+C+ EN ELA EAAR+GIVLLKN   +LPL+ 
Sbjct: 363 NNFATLMRLGFFDGDPSKQLYGKLGPKDVCTAENQELAREAARQGIVLLKNTAGSLPLSP 422

Query: 428 AKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFSGYANVTYKTGCDDVACKSNNSI 487
             +K +AV+GP+AN T  MIGNY G PC+Y +P+ G +     TY  GC +VAC S   +
Sbjct: 423 TAIKNLAVIGPNANVTKTMIGNYEGTPCKYTTPLQGLAASVATTYLPGCSNVAC-STAQV 481

Query: 488 FAASEAAKTADATIILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMS 547
             A + A  ADAT+++ G DLS+EAES DR D+ LPG Q  LI  VA V+ GPVILVIMS
Sbjct: 482 DDAKKLAAAADATVLVMGADLSIEAESRDRVDVLLPGQQQLLITAVANVSCGPVILVIMS 541

Query: 548 AGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLP 607
            GG+D++FA TN  I +ILW GYPGE GG AIAD++FG +NP GRLP+TWY   YV  +P
Sbjct: 542 GGGMDVSFARTNDKITSILWVGYPGEAGGAAIADIIFGYYNPSGRLPMTWYPQSYVDKVP 601

Query: 608 LTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHC 667
           +T+M +RP  S GYPGRTY+FY G T+Y FG GLSY+QF + L+   + + V L +   C
Sbjct: 602 MTNMNMRPDPSNGYPGRTYRFYTGETVYSFGDGLSYSQFTHELIQAPQLVYVPLEESHVC 661

Query: 668 RNLNYTSDASKTRCPGVLVNDLRCDD-YFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAAT 726
            +         + C  V+ ++  C +  F+  +  +N G+  GS  V ++S PPA +  +
Sbjct: 662 HS---------SECQSVVASEQTCQNSTFDMLLRVKNEGTISGSHTVFLFSSPPA-VHNS 711

Query: 727 YIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGN 779
             K ++GF++VF+ A   + ++F  + CK L++VD   +  +  GEH + VG+
Sbjct: 712 PQKHLVGFEKVFLNAQTGRHVRFKVDICKDLSVVDELGSKKVALGEHVLHVGS 764


>gi|115460876|ref|NP_001054038.1| Os04g0640700 [Oryza sativa Japonica Group]
 gi|38344900|emb|CAE02971.2| OSJNBb0079B02.3 [Oryza sativa Japonica Group]
 gi|113565609|dbj|BAF15952.1| Os04g0640700 [Oryza sativa Japonica Group]
 gi|116310882|emb|CAH67823.1| OSIGBa0138H21-OSIGBa0138E01.14 [Oryza sativa Indica Group]
 gi|218195682|gb|EEC78109.1| hypothetical protein OsI_17615 [Oryza sativa Indica Group]
          Length = 765

 Score =  753 bits (1945), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 371/755 (49%), Positives = 510/755 (67%), Gaps = 29/755 (3%)

Query: 30  SSSPVFVCDPGRFSKLGLQMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVP 89
           + +PVF CD    +     +S + FCD +   + R  DL+ R+TL EKV  L +    +P
Sbjct: 26  AQTPVFACDASNAT-----VSGYGFCDRTKSSAARAADLLGRLTLAEKVGFLVNKQAALP 80

Query: 90  RLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVST 149
           RLG+P YEWWSEALHGVS VGPGT F  ++PGATSFP  ILT ASFN SL++ IG+ VST
Sbjct: 81  RLGIPAYEWWSEALHGVSYVGPGTRFSTLVPGATSFPQPILTAASFNASLFRAIGEVVST 140

Query: 150 EARAMYNLGRAGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHE 209
           EARAM+N+G AGLT+WSPNIN+ RDPRWGR  ETPGEDP +  +YAV YV GLQD  G  
Sbjct: 141 EARAMHNVGLAGLTFWSPNINIFRDPRWGRGQETPGEDPLLASKYAVGYVTGLQDAGGGS 200

Query: 210 NATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGD 269
           +A       LKV++CCKHY AYDVDNWKGV+RY FDA V++QD+++TF  PF+ CV +G+
Sbjct: 201 DA-------LKVAACCKHYTAYDVDNWKGVERYTFDAVVSQQDLDDTFQPPFKSCVIDGN 253

Query: 270 ASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKED 329
            +SVMCSYN+VNG P+CAD  LL+  +RG+W L+GYIV+DCDS+ V+ +N  +  +  ED
Sbjct: 254 VASVMCSYNKVNGKPTCADKDLLSGVIRGDWKLNGYIVSDCDSVDVLYNNQHYTKN-PED 312

Query: 330 AVAQTLKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ-- 387
           A A T+K+GLDL+CG +    T  AVQ GK+ E+D+D+++   + VLMRLGFFDG P+  
Sbjct: 313 AAAITIKSGLDLNCGNFLAQHTVAAVQAGKLSESDVDRAITNNFIVLMRLGFFDGDPRKL 372

Query: 388 -YVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAM 446
            + SLG +D+C+  N ELA EAAR+GIVLLKN    LPL++  +K++AV+GP+ANA+  M
Sbjct: 373 PFGSLGPKDVCTSSNQELAREAARQGIVLLKN-TGALPLSAKSIKSMAVIGPNANASFTM 431

Query: 447 IGNYAGIPCRYMSPIAGFSGYANVTYKTGCDDVACKSNN-SIFAASEAAKTADATIILAG 505
           IGNY G PC+Y +P+ G        Y+ GC +V C  N+  + AA++AA +AD T+++ G
Sbjct: 432 IGNYEGTPCKYTTPLQGLGANVATVYQPGCTNVGCSGNSLQLSAATQAAASADVTVLVVG 491

Query: 506 LDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAI 565
            D SVE ESLDR  L LPG Q QL++ VA  ++GPVILV+MS G  DI+FA+++  I AI
Sbjct: 492 ADQSVERESLDRTSLLLPGQQPQLVSAVANASRGPVILVVMSGGPFDISFAKSSDKISAI 551

Query: 566 LWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRT 625
           LW GYPGE GG A+AD++FG  NPGGRLP+TWY   +   + +T M +RP  S GYPGRT
Sbjct: 552 LWVGYPGEAGGAALADILFGYHNPGGRLPVTWYPASFADKVSMTDMRMRPDSSTGYPGRT 611

Query: 626 YKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLN-YTSDASKTRCPGV 684
           Y+FY G T+Y FG GLSYT+F ++L+S  + + V L +   C   + ++ +A+   C  +
Sbjct: 612 YRFYTGDTVYAFGDGLSYTKFAHSLVSAPEQVAVQLAEGHACHTEHCFSVEAAGEHCGSL 671

Query: 685 LVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRN 744
                     F+  +  +N G   G   V ++S PP+ + +   K ++GF++V +  G+ 
Sbjct: 672 ---------SFDVHLRVRNAGGMAGGHTVFLFSSPPS-VHSAPAKHLLGFEKVSLEPGQA 721

Query: 745 KRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGN 779
             + F  + CK L++VD   N  +  G HT+ VG+
Sbjct: 722 GVVAFKVDVCKDLSVVDELGNRKVALGSHTLHVGD 756


>gi|297811069|ref|XP_002873418.1| beta-xylosidase 3 [Arabidopsis lyrata subsp. lyrata]
 gi|297319255|gb|EFH49677.1| beta-xylosidase 3 [Arabidopsis lyrata subsp. lyrata]
          Length = 780

 Score =  753 bits (1943), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 389/792 (49%), Positives = 526/792 (66%), Gaps = 33/792 (4%)

Query: 4   VVSSLLCFSLSIALLVFSTNAVDANGSSSPVFVCD-PGRFSKLGLQMSSFLFCDSSLPYS 62
           V + LLCF L I+          +N  SSPVF CD  G  S  GL+     FC++ L   
Sbjct: 16  VSTLLLCFLLCIS--------EQSNAQSSPVFACDVTGNPSLAGLR-----FCNTGLNIK 62

Query: 63  IRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGA 122
            RV DLV R+TL+EK+  LG  A GV RLG+P Y+WWSEALHGVSNVG G+ F   +PGA
Sbjct: 63  SRVTDLVGRLTLEEKIGFLGSNAIGVSRLGIPAYKWWSEALHGVSNVGGGSSFSGQVPGA 122

Query: 123 TSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNINVARDPRWGRITE 182
           TSFP VILT ASFN SL++ IG+ VSTEARAMYN+G AGLT+WSPN+N+ RDPRWGR  E
Sbjct: 123 TSFPQVILTAASFNVSLFQAIGKVVSTEARAMYNVGSAGLTFWSPNVNIFRDPRWGRGQE 182

Query: 183 TPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRY 242
           TPGEDP +  +YAV YVRGLQ+ +G +         LKV++CCKHY AYDVDNWK V R+
Sbjct: 183 TPGEDPELSSKYAVAYVRGLQETDGGD------PNRLKVAACCKHYTAYDVDNWKDVHRF 236

Query: 243 HFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDL 302
            F+A V +QDM +TF  PF+ CV +G+ +SVMCSYN+VNG P+CADP LL+  +RG+W L
Sbjct: 237 TFNAVVNQQDMADTFQPPFKSCVVDGNVASVMCSYNQVNGKPTCADPDLLSGVIRGQWKL 296

Query: 303 HGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFTGNAVQQGKVKE 362
           +GYIV+DCDS+ V+     +   + E+AVA+++ AGLDL+C  +   +   AV+ G V E
Sbjct: 297 NGYIVSDCDSVDVLYTKQHY-TKTPEEAVAKSILAGLDLNCDHFTGQYAMKAVKVGLVNE 355

Query: 363 TDIDKSLKYLYTVLMRLGFFDGSPQ----YVSLGKQDICSDENIELAAEAAREGIVLLKN 418
           T IDK++   +  LMRLGFFDG P+    Y  LG  D+C+  N ELA +AAR+GIVLLKN
Sbjct: 356 TAIDKAISNNFATLMRLGFFDGDPKKQQLYGGLGPNDVCTANNQELARDAARQGIVLLKN 415

Query: 419 DQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFSGYANVTYKTGCDD 478
              +LPL+ + +KT+AV+GP+ANAT  MIGNY GIPC+Y +P+ G +   + TY+ GC +
Sbjct: 416 SAGSLPLSPSAIKTLAVIGPNANATETMIGNYNGIPCKYTTPLQGLAETVSSTYQLGC-N 474

Query: 479 VACKSNNSIFAASEAAKTADATIILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAK 538
           VAC +   + +A+  A +ADA +++ G D S+E E+LDR DL+LPG Q +L+ QVA+VAK
Sbjct: 475 VAC-AEPDLGSAAALAASADAVVLVMGADQSIEQENLDRLDLYLPGKQQELVTQVAKVAK 533

Query: 539 GPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWY 598
           GPV+LVIMS G  DI FA+    I  I+W GYPGE GG AIADV+FG+ NP G LP+TWY
Sbjct: 534 GPVVLVIMSGGAFDITFAKNEEKITGIMWVGYPGEAGGLAIADVIFGRHNPSGNLPMTWY 593

Query: 599 NGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQ 658
              YV+ +P+T+M +RP  S GYPGRTY+FY G T+Y FG GLSYT F + +L   K + 
Sbjct: 594 PQSYVEKVPMTNMNMRPDKSNGYPGRTYRFYTGETVYAFGDGLSYTNFNHQILKAPKLVS 653

Query: 659 VNLNKLQHCRNLNYTS-DASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYS 717
           ++L++   CR+    S DA    C   +   L     FE ++  +NVG  +GS  V +++
Sbjct: 654 LDLDENHACRSSECQSVDAIGPHCDNAVGGGLN----FEVQLKVRNVGDREGSHTVFLFT 709

Query: 718 KPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFV 777
            PP E+  +  K ++GF+++ +       I+F  + CK L++VD      +  G + + V
Sbjct: 710 TPP-EVHGSPRKHLLGFEKIRLGEKEETVIRFNVDVCKDLSVVDEIGKRKIALGHYLLHV 768

Query: 778 GNGGVSFPIHLN 789
           G+   S  I ++
Sbjct: 769 GSFKHSLTISVS 780


>gi|65736613|dbj|BAD98523.1| alpha-L-arabinofuranosidase / beta-D-xylosidase [Pyrus pyrifolia]
          Length = 774

 Score =  753 bits (1943), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 375/753 (49%), Positives = 503/753 (66%), Gaps = 26/753 (3%)

Query: 31  SSPVFVCDPGRFSKLGLQMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPR 90
           + P F CDP       L+     FC   +P  +RV+DL+ R+TL EK+  L + A  VPR
Sbjct: 28  ARPPFACDPRNPITRTLK-----FCRVRVPIHVRVQDLIGRLTLQEKIGLLVNNAIAVPR 82

Query: 91  LGLPQYEWWSEALHGVSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTE 150
           LG+  YEWWSEALHGVSNVGPGT F   + GATSFP VI T ASFNESLW++IG+ VS E
Sbjct: 83  LGIQGYEWWSEALHGVSNVGPGTKFGTFL-GATSFPQVITTAASFNESLWEEIGRVVSDE 141

Query: 151 ARAMYNLGRAGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHEN 210
           ARAMYN G AGLT+WSPN+N+ RDPRWGR  ETPGEDP +  +Y   YV+GLQ       
Sbjct: 142 ARAMYNGGAAGLTFWSPNVNIFRDPRWGRGQETPGEDPVLAAKYGARYVKGLQG------ 195

Query: 211 ATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDA 270
             D     LKV++CCKHY AYD+DNW GVDR+HF+ARV++QD+E+T+  PF+ CV +G+ 
Sbjct: 196 --DGAGNRLKVAACCKHYTAYDLDNWNGVDRFHFNARVSKQDLEDTYNVPFKACVVDGNV 253

Query: 271 SSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDA 330
           +SVMCSYN+VNG P+CADP LL  T+RG+W L+GYIV+DCDS+ V  DN  +   + E A
Sbjct: 254 ASVMCSYNQVNGKPTCADPDLLKGTIRGQWKLNGYIVSDCDSVGVYYDNQHY-TKTPEAA 312

Query: 331 VAQTLKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSP---Q 387
            A  +KAGLDLDCG +    T  A++ G+V E DI+ +L    TV MRLG FDG P   +
Sbjct: 313 AAYAIKAGLDLDCGPFLGIHTEAAIRTGQVNEIDINYALANTITVQMRLGMFDGEPSTQR 372

Query: 388 YVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMI 447
           Y +LG  D+C   + ELA EAAR+GIVLL+N  N+LPL++ + +TVAV+GP+++ T  MI
Sbjct: 373 YGNLGLADVCKPSSNELALEAARQGIVLLENRGNSLPLSTIRHRTVAVIGPNSDVTETMI 432

Query: 448 GNYAGIPCRYMSPIAGFSGYANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLD 507
           GNYAGI C Y +P+ G + Y    ++ GC DV C  N  I AA  AA+ ADAT+++ GLD
Sbjct: 433 GNYAGIACGYTTPLQGIARYTRTIHQAGCTDVHCNGNQLIGAAEVAARQADATVLVIGLD 492

Query: 508 LSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILW 567
            S+EAE  DR  L LPG+Q +L+++VA  ++GP ILVIMS G +D+ FA+ +  I AI+W
Sbjct: 493 QSIEAEFRDRTGLLLPGHQQELVSRVARASRGPTILVIMSGGPIDVTFAKNDPCIGAIIW 552

Query: 568 AGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYK 627
            GYPG+ GG AIADV+FG  NP G+LP+TWY  +YV  LP+T M +R   + GYPGRTY+
Sbjct: 553 VGYPGQAGGTAIADVLFGTTNPSGKLPMTWYPQNYVANLPMTDMAMRADPARGYPGRTYR 612

Query: 628 FYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVN 687
           FY GP ++PFG GLSYT+F ++L      + V L  L   +N    S+       GV V+
Sbjct: 613 FYKGPVVFPFGMGLSYTRFSHSLAQGPTLVSVPLTSLVAAKNTTMLSNH------GVRVS 666

Query: 688 DLRCDDY-FEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKR 746
              CD    +F +D +N G+ DG+  ++V++  PA   A   KQ++GF +V + AG  +R
Sbjct: 667 HTNCDSLSLDFHIDIKNTGTMDGTHTLLVFATQPAGKWAPN-KQLVGFHKVHIVAGSERR 725

Query: 747 IKFVFNACKSLNIVDYAANTLLPAGEHTIFVGN 779
           ++   + CK L+IVD      +P G+H + +G+
Sbjct: 726 VRVGVHVCKHLSIVDKLGIRRIPLGQHKLEIGD 758


>gi|296083056|emb|CBI22460.3| unnamed protein product [Vitis vinifera]
          Length = 896

 Score =  752 bits (1941), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 376/735 (51%), Positives = 490/735 (66%), Gaps = 52/735 (7%)

Query: 50  SSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNV 109
           S F FC++SLPY  R  DLVSR+TL EK +QL + A G+ RLG+P YEWWSEALHGVSN 
Sbjct: 61  SQFPFCNTSLPYQDRASDLVSRLTLQEKAKQLINSATGISRLGVPDYEWWSEALHGVSNS 120

Query: 110 GPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNI 169
           G G HF D IP  T FP VIL+ ASFNESLW  +GQ VSTE RAMYN+G+AGLTYWSPN+
Sbjct: 121 GIGVHFHDPIPAVTIFPAVILSAASFNESLWYTMGQVVSTEGRAMYNVGQAGLTYWSPNV 180

Query: 170 NVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYA 229
           N+ RDPRWGR  ETPGEDP VV RYAVNYVRGLQ+V G E   +  +  LKVSSCCKHY 
Sbjct: 181 NIFRDPRWGRGQETPGEDPLVVSRYAVNYVRGLQEV-GKEG--NFAADRLKVSSCCKHYT 237

Query: 230 AYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADP 289
           AYDVD WKGVDR+HFDA+VT QD+E+T+  PF+ CV+EG  SSVMCSYNRVNG+P+CA+P
Sbjct: 238 AYDVDKWKGVDRFHFDAKVTLQDLEDTYQPPFKSCVEEGHVSSVMCSYNRVNGVPTCANP 297

Query: 290 KLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTN 349
           +LL   +R +W L GYIV+DCDSI V  +   +  ++ EDAVA  LKAGL+L+CG Y  +
Sbjct: 298 ELLKGVIRDQWGLDGYIVSDCDSIMVYHERMNY-TETPEDAVALALKAGLNLNCGSYLGD 356

Query: 350 FTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGK---QDICSDENIELAA 406
           +T NAV  GKVKE+ ++++L Y Y VLMRLGFFDG P  +  GK    D+C+ ++  LA 
Sbjct: 357 YTKNAVNLGKVKESIVNQALIYNYIVLMRLGFFDGDPTMLPFGKMGPSDVCTVDHQLLAL 416

Query: 407 EAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFSG 466
           +AA++GIVLL N+   LPL+    KT+AV+GP+A+AT  M+ NYAG+PCRY SP+ G   
Sbjct: 417 DAAKQGIVLLHNN-GALPLSPNTTKTLAVIGPNADATNTMLSNYAGVPCRYTSPLQGLQK 475

Query: 467 YAN-VTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAESLDREDLWLPGY 525
           Y + V+Y+ GC +V+C     I  A+  A  ADAT+++ GLDL +EAE LDR +L LPG+
Sbjct: 476 YVSAVSYEKGCANVSCSEETLIEGAASIASMADATVVVVGLDLFIEAEDLDRVNLTLPGF 535

Query: 526 QTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFG 585
           Q +L+ + A+ A G VILV+MSAG VDI+F +  + I  ILW GYPG+ GG AI+ V+FG
Sbjct: 536 QEKLVMEAAKAANGTVILVVMSAGPVDISFVKNVSKIGGILWVGYPGQAGGDAISQVIFG 595

Query: 586 KFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQ 645
            +NPGGR P TWY  +YV  +P+T M +RP  +  +PGRTY+FY G +LY FG+GLSY+ 
Sbjct: 596 DYNPGGRSPFTWYPQEYVDQVPMTDMNMRPNATSNFPGRTYRFYTGKSLYQFGHGLSYST 655

Query: 646 FKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVG 705
           F  NL +    I V                                          +N G
Sbjct: 656 FYKNLSNIDIVIGV------------------------------------------KNAG 673

Query: 706 STDGSDVVIVYSKPP-AEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAA 764
             DG+ VV+ + KPP + +      +++GF+RV V+ G+ + +    + C  ++ VD   
Sbjct: 674 EIDGTHVVLAFWKPPRSGVRGAPGVELVGFERVEVKRGKTEMVGMRLDVCGKISNVDEEG 733

Query: 765 NTLLPAGEHTIFVGN 779
              L  G HT+ VG+
Sbjct: 734 KRKLVMGMHTLVVGS 748


>gi|297795695|ref|XP_002865732.1| beta-xylosidase 1 [Arabidopsis lyrata subsp. lyrata]
 gi|297311567|gb|EFH41991.1| beta-xylosidase 1 [Arabidopsis lyrata subsp. lyrata]
          Length = 774

 Score =  751 bits (1938), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 382/777 (49%), Positives = 520/777 (66%), Gaps = 26/777 (3%)

Query: 8   LLCFSLSIALLVFSTNAVDANGSSSPVFVCDPGRFSKLGLQMSSFLFCDSSLPYSIRVKD 67
           LL  +  + +LVF    V ++ S  P+F CDP      GL   +  FC  ++P  +RV+D
Sbjct: 8   LLIGNKVVVILVFLLCLVHSSESLRPLFACDPAN----GL-TRTLRFCRVNVPIHVRVQD 62

Query: 68  LVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGATSFPT 127
           L+ R+TL EK++ L + A  VPRLG+  YEWWSEALHGVS+VGPG+ F    PGATSFP 
Sbjct: 63  LIGRLTLQEKIRNLVNNAAAVPRLGIGGYEWWSEALHGVSDVGPGSKFGGAFPGATSFPQ 122

Query: 128 VILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNINVARDPRWGRITETPGED 187
           VI T ASFN+SLW++IG+ VS EARAMYN G AGLTYWSPN+N+ RDPRWGR  ETPGED
Sbjct: 123 VITTAASFNQSLWEEIGRVVSDEARAMYNGGVAGLTYWSPNVNILRDPRWGRGQETPGED 182

Query: 188 PFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDAR 247
           P V  +YA +YVRGLQ        T   +R LKV++CCKHY AYD+DNW GVDR+HF+A+
Sbjct: 183 PIVAAKYAASYVRGLQ-------GTAAGNR-LKVAACCKHYTAYDLDNWNGVDRFHFNAK 234

Query: 248 VTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIV 307
           VT+QD+E+T+  PF+ CV EG  +SVMCSYN+VNG P+CAD  LL  T+RG+W L+GYIV
Sbjct: 235 VTQQDLEDTYNVPFKSCVYEGKVASVMCSYNQVNGKPTCADENLLKNTIRGKWRLNGYIV 294

Query: 308 ADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDK 367
           +DCDS+ V   N +    + E+A A ++KAGLDLDCG +   FT  AV++G + E DI+ 
Sbjct: 295 SDCDSVDVFF-NQQHYTSTPEEAAAASIKAGLDLDCGPFLAIFTEGAVKKGLLTENDINL 353

Query: 368 SLKYLYTVLMRLGFFDGS-PQYVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLN 426
           +L    TV MRLG FDG+   Y +LG +D+CS  +  LA EAA +GIVLLKN   +LPL+
Sbjct: 354 ALANTLTVQMRLGMFDGNLGPYANLGPRDVCSLAHKHLALEAAHQGIVLLKNSGRSLPLS 413

Query: 427 SAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFSGYANVTYKTGCDDVACKSNNS 486
             + +TVAV+GP+++ T  MIGNYAG  C Y +P+ G S YA   ++ GC  VACK N  
Sbjct: 414 PRRHRTVAVIGPNSDVTETMIGNYAGKACAYTTPLQGISRYARTLHQAGCAGVACKGNQG 473

Query: 487 IFAASEAAKTADATIILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIM 546
             AA  AA+ ADAT+++ GLD S+EAE+ DR  L LPGYQ  L+ +VA+ ++GPVILV+M
Sbjct: 474 FGAAEAAAREADATVLVMGLDQSIEAETRDRTGLLLPGYQQDLVTRVAQASRGPVILVLM 533

Query: 547 SAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQML 606
           S G +D+ FA+ +  + AI+WAGYPG+ GG AIA+++FG  NPGG+LP+TWY  DYV  +
Sbjct: 534 SGGPIDVTFAKNDPRVAAIIWAGYPGQAGGAAIANIIFGAANPGGKLPMTWYPQDYVAKV 593

Query: 607 PLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQH 666
           P+T M +R   S  YPGRTY+FY GP ++PFG+GLSYT F  N L+ +   Q++++    
Sbjct: 594 PMTVMAMRA--SGNYPGRTYRFYKGPVVFPFGFGLSYTTFT-NSLAKSPLAQLSVS---- 646

Query: 667 CRNLNYTSDASKTRCPGVLVNDLRCDDY--FEFKVDFQNVGSTDGSDVVIVYSKPPAE-I 723
             NLN  +    +    + V+   C+ +      V+  N G  DG+  V V+++PP   I
Sbjct: 647 LSNLNSANAILNSTSHSIKVSHTNCNSFPKMPLHVEVSNTGEFDGTHTVFVFAEPPKNGI 706

Query: 724 AATYI-KQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGN 779
               + KQ+I F++V V AG  + ++   +ACK L +VD      +P G+H + +G+
Sbjct: 707 KGLGVNKQLIAFEKVHVMAGAKQTVRVDVDACKHLGVVDEYGKRRIPMGKHKLHIGD 763


>gi|32481073|gb|AAP83934.1| auxin-induced beta-glucosidase [Chenopodium rubrum]
          Length = 767

 Score =  750 bits (1937), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 382/779 (49%), Positives = 516/779 (66%), Gaps = 35/779 (4%)

Query: 6   SSLLCFSLSIALLVFSTNAVDANGSSSPVFVCDPGRFSKLGLQMSSFLFCDSSLPYSIRV 65
           ++  CF +   LL        A   ++P+  CDP    K GL   +  FC  +LP   RV
Sbjct: 5   NNFFCFLVLFILL-------SAEARAAPL-ACDP----KSGL-TRALRFCRVNLPIRARV 51

Query: 66  KDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGATSF 125
           +DL+ R+ L EKV+ L + A  VPRLG+  YEWWSEALHGVSNVGPGT F    P ATSF
Sbjct: 52  QDLIGRLNLQEKVKLLVNNAAPVPRLGISGYEWWSEALHGVSNVGPGTKFRGAFPAATSF 111

Query: 126 PTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNINVARDPRWGRITETPG 185
           P VI T ASFN SLW+ IGQ VS EARAMYN G AGLTYWSPN+N+ RDPRWGR  ETPG
Sbjct: 112 PQVITTAASFNASLWEAIGQVVSDEARAMYNGGTAGLTYWSPNVNIFRDPRWGRGQETPG 171

Query: 186 EDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFD 245
           EDP +  +YA +YVRGLQ +         N   LKV++CCKHY AYD+DNW  VDR+HF+
Sbjct: 172 EDPTLASQYAASYVRGLQGI--------YNKNRLKVAACCKHYTAYDLDNWNAVDRFHFN 223

Query: 246 ARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGY 305
           A+V++QD+E+T+  PF+ CV+EG  +SVMCSYN+VNG P+CADP LL  T+RG+W L+GY
Sbjct: 224 AKVSKQDLEDTYNVPFKGCVQEGRVASVMCSYNQVNGKPTCADPDLLRNTIRGQWRLNGY 283

Query: 306 IVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFTGNAVQQGKVKETDI 365
           IV+DCDS+ V+ D+  +   + E+A A T+KAGLDLDCG +    T  AV++G + E D+
Sbjct: 284 IVSDCDSVGVLYDDQHY-TRTPEEAAADTIKAGLDLDCGPFLAVHTEAAVKRGLLTEADV 342

Query: 366 DKSLKYLYTVLMRLGFFDG---SPQYVSLGKQDICSDENIELAAEAAREGIVLLKNDQNT 422
           +++L   +TV MRLG FDG   +  +  LG +D+CS  + +LA +AAR+GIVLL+N   +
Sbjct: 343 NQALTNTFTVQMRLGMFDGEAAAQPFGHLGPKDVCSPAHQDLALQAARQGIVLLQNRGRS 402

Query: 423 LPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFSGYANVTYKTGCDDVACK 482
           LPL++A+ + +AV+GP+A+ATV MIGNYAG+ C Y SP+ G + YA   ++ GC  VAC 
Sbjct: 403 LPLSTARHRNIAVIGPNADATVTMIGNYAGVACGYTSPLQGIARYAKTVHQAGCIGVACT 462

Query: 483 SNNSIFAASEAAKTADATIILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVI 542
           SN    AA+ AA  ADAT+++ GLD S+EAE  DR  + LPG+Q +L+++VA  ++GP I
Sbjct: 463 SNQQFGAATAAAAHADATVLVMGLDQSIEAEFRDRASVLLPGHQQELVSKVALASRGPTI 522

Query: 543 LVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDY 602
           LV+M  G VD+ FA+ +  I AILW GYPG+ GG AIADV+FG  NPGG+LP TWY   Y
Sbjct: 523 LVLMCGGPVDVTFAKNDPKISAILWVGYPGQAGGTAIADVLFGTTNPGGKLPNTWYPQSY 582

Query: 603 VQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNL- 661
           V  +P+T + +R   S GYPGRTY+FY GP ++PFG+GLSYT+F  +L      + V L 
Sbjct: 583 VAKVPMTDLAMRANPSNGYPGRTYRFYKGPVVFPFGFGLSYTRFTQSLAHAPTKVMVPLA 642

Query: 662 NKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDY-FEFKVDFQNVGSTDGSDVVIVYSKPP 720
           N+  +    ++  DA K       V    CD+      +D +N G  DGS  ++V+S PP
Sbjct: 643 NQFTNSNITSFNKDALK-------VLHTNCDNIPLSLHIDVKNKGKVDGSHTILVFSTPP 695

Query: 721 AEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGN 779
               ++  KQ+IGF+RV V AG  +R++   + C  L+  D      +P GEHT+ +G+
Sbjct: 696 KGTKSSE-KQLIGFKRVHVFAGSKQRVRMNIHVCNHLSRADEFGVRRIPIGEHTLHIGD 753


>gi|15242492|ref|NP_196535.1| beta-xylosidase 3 [Arabidopsis thaliana]
 gi|75264323|sp|Q9LXD6.1|BXL3_ARATH RecName: Full=Beta-D-xylosidase 3; Short=AtBXL3; AltName:
           Full=Alpha-L-arabinofuranosidase; Flags: Precursor
 gi|7671416|emb|CAB89357.1| beta-xylosidase-like protein [Arabidopsis thaliana]
 gi|9759004|dbj|BAB09531.1| beta-xylosidase [Arabidopsis thaliana]
 gi|15450735|gb|AAK96639.1| AT5g09730/F17I14_80 [Arabidopsis thaliana]
 gi|332004056|gb|AED91439.1| beta-xylosidase 3 [Arabidopsis thaliana]
          Length = 773

 Score =  746 bits (1926), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 372/775 (48%), Positives = 516/775 (66%), Gaps = 25/775 (3%)

Query: 11  FSLSIALLVFSTNAVD-ANGSSSPVFVCD-PGRFSKLGLQMSSFLFCDSSLPYSIRVKDL 68
           FS+S   L F     + +N  SSPVF CD  G  S  GL+     FC++ L    RV DL
Sbjct: 9   FSVSTLFLCFIVCISEQSNNQSSPVFACDVTGNPSLAGLR-----FCNAGLSIKARVTDL 63

Query: 69  VSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGATSFPTV 128
           V R+TL+EK+  L   A GV RLG+P Y+WWSEALHGVSNVG G+ F   +PGATSFP V
Sbjct: 64  VGRLTLEEKIGFLTSKAIGVSRLGIPSYKWWSEALHGVSNVGGGSRFTGQVPGATSFPQV 123

Query: 129 ILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNINVARDPRWGRITETPGEDP 188
           ILT ASFN SL++ IG+ VSTEARAMYN+G AGLT+WSPN+N+ RDPRWGR  ETPGEDP
Sbjct: 124 ILTAASFNVSLFQAIGKVVSTEARAMYNVGSAGLTFWSPNVNIFRDPRWGRGQETPGEDP 183

Query: 189 FVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARV 248
            +  +YAV YV+GLQ+ +G +         LKV++CCKHY AYD+DNW+ V+R  F+A V
Sbjct: 184 TLSSKYAVAYVKGLQETDGGD------PNRLKVAACCKHYTAYDIDNWRNVNRLTFNAVV 237

Query: 249 TEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVA 308
            +QD+ +TF  PF+ CV +G  +SVMCSYN+VNG P+CADP LL+  +RG+W L+GYIV+
Sbjct: 238 NQQDLADTFQPPFKSCVVDGHVASVMCSYNQVNGKPTCADPDLLSGVIRGQWQLNGYIVS 297

Query: 309 DCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKS 368
           DCDS+ V+     + A + E+AVA++L AGLDL+C  +       AV+ G V ET IDK+
Sbjct: 298 DCDSVDVLFRKQHY-AKTPEEAVAKSLLAGLDLNCDHFNGQHAMGAVKAGLVNETAIDKA 356

Query: 369 LKYLYTVLMRLGFFDGSPQ---YVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPL 425
           +   +  LMRLGFFDG P+   Y  LG +D+C+ +N ELA + AR+GIVLLKN   +LPL
Sbjct: 357 ISNNFATLMRLGFFDGDPKKQLYGGLGPKDVCTADNQELARDGARQGIVLLKNSAGSLPL 416

Query: 426 NSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFSGYANVTYKTGCDDVACKSNN 485
           + + +KT+AV+GP+ANAT  MIGNY G+PC+Y +P+ G +   + TY+ GC +VAC  + 
Sbjct: 417 SPSAIKTLAVIGPNANATETMIGNYHGVPCKYTTPLQGLAETVSSTYQLGC-NVAC-VDA 474

Query: 486 SIFAASEAAKTADATIILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVI 545
            I +A + A +ADA +++ G D S+E E  DR DL+LPG Q +L+ +VA  A+GPV+LVI
Sbjct: 475 DIGSAVDLAASADAVVLVVGADQSIEREGHDRVDLYLPGKQQELVTRVAMAARGPVVLVI 534

Query: 546 MSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQM 605
           MS GG DI FA+ +  I +I+W GYPGE GG AIADV+FG+ NP G LP+TWY   YV+ 
Sbjct: 535 MSGGGFDITFAKNDKKITSIMWVGYPGEAGGLAIADVIFGRHNPSGNLPMTWYPQSYVEK 594

Query: 606 LPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQ 665
           +P+++M +RP  S GYPGR+Y+FY G T+Y F   L+YT+F + L+   + + ++L++  
Sbjct: 595 VPMSNMNMRPDKSKGYPGRSYRFYTGETVYAFADALTYTKFDHQLIKAPRLVSLSLDENH 654

Query: 666 HCRNLNYTS-DASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIA 724
            CR+    S DA    C     N +     FE  ++ +N G   GS  V +++  P ++ 
Sbjct: 655 PCRSSECQSLDAIGPHCE----NAVEGGSDFEVHLNVKNTGDRAGSHTVFLFTTSP-QVH 709

Query: 725 ATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGN 779
            + IKQ++GF+++ +       ++F  N CK L++VD      +  G H + VG+
Sbjct: 710 GSPIKQLLGFEKIRLGKSEEAVVRFNVNVCKDLSVVDETGKRKIALGHHLLHVGS 764


>gi|449436749|ref|XP_004136155.1| PREDICTED: probable beta-D-xylosidase 2-like [Cucumis sativus]
          Length = 772

 Score =  746 bits (1925), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 373/771 (48%), Positives = 509/771 (66%), Gaps = 30/771 (3%)

Query: 15  IALLVFSTNAVDANGSSSPVFVCDPGRFSKLGLQMSSFLFCDSSLPYSIRVKDLVSRMTL 74
           I +L+  +      G +   F CDP   +     +S + FC  +LP   RVKDL+ R+TL
Sbjct: 9   IPILIILSAIFRHGGGAREPFACDPKDAA-----LSRYPFCRVALPIPERVKDLIGRLTL 63

Query: 75  DEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGATSFPTVILTTAS 134
            EKV+ L + A  VPRLG+  YEWWSEALHGVSNVGPGT F    PGATSFP VI T AS
Sbjct: 64  QEKVRLLVNNAAAVPRLGIKGYEWWSEALHGVSNVGPGTEFGGDFPGATSFPQVITTVAS 123

Query: 135 FNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNINVARDPRWGRITETPGEDPFVVGRY 194
           FN SLW+ IG+ VS EARAMYN G AGLTYWSPN+N+ RDPRWGR  ETPGEDP V G Y
Sbjct: 124 FNVSLWEAIGRVVSDEARAMYNGGAAGLTYWSPNVNIFRDPRWGRGQETPGEDPVVAGEY 183

Query: 195 AVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDME 254
           A  Y++GLQ  +G           LKV++CCKH+ AYD+DNW G DR+HF+A+VT QDM 
Sbjct: 184 AARYIKGLQGNDGDR---------LKVAACCKHFTAYDLDNWNGTDRFHFNAKVTRQDMV 234

Query: 255 ETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQ 314
           +TF  PF  CVKEG  +SVMCSYN+VNG+P+CADP LL  T+R +W L+GYIV+DCDS+ 
Sbjct: 235 DTFEVPFRKCVKEGKVASVMCSYNQVNGVPTCADPNLLKGTIRNQWGLNGYIVSDCDSVG 294

Query: 315 VMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYT 374
           V  DN  + + + E+A A  +KAGLDLDCG +    T +AV++G + +T I+ +L    T
Sbjct: 295 VFYDNQHYTS-TAEEAAADAIKAGLDLDCGPFLAVHTEDAVKKGLLTQTHINNALANTIT 353

Query: 375 VLMRLGFFDGSPQ---YVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVK 431
           V MRLG FDG+P    Y  LG +++CS  + +LA +AAR+GIVLLKN    LPL++   +
Sbjct: 354 VQMRLGMFDGAPSSHAYGKLGPKNVCSPSHQQLALDAARQGIVLLKNRLPGLPLSADHHR 413

Query: 432 TVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFSGYANVTYKTGCDDVACKSNNSIFAAS 491
           TVAV+GP+++  V MIGNYAG+ C Y++P+ G   Y  V ++ GCD+VAC ++ S   A 
Sbjct: 414 TVAVIGPNSDVNVTMIGNYAGVACGYVTPLEGIKRYTTVVHRKGCDNVACATDYSFTDAL 473

Query: 492 EAAKTADATIILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGV 551
            AA TADAT+++ GLD SVEAE+ DR+ L LPG Q +L+ +VA  ++GP ++++MS G +
Sbjct: 474 AAASTADATVLVMGLDQSVEAETKDRDGLLLPGRQQELVLKVAAASRGPTVVILMSGGPI 533

Query: 552 DIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSM 611
           D++FA+ +  I AILW GYPG+ GG AIADV+FG  NPGG+LP+TWY   Y+  LP+T+M
Sbjct: 534 DVSFADNDPRISAILWVGYPGQAGGAAIADVLFGTTNPGGKLPMTWYPQSYLSNLPMTNM 593

Query: 612 PLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLN 671
            +R   S  YPGRTY+FY GP +Y FG+GLSYT F + ++     + ++L+  +      
Sbjct: 594 AMRSTSS--YPGRTYRFYAGPVVYEFGHGLSYTNFIHTIVKAPTIVSISLSGHRQ----- 646

Query: 672 YTSDASKTRCPGVLVNDLRCDDY-FEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYI-- 728
            T  AS      + V   +C        VD +N G  DG   ++V+S PPA   AT++  
Sbjct: 647 -THSASTLSSKAIRVTHAKCQKLSLVIHVDVENKGDRDGFHTMLVFSTPPAN-GATWVPR 704

Query: 729 KQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGN 779
           KQ++ F+++ + +   +R++   + CK L++VD      +P G+H I +GN
Sbjct: 705 KQLVAFEKLHLASREKRRLQVHVHVCKYLSVVDKLGVRRIPLGDHYIHIGN 755


>gi|225431898|ref|XP_002276351.1| PREDICTED: beta-D-xylosidase 1-like [Vitis vinifera]
          Length = 770

 Score =  744 bits (1922), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 370/749 (49%), Positives = 513/749 (68%), Gaps = 26/749 (3%)

Query: 35  FVCDPGRFSKLGLQMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLP 94
           F CDP    + G+   +  FC  SLP   R +DLV R+TL EK++ L + A  VPRLG+ 
Sbjct: 27  FACDP----RNGV-TRNLPFCRVSLPIQERARDLVGRLTLQEKIRLLVNNAIDVPRLGIK 81

Query: 95  QYEWWSEALHGVSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAM 154
            YEWWSEALHGVSNVGPGT F    PGATSFP VI T ASFN SLW++IG+ VS EARAM
Sbjct: 82  GYEWWSEALHGVSNVGPGTKFGGSFPGATSFPQVITTAASFNASLWEEIGRVVSDEARAM 141

Query: 155 YNLGRAGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDL 214
           YN G AGLTYWSPN+N+ RDPRWGR  ETPGEDP V  +YA  YVRGLQ      NA D 
Sbjct: 142 YNGGMAGLTYWSPNVNIFRDPRWGRGQETPGEDPAVAAKYAAAYVRGLQG-----NARDR 196

Query: 215 NSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVM 274
               LKV++CCKHY AYD+D+W G+DR+HF+ARV++QD+E+T+  PF+ CV EG+ +SVM
Sbjct: 197 ----LKVAACCKHYTAYDLDHWGGIDRFHFNARVSKQDLEDTYDVPFKACVVEGNVASVM 252

Query: 275 CSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQT 334
           CSYN+VNG P+CADP LL  T+RGEW L+GYIV+DCDS+ V  D   + A + E+A A  
Sbjct: 253 CSYNQVNGKPTCADPHLLRDTIRGEWKLNGYIVSDCDSVGVFYDEQHYTA-TPEEAAAVA 311

Query: 335 LKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ---YVSL 391
           +KAGLDLDCG +    T  A++ GK+ E D++ +L    +V MRLG FDG P    Y +L
Sbjct: 312 IKAGLDLDCGPFLAIHTEAAIRGGKLTEADVNGALMNTISVQMRLGMFDGEPSAQPYGNL 371

Query: 392 GKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYA 451
           G +D+C+  + +LA EAAR+GIVL++N    LPL++++ +T+AV+GP+++ T  MIGNYA
Sbjct: 372 GPRDVCTPAHQQLALEAARQGIVLVQNRGPALPLSTSRHRTIAVIGPNSDVTETMIGNYA 431

Query: 452 GIPCRYMSPIAGFSGYANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVE 511
           G+ C Y +P+ G   YA   ++ GC  VAC+ +    AA  AA+ ADAT+++ GLD S+E
Sbjct: 432 GVACGYTTPLQGIGRYARTIHQAGCSGVACRDDQQFGAAVAAARQADATVLVMGLDQSIE 491

Query: 512 AESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYP 571
           AE  DR D+ LPG Q +L+++VA  ++GP +LV+MS G +D++FA+ +  I AI+W GYP
Sbjct: 492 AEFRDRVDILLPGRQQELVSKVAVASRGPTVLVLMSGGPIDVSFAKNDPRIAAIIWVGYP 551

Query: 572 GEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNG 631
           G+ GG AIADV+FG+ NPGG+LP+TWY   Y++  P+T+M +R + S GYPGRTY+FYNG
Sbjct: 552 GQAGGTAIADVLFGRTNPGGKLPVTWYPQSYLRKAPMTNMAMRAIPSRGYPGRTYRFYNG 611

Query: 632 PTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRC 691
           P ++PFG+GLSY+ F ++L     T+ V+L  LQ  +N    S  +      + ++   C
Sbjct: 612 PVVFPFGHGLSYSTFAHSLAQAPTTVSVSLASLQTIKNSTIVSSGA------IRISHANC 665

Query: 692 DDY-FEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFV 750
           +     F +D +N G+ DGS  ++++S PP    +   K+++ F++V V AG  +R++F 
Sbjct: 666 NTQPLGFHIDVKNTGTMDGSHTLLLFSTPPPGTWSPN-KRLLAFEKVHVGAGSQERVRFD 724

Query: 751 FNACKSLNIVDYAANTLLPAGEHTIFVGN 779
            + CK L++VD+     +P GEH   +G+
Sbjct: 725 VHVCKHLSVVDHFGIHRIPMGEHHFHIGD 753


>gi|449505346|ref|XP_004162442.1| PREDICTED: LOW QUALITY PROTEIN: probable beta-D-xylosidase 2-like
           [Cucumis sativus]
          Length = 772

 Score =  743 bits (1919), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 372/771 (48%), Positives = 508/771 (65%), Gaps = 30/771 (3%)

Query: 15  IALLVFSTNAVDANGSSSPVFVCDPGRFSKLGLQMSSFLFCDSSLPYSIRVKDLVSRMTL 74
           I +L+  +      G +   F CDP   +     +S + FC  +LP   RVKDL+ R+TL
Sbjct: 9   IPILIILSAIFRHGGGAREPFACDPKDAA-----LSRYPFCRVALPIPERVKDLIGRLTL 63

Query: 75  DEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGATSFPTVILTTAS 134
            EKV+ L + A  VPRLG+  YEWWSEALHGVSNVGPGT F    PGATSFP VI T AS
Sbjct: 64  QEKVRLLVNNAAAVPRLGIKGYEWWSEALHGVSNVGPGTEFGGDFPGATSFPQVITTVAS 123

Query: 135 FNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNINVARDPRWGRITETPGEDPFVVGRY 194
           FN SLW+ IG+ VS EARAMYN G AGLTYWSPN+N+ RDPRWGR  ETPGEDP V G Y
Sbjct: 124 FNVSLWEAIGRVVSDEARAMYNGGAAGLTYWSPNVNIFRDPRWGRGQETPGEDPVVAGEY 183

Query: 195 AVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDME 254
           A  Y++GLQ  +G           LKV++CCKH+ AYD+DNW G DR+HF+A+VT QDM 
Sbjct: 184 AARYIKGLQGNDGDR---------LKVAACCKHFTAYDLDNWNGTDRFHFNAKVTRQDMV 234

Query: 255 ETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQ 314
           +TF  PF  CVKEG  +SVMCSYN+VNG+P+CADP LL  T+R +W L+GYIV+DCDS+ 
Sbjct: 235 DTFEVPFRKCVKEGKVASVMCSYNQVNGVPTCADPNLLKGTIRNQWGLNGYIVSDCDSVG 294

Query: 315 VMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYT 374
           V  DN  + + + E+A A  +KAGLDLDCG +    T +AV++  + +T I+ +L    T
Sbjct: 295 VFYDNQHYTS-TAEEAAADAIKAGLDLDCGPFLAVHTEDAVKKXLLTQTHINNALANTIT 353

Query: 375 VLMRLGFFDGSPQ---YVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVK 431
           V MRLG FDG+P    Y  LG +++CS  + +LA +AAR+GIVLLKN    LPL++   +
Sbjct: 354 VQMRLGMFDGAPSSHAYGKLGPKNVCSPSHQQLALDAARQGIVLLKNRLPGLPLSAXHHR 413

Query: 432 TVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFSGYANVTYKTGCDDVACKSNNSIFAAS 491
           TVAV+GP+++  V MIGNYAG+ C Y++P+ G   Y  V ++ GCD+VAC ++ S   A 
Sbjct: 414 TVAVIGPNSDVNVTMIGNYAGVACGYVTPLEGIKRYTTVVHRKGCDNVACATDYSFTDAL 473

Query: 492 EAAKTADATIILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGV 551
            AA TADAT+++ GLD SVEAE+ DR+ L LPG Q +L+ +VA  ++GP ++++MS G +
Sbjct: 474 AAASTADATVLVMGLDQSVEAETKDRDGLLLPGRQQELVLKVAAASRGPTVVILMSGGPI 533

Query: 552 DIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSM 611
           D++FA+ +  I AILW GYPG+ GG AIADV+FG  NPGG+LP+TWY   Y+  LP+T+M
Sbjct: 534 DVSFADNDPRISAILWVGYPGQAGGAAIADVLFGTTNPGGKLPMTWYPQSYLSNLPMTNM 593

Query: 612 PLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLN 671
            +R   S  YPGRTY+FY GP +Y FG+GLSYT F + ++     + ++L+  +      
Sbjct: 594 AMRSTSS--YPGRTYRFYAGPVVYEFGHGLSYTNFIHTIVKAPTIVSISLSGHRQ----- 646

Query: 672 YTSDASKTRCPGVLVNDLRCDDY-FEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYI-- 728
            T  AS      + V   +C        VD +N G  DG   ++V+S PPA   AT++  
Sbjct: 647 -THSASTLSSKAIRVTHAKCQKLSLVIHVDVENKGDRDGFHTMLVFSTPPAN-GATWVPR 704

Query: 729 KQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGN 779
           KQ++ F+++ + +   +R++   + CK L++VD      +P G+H I +GN
Sbjct: 705 KQLVAFEKLHLASREKRRLQVHVHVCKYLSVVDKLGVRRIPLGDHYIHIGN 755


>gi|357166259|ref|XP_003580652.1| PREDICTED: beta-D-xylosidase 4-like [Brachypodium distachyon]
          Length = 774

 Score =  736 bits (1901), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 368/756 (48%), Positives = 500/756 (66%), Gaps = 29/756 (3%)

Query: 30  SSSPVFVCDPGRFSKLGLQMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVP 89
           + +PVF CD    +  G     + FCD +   S R  DLVSR+TL +KV  L +    + 
Sbjct: 33  AQTPVFACDAANSTVAG-----YAFCDRAKSASARAADLVSRLTLADKVGFLVNKQPALA 87

Query: 90  RLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVST 149
           RLG+P YEWWSEALHGVS VGPGT F  ++PGATSFP  ILT ASFN SL++ IG+ VS 
Sbjct: 88  RLGIPAYEWWSEALHGVSYVGPGTRFSPLVPGATSFPQPILTAASFNASLFRAIGEVVSN 147

Query: 150 EARAMYNLGRAGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHE 209
           EARAM+N+G AGLT+WSPNIN+ RDPRWGR  ETPGEDP +  RYAV YV GLQD     
Sbjct: 148 EARAMHNVGLAGLTFWSPNINIFRDPRWGRGQETPGEDPLLASRYAVGYVSGLQDAGADA 207

Query: 210 NATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGD 269
           +       PLKV++CCKHY AYDVDNWKGV+RY FDA+V++QD+++TF  PF+ CV +G 
Sbjct: 208 DG------PLKVAACCKHYTAYDVDNWKGVERYTFDAKVSQQDLDDTFQPPFKSCVIDGK 261

Query: 270 ASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKED 329
            +SVMCSYN+VNG P+CAD  LL+  +RG+W L+GYIV+DCDS+ V+     +   + E+
Sbjct: 262 VASVMCSYNKVNGKPTCADKDLLSGVIRGDWKLNGYIVSDCDSVDVLYSQQHY-TKTPEE 320

Query: 330 AVAQTLKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ-- 387
           A A T+K+GLDL+CG +    T  AVQ G + E+D+D+++   + +LMRLGFFDG P+  
Sbjct: 321 AAAITIKSGLDLNCGDFLAKHTVAAVQAGNLSESDVDRAITNNFIMLMRLGFFDGDPRKL 380

Query: 388 -YVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAM 446
            Y SLG +D+C+  N ELA E AR+GIVLLKND   LPL++  +K++AV+GP+ANA+  M
Sbjct: 381 AYGSLGPKDVCTSSNQELARETARQGIVLLKND-GALPLSAKSIKSMAVIGPNANASFTM 439

Query: 447 IGNYAGIPCRYMSPIAGFSGYANVTYKTGCDDVACKSNN-SIFAASEAAKTADATIILAG 505
           IGNY G PC+Y +P+ G        Y+ GC +V C  N+  + AA+ AA +AD T+++ G
Sbjct: 440 IGNYEGTPCKYTTPLHGLGNNVATVYQPGCSNVGCSGNSLQLSAATAAAASADVTVLVVG 499

Query: 506 LDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAI 565
            D S+E E+LDR  L LPG Q  LI+ VA  +KG VILV+MS G  DI+FA+ +  I AI
Sbjct: 500 ADQSIEREALDRTSLLLPGQQPDLISAVANASKGHVILVVMSGGPFDISFAKASDKISAI 559

Query: 566 LWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRT 625
           LW GYPGE GG AIAD++FGK+NP GRLP+TWY   +   +P+T M +RP +S GYPGRT
Sbjct: 560 LWVGYPGEAGGAAIADIIFGKYNPSGRLPVTWYPASFADKVPMTDMRMRPDNSTGYPGRT 619

Query: 626 YKFYNGPTLYPFGYGLSYTQFKYNLLSFTKT-IQVNLNKLQHCRNLNYTS-DASKTRCPG 683
           Y+FY G T++ FG GLSYT   +NL++   + + + L +   C      S +A+   C G
Sbjct: 620 YRFYTGETVFAFGDGLSYTTMSHNLVAAPPSEVSMQLAEGHACHTKECASVEAAGDHCEG 679

Query: 684 VLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGR 743
           +          FE ++   N G   G+  V+++S PPA +     K ++GF+++ +  G+
Sbjct: 680 MA---------FEVRLRVHNTGEMAGAHTVLLFSSPPA-VHNAPAKHLLGFEKLNLEPGQ 729

Query: 744 NKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGN 779
                F  + CK L++VD   N  +  G HT+ VG+
Sbjct: 730 AGVAAFKVDVCKDLSVVDELGNRKVALGGHTLHVGD 765


>gi|255545664|ref|XP_002513892.1| Periplasmic beta-glucosidase precursor, putative [Ricinus communis]
 gi|223546978|gb|EEF48475.1| Periplasmic beta-glucosidase precursor, putative [Ricinus communis]
          Length = 774

 Score =  736 bits (1901), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 369/771 (47%), Positives = 508/771 (65%), Gaps = 34/771 (4%)

Query: 30  SSSPVFVCDPGRFSKLGLQMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVP 89
           S+ P F CDP   S      SSFLFC +SLP S RV+DLVSR+TLDEK+ QL   A  +P
Sbjct: 24  STEPPFSCDPSNPS-----TSSFLFCKTSLPISQRVRDLVSRLTLDEKISQLVSSAPSIP 78

Query: 90  RLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVST 149
           RLG+P YEWWSEALHGV+NVG G HF+  I  ATSFP VILT ASF+   W +IGQ +  
Sbjct: 79  RLGIPAYEWWSEALHGVANVGRGIHFEGAIKAATSFPQVILTAASFDAYQWYRIGQVIGR 138

Query: 150 EARAMYNLGRA-GLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQ----- 203
           EARA+YN G+A G+T+W+PNIN+ RDPRWGR  ETPGEDP V G+YAV+YVRG+Q     
Sbjct: 139 EARAVYNAGQATGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGKYAVSYVRGVQGDSFQ 198

Query: 204 --DVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPF 261
              ++GH          L+ S+CCKH+ AYD+DNWKGV+R+ FDARVT QD+ +T+  PF
Sbjct: 199 GGKLKGH----------LQASACCKHFTAYDLDNWKGVNRFVFDARVTMQDLADTYQPPF 248

Query: 262 EMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHK 321
           + CV++G AS +MC+YNRVNGIPSCAD  LL++T RG+WD HGYI +DCD++ ++ DN  
Sbjct: 249 QSCVQQGKASGIMCAYNRVNGIPSCADFNLLSRTARGQWDFHGYIASDCDAVSIIYDNQG 308

Query: 322 FLADSKEDAVAQTLKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGF 381
           + A S EDAV   LKAG+D++CG Y    T  AV+Q K+ E  ID++L  L++V MRLG 
Sbjct: 309 Y-AKSPEDAVVDVLKAGMDVNCGSYLQKHTKAAVEQKKLPEASIDRALHNLFSVRMRLGL 367

Query: 382 FDGSPQ---YVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGP 438
           F+G+P    + ++G   +CS E+  LA EAAR GIVLLKN    LPL  +K  ++AV+GP
Sbjct: 368 FNGNPTEQPFSNIGPDQVCSQEHQILALEAARNGIVLLKNSARLLPLQKSKTVSLAVIGP 427

Query: 439 HANATVAMIGNYAGIPCRYMSPIAGFSGYA-NVTYKTGCDDVACKSNNSIFAASEAAKTA 497
           +AN+   ++GNYAG PC+ ++P+     Y  N  Y +GCD V C S+ SI  A + AK  
Sbjct: 428 NANSVQTLLGNYAGPPCKTVTPLQALQYYVKNTIYYSGCDTVKC-SSASIDKAVDIAKGV 486

Query: 498 DATIILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAE 557
           D  +++ GLD + E E LDR DL LPG Q +LI  VA+ AK P++LV++S G VDI+FA+
Sbjct: 487 DRVVMIMGLDQTQEREELDRLDLVLPGKQQELITNVAKSAKNPIVLVLLSGGPVDISFAK 546

Query: 558 TNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVD 617
            + NI +ILWAGYPGE GG A+A+++FG  NPGG+LP+TWY  ++V+ +P+T M +RP  
Sbjct: 547 YDENIGSILWAGYPGEAGGIALAEIIFGDHNPGGKLPMTWYPQEFVK-VPMTDMRMRPDP 605

Query: 618 SLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDAS 677
           S GYPGRTY+FY G  ++ FGYGLSY+++ Y L   ++T ++ LN+    R ++  SD  
Sbjct: 606 SSGYPGRTYRFYKGRNVFEFGYGLSYSKYSYELKYVSQT-KLYLNQSSTMRIID-NSDPV 663

Query: 678 KTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRV 737
           +      L  +   +  F  KV  +N G   G   V+++++          +Q+IGF+ V
Sbjct: 664 RATLVAQLGAEFCKESKFSVKVGVENQGEMAGKHPVLLFARHARHGNGRPRRQLIGFKSV 723

Query: 738 FVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGNGGVSFPIHL 788
            + AG    I+F  + C+  +  +     ++  G H + V  GG  +PI +
Sbjct: 724 ILNAGEKAEIEFELSPCEHFSRANEDGLRVMEEGTHFLMV--GGDKYPISV 772


>gi|242077366|ref|XP_002448619.1| hypothetical protein SORBIDRAFT_06g030270 [Sorghum bicolor]
 gi|241939802|gb|EES12947.1| hypothetical protein SORBIDRAFT_06g030270 [Sorghum bicolor]
          Length = 767

 Score =  735 bits (1898), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 370/755 (49%), Positives = 502/755 (66%), Gaps = 30/755 (3%)

Query: 30  SSSPVFVCDPGRFSKLGLQMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVP 89
           + +PVF CD    +     ++S+ FC+ S   S R  DLVSR+TL EKV  L D    +P
Sbjct: 29  AQTPVFACDASNAT-----LASYGFCNRSASASARAADLVSRLTLAEKVGFLVDKQAALP 83

Query: 90  RLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVST 149
           RLG+P YEWWSEALHGVS VGPGT F  ++P ATSFP  ILT ASFN +L++ IG+ VS 
Sbjct: 84  RLGIPLYEWWSEALHGVSYVGPGTRFSSLVPAATSFPQPILTAASFNATLFRAIGEVVSN 143

Query: 150 EARAMYNLGRAGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHE 209
           EARAM+N+G AGLT+WSPNIN+ RDPRWGR  ETPGEDP +  +YAV YV GLQD     
Sbjct: 144 EARAMHNVGLAGLTFWSPNINIFRDPRWGRGQETPGEDPLLTSKYAVGYVTGLQDAGS-- 201

Query: 210 NATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGD 269
                 S  LKV++CCKHY AYDVDNWKGV+RY F+A V++QD+++TF  PF+ CV +G+
Sbjct: 202 -----GSGSLKVAACCKHYTAYDVDNWKGVERYTFNAVVSQQDLDDTFQPPFKSCVVDGN 256

Query: 270 ASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKED 329
            +SVMCSYN+VNG P+CAD  LL+  +RG+W L+GYI +DCDS+ V+ +N  +   + ED
Sbjct: 257 VASVMCSYNQVNGKPTCADKDLLSGVIRGDWKLNGYISSDCDSVDVLYNNQHY-TKTPED 315

Query: 330 AVAQTLKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ-- 387
           A A ++KAGLDL+CG +    T  AVQ GK+ E+D+D+++   +  LMRLGFFDG P+  
Sbjct: 316 AAAISIKAGLDLNCGNFLAQHTVAAVQAGKLSESDVDRAITNNFITLMRLGFFDGDPRKL 375

Query: 388 -YVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAM 446
            + +LG  D+C+  N ELA EAAR+GIVLLKN    LPL+++ +K++AV+GP+ANA+  M
Sbjct: 376 PFGNLGPSDVCTSSNQELAREAARQGIVLLKN-SGALPLSASSIKSLAVIGPNANASFTM 434

Query: 447 IGNYAGIPCRYMSPIAGFSGYANVTYKTGCDDVACKSNN-SIFAASEAAKTADATIILAG 505
           IGNY G PC+Y +P+ G        Y+ GC +V C  N+  + AA++AA +AD T+++ G
Sbjct: 435 IGNYEGTPCKYTTPLQGLGANVATVYQPGCTNVGCSGNSLQLDAATKAAASADVTVLVVG 494

Query: 506 LDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAI 565
            D S+E ESLDR  L LPG Q QL++ VA  ++GP ILVIMS G  DI+FA+++  I AI
Sbjct: 495 ADQSIERESLDRTSLLLPGQQPQLVSAVANASRGPCILVIMSGGPFDISFAKSSDKIAAI 554

Query: 566 LWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRT 625
           LW GYPGE GG AIADV+FG  NP GRLP+TWY   + + +P+  M +RP  S GYPGRT
Sbjct: 555 LWVGYPGEAGGAAIADVLFGHHNPSGRLPVTWYPESFTK-VPMIDMRMRPDASTGYPGRT 613

Query: 626 YKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVL 685
           Y+FY G T+Y FG GLSYT F ++L+S  K + + L +   C            +CP V 
Sbjct: 614 YRFYTGDTVYAFGDGLSYTSFAHHLVSAPKQVALQLAEGHTCLT---------EQCPSVE 664

Query: 686 VNDLRCDDY-FEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRN 744
                C+   F+  +  +N G   G+  V ++S PPA +     K ++GF++V +  G+ 
Sbjct: 665 AEGAHCEGLAFDVHLRVRNAGDMSGAHTVFLFSSPPA-VHNAPAKHLLGFEKVSLEPGQA 723

Query: 745 KRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGN 779
             + F  + CK L++VD   N  +  G HT+ VG+
Sbjct: 724 GVVAFKVDVCKDLSVVDELGNRKVALGNHTLHVGD 758


>gi|449466797|ref|XP_004151112.1| PREDICTED: beta-D-xylosidase 1-like [Cucumis sativus]
          Length = 770

 Score =  734 bits (1895), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 369/730 (50%), Positives = 491/730 (67%), Gaps = 25/730 (3%)

Query: 54  FCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGT 113
           FC  SL    RVKDL+ R+TL EK++ L + A  VPRLG+  YEWWSEALHGVSNVGPGT
Sbjct: 46  FCQESLGIEERVKDLIGRLTLGEKIRLLVNNAIAVPRLGIRGYEWWSEALHGVSNVGPGT 105

Query: 114 HFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNINVAR 173
            F    PGATSFP VI T ASFN+SLW  IG+ VS EARAMYN G AGLTYWSPN+N+ R
Sbjct: 106 KFGGTFPGATSFPQVITTAASFNQSLWLLIGRVVSDEARAMYNGGTAGLTYWSPNVNIFR 165

Query: 174 DPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDV 233
           DPRWGR  ETPGEDP +  +YA NYV+GLQ  +G +         LKV++CCKHY AYD+
Sbjct: 166 DPRWGRGQETPGEDPILAAKYAANYVQGLQGNDGKKR--------LKVAACCKHYTAYDL 217

Query: 234 DNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLN 293
           DNW GVDRYHF+A+V++QD+E+T+  PF+ CV EG  +SVMCSYN+VNG P+CADP LL 
Sbjct: 218 DNWNGVDRYHFNAKVSKQDLEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCADPDLLK 277

Query: 294 QTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFTGN 353
            T+RG W L GYIV+DCDS+ V+ D+  F   + E+A A T+KAGLDLDCG +    T  
Sbjct: 278 NTIRGAWGLDGYIVSDCDSVGVLYDSQHF-TPTPEEAAASTIKAGLDLDCGPFLAVHTAT 336

Query: 354 AVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ---YVSLGKQDICSDENIELAAEAAR 410
           AV +G +KE D++ +L  L +V MRLG FDG P    Y +LG +D+C+  +  LA EAAR
Sbjct: 337 AVGRGLLKEVDLNNALANLLSVQMRLGMFDGEPAAQPYGNLGPKDVCTPAHKHLALEAAR 396

Query: 411 EGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFSGYANV 470
           +GIVLL+N    LPL+  + +TVAV+GP+++ATV MIGNYAG+ C Y +P+ G S Y   
Sbjct: 397 QGIVLLQNRAGALPLSPTRHRTVAVIGPNSDATVTMIGNYAGVACEYTTPVQGISKYVKT 456

Query: 471 TYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAESLDREDLWLPGYQTQLI 530
            +  GC +VAC  +  I  A  AA+ ADA +++ GLD S+EAES DR  + LPG Q +L+
Sbjct: 457 IHAKGCANVACVGDQLIGEAEAAARVADAAVVVVGLDQSIEAESRDRNGVLLPGKQEELV 516

Query: 531 NQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPG 590
            ++    KGP ++V+MS G +D++FA+ +  I  ILW GYPG+ GG AIADV+FG  NPG
Sbjct: 517 RRIGLACKGPTVVVLMSGGPIDVSFAKNDGKISGILWVGYPGQAGGAAIADVLFGATNPG 576

Query: 591 GRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNL 650
           G+LP+TWY   Y+  +P+T+M LRP  S GYPGRTY+FY GP ++PFG+GLSY++F    
Sbjct: 577 GKLPMTWYPQSYLAKVPMTNMGLRPDPSTGYPGRTYRFYKGPVVFPFGFGLSYSKFSQ-- 634

Query: 651 LSFTKT-IQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDG 709
            SF +   +++L       N + T   S T C    V+DL         +D +N G+ DG
Sbjct: 635 -SFAEAPTKISLPLSSLSPNSSATVKVSHTDCAS--VSDL------PIMIDVKNTGTVDG 685

Query: 710 SDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLP 769
           S  ++V+S  P +  +   K +IGF++V + AG  KR++   + C  L+ VD      +P
Sbjct: 686 SHTILVFSTVPNQTWSPE-KHLIGFEKVHLIAGSQKRVRIGIHVCDHLSRVDEFGTRRIP 744

Query: 770 AGEHTIFVGN 779
            GEH + +G+
Sbjct: 745 MGEHKLHIGD 754


>gi|326494302|dbj|BAJ90420.1| predicted protein [Hordeum vulgare subsp. vulgare]
 gi|326521150|dbj|BAJ96778.1| predicted protein [Hordeum vulgare subsp. vulgare]
 gi|326527851|dbj|BAK08165.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 775

 Score =  733 bits (1891), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 366/756 (48%), Positives = 498/756 (65%), Gaps = 27/756 (3%)

Query: 30  SSSPVFVCDPGRFSKLGLQMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVP 89
           + +PVF CD    +     ++++ FC+     S R +DLVSR+TL EKV  L +    + 
Sbjct: 32  AQAPVFACDASNAT-----LAAYGFCNRKATASARARDLVSRLTLAEKVGFLVNKQPALG 86

Query: 90  RLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVST 149
           RLG+P YEWWSEALHGVS VGPGT F  ++PGATSFP  ILT ASFN SL++ IG+ VST
Sbjct: 87  RLGIPAYEWWSEALHGVSYVGPGTRFSPLVPGATSFPQPILTAASFNASLFRAIGEVVST 146

Query: 150 EARAMYNLGRAGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHE 209
           EARAM+N+G AGLT+WSPNIN+ RDPRWGR  ETPGEDP +  +YAV YV GLQD     
Sbjct: 147 EARAMHNVGLAGLTFWSPNINIFRDPRWGRGQETPGEDPLLASKYAVGYVTGLQDA---- 202

Query: 210 NATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGD 269
            A  +    LKV++CCKHY AYDVDNWKGV+RY FDA+V++QD+++TF  PF+ CV +G+
Sbjct: 203 GAGGVTDGALKVAACCKHYTAYDVDNWKGVERYTFDAKVSQQDLDDTFQPPFKSCVLDGN 262

Query: 270 ASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKED 329
            +SVMCSYN+VNG P+CAD  LL   +RG+W L+GYIV+DCDS+ V+     +   + E+
Sbjct: 263 VASVMCSYNKVNGKPTCADKDLLEGVIRGDWKLNGYIVSDCDSVDVLYTQQHY-TKTPEE 321

Query: 330 AVAQTLKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ-- 387
           A A T+K+GLDL+CG +    T  AVQ G++ E D+D+++   + +LMRLGFFDG P+  
Sbjct: 322 AAAITIKSGLDLNCGNFLAQHTVAAVQAGELSEEDVDRAITNNFIMLMRLGFFDGDPRQL 381

Query: 388 -YVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAM 446
            + SLG +D+C+  N ELA E AR+GIVLLKN    LPL++  +K++AV+GP+ANA+  M
Sbjct: 382 AFGSLGPKDVCTSSNRELARETARQGIVLLKN-SGALPLSAKSIKSMAVIGPNANASFTM 440

Query: 447 IGNYAGIPCRYMSPIAGFSGYANVTYKTGCDDVACKSNN-SIFAASEAAKTADATIILAG 505
           IGNY G PC+Y +P+ G     N  Y+ GC +V C  N+  +  A  AA +AD T+++ G
Sbjct: 441 IGNYEGTPCKYTTPLQGLGAKVNTVYQPGCTNVGCSGNSLQLSTAVAAAASADVTVLVVG 500

Query: 506 LDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAI 565
            D S+E ESLDR  L LPG QTQL++ VA  + GPVILV+MS G  DI+FA+ +  I AI
Sbjct: 501 ADQSIERESLDRTSLLLPGQQTQLVSAVANASSGPVILVVMSGGPFDISFAKASDKIAAI 560

Query: 566 LWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRT 625
           LW GYPGE GG A+AD++FG  NP GRLP+TWY   Y   + +T M +RP  S GYPGRT
Sbjct: 561 LWVGYPGEAGGAALADILFGSHNPSGRLPVTWYPASYADTVTMTDMRMRPDTSTGYPGRT 620

Query: 626 YKFYNGPTLYPFGYGLSYTQFKYNLLSFTKT-IQVNLNKLQHCRNLNYTSDASKTRCPGV 684
           Y+FY G T++ FG GLSYT+  ++L+S   + + + L +   CR            C  V
Sbjct: 621 YRFYTGDTVFAFGDGLSYTKMSHSLVSAPPSYVSMRLAEDHPCR---------AEECASV 671

Query: 685 LVNDLRCDDY-FEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGR 743
                 CDD  F+ K+  +N G   G+  V+++S PP    A   K ++GF++V +  G 
Sbjct: 672 EAAGDHCDDLAFDVKLQVRNAGEVAGAHSVLLFSSPPPAHNAP-AKHLLGFEKVSLAPGE 730

Query: 744 NKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGN 779
              + F  + C+ L++VD      +  G HT+ VG+
Sbjct: 731 AGTVAFRVDVCRDLSVVDELGGRKVALGGHTLHVGD 766


>gi|413919688|gb|AFW59620.1| putative O-Glycosyl hydrolase superfamily protein [Zea mays]
          Length = 773

 Score =  731 bits (1888), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 369/755 (48%), Positives = 498/755 (65%), Gaps = 30/755 (3%)

Query: 30  SSSPVFVCDPGRFSKLGLQMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVP 89
           + +P F CD    +     ++S+ FC+ S   + R  DLVSR+TL EKV  L D    +P
Sbjct: 35  AQTPAFACDASNAT-----LASYGFCNRSAAAAARAADLVSRLTLAEKVGFLVDKQAALP 89

Query: 90  RLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVST 149
           RLG+P YEWWSEALHGVS VGPGT F  ++PGATSFP  ILT ASFN +L++ IG+ VS 
Sbjct: 90  RLGVPLYEWWSEALHGVSYVGPGTRFSPLVPGATSFPQPILTAASFNATLFRAIGEVVSN 149

Query: 150 EARAMYNLGRAGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHE 209
           EARAM+N+G AGLT+WSPNIN+ RDPRWGR  ETPGEDP +  +YAV YV GLQ      
Sbjct: 150 EARAMHNVGLAGLTFWSPNINIFRDPRWGRGQETPGEDPLLTSKYAVGYVTGLQGAVSGA 209

Query: 210 NATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGD 269
            A       LKV++CCKHY AYDVDNWKGV+RY FDA V++QD+++TF  PF+ CV +G+
Sbjct: 210 GA-------LKVAACCKHYTAYDVDNWKGVERYTFDAVVSQQDLDDTFQPPFKSCVVDGN 262

Query: 270 ASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKED 329
            +SVMCSYN+VNG P+CAD  LL+  +RG+W L+GYI +DCDS+ V+ +N  +   + ED
Sbjct: 263 VASVMCSYNQVNGKPTCADKDLLSGVIRGDWKLNGYISSDCDSVDVLYNNQHY-TKTPED 321

Query: 330 AVAQTLKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ-- 387
           A A ++KAGLDL+CG +    T  AVQ GK+ E+D+D+++      LMRLGFFDG P+  
Sbjct: 322 AAAISIKAGLDLNCGTFLAQHTVAAVQAGKLSESDVDRAVTNNLVTLMRLGFFDGDPREL 381

Query: 388 -YVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAM 446
            + +LG  D+C+  N ELA EAAR+GIVLLKN    LPL++  +K++AV+GP+ANA+  M
Sbjct: 382 PFGNLGPSDVCTPSNQELAREAARQGIVLLKN-TGKLPLSATSIKSMAVIGPNANASFTM 440

Query: 447 IGNYAGIPCRYMSPIAGFSGYANVTYKTGCDDVACKSNN-SIFAASEAAKTADATIILAG 505
           IGNY G PC+Y +P+ G        Y+ GC +V C  N+  + AA++AA +AD T+++ G
Sbjct: 441 IGNYEGTPCKYTTPLQGLGANVATVYQPGCTNVGCSGNSLQLDAATKAAASADVTVLVVG 500

Query: 506 LDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAI 565
            D S+E ESLDR  L LPG Q QL++ VA  + GP ILV+MS G  DI+FA+++  I AI
Sbjct: 501 ADQSIERESLDRTSLLLPGQQPQLVSAVANASSGPCILVVMSGGPFDISFAKSSDKIAAI 560

Query: 566 LWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRT 625
           LW GYPGE GG AIADV+FG  NP GRLP+TWY   + + +P+T M +RP  S GYPGRT
Sbjct: 561 LWVGYPGEAGGAAIADVLFGYHNPSGRLPVTWYPESFTK-VPMTDMRMRPDPSTGYPGRT 619

Query: 626 YKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVL 685
           Y+FY G T+Y FG GLSYT F ++L+S  K + + L +   C            +CP V 
Sbjct: 620 YRFYTGDTVYAFGDGLSYTSFAHHLVSAPKQLALQLAEGHACLT---------EQCPSVE 670

Query: 686 VNDLRCDDY-FEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRN 744
                C+   F+  +  +N G   G   V ++S PPA +     K ++GF++V +  G+ 
Sbjct: 671 AEGAHCEGLAFDVHLRVRNAGERSGGHTVFLFSSPPA-VHNAPAKHLLGFEKVSLEPGQA 729

Query: 745 KRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGN 779
             + F  + CK L++VD   N  +  G HT+ VG+
Sbjct: 730 GVVAFKVDVCKDLSVVDELGNRKVALGSHTLHVGD 764


>gi|326492918|dbj|BAJ90315.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 775

 Score =  731 bits (1888), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 365/756 (48%), Positives = 498/756 (65%), Gaps = 27/756 (3%)

Query: 30  SSSPVFVCDPGRFSKLGLQMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVP 89
           + +PVF CD    +     ++++ FC+     S R +DLVSR+TL EKV  L +    + 
Sbjct: 32  AQAPVFACDASNAT-----LAAYGFCNRKATASARARDLVSRLTLAEKVGFLVNKQPALG 86

Query: 90  RLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVST 149
           RLG+P YEWWSEALHGVS VGPGT F  ++PGATSFP  ILT ASFN SL++ IG+ VST
Sbjct: 87  RLGIPAYEWWSEALHGVSYVGPGTRFSPLVPGATSFPQPILTAASFNASLFRAIGEVVST 146

Query: 150 EARAMYNLGRAGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHE 209
           EARAM+N+G AGLT+WSPNIN+ RDPRWGR  ETPGEDP +  +YAV YV GLQD     
Sbjct: 147 EARAMHNVGLAGLTFWSPNINIFRDPRWGRGQETPGEDPLLASKYAVGYVTGLQDA---- 202

Query: 210 NATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGD 269
            A  +    LKV++CCKHY AYDVDNWKGV+RY FDA+V++QD+++TF  PF+ CV +G+
Sbjct: 203 GAGGVTDGALKVAACCKHYTAYDVDNWKGVERYTFDAKVSQQDLDDTFQPPFKSCVLDGN 262

Query: 270 ASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKED 329
            +SVMCSYN+VNG P+CAD  LL   +RG+W L+GYIV+DCDS+ V+     +   + E+
Sbjct: 263 VASVMCSYNKVNGKPTCADKDLLEGVIRGDWKLNGYIVSDCDSVDVLYTQQHY-TKTPEE 321

Query: 330 AVAQTLKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ-- 387
           A A T+K+GLDL+CG +    T  AVQ G++ E D+D+++   + +LMRLGFFDG P+  
Sbjct: 322 AAAITIKSGLDLNCGNFLAQHTVAAVQAGELSEEDVDRAITNNFIMLMRLGFFDGDPRQL 381

Query: 388 -YVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAM 446
            + SLG +D+C+  N ELA E AR+GIVLLKN    LPL++  +K++AV+GP+ANA+  M
Sbjct: 382 AFGSLGPKDVCTSSNRELARETARQGIVLLKN-SGALPLSAKSIKSMAVIGPNANASFTM 440

Query: 447 IGNYAGIPCRYMSPIAGFSGYANVTYKTGCDDVACKSNN-SIFAASEAAKTADATIILAG 505
           IGNY G PC+Y +P+ G     N  Y+ GC +V C  N+  +  A  AA +AD T+++ G
Sbjct: 441 IGNYEGTPCKYTTPLQGLGAKVNTVYQPGCTNVGCSGNSLQLSTAVAAAASADVTVLVVG 500

Query: 506 LDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAI 565
            D S+E ESLDR  L LPG QTQL++ VA  + GPVILV+MS G  DI+FA+ +  I AI
Sbjct: 501 ADQSIERESLDRTSLLLPGQQTQLVSAVANASSGPVILVVMSGGPFDISFAKASDKIAAI 560

Query: 566 LWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRT 625
           LW GYPGE GG A+AD++FG  NP G+LP+TWY   Y   + +T M +RP  S GYPGRT
Sbjct: 561 LWVGYPGEAGGAALADILFGSHNPSGKLPVTWYPASYADTVTMTDMRMRPDTSTGYPGRT 620

Query: 626 YKFYNGPTLYPFGYGLSYTQFKYNLLSFTKT-IQVNLNKLQHCRNLNYTSDASKTRCPGV 684
           Y+FY G T++ FG GLSYT+  ++L+S   + + + L +   CR            C  V
Sbjct: 621 YRFYTGDTVFAFGDGLSYTKMSHSLVSAPPSYVSMRLAEDHPCR---------AEECASV 671

Query: 685 LVNDLRCDDY-FEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGR 743
                 CDD  F+ K+  +N G   G+  V+++S PP    A   K ++GF++V +  G 
Sbjct: 672 EAAGDHCDDLAFDVKLQVRNAGEVAGAHSVLLFSSPPPAHNAP-AKHLLGFEKVSLAPGE 730

Query: 744 NKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGN 779
              + F  + C+ L++VD      +  G HT+ VG+
Sbjct: 731 AGTVAFRVDVCRDLSVVDELGGRKVALGGHTLHVGD 766


>gi|302786124|ref|XP_002974833.1| hypothetical protein SELMODRAFT_101733 [Selaginella moellendorffii]
 gi|300157728|gb|EFJ24353.1| hypothetical protein SELMODRAFT_101733 [Selaginella moellendorffii]
          Length = 784

 Score =  726 bits (1873), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 364/754 (48%), Positives = 507/754 (67%), Gaps = 24/754 (3%)

Query: 35  FVCDPGRFSKLGLQMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLP 94
           + CD    + LG    SF FCD+ L   +RV+DLVSR+TLDEKV ++ + A G+PRLG+P
Sbjct: 36  YACDVSSNASLG----SFPFCDTKLGIDVRVQDLVSRLTLDEKVDEMVNAAQGIPRLGVP 91

Query: 95  QYEWWSEALHGVSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAM 154
            Y+WW EALHGV++  PG  F  + P ATSFP  I T ASFN +L+  IG+AVS+EARA+
Sbjct: 92  SYQWWQEALHGVAS-SPGVQFGGLAPAATSFPMPIATAASFNSTLFYSIGEAVSSEARAL 150

Query: 155 YNLGRAGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDL 214
           +NLGRAGLT+WSPN+N+ RDPRWGR  ETPGEDP +  ++A  YVRGLQ      +A+D 
Sbjct: 151 HNLGRAGLTFWSPNVNIFRDPRWGRGQETPGEDPLLASKFASLYVRGLQGGAYEGSASD- 209

Query: 215 NSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVM 274
               LKVS+CCKH  AYDVDNWKG+DRYHF+A V+EQD+ +T+  PF+ C+++G  SSVM
Sbjct: 210 --GFLKVSACCKHLTAYDVDNWKGMDRYHFNAEVSEQDLVDTYNPPFQSCIEDGRVSSVM 267

Query: 275 CSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQT 334
           CSYNRVNG+P+CAD  LL +TVR  W  +GYIV+DCD++QV+ ++  + A S EDAVA +
Sbjct: 268 CSYNRVNGVPTCADRNLLTETVRNSWGFNGYIVSDCDALQVLFEDTTY-APSAEDAVADS 326

Query: 335 LKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ---YVSL 391
           + AGLDL+CG +      +A+Q GK+ E D+D ++  L    MRLG FDG P    Y SL
Sbjct: 327 ILAGLDLNCGTFLGKHAKSALQAGKITEADLDHAVSNLMRTRMRLGLFDGDPNSQPYSSL 386

Query: 392 GKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYA 451
           G  DICS+++ +LA +AA +G+VLLKND  +LPL++A +KTVA++GP+ANAT  M+GNY 
Sbjct: 387 GATDICSNDHQQLALDAALQGVVLLKND-GSLPLSTA-LKTVALIGPNANATYTMLGNYE 444

Query: 452 GIPCRYMSPIAGFSGY-ANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSV 510
           GIPC+Y+SP+ G   Y +N+ Y  GC +VAC   + + +A E A  ADA +++ GLD S 
Sbjct: 445 GIPCKYISPLQGMQIYSSNILYSPGCRNVACNEGDLVASAVEVATKADAVVLVVGLDQSQ 504

Query: 511 EAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGY 570
           E E+ DR  L LPG Q+QL++ +A     P++LVIMSAG VDI+  + N+ I +++W GY
Sbjct: 505 ERETFDRTSLLLPGMQSQLVSNIANAVTSPIVLVIMSAGPVDISTFKDNSRISSVIWLGY 564

Query: 571 PGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYN 630
           PG+ GG A+A VVFG +NPGGRLP TWY+ ++   + +  M +RP    GYPGR+Y+FY 
Sbjct: 565 PGQSGGAALAHVVFGAYNPGGRLPNTWYHEEFTN-VSMLDMQMRPNPLSGYPGRSYRFYT 623

Query: 631 GPTLYPFGYGLSYTQFKYN-LLSFTKT--IQVNLNKLQHCRNLNYTSDASKTRCPGVLVN 687
           G  LY FG GLSY+ + Y  LL+ TK    + N    + C  +N +   +K+ C  +  +
Sbjct: 624 GTPLYNFGDGLSYSTYFYKFLLAPTKLSFFKSNTGNSRGCPAVNRSK--AKSGCFHLPAD 681

Query: 688 DLR-CDD-YFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNK 745
           DL  C+   F+  V+  N+G   GS  V+++S PP  +    +KQ+I FQ+V + +   +
Sbjct: 682 DLETCNSILFQVSVEVSNLGPRSGSHSVLIFSAPP-PVEGAPLKQLIAFQKVHLESDTTQ 740

Query: 746 RIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGN 779
           R+ F  + CK L+ V       L +G H + +GN
Sbjct: 741 RLIFGIDPCKHLSSVRRNGKRFLHSGRHKLLIGN 774


>gi|296083274|emb|CBI22910.3| unnamed protein product [Vitis vinifera]
          Length = 738

 Score =  720 bits (1859), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 363/748 (48%), Positives = 498/748 (66%), Gaps = 56/748 (7%)

Query: 35  FVCDPGRFSKLGLQMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLP 94
           F CDP    + G+   +  FC  SLP   R +DLV R+TL EK++ L + A  VPRLG+ 
Sbjct: 27  FACDP----RNGV-TRNLPFCRVSLPIQERARDLVGRLTLQEKIRLLVNNAIDVPRLGIK 81

Query: 95  QYEWWSEALHGVSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAM 154
            YEWWSEALHGVSNVGPGT F    PGATSFP VI T ASFN SLW++IG+ VS EARAM
Sbjct: 82  GYEWWSEALHGVSNVGPGTKFGGSFPGATSFPQVITTAASFNASLWEEIGRVVSDEARAM 141

Query: 155 YNLGRAGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDL 214
           YN G AGLTYWSPN+N+ RDPRWGR  ETPGEDP V  +YA  YVRGLQ      NA D 
Sbjct: 142 YNGGMAGLTYWSPNVNIFRDPRWGRGQETPGEDPAVAAKYAAAYVRGLQG-----NARDR 196

Query: 215 NSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVM 274
               LKV++CCKHY AYD+D+W G+DR+HF+ARV++QD+E+T+  PF+ CV EG+ +SVM
Sbjct: 197 ----LKVAACCKHYTAYDLDHWGGIDRFHFNARVSKQDLEDTYDVPFKACVVEGNVASVM 252

Query: 275 CSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQT 334
           CSYN+VNG P+CADP LL  T+RGEW L+GYIV+DCDS+ V  D   + A + E+A A  
Sbjct: 253 CSYNQVNGKPTCADPHLLRDTIRGEWKLNGYIVSDCDSVGVFYDEQHYTA-TPEEAAAVA 311

Query: 335 LKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ---YVSL 391
           +KAGLDLDCG +    T  A++ GK+ E D++ +L    +V MRLG FDG P    Y +L
Sbjct: 312 IKAGLDLDCGPFLAIHTEAAIRGGKLTEADVNGALMNTISVQMRLGMFDGEPSAQPYGNL 371

Query: 392 GKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYA 451
           G +D+C+  + +LA EAAR+GIVL++N    LPL++++ +T+AV+GP+++ T  MIGNYA
Sbjct: 372 GPRDVCTPAHQQLALEAARQGIVLVQNRGPALPLSTSRHRTIAVIGPNSDVTETMIGNYA 431

Query: 452 GIPCRYMSPIAGFSGYANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVE 511
           G+ C Y +P+ G   YA   ++ GC  VAC+ +    AA  AA+ ADAT+++ GLD S+E
Sbjct: 432 GVACGYTTPLQGIGRYARTIHQAGCSGVACRDDQQFGAAVAAARQADATVLVMGLDQSIE 491

Query: 512 AESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYP 571
           AE  DR D+ LPG Q +L+++VA  ++GP +LV+MS G +D++FA+ +  I AI+W GYP
Sbjct: 492 AEFRDRVDILLPGRQQELVSKVAVASRGPTVLVLMSGGPIDVSFAKNDPRIAAIIWVGYP 551

Query: 572 GEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNG 631
           G+ GG AIADV+FG+ NPGG+LP+TWY   Y++  P+T+M +R + S GYPGRTY+FYNG
Sbjct: 552 GQAGGTAIADVLFGRTNPGGKLPVTWYPQSYLRKAPMTNMAMRAIPSRGYPGRTYRFYNG 611

Query: 632 PTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRC 691
           P ++PFG+GLSY+ F ++L     T                                   
Sbjct: 612 PVVFPFGHGLSYSTFAHSLAQAPTTP---------------------------------- 637

Query: 692 DDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVF 751
                F +D +N G+ DGS  ++++S PP    +   K+++ F++V V AG  +R++F  
Sbjct: 638 ---LGFHIDVKNTGTMDGSHTLLLFSTPPPGTWSPN-KRLLAFEKVHVGAGSQERVRFDV 693

Query: 752 NACKSLNIVDYAANTLLPAGEHTIFVGN 779
           + CK L++VD+     +P GEH   +G+
Sbjct: 694 HVCKHLSVVDHFGIHRIPMGEHHFHIGD 721


>gi|18025340|gb|AAK38481.1| alpha-L-arabinofuranosidase/beta-D-xylosidase isoenzyme ARA-I
           [Hordeum vulgare]
          Length = 777

 Score =  720 bits (1859), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 361/753 (47%), Positives = 491/753 (65%), Gaps = 27/753 (3%)

Query: 33  PVFVCDPGRFSKLGLQMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLG 92
           PVF CD    +     ++++ FC+     S R +DLVSR+TL EKV  L +    + RLG
Sbjct: 37  PVFACDASNAT-----LAAYGFCNRKATASARARDLVSRLTLAEKVGFLVNKQPALGRLG 91

Query: 93  LPQYEWWSEALHGVSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEAR 152
           +P YEWWSEALHGVS VGPGT F  ++PGATSFP  ILT ASFN SL++ IG+ VSTEAR
Sbjct: 92  IPAYEWWSEALHGVSYVGPGTRFSPLVPGATSFPQPILTAASFNASLFRAIGEVVSTEAR 151

Query: 153 AMYNLGRAGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENAT 212
           AM+N+G AGLT+WSPNIN+ RDPRWGR  ETPGEDP +  +YAV YV GLQD      A 
Sbjct: 152 AMHNVGLAGLTFWSPNINIFRDPRWGRGQETPGEDPLLASKYAVGYVTGLQDA----GAG 207

Query: 213 DLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASS 272
            +    LKV++CCKHY AYDVDNWKGV+RY FDA+V++QD+++TF  PF+ CV +G+ +S
Sbjct: 208 GVTDGALKVAACCKHYTAYDVDNWKGVERYTFDAKVSQQDLDDTFQPPFKSCVLDGNVAS 267

Query: 273 VMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVA 332
           VMCSYN+VNG P+CAD  LL   +RG+W L+GYIV+DCDS+ V+     +   + E+A A
Sbjct: 268 VMCSYNKVNGKPTCADKDLLEGVIRGDWKLNGYIVSDCDSVDVLYTQQHY-TKTPEEAAA 326

Query: 333 QTLKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ---YV 389
            T+K+G+DL+CG +    T  AVQ G++ E D+D+++   + +LMRLGFFDG P+   + 
Sbjct: 327 ITIKSGVDLNCGNFLAQHTVAAVQAGELSEEDVDRAITNNFIMLMRLGFFDGDPRQLAFG 386

Query: 390 SLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGN 449
           SLG +D+C+  N ELA E AR+GIVLLKN    LPL++  +K++AV+GP+ANA+  MIGN
Sbjct: 387 SLGPKDVCTSSNRELARETARQGIVLLKN-SGALPLSAKSIKSMAVIGPNANASFTMIGN 445

Query: 450 YAGIPCRYMSPIAGFSGYANVTYKTGCDDVACKSNN-SIFAASEAAKTADATIILAGLDL 508
           Y G PC+Y +P+ G     N  Y+ GC +V C  N+  +  A  AA +AD T+++ G D 
Sbjct: 446 YEGTPCKYTTPLQGLGAKVNTVYQPGCTNVGCSGNSLQLSTAVAAAASADVTVLVVGADQ 505

Query: 509 SVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWA 568
           S+E ESLDR  L LPG QTQL++ VA  + GPVILV+MS G  DI+FA+ +  I A LW 
Sbjct: 506 SIERESLDRTSLLLPGQQTQLVSAVANASSGPVILVVMSGGPFDISFAKASDKIAATLWV 565

Query: 569 GYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKF 628
           GYPGE GG A+ D +FG  NP GRLP+TWY   Y   + +T M +RP  S GYPGRTY+F
Sbjct: 566 GYPGEAGGAALDDTLFGSHNPSGRLPVTWYPASYADTVTMTDMRMRPDTSTGYPGRTYRF 625

Query: 629 YNGPTLYPFGYGLSYTQFKYNLLSFTKT-IQVNLNKLQHCRNLNYTSDASKTRCPGVLVN 687
           Y G T++ FG GLSYT+  ++L+S   + + + L +   CR            C  V   
Sbjct: 626 YTGDTVFAFGDGLSYTKMSHSLVSAPPSYVSMRLAEDHLCR---------AEECASVEAA 676

Query: 688 DLRCDDY-FEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKR 746
              CDD   + K+  +N G   G+  V+++S PP    A   K ++GF++V +  G    
Sbjct: 677 GDHCDDLALDVKLQVRNAGEVAGAHSVLLFSSPPPAHNAP-AKHLVGFEKVSLAPGEAGT 735

Query: 747 IKFVFNACKSLNIVDYAANTLLPAGEHTIFVGN 779
           + F  + C+ L++VD      +  G HT+  G+
Sbjct: 736 VAFRVDVCRDLSVVDELGGRKVALGGHTLHDGD 768


>gi|302760655|ref|XP_002963750.1| hypothetical protein SELMODRAFT_80102 [Selaginella moellendorffii]
 gi|300169018|gb|EFJ35621.1| hypothetical protein SELMODRAFT_80102 [Selaginella moellendorffii]
          Length = 785

 Score =  718 bits (1854), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 363/762 (47%), Positives = 506/762 (66%), Gaps = 30/762 (3%)

Query: 30  SSSPVFVCDPGRFSKLGLQMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVP 89
           ++ P + CD    + LG    SF FCD+ L   +RV+DLVSR+TLDEKV ++ + A G+P
Sbjct: 32  TAQPRYACDVSSNASLG----SFPFCDTKLGVDVRVQDLVSRLTLDEKVDEMVNAAQGIP 87

Query: 90  RLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVST 149
           RLG+P Y+WW EALHGV++  PG  F  + P ATSFP  I   ASFN +L+  IG+AVS+
Sbjct: 88  RLGVPSYQWWQEALHGVAS-SPGVQFGGLAPAATSFPMPIAMAASFNSTLFYSIGEAVSS 146

Query: 150 EARAMYNLGRAGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHE 209
           EARA++NLGRAGLT+WSPN+N+ RDPRWGR  ETPGEDP +  ++A  YVRGLQ      
Sbjct: 147 EARALHNLGRAGLTFWSPNVNIFRDPRWGRGQETPGEDPLLASKFASLYVRGLQGGAYGG 206

Query: 210 NATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGD 269
           +A+D     LKVS+CCKH  AYD+DNWKG+DRYHF+A V+EQD+ +T+  PF+ C+++G 
Sbjct: 207 SASD---GFLKVSACCKHLTAYDMDNWKGMDRYHFNAEVSEQDLVDTYNPPFQSCIEDGR 263

Query: 270 ASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKED 329
            SSVMCSYNRVNG+P+CAD  LL +TVR  W  +GYIV+DCD++QV+ ++  + A S ED
Sbjct: 264 VSSVMCSYNRVNGVPTCADRSLLTETVRNSWGFNGYIVSDCDALQVLFEDTTY-APSAED 322

Query: 330 AVAQTLKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDG---SP 386
           AVA ++ AGLDL+CG +      +A+Q GKV E D+D ++  L    MRLG FDG   + 
Sbjct: 323 AVADSILAGLDLNCGTFLGKHAKSALQAGKVTEADLDHAISNLMRTRMRLGLFDGDLNTR 382

Query: 387 QYVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAM 446
            Y SLG  DICS+++ +LA +AA +G+VLLKND  +LPL++A +KTVA++GP+ANAT  M
Sbjct: 383 PYSSLGATDICSNDHQQLALDAALQGVVLLKND-GSLPLSTA-LKTVALIGPNANATYTM 440

Query: 447 IGNYAGIPCRYMSPIAGFSGY-ANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAG 505
           +GNY GIPC+Y+SP+ G   Y  N+ Y  GC DVAC   + + +A E A  ADA +++ G
Sbjct: 441 LGNYEGIPCKYVSPLQGMQIYNNNILYSPGCRDVACSEGDLVASAVEVATKADAVVLVVG 500

Query: 506 LDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAI 565
           LD S E E+ DR  L LPG Q+QL++ +A     P++LVIMSAG VDI+  + N+ I ++
Sbjct: 501 LDQSQERETFDRTSLLLPGMQSQLVSNIANAVTCPIVLVIMSAGPVDISTFKDNSRISSV 560

Query: 566 LWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRT 625
           +W GYPG+ GG A+A VVFG +NPGGRLP TWY+ ++   + +  M +RP    GYPGR+
Sbjct: 561 IWIGYPGQSGGAALAHVVFGAYNPGGRLPNTWYHEEFTN-VSMLDMRMRPNPPSGYPGRS 619

Query: 626 YKFYNGPTLYPFGYGLSYTQFKYNL------LSFTKTIQVNLNKLQHCRNLNYTSDASKT 679
           Y+FY G  LY FG GLSY+ + Y        LSF K+   N    + C  +N +   ++ 
Sbjct: 620 YRFYTGTPLYNFGDGLSYSTYLYKFLLAPTRLSFFKS---NTRNSRDCPTVNRSE--AEF 674

Query: 680 RCPGVLVNDLR-CDD-YFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRV 737
            C  +  +DL  C+   F+  V+  N+G   GS  V+++S PP  +    +KQ+I FQ+V
Sbjct: 675 GCFHLPADDLETCNSILFQVSVEVSNLGPRSGSHSVLIFSAPP-PVEGAPLKQLIAFQKV 733

Query: 738 FVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGN 779
            + +   +R+ F  + CK L+ V       L +G H + +GN
Sbjct: 734 HLESDTTQRLIFGIDPCKHLSSVRRNGKRFLHSGRHKLLIGN 775


>gi|224066931|ref|XP_002302285.1| predicted protein [Populus trichocarpa]
 gi|222844011|gb|EEE81558.1| predicted protein [Populus trichocarpa]
          Length = 773

 Score =  716 bits (1849), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 359/772 (46%), Positives = 505/772 (65%), Gaps = 36/772 (4%)

Query: 30  SSSPVFVCDPGRFSKLGLQMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVP 89
           S+ P F CD    S       +F FC+++LP S R +DLVSR+TLDEK+ QL + A  +P
Sbjct: 23  STQPPFSCDSSNPS-----TKAFPFCETTLPISQRARDLVSRLTLDEKISQLVNSAPPIP 77

Query: 90  RLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVST 149
           RLG+P YEWWSEALHGVSN GPG HF+D I GATSFP VILT ASF+   W +IGQA+  
Sbjct: 78  RLGIPGYEWWSEALHGVSNAGPGIHFNDNIKGATSFPQVILTAASFDAYQWYRIGQAIGK 137

Query: 150 EARAMYNLGRA-GLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQ----- 203
           EARA+YN G+A G+T+W+PNIN+ RDPRWGR  ETPGEDP V G YA +YV+G+Q     
Sbjct: 138 EARALYNAGQATGMTFWAPNINIFRDPRWGRGQETPGEDPLVTGLYAASYVKGVQGDSFE 197

Query: 204 --DVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPF 261
              ++GH          L+ S+CCKH+ AYD+DNWKG++R+ FDARVT QD+ +T+  PF
Sbjct: 198 GGKIKGH----------LQASACCKHFTAYDLDNWKGMNRFVFDARVTMQDLADTYQPPF 247

Query: 262 EMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHK 321
           + CV++G AS +MC+YN+VNG+PSCAD  LL++T R +W   GYI +DCD++ ++ D+  
Sbjct: 248 KSCVEQGRASGIMCAYNKVNGVPSCADSNLLSKTARAQWGFRGYITSDCDAVSIIHDDQG 307

Query: 322 FLADSKEDAVAQTLKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGF 381
           + A S EDAV   LKAG+D++CG Y       AV+Q K+ E+DIDK+L  L++V MRLG 
Sbjct: 308 Y-AKSPEDAVVDVLKAGMDVNCGSYLLKHAKVAVEQKKLSESDIDKALHNLFSVRMRLGL 366

Query: 382 FDGSPQ---YVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGP 438
           F+G P+   + ++G   +CS E+  LA EAAR GIVLLKN    LPL+ +K K++AV+GP
Sbjct: 367 FNGRPEGQLFGNIGPDQVCSQEHQILALEAARNGIVLLKNSARLLPLSKSKTKSLAVIGP 426

Query: 439 HANATVAMIGNYAGIPCRYMSPIAGFSGYANVT-YKTGCDDVACKSNNSIFAASEAAKTA 497
           +AN+   ++GNYAG PCR+++P+     Y   T Y   CD V C S+ S+  A + AK A
Sbjct: 427 NANSGQMLLGNYAGPPCRFVTPLQALQSYIKQTVYHPACDTVQC-SSASVDRAVDVAKGA 485

Query: 498 DATIILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAE 557
           D  +++ GLD + E E LDR DL LPG Q +LI  VA+ AK PV+LV+ S G VDI+FA+
Sbjct: 486 DNVVLMMGLDQTQEREELDRTDLLLPGKQQELIIAVAKAAKNPVVLVLFSGGPVDISFAK 545

Query: 558 TNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVD 617
            + NI +ILWAGYPGE G  A+A++VFG  NPGGRLP+TWY  ++V+ +P+T M +RP  
Sbjct: 546 NDKNIGSILWAGYPGEGGAIALAEIVFGDHNPGGRLPMTWYPQEFVK-VPMTDMGMRPEA 604

Query: 618 SLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTK-TIQVNLNKLQHCRNLNYTSDA 676
           S GYPGRTY+FY G +++ FGYG+SY+++ Y L + ++ T+ +N +   H  N     D+
Sbjct: 605 SSGYPGRTYRFYRGRSVFEFGYGISYSKYSYELTAVSQNTLYLNQSSTMHIIN---DFDS 661

Query: 677 SKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQR 736
            ++     L  +    +    ++  +N G   G   V+++++          KQ+IGFQ 
Sbjct: 662 VRSTLISELGTEFCEQNKCRARIGVKNHGEMAGKHPVLLFARQEKHGNGRPRKQLIGFQS 721

Query: 737 VFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGNGGVSFPIHL 788
           V + AG    I+F  + C+ L+  +     ++  G H + V   G  +PI +
Sbjct: 722 VVLGAGERAEIEFEVSPCEHLSRANEDGLMVMEEGRHFLVV--DGDEYPISV 771


>gi|371917286|dbj|BAL44719.1| SlArf/Xyl4 [Solanum lycopersicum]
          Length = 775

 Score =  716 bits (1848), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 364/790 (46%), Positives = 504/790 (63%), Gaps = 29/790 (3%)

Query: 9   LCFSLSIALLVFSTNAVDANGSSSPVFVCDPGRFSKLGLQMSSFLFCDSSLPYSIRVKDL 68
           L  S  I  ++ S + V    S+ P F CD         Q  S  FC + LP S+RV DL
Sbjct: 3   LHISTLITTILISLSLVSIVQSTQPPFSCDSSN-----PQTKSLKFCQTGLPISVRVLDL 57

Query: 69  VSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGATSFPTV 128
           VSR+TLDEK+ QL + A  +PRLG+P YEWWSE+LHGV + G G  F+  I GATSFP V
Sbjct: 58  VSRLTLDEKISQLVNSAPAIPRLGIPAYEWWSESLHGVGSAGKGIFFNGSIAGATSFPQV 117

Query: 129 ILTTASFNESLWKKIGQAVSTEARAMYNLGRA-GLTYWSPNINVARDPRWGRITETPGED 187
           ILT A+F+E+LW +IGQ +  EAR +YN G+A G+T+W+PNIN+ RDPRWGR  ETPGED
Sbjct: 118 ILTAATFDENLWYRIGQVIGVEARGVYNAGQAIGMTFWAPNINIFRDPRWGRGQETPGED 177

Query: 188 PFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDAR 247
           P + G+YA+ YVRG+Q      N   L    L+ S+CCKH+ AYD+D WK +DR+ F+A 
Sbjct: 178 PIMTGKYAIRYVRGVQG--DSFNGGQLKKGHLQASACCKHFTAYDLDQWKNLDRFSFNAI 235

Query: 248 VTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIV 307
           VT QDM +TF  PF+ C+++  AS +MCSYN VNGIPSCA+  LL +T R +W  HGYI 
Sbjct: 236 VTPQDMADTFQPPFQDCIQKAQASGIMCSYNSVNGIPSCANYNLLTKTARQQWGFHGYIT 295

Query: 308 ADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDK 367
           +DCD++QVM DNH++  ++ ED+ A  LKAG+D+DCG Y   +T +AV + KV +  ID+
Sbjct: 296 SDCDAVQVMHDNHRY-GNTPEDSTAFALKAGMDIDCGDYLKKYTKSAVMKKKVSQVHIDR 354

Query: 368 SLKYLYTVLMRLGFFDGSPQ---YVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLP 424
           +L  L+++ MRLG F+G P+   Y ++    +C+ ++ +LA EAAR GIVLLKN    LP
Sbjct: 355 ALHNLFSIRMRLGLFNGDPRKQLYGNISPSQVCAPQHQQLALEAARNGIVLLKNTGKLLP 414

Query: 425 LNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFSGYA-NVTYKTGCDDVACKS 483
           L+ AK  ++AV+G +AN    + GNY G PC+Y+  +    GYA +V Y+ GC+   C S
Sbjct: 415 LSKAKTNSLAVIGHNANNAYILRGNYDGPPCKYIEILKALVGYAKSVQYQQGCNAANCTS 474

Query: 484 NNSIFAASEAAKTADATIILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVIL 543
            N I  A   A+ AD  +++ GLD + E E  DR+DL LPG Q  LIN VA+ AK PVIL
Sbjct: 475 AN-IDQAVNIARNADYVVLIMGLDQTQEREQFDRDDLVLPGQQENLINSVAKAAKKPVIL 533

Query: 544 VIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYV 603
           VI+S G VDI+FA+ N  I +ILWAGYPGE GG A+A+++FG+ NPGG+LP+TWY   +V
Sbjct: 534 VILSGGPVDISFAKYNPKIGSILWAGYPGEAGGIALAEIIFGEHNPGGKLPVTWYPQAFV 593

Query: 604 QMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFT-KTIQVNLN 662
           + +P+T M +RP    GYPGRTY+FY GP +Y FGYGLSYT + Y   S T  TIQ  LN
Sbjct: 594 K-IPMTDMRMRPDPKTGYPGRTYRFYKGPKVYEFGYGLSYTTYSYGFHSATPNTIQ--LN 650

Query: 663 KLQHCRNLNYTSDASKTRCPGVLVNDLRCDD----YFEFKVDFQNVGSTDGSDVVIVYSK 718
           +L   + +  +     T      V+++  D+     F   V  +N G  DG   V+++ K
Sbjct: 651 QLLSVKTVENSDSIRYT-----FVDEIGSDNCEKAKFSAHVSVENSGEMDGKHPVLLFVK 705

Query: 719 PPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVG 778
                  + IKQ++GFQ V ++AG N ++ F  + C+ L+  +     ++  G   + VG
Sbjct: 706 QDKARNGSPIKQLVGFQSVSLKAGENSQLVFEISPCEHLSSANEDGLMMIEEGSRYLVVG 765

Query: 779 NGGVSFPIHL 788
           +     PI++
Sbjct: 766 DA--EHPINI 773


>gi|302811514|ref|XP_002987446.1| hypothetical protein SELMODRAFT_426206 [Selaginella moellendorffii]
 gi|300144852|gb|EFJ11533.1| hypothetical protein SELMODRAFT_426206 [Selaginella moellendorffii]
          Length = 772

 Score =  712 bits (1837), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 366/739 (49%), Positives = 499/739 (67%), Gaps = 24/739 (3%)

Query: 49  MSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSN 108
           +++F FC++SLP + RV+D V+R+TL+EK+ QL + A G+PRLG+P+Y+WW EALHGV++
Sbjct: 39  LAAFPFCNTSLPITDRVEDYVARLTLEEKISQLINTATGIPRLGVPKYQWWQEALHGVAS 98

Query: 109 VGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPN 168
             PG  F   +P ATSFP  I T ASFN SL+  IGQAVSTEARAM+NLG++GLT+WSPN
Sbjct: 99  -SPGVQFGGSVPAATSFPMPITTAASFNTSLFYGIGQAVSTEARAMHNLGQSGLTFWSPN 157

Query: 169 INVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHY 228
           IN+ RDPRWGR  ETPGEDP +   +A  YVRGLQ+ +         S  LKVS+CCKH 
Sbjct: 158 INIYRDPRWGRGQETPGEDPLLSSNFATYYVRGLQESQA-------GSDKLKVSACCKHM 210

Query: 229 AAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCAD 288
            AYDVDNW G DRYHF+A VTEQD+E+T+  PF+ CV++G  SSVMCSYNR+NG+P+CAD
Sbjct: 211 TAYDVDNWLGTDRYHFNAIVTEQDLEDTYNAPFKSCVEDGGVSSVMCSYNRLNGVPTCAD 270

Query: 289 PKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYT 348
            +LL  TVR  W L+GYIV+DCDS+QV  DN  + A +++ A A  L AGL+L+CG +  
Sbjct: 271 HELLTTTVRETWKLNGYIVSDCDSLQVFFDNTNYAATAED-AAADALLAGLNLNCGTFLA 329

Query: 349 NFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ---YVSLGKQDICSDENIELA 405
             T +A+QQ KV E  I+++L YL TV MRLG +DG P+   Y SLG  D+C+ E+  LA
Sbjct: 330 KHTLSAIQQKKVTEATINQALTYLVTVQMRLGLYDGDPKSQTYGSLGASDVCTSEHQTLA 389

Query: 406 AEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFS 465
            EAAR+G+VLLKN    LPL+++K+K++AVVGPHANAT AMIGNYAGIPC+Y SP+  F 
Sbjct: 390 LEAARQGMVLLKN-LGALPLSTSKIKSLAVVGPHANATRAMIGNYAGIPCKYTSPLQAFQ 448

Query: 466 GYANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAESLDREDLWLPGY 525
            YA V+Y  GC +VAC S++ I  A  AA  ADA ++  GLDL++EAESLDR  L LPG 
Sbjct: 449 KYAQVSYAPGCANVACSSDSLISGAVSAAAAADAVVVAVGLDLTIEAESLDRTSLLLPGK 508

Query: 526 QTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFG 585
           Q +L++QV + AKGPV++VI+SAG +DI FA +++ I  ILWAGYPG+ GG AIA+V+FG
Sbjct: 509 QQELVSQVMQAAKGPVVIVILSAGAIDIPFALSDSRIAGILWAGYPGQAGGAAIAEVIFG 568

Query: 586 KFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQ 645
             NP G+LP TWY  ++   + +  M +RP  S GYPGRTY+FY GPT++ FG GLSYT 
Sbjct: 569 DHNPSGKLPATWYPQNFTS-ISMLDMNMRPNASTGYPGRTYRFYTGPTIFKFGDGLSYTS 627

Query: 646 FKYNLLSFTKTIQV-NLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKV--DFQ 702
                +     + + +   +Q C  L  +S      C  +   D +  +  + +V    +
Sbjct: 628 LSAKFIKAPSFLSIPSTAPMQPCTGLKKSSS-----CFHLDATDEKSCESLKSQVAISVR 682

Query: 703 NVGSTDGSDVVIVYSKPPAEIA-ATYIKQVIGFQRVFVRAGR-NKRIKFVFNACKSLNIV 760
           N G+   S  ++++S PP+  +     +Q++GF ++ +     +  + F  + C+     
Sbjct: 683 NKGAMAISHTLMLFSTPPSAGSDGVPQRQLVGFNKIQIAGDSISNPVIFDLDPCRHFVHA 742

Query: 761 DYAANTLLPAGEHTIFVGN 779
           D     LL +G H +  GN
Sbjct: 743 DRDGKKLLRSGTHVLTAGN 761


>gi|115486595|ref|NP_001068441.1| Os11g0673200 [Oryza sativa Japonica Group]
 gi|113645663|dbj|BAF28804.1| Os11g0673200 [Oryza sativa Japonica Group]
          Length = 822

 Score =  711 bits (1836), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 385/780 (49%), Positives = 497/780 (63%), Gaps = 60/780 (7%)

Query: 50  SSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNV 109
           ++  FC  SLP   R +DLV+R+T  EKV+ L + A GVPRLG+  YEWWSEALHGVS+ 
Sbjct: 39  ATLPFCRRSLPARARARDLVARLTRAEKVRLLVNNAAGVPRLGVAGYEWWSEALHGVSDT 98

Query: 110 GPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQ------------------------ 145
           GPG  F    PGAT+FP VI T ASFN +LW+ IGQ                        
Sbjct: 99  GPGVRFGGAFPGATAFPQVIGTAASFNATLWELIGQVMPILKGGHARCNQRPSCIRISVF 158

Query: 146 --------AVSTEARAMYNLGRAGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVN 197
                   AVS E RAMYN G+AGLT+WSPN+N+ RDPRWGR  ETPGEDP V  RYA  
Sbjct: 159 MYVYVCAQAVSDEGRAMYNGGQAGLTFWSPNVNIFRDPRWGRGQETPGEDPAVAARYAAA 218

Query: 198 YVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETF 257
           YVRGLQ  +        +S  LK+++CCKH+ AYD+DNW G DR+HF+A VT QD+E+TF
Sbjct: 219 YVRGLQQQQ-------PSSGRLKLAACCKHFTAYDLDNWSGTDRFHFNAVVTRQDLEDTF 271

Query: 258 LRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMV 317
             PF  CV +G A+SVMCSYN+VNG+P+CAD   L  T+R  W L GYIV+DCDS+ V  
Sbjct: 272 NVPFRSCVVDGRAASVMCSYNQVNGVPTCADAAFLRGTIRRRWGLAGYIVSDCDSVDVFY 331

Query: 318 DNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLM 377
            +  +   ++EDAVA TL+AGLDLDCG +   +T  AV QGKV + DID ++    TV M
Sbjct: 332 SDQHY-TRTREDAVAATLRAGLDLDCGPFLAQYTEGAVAQGKVGDGDIDAAVTNTVTVQM 390

Query: 378 RLGFFDGSPQ---YVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVK-TV 433
           RLG FDG P    +  LG Q +C+  + ELA EAAR+GIVLLKND   LPL+ A  +  V
Sbjct: 391 RLGMFDGDPAAQPFGHLGPQHVCTAAHQELAVEAARQGIVLLKNDGRALPLSPATARRAV 450

Query: 434 AVVGPHANATVAMIGNYAGIPCRYMSPIAGFSGYA-NVTYKTGCDDVACK-SNNSIFAAS 491
           AVVGPHA ATVAMIGNYAG PCRY +P+ G + YA    ++ GC DVAC  S   I AA 
Sbjct: 451 AVVGPHAEATVAMIGNYAGKPCRYTTPLQGVARYAARAAHQPGCTDVACAGSGQPIAAAV 510

Query: 492 EAAKTADATIILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGV 551
           +AA+ ADATI++AGLD  +EAE LDR  L LPG Q +LI+ VA+ +KGPVILV+MS G +
Sbjct: 511 DAARRADATIVVAGLDQKIEAEGLDRASLLLPGRQAELISSVAKASKGPVILVLMSGGPI 570

Query: 552 DIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSM 611
           DI FA+ +  I  ILWAGYPG+ GG+AIADV+FG  NPGG+LP+TWY  DY+Q +P+T+M
Sbjct: 571 DIGFAQNDPKIAGILWAGYPGQAGGQAIADVIFGHHNPGGKLPVTWYPQDYLQKVPMTNM 630

Query: 612 PLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNL----NKLQHC 667
            +R   + GYPGRTY+FY GPT++PFG+GLSYT F +++      + V L          
Sbjct: 631 AMRANPAKGYPGRTYRFYTGPTIHPFGHGLSYTSFTHSIAHAPSQLTVRLSAHHAAASAS 690

Query: 668 RNLNYTSDASKTRCPGVLVNDLRCDDY-FEFKVDFQNVGSTDGSDVVIVY-------SKP 719
            +LN T+  S+     V V   RC++      VD +NVG  DG+  V+VY       +  
Sbjct: 691 ASLNATARLSRAAA--VRVAHARCEELRMPVHVDVRNVGERDGAHTVLVYAAAPASSAAE 748

Query: 720 PAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGN 779
            A      ++Q++ F++V V AG   R++   + C  L++ D      +P GEH + +G 
Sbjct: 749 AAAGHGAPVRQLVAFEKVHVGAGGTARVEMGIDVCDGLSVADRNGVRRIPVGEHRLIIGE 808


>gi|85813772|emb|CAJ65922.1| xylan 1,4-beta-xylosidase [Populus tremula x Populus alba]
          Length = 757

 Score =  711 bits (1836), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 387/793 (48%), Positives = 501/793 (63%), Gaps = 75/793 (9%)

Query: 5   VSSLLCFSLSIALLVFSTNAVDANGSSSPVFVCDPGRFSKLGLQMSSFLFCDSSLPYSIR 64
           VS  L FSL   LL  S++ V A   SSPVF CD      L    +SF FC++SL  S R
Sbjct: 13  VSVFLFFSLVCFLLFSSSHVVLAQ--SSPVFACDVVSNPSL----ASFGFCNTSLGVSDR 66

Query: 65  VKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGATS 124
           V DLV R+TL EK+  L + A  V RLG+P+YEWWSEALHGVS VGPGTHF  V+PGATS
Sbjct: 67  VVDLVKRLTLQEKILFLVNSAGSVSRLGIPKYEWWSEALHGVSYVGPGTHFSSVVPGATS 126

Query: 125 FPTVILTTASFNESLWKKIG----QAVSTEARAMYNLGRAGLTYWSPNINVARDPRWGRI 180
           FP VILT ASFN SL+  IG    Q VSTEARAMYN+G AGLT+WSPNIN+ RDPRWGR 
Sbjct: 127 FPQVILTAASFNTSLFVAIGKVISQVVSTEARAMYNVGLAGLTFWSPNINIFRDPRWGRG 186

Query: 181 TETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVD 240
            ETPGEDP +  +Y   YV+GLQ  +      D N   LKV++CCKHY AYD+DNWKGVD
Sbjct: 187 QETPGEDPLLSSKYGSGYVKGLQQRD------DGNPDGLKVAACCKHYTAYDLDNWKGVD 240

Query: 241 RYHFDA-RVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGE 299
           RYHF+A  VT+QDM++TF  PF+ CV +G+ +SVMCSYN+VNGIP+CADP LL+  +RGE
Sbjct: 241 RYHFNAVVVTKQDMDDTFQPPFKSCVVDGNVASVMCSYNKVNGIPTCADPDLLSGVIRGE 300

Query: 300 WDLHG--YIVADCDSIQVMVDNHKFLADSKEDAVAQTLKA--GLDLDCGQYYTNFTGNAV 355
           W L+G  YIV DCDSI V  ++  +   + E+A A+ + A  GLDL+CG +    T  AV
Sbjct: 301 WKLNGYVYIVTDCDSIDVFYNSQHY-TKTPEEAAAKAILAGIGLDLNCGSFLGKHTEAAV 359

Query: 356 QQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ---YVSLGKQDICSDENIELAAEAAREG 412
             G V E+ ID+++   +  LMRLGFFDG P    Y  LG +D+C+ EN ELA EAAR+G
Sbjct: 360 TAGLVNESAIDRAVSNNFATLMRLGFFDGDPSKQLYGKLGPKDVCTAENQELAREAARQG 419

Query: 413 IVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFSGYANVTY 472
           IVLLKN                                 G PC+Y +P+ G +     TY
Sbjct: 420 IVLLKN--------------------------------TGTPCKYTTPLQGLAALVATTY 447

Query: 473 KTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAESLDREDLWLPGYQTQLINQ 532
             GC +VAC S   +  A + A  ADAT+++ G DLS+EAES DR D+ LPG Q  LI  
Sbjct: 448 LPGCSNVAC-STAQVDDAKKIAAAADATVLVMGADLSIEAESRDRVDILLPGQQQLLITA 506

Query: 533 VAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFN---- 588
           VA  + GPVILVIMS GG+D++FA+TN  I +ILW GYPGE GG AIAD++FG +N    
Sbjct: 507 VANASTGPVILVIMSGGGMDVSFAKTNDKITSILWVGYPGEAGGAAIADIIFGSYNPSTH 566

Query: 589 --PGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQF 646
             PGGRLP+TWY   YV  +P+T+M +RP  S GYPGRTY+FY G T+Y FG GLSY++F
Sbjct: 567 QPPGGRLPMTWYPQSYVDKVPMTNMNMRPDPSNGYPGRTYRFYTGETVYSFGDGLSYSEF 626

Query: 647 KYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGS 706
            + L      + V L +   C    Y+S+     C  V   +  C + F+  +  +N G+
Sbjct: 627 SHELTQAPGLVSVPLEENHVC----YSSE-----CKSVAAAEQTCQN-FDVHLRIKNTGT 676

Query: 707 TDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANT 766
           T GS  V ++S PP+ +  +  K ++GF++VF+ A  +  + F  + CK L++VD   + 
Sbjct: 677 TSGSHTVFLFSTPPS-VHNSPQKHLVGFEKVFLHAQTDSHVGFKVDVCKDLSVVDELGSK 735

Query: 767 LLPAGEHTIFVGN 779
            +  GEH + +G+
Sbjct: 736 KVALGEHVLHIGS 748


>gi|302796585|ref|XP_002980054.1| hypothetical protein SELMODRAFT_419541 [Selaginella moellendorffii]
 gi|300152281|gb|EFJ18924.1| hypothetical protein SELMODRAFT_419541 [Selaginella moellendorffii]
          Length = 779

 Score =  710 bits (1832), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 369/765 (48%), Positives = 485/765 (63%), Gaps = 59/765 (7%)

Query: 52  FLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGP 111
           F FC++ LP S RV+DL+SRMTL EK+ QL + A G+PRLGLP+YEWW EALHGV+ V P
Sbjct: 44  FGFCNTRLPTSTRVEDLISRMTLQEKIIQLVNNAAGIPRLGLPRYEWWQEALHGVA-VSP 102

Query: 112 GTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNINV 171
           G  F    PGATSFP  ILT ASF+         AVSTEARAM+N  RAGLTYWSPN+N+
Sbjct: 103 GVKFGGKFPGATSFPMPILTAASFD---------AVSTEARAMHNYQRAGLTYWSPNVNI 153

Query: 172 ARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAY 231
            RDPRWGR  ETPGEDP +  +YA  YVRGLQD       T+L    LKVS+CCKH  AY
Sbjct: 154 YRDPRWGRGQETPGEDPLLSSKYATFYVRGLQD-------TNLGGDKLKVSACCKHMTAY 206

Query: 232 DVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKL 291
           DVDNWKG  R+ F+A VT+QD+ +T+  PF+ CV++   SSVMCSYNRVNG+P+CAD  L
Sbjct: 207 DVDNWKGTTRFKFNAIVTQQDLSDTYNPPFQSCVEDAKVSSVMCSYNRVNGVPTCADYNL 266

Query: 292 LNQTVRGEWDLHG----------------YIVADCDSIQVMVDNHKFLADSKEDAVAQTL 335
           L+ TVR  W+L+G                YIV+DCDS+Q   DN  + A + ED VA  L
Sbjct: 267 LSATVRSSWNLNGSILLTCEVLLLYLPCSYIVSDCDSLQTFFDNTNY-AKTAEDVVADAL 325

Query: 336 KAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ---YVSLG 392
            AGL+LDCG +    T +A+  GK+ E +++++L+YLY V MRLG +DG+P+   Y +LG
Sbjct: 326 LAGLNLDCGPFLAIHTQSAITNGKITEANVNQALRYLYNVQMRLGLYDGNPRSQPYGNLG 385

Query: 393 KQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAG 452
            Q +C+ EN +LA +AA+EGIVLLKN+ N LP + + ++TVA +GPHA AT AMIGNY G
Sbjct: 386 PQSVCTGENQQLALDAAKEGIVLLKNNGNVLPFSKSNIRTVAAIGPHAKATRAMIGNYQG 445

Query: 453 IPCRYMSPIAGFSGYANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEA 512
           IPC+Y +P  G S YA V Y  GC DVAC SN+ I +A+  A  ADA ++  GLDL+ EA
Sbjct: 446 IPCKYTTPHDGLSAYARVVYSAGCSDVACYSNSLIGSAASTASQADAVVLFVGLDLNQEA 505

Query: 513 ESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPG 572
           E  DR  L LPG Q +L+ +V + AKGPV+LVI S G VD++FA+ +  ++ +LWAGYPG
Sbjct: 506 EGKDRTSLLLPGKQQELVTEVTKAAKGPVVLVIFSGGSVDVSFAKYDKKVQGMLWAGYPG 565

Query: 573 EEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGP 632
           E GG AIA V+FG  NPGGRLP+TWY   +  +  L  M +RP  S GYPGRTY+FY G 
Sbjct: 566 EAGGAAIAQVLFGDHNPGGRLPVTWYPESFTGITML-DMNMRPDASRGYPGRTYRFYTGQ 624

Query: 633 TLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLV------ 686
           ++Y FGYG +Y++  +      K   ++L   +        + A K  C G L       
Sbjct: 625 SVYNFGYGKTYSKLSHKF----KEAPLSLGFPE--------AAAVKRSCDGNLTCFHLNA 672

Query: 687 -NDLRCDDYF-EFKVDFQNVGSTDGSDVVIVYSKPP-AEIAATYIKQVIGFQRVFVRAGR 743
            +++ C     + ++   N G    +  V++YS PP A      I+Q+ GF +V V  G 
Sbjct: 673 HDEITCSTLTSKVRILVHNEGDRPSNRAVLLYSSPPNAGRDGAPIRQLAGFGKVSVAPGA 732

Query: 744 NKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGNGGVSFPIHL 788
            + ++   + CK L+        +L  G HT+ VGN     PI L
Sbjct: 733 VENVEIEIDPCKHLSHAGANGVRILHGGIHTLAVGNARHPLPILL 777


>gi|302811516|ref|XP_002987447.1| hypothetical protein SELMODRAFT_426207 [Selaginella moellendorffii]
 gi|300144853|gb|EFJ11534.1| hypothetical protein SELMODRAFT_426207 [Selaginella moellendorffii]
          Length = 779

 Score =  708 bits (1828), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 369/765 (48%), Positives = 483/765 (63%), Gaps = 59/765 (7%)

Query: 52  FLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGP 111
           F FC++ LP S RV+DL+SRMTL EK+ QL + A G+PRLGLP+YEWW EALHGV+ V P
Sbjct: 44  FGFCNTRLPTSTRVEDLISRMTLQEKIIQLVNNAAGIPRLGLPRYEWWQEALHGVA-VSP 102

Query: 112 GTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNINV 171
           G  F    PGATSFP  ILT ASF+         AVSTEARAM+N  RAGLTYWSPN+N+
Sbjct: 103 GVKFGGKFPGATSFPMPILTAASFD---------AVSTEARAMHNYQRAGLTYWSPNVNI 153

Query: 172 ARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAY 231
            RDPRWGR  ETPGEDP +  +YA  YVRGLQD       T+L    LKVS+CCKH  AY
Sbjct: 154 YRDPRWGRGQETPGEDPLLSSKYATFYVRGLQD-------TNLGGDKLKVSACCKHMTAY 206

Query: 232 DVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKL 291
           DVDNWKG  R+ F+A VT+QD+ +T+  PF+ CV++   SSVMCSYNRVNG+P+CAD  L
Sbjct: 207 DVDNWKGTTRFKFNAIVTQQDLSDTYNPPFQSCVEDAKVSSVMCSYNRVNGVPTCADYNL 266

Query: 292 LNQTVRGEWDLHG----------------YIVADCDSIQVMVDNHKFLADSKEDAVAQTL 335
           L+ TVR  W+L+G                YIV+DCDS+Q   DN  + A + ED VA  L
Sbjct: 267 LSATVRSSWNLNGSILLTCEVLLLYLPCSYIVSDCDSLQTFFDNTNY-AKTAEDVVADAL 325

Query: 336 KAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ---YVSLG 392
            AGL+LDCG +    T +A+  GK+ E +++++L+YLY V MRLG +DG+P+   Y +LG
Sbjct: 326 LAGLNLDCGPFLAIHTQSAITNGKITEANVNQALRYLYNVQMRLGLYDGNPRSQPYGNLG 385

Query: 393 KQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAG 452
            Q +C+ EN +LA +AA+EGIVLLKN+ N LP + + ++TVA +GPHA AT AMIGNY G
Sbjct: 386 PQSVCTGENQQLALDAAKEGIVLLKNNGNVLPFSKSNIRTVAAIGPHAKATRAMIGNYQG 445

Query: 453 IPCRYMSPIAGFSGYANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEA 512
           IPC+Y +P  G S YA V Y  GC DVAC S++ I +A   A  ADA ++  GLDL+ EA
Sbjct: 446 IPCKYTTPHDGLSAYARVVYSAGCSDVACYSDSLIGSAVSTASQADAVVLFVGLDLNQEA 505

Query: 513 ESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPG 572
           E  DR  L LPG Q +L+ +V + AKGP +LVI S G VD++FA+ N  ++ ILWAGYPG
Sbjct: 506 EGKDRTSLLLPGKQQELVTEVTKAAKGPAVLVIFSGGSVDVSFAKYNNKVQGILWAGYPG 565

Query: 573 EEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGP 632
           E GG AIA V+FG  NPGGRLP+TWY   +  +  L  M +RP  S GYPGRTY+FY G 
Sbjct: 566 EAGGAAIAQVLFGDHNPGGRLPVTWYPESFTGITML-DMNMRPDASRGYPGRTYRFYTGQ 624

Query: 633 TLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLV------ 686
           ++Y FGYG +Y++  +      K   ++L   +        + A K  C G L       
Sbjct: 625 SVYNFGYGKTYSKLSHKF----KEAPLSLGFPE--------AAAVKRSCDGNLTCFHLNA 672

Query: 687 -NDLRCDDYF-EFKVDFQNVGSTDGSDVVIVYSKPP-AEIAATYIKQVIGFQRVFVRAGR 743
            +++ C     + ++   N G    +  V++YS PP A      I+Q+ GF +V V  G 
Sbjct: 673 HDEITCSTLTSKVRILVHNKGDRPSNRAVLLYSSPPNAGRDGAPIRQLAGFGKVSVAPGA 732

Query: 744 NKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGNGGVSFPIHL 788
            + ++   + CK L+        +L  G HT+ VGN     PI L
Sbjct: 733 VENVEIEIDPCKHLSHAGANGVRILHGGIHTLAVGNARHPLPILL 777


>gi|302796583|ref|XP_002980053.1| hypothetical protein SELMODRAFT_112087 [Selaginella moellendorffii]
 gi|300152280|gb|EFJ18923.1| hypothetical protein SELMODRAFT_112087 [Selaginella moellendorffii]
          Length = 772

 Score =  707 bits (1826), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 366/739 (49%), Positives = 497/739 (67%), Gaps = 24/739 (3%)

Query: 49  MSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSN 108
           +++F FC++SL  + RV+D V+R+TL+EK+ QL + A G+PRLG+P+Y+WW EALHGV++
Sbjct: 39  LAAFPFCNTSLAITDRVEDYVARLTLEEKISQLINTATGIPRLGVPKYQWWQEALHGVAS 98

Query: 109 VGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPN 168
             PG  F   +P ATSFP  I T ASFN SL+  IGQAVSTEARAM+NLG++GLT+WSPN
Sbjct: 99  -SPGVQFGGSVPAATSFPMPITTAASFNTSLFYGIGQAVSTEARAMHNLGQSGLTFWSPN 157

Query: 169 INVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHY 228
           IN+ RDPRWGR  ETPGEDP +   +A  YVRGLQ+ +         S  LKVS+CCKH 
Sbjct: 158 INIYRDPRWGRGQETPGEDPLLSSNFATYYVRGLQESQA-------GSDKLKVSACCKHM 210

Query: 229 AAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCAD 288
            AYDVDNW G DRYHF+A VTEQD+E+T+  PF+ CV++G  SSVMCSYNR+NG+P+CAD
Sbjct: 211 TAYDVDNWLGTDRYHFNAIVTEQDLEDTYNAPFKSCVEDGGVSSVMCSYNRLNGVPTCAD 270

Query: 289 PKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYT 348
            +LL  TVR  W L+GYIV+DCDS+QV  DN  + A +++ A A  L AGL+L+CG +  
Sbjct: 271 HELLTTTVRETWKLNGYIVSDCDSLQVFFDNTNYAATAED-AAADALLAGLNLNCGTFLA 329

Query: 349 NFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ---YVSLGKQDICSDENIELA 405
             T +A+QQ KV E  I+++L YL TV MRLG +DG P+   Y SLG  D+C+ E+  LA
Sbjct: 330 KHTLSAIQQKKVTEATINQALTYLVTVQMRLGLYDGDPKSQTYGSLGASDVCTSEHQTLA 389

Query: 406 AEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFS 465
            EAAR+G+VLLKN    LPL+++K+K++AVVGPHANAT AMIGNYAGIPC+Y SP+  F 
Sbjct: 390 LEAARQGMVLLKN-LGALPLSTSKIKSLAVVGPHANATRAMIGNYAGIPCKYTSPLQAFQ 448

Query: 466 GYANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAESLDREDLWLPGY 525
            YA V+Y  GC +VAC S++ I  A  AA  ADA ++  GLDL++EAESLDR  L LPG 
Sbjct: 449 KYAQVSYAPGCANVACSSDSLISGAVSAAAAADAVVVAVGLDLTIEAESLDRTSLLLPGK 508

Query: 526 QTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFG 585
           Q +L++QV + AKGPV++VI+SAG +DI FA +++ I  ILWAGYPG+ GG AIA+V+FG
Sbjct: 509 QQELVSQVMQAAKGPVVIVILSAGAIDIPFALSDSRIAGILWAGYPGQAGGAAIAEVIFG 568

Query: 586 KFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQ 645
             NP G+LP TWY  ++   + +  M +RP  S GYPGRTY+FY GPT++ FG GLSYT 
Sbjct: 569 DHNPSGKLPATWYPQNFTS-ISMLDMNMRPNASTGYPGRTYRFYTGPTIFKFGDGLSYTS 627

Query: 646 FKYNLLSFTKTIQV-NLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKV--DFQ 702
                +     + + +   +Q C  L  +S      C  +   D +  +  + +V    +
Sbjct: 628 LSAKFIKAPSFLSIPSTAPMQPCTGLKKSSS-----CFHLDATDEKSCESLKSQVAISVR 682

Query: 703 NVGSTDGSDVVIVYSKPP-AEIAATYIKQVIGFQRVFVRAGR-NKRIKFVFNACKSLNIV 760
           N G+   S  ++++S PP A       +Q++GF ++ +     +  + F  + C+     
Sbjct: 683 NKGAMAISHTLMLFSTPPNAGSDGVPQRQLVGFNKIQIAGDSISNPVIFDLDPCRHFVHA 742

Query: 761 DYAANTLLPAGEHTIFVGN 779
           D     LL +G H +  GN
Sbjct: 743 DPDGKKLLRSGTHVLTAGN 761


>gi|242071935|ref|XP_002451244.1| hypothetical protein SORBIDRAFT_05g026400 [Sorghum bicolor]
 gi|241937087|gb|EES10232.1| hypothetical protein SORBIDRAFT_05g026400 [Sorghum bicolor]
          Length = 790

 Score =  705 bits (1819), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 365/762 (47%), Positives = 480/762 (62%), Gaps = 52/762 (6%)

Query: 46  GLQMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHG 105
           G   ++  FC  SLP   R +DLVSR+T  EKV+ L + A GV RLG+  YEWWSEALHG
Sbjct: 39  GGPATTLPFCRQSLPLHARARDLVSRLTRAEKVRLLVNNAAGVARLGVGGYEWWSEALHG 98

Query: 106 VSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYW 165
           VS+ GPG  F    PGAT+FP VI   A+ N +LW+ IG+AVS EARAMYN GRAGLT+W
Sbjct: 99  VSDTGPGVKFGGAFPGATAFPQVIGAAAALNATLWELIGRAVSDEARAMYNGGRAGLTFW 158

Query: 166 SPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCC 225
           SPN+N+ RDPRWGR  ETPGEDP +  RYA  YVRGLQ    H          LK+++CC
Sbjct: 159 SPNVNIFRDPRWGRGQETPGEDPAISSRYAAAYVRGLQQPYDHNR--------LKLAACC 210

Query: 226 KHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPS 285
           KH+ AYD+D+W G DR+HF+A V+ QD+E+TF  PF  CV  G A+SVMCSYN+VNG+P+
Sbjct: 211 KHFTAYDLDSWGGTDRFHFNAVVSPQDLEDTFNVPFRACVAGGRAASVMCSYNQVNGVPT 270

Query: 286 CADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQ 345
           CAD   L  T+R  W L GYIV+DCDS+ V   +  +   + EDAVA TL+AGLDLDCG 
Sbjct: 271 CADQGFLRGTIRKAWGLDGYIVSDCDSVDVFFRDQHY-TRTAEDAVAATLRAGLDLDCGP 329

Query: 346 YYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ---YVSLGKQDICSDENI 402
           +   +T NAV + KV + D+D +L    TV MRLG FDG P    +  LG  D+C+  + 
Sbjct: 330 FLALYTENAVARKKVSDADVDAALLNTVTVQMRLGMFDGDPASGPFGHLGAADVCTKAHQ 389

Query: 403 ELAAEAAREGIVLLKN-------DQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPC 455
           +LA +AAR+ +VLLKN       D++ LPL  A  + VAVVGPHA+ATVAMIGNYAG PC
Sbjct: 390 DLALDAARQSVVLLKNQRGRKHRDRDVLPLRPAAHRVVAVVGPHADATVAMIGNYAGKPC 449

Query: 456 RYMSPIAGFSGY-ANVTYKTGCDDVACKSNNS-IFAASEAAKTADATIILAGLDLSVEAE 513
           RY +P+ G + Y A V ++ GC DVAC+  N  I AA +AA+         GL  S    
Sbjct: 450 RYTTPLQGVAAYAARVVHQAGCADVACQGKNQPIAAAVDAARRLTPPSSSPGLTRS---- 505

Query: 514 SLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGE 573
                 L LPG Q +LI+ VA+ AKGPVILV+MS G +DIAFA+ +  I  ILW GYPG+
Sbjct: 506 ------LLLPGRQAELISAVAKAAKGPVILVLMSGGPIDIAFAQNDPRIDGILWVGYPGQ 559

Query: 574 EGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPT 633
            GG+AIADV+FG+ NPGG+LP+TWY  DY++ +P+T+M +R   + GYPGRTY+FY GPT
Sbjct: 560 AGGQAIADVIFGQHNPGGKLPVTWYPQDYLEKVPMTNMAMRANPARGYPGRTYRFYTGPT 619

Query: 634 LYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRN--------LNYTSDASKTRCPGVL 685
           ++ FG+GLSYTQF + L      + V L+      +        LN T  +   R     
Sbjct: 620 IHAFGHGLSYTQFTHTLAHAPAQLTVRLSTSSASASASASAASLLNATRPSRAVR----- 674

Query: 686 VNDLRCDDY-FEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATY-------IKQVIGFQRV 737
           V   RC+       VD +NVG  DG+  V+VY   P+  +++         +Q++ F++V
Sbjct: 675 VAHARCEGLTVPVHVDVRNVGDRDGAHAVLVYHVAPSSSSSSAPAGTDAPARQLVAFEKV 734

Query: 738 FVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGN 779
            V AG   R++   + C  L++ D      +P GEH + +G 
Sbjct: 735 HVPAGGVARVEMGIDVCDRLSVADRDGVRRIPVGEHRLMIGE 776


>gi|326489197|dbj|BAK01582.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 709

 Score =  702 bits (1812), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 346/696 (49%), Positives = 466/696 (66%), Gaps = 22/696 (3%)

Query: 90  RLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVST 149
           RLG+P YEWWSEALHGVS VGPGT F  ++PGATSFP  ILT ASFN SL++ IG+ VST
Sbjct: 21  RLGIPAYEWWSEALHGVSYVGPGTRFSPLVPGATSFPQPILTAASFNASLFRAIGEVVST 80

Query: 150 EARAMYNLGRAGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHE 209
           EARAM+N+G AGLT+WSPNIN+ RDPRWGR  ETPGEDP +  +YAV YV GLQD     
Sbjct: 81  EARAMHNVGLAGLTFWSPNINIFRDPRWGRGQETPGEDPLLASKYAVGYVTGLQDA---- 136

Query: 210 NATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGD 269
            A  +    LKV++CCKHY AYDVDNWKGV+RY FDA+V++QD+++TF  PF+ CV +G+
Sbjct: 137 GAGGVTDGALKVAACCKHYTAYDVDNWKGVERYTFDAKVSQQDLDDTFQPPFKSCVLDGN 196

Query: 270 ASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKED 329
            +SVMCSYN+VNG P+CAD  LL   +RG+W L+GYIV+DCDS+ V+     +   + E+
Sbjct: 197 VASVMCSYNKVNGKPTCADKDLLEGVIRGDWKLNGYIVSDCDSVDVLYTQQHY-TKTPEE 255

Query: 330 AVAQTLKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ-- 387
           A A T+K+GLDL+CG +    T  AVQ G++ E D+D+++   + +LMRLGFFDG P+  
Sbjct: 256 AAAITIKSGLDLNCGNFLAQHTVAAVQAGELSEEDVDRAITNNFIMLMRLGFFDGDPRQL 315

Query: 388 -YVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAM 446
            + SLG +D+C+  N ELA E AR+GIVLLKN    LPL++  +K++AV+GP+ANA+  M
Sbjct: 316 AFGSLGPKDVCTSSNRELARETARQGIVLLKN-SGALPLSAKSIKSMAVIGPNANASFTM 374

Query: 447 IGNYAGIPCRYMSPIAGFSGYANVTYKTGCDDVACKSNN-SIFAASEAAKTADATIILAG 505
           IGNY G PC+Y +P+ G     N  Y+ GC +V C  N+  +  A  AA +AD T+++ G
Sbjct: 375 IGNYEGTPCKYTTPLQGLGAKVNTVYQPGCTNVGCSGNSLQLSTAVAAAASADVTVLVVG 434

Query: 506 LDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAI 565
            D S+E ESLDR  L LPG QTQL++ VA  + GPVILV+MS G  DI+FA+ +  I AI
Sbjct: 435 ADQSIERESLDRTSLLLPGQQTQLVSAVANASSGPVILVVMSGGPFDISFAKASDKIAAI 494

Query: 566 LWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRT 625
           LW GYPGE GG A+AD++FG  NP GRLP+TWY   Y   + +T M +RP  S GYPGRT
Sbjct: 495 LWVGYPGEAGGAALADILFGSHNPSGRLPVTWYPASYADTVTMTDMRMRPDTSTGYPGRT 554

Query: 626 YKFYNGPTLYPFGYGLSYTQFKYNLLSFTKT-IQVNLNKLQHCRNLNYTSDASKTRCPGV 684
           Y+FY G T++ FG GLSYT+  ++L+S   + + + L +   CR            C  V
Sbjct: 555 YRFYTGDTVFAFGDGLSYTKMSHSLVSAPPSYVSMRLAEDHPCR---------AEECASV 605

Query: 685 LVNDLRCDDY-FEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGR 743
                 CDD  F+ K+  +N G   G+  V+++S PP    A   K ++GF++V +  G 
Sbjct: 606 EAAGDHCDDLAFDVKLQVRNAGEVAGAHSVLLFSSPPPAHNAP-AKHLLGFEKVSLAPGE 664

Query: 744 NKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGN 779
              + F  + C+ L++VD      +  G HT+ VG+
Sbjct: 665 AGTVAFRVDVCRDLSVVDELGGRKVALGGHTLHVGD 700


>gi|302786474|ref|XP_002975008.1| hypothetical protein SELMODRAFT_103038 [Selaginella moellendorffii]
 gi|300157167|gb|EFJ23793.1| hypothetical protein SELMODRAFT_103038 [Selaginella moellendorffii]
          Length = 772

 Score =  701 bits (1808), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 351/753 (46%), Positives = 471/753 (62%), Gaps = 45/753 (5%)

Query: 50  SSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNV 109
           SSF FCD SLP   RV DLV RM L EK+ Q+   A G+PRLG+P Y+WW EALHGV+  
Sbjct: 31  SSFPFCDVSLPVPDRVADLVGRMNLSEKIAQIVSNASGIPRLGIPGYQWWEEALHGVAE- 89

Query: 110 GPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNI 169
            PG  F   +P ATSFP VILT ASFN SLW KI QA+S EA AMYN GR+GLT+WSPNI
Sbjct: 90  SPGVKFAAPVPSATSFPQVILTVASFNSSLWNKIAQAISIEAIAMYNAGRSGLTFWSPNI 149

Query: 170 NVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENA--TDLNSRP--LKVSSCC 225
           N+ RDPRWGR  ETPGEDP +  +YA  +VRGLQ+ +  E    + +  RP  LKVSSCC
Sbjct: 150 NIFRDPRWGRGQETPGEDPLLSSKYAAYFVRGLQEGDYDEGTAISTMQRRPTRLKVSSCC 209

Query: 226 KHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPS 285
           KH+ AYD++  +G D +HF+A+VT QD+++TF  PF  C+ +G AS +MCSYNRVNG+PS
Sbjct: 210 KHFTAYDMEKSEGTDCFHFNAQVTVQDLQDTFDPPFRSCIVDGQASGLMCSYNRVNGVPS 269

Query: 286 CADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQ 345
           CAD   L +TVR  W   GYIV+DCD++ ++ +   +   + EDAVA  L AG+DL+CG 
Sbjct: 270 CADYTFLTETVRNSWGFEGYIVSDCDAVALLYEYINY-TTTAEDAVADVLSAGMDLNCGT 328

Query: 346 YYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSP--QYVSLGKQDICSDENIE 403
           +    T  A++QGKV E  +D++L  + TV MRLG FDG+    Y S+G   +C+ E+ +
Sbjct: 329 FLLRHTAAAIEQGKVTEAAVDRALSNVMTVRMRLGLFDGNSGETYNSIGPDAVCTREHRQ 388

Query: 404 LAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAG 463
           L+ EAA +GIVLLKN  N LP     + T+AV+GP  NAT  M+GNYAG+PC+Y++P  G
Sbjct: 389 LSLEAAEQGIVLLKNSGNVLPFPRNDLMTIAVIGPSGNATETMLGNYAGVPCQYITPFQG 448

Query: 464 FSGYAN-VTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAESLDREDLWL 522
              Y   V ++ GC D+ C       AA  AA+ +DA +I+ GLD   E E LDR  L L
Sbjct: 449 LQEYTKGVVFEPGCKDIMCNDTTLFLAAVRAAENSDAVVIVVGLDKDQEREGLDRTSLLL 508

Query: 523 PGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADV 582
           PGYQ  L+ +V++VAKGPVILV+MS G +D+ FA+ N  I ++LW GYPGE GG+AIA V
Sbjct: 509 PGYQQDLVLEVSKVAKGPVILVVMSGGPIDVTFAKGNCKISSVLWVGYPGEAGGKAIARV 568

Query: 583 VFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLS 642
           +FG  NP GRLP+TWY   + + + + +M LRP  S G+PGRTY+FY G  +Y FG+GLS
Sbjct: 569 IFGDHNPAGRLPMTWYPQAFAEHVSILNMHLRPNTSTGFPGRTYRFYTGENVYEFGHGLS 628

Query: 643 YTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDF- 701
           YT F Y   S    I                 +    R P      LR D    F +D+ 
Sbjct: 629 YTNFTYTNFSAPSNITAR--------------NTVAIRTP------LREDGARHFPIDYT 668

Query: 702 -------------QNVGSTDGSDVVIVYSKPPAEIAATYI--KQVIGFQRVFVRAGRNKR 746
                         N G+ D   + ++Y+ PPA  ++     KQ+I F+R  + AGR  +
Sbjct: 669 GCEALAFKVVAYISNTGTRDSDHISLLYAIPPAASSSLSPPRKQLISFKRQHLIAGRCAK 728

Query: 747 IKFVFNACKSLNIVDYAANTLLPAGEHTIFVGN 779
           ++F  + CK L + + A   +L  G++ + +G+
Sbjct: 729 VEFDVDTCKDLGLTNEAGTKVLVHGDYKLSLGD 761


>gi|449508468|ref|XP_004163321.1| PREDICTED: LOW QUALITY PROTEIN: probable beta-D-xylosidase 7-like
           [Cucumis sativus]
          Length = 783

 Score =  697 bits (1799), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 352/776 (45%), Positives = 493/776 (63%), Gaps = 38/776 (4%)

Query: 27  ANGSSSPVFVCDPGRFSKLGLQMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAH 86
           A  SS P + CD            +  FC + LP  +R +DLVSR+TLDEKV QL +   
Sbjct: 30  AGSSSQPPYACDSSN-----PLTKTLPFCKTYLPIKLRARDLVSRLTLDEKVLQLVNTVP 84

Query: 87  GVPRLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQA 146
            +PRLG+P YEWWSEALHGV+NVG G   +  I  ATSFP VILT ASF+E+LW +IGQA
Sbjct: 85  PIPRLGIPAYEWWSEALHGVANVGYGIRLNGTITAATSFPQVILTAASFDENLWYQIGQA 144

Query: 147 VSTEARAMYNLGRA-GLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQD- 204
           + TEARA+YN G+A G+T+W+PNIN+ RDPRWGR  ETPGEDP + G+Y+V YVRG+Q  
Sbjct: 145 IGTEARAVYNAGQAKGMTFWTPNINIFRDPRWGRGQETPGEDPLMTGKYSVAYVRGIQGD 204

Query: 205 -VEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEM 263
            +EG +         LK S+CCKH+ AYD+D W G+ RY FDA+VT QDM +T+  PFE 
Sbjct: 205 AIEGGKLGNQ-----LKASACCKHFTAYDLDRWNGMTRYVFDAKVTMQDMADTYQPPFES 259

Query: 264 CVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFL 323
           CV+EG AS +MC+YNRVNG+PSCAD  LL  T R +W  +GYI +DCD++ ++ D   + 
Sbjct: 260 CVEEGKASGIMCAYNRVNGVPSCADHHLLTATARKQWKFNGYITSDCDAVSIIHDAQGY- 318

Query: 324 ADSKEDAVAQTLKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFD 383
           A   EDAVA  L+AG+D++CG Y    T +AV+  KV    ID++L+ L++V MRLG FD
Sbjct: 319 AKIPEDAVADVLRAGMDVNCGTYLKEHTKSAVEMKKVPMLHIDRALRNLFSVRMRLGLFD 378

Query: 384 GSPQ---YVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHA 440
           G+P    +  +G+  +CS ++  LA +AAREGIVLLKN    LPL+ +   ++AV+G + 
Sbjct: 379 GNPTKLPFGQIGRDQVCSQQHQNLALQAAREGIVLLKNSAKLLPLSKSNTHSLAVIGHNG 438

Query: 441 NATVAMIGNYAGIPCRYMSPIAGFSGYA-NVTYKTGCDDVACKSNNSIFAASEAAKTADA 499
           N    + GNYAGIPC+  +P  G + Y  N  Y  GC+   C +  +I+ A + AK+ D 
Sbjct: 439 NDPKTLRGNYAGIPCKSATPFQGLNNYVKNTVYHRGCNYANC-TEATIYQAVKIAKSVDY 497

Query: 500 TIILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETN 559
            +++ GLD + E E  DR +L LPG Q +LI +VA+ AK PVILVI+S G VDI+ A+ N
Sbjct: 498 VVLVMGLDQTQEREDFDRTELGLPGKQDKLIAEVAKAAKXPVILVILSGGPVDISSAKYN 557

Query: 560 TNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSL 619
             I +ILWAGYPG+ GG AIA+++FG  NPGGRLP+TWY  D+++  P+T M +R   S 
Sbjct: 558 EKIGSILWAGYPGQAGGTAIAEIIFGDHNPGGRLPLTWYPHDFIK-FPMTDMRMRADSST 616

Query: 620 GYPGRTYKFYNGPTLYPFGYGLSYTQ--FKYNLLSFTKTIQVNLNKLQHCRNLN-----Y 672
           GYPGRTY+FYNGP +Y FGYGLSY+   +++  +S +K +  +    Q  +N +      
Sbjct: 617 GYPGRTYRFYNGPKVYEFGYGLSYSNHIYEFTSVSESKLLLSHPKASQPAKNSDLVSYRL 676

Query: 673 TSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVI 732
            S+  K  C    VN           V  +N G   G   V+++ KP   I  + +KQ++
Sbjct: 677 VSELDKKFCESKTVN---------VTVGVRNEGEMGGKHSVLLFIKPSKPINGSPVKQLV 727

Query: 733 GFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGNGGVSFPIHL 788
           GF++V + AG  + I+F+ + C  ++        ++  G +++ VG+  V  P+ +
Sbjct: 728 GFKKVEINAGERREIEFLVSPCDHISKASEEGLMIIEEGSYSLVVGD--VEHPLDI 781


>gi|449465962|ref|XP_004150696.1| PREDICTED: probable beta-D-xylosidase 7-like [Cucumis sativus]
          Length = 783

 Score =  697 bits (1799), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 352/776 (45%), Positives = 493/776 (63%), Gaps = 38/776 (4%)

Query: 27  ANGSSSPVFVCDPGRFSKLGLQMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAH 86
           A  SS P + CD            +  FC + LP  +R +DLVSR+TLDEKV QL +   
Sbjct: 30  AGSSSQPPYACDSSN-----PLTKTLPFCKTYLPIKLRARDLVSRLTLDEKVLQLVNTVP 84

Query: 87  GVPRLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQA 146
            +PRLG+P YEWWSEALHGV+NVG G   +  I  ATSFP VILT ASF+E+LW +IGQA
Sbjct: 85  PIPRLGIPAYEWWSEALHGVANVGYGIRLNGTITAATSFPQVILTAASFDENLWYQIGQA 144

Query: 147 VSTEARAMYNLGRA-GLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQD- 204
           + TEARA+YN G+A G+T+W+PNIN+ RDPRWGR  ETPGEDP + G+Y+V YVRG+Q  
Sbjct: 145 IGTEARAVYNAGQAKGMTFWTPNINIFRDPRWGRGQETPGEDPLMTGKYSVAYVRGIQGD 204

Query: 205 -VEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEM 263
            +EG +         LK S+CCKH+ AYD+D W G+ RY FDA+VT QDM +T+  PFE 
Sbjct: 205 AIEGGKLGNQ-----LKASACCKHFTAYDLDRWNGMTRYVFDAKVTMQDMADTYQPPFES 259

Query: 264 CVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFL 323
           CV+EG AS +MC+YNRVNG+PSCAD  LL  T R +W  +GYI +DCD++ ++ D   + 
Sbjct: 260 CVEEGKASGIMCAYNRVNGVPSCADHHLLTATARKQWKFNGYITSDCDAVSIIHDAQGY- 318

Query: 324 ADSKEDAVAQTLKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFD 383
           A   EDAVA  L+AG+D++CG Y    T +AV+  KV    ID++L+ L++V MRLG FD
Sbjct: 319 AKIPEDAVADVLRAGMDVNCGTYLKEHTKSAVEMKKVPMLHIDRALRNLFSVRMRLGLFD 378

Query: 384 GSPQ---YVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHA 440
           G+P    +  +G+  +CS ++  LA +AAREGIVLLKN    LPL+ +   ++AV+G + 
Sbjct: 379 GNPTKLPFGQIGRDQVCSQQHQNLALQAAREGIVLLKNSAKLLPLSKSNTHSLAVIGHNG 438

Query: 441 NATVAMIGNYAGIPCRYMSPIAGFSGYA-NVTYKTGCDDVACKSNNSIFAASEAAKTADA 499
           N    + GNYAGIPC+  +P  G + Y  N  Y  GC+   C +  +I+ A + AK+ D 
Sbjct: 439 NDPKTLRGNYAGIPCKSATPFQGLNNYVKNTVYHRGCNYANC-TEATIYQAVKIAKSVDY 497

Query: 500 TIILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETN 559
            +++ GLD + E E  DR +L LPG Q +LI +VA+ AK PVILVI+S G VDI+ A+ N
Sbjct: 498 VVLVMGLDQTQEREDFDRTELGLPGKQDKLIAEVAKAAKRPVILVILSGGPVDISSAKYN 557

Query: 560 TNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSL 619
             I +ILWAGYPG+ GG AIA+++FG  NPGGRLP+TWY  D+++  P+T M +R   S 
Sbjct: 558 EKIGSILWAGYPGQAGGTAIAEIIFGDHNPGGRLPLTWYPHDFIK-FPMTDMRMRADSST 616

Query: 620 GYPGRTYKFYNGPTLYPFGYGLSYTQ--FKYNLLSFTKTIQVNLNKLQHCRNLN-----Y 672
           GYPGRTY+FYNGP +Y FGYGLSY+   +++  +S +K +  +    Q  +N +      
Sbjct: 617 GYPGRTYRFYNGPKVYEFGYGLSYSNHIYEFTSVSESKLLLSHPKASQPAKNSDLVSYRL 676

Query: 673 TSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVI 732
            S+  K  C    VN           V  +N G   G   V+++ KP   I  + +KQ++
Sbjct: 677 VSELDKKFCESKTVN---------VTVGVRNEGEMGGKHSVLLFIKPSKPINGSPVKQLV 727

Query: 733 GFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGNGGVSFPIHL 788
           GF++V + AG  + I+F+ + C  ++        ++  G +++ VG+  V  P+ +
Sbjct: 728 GFKKVEINAGERREIEFLVSPCDHISKASEEGLMIIEEGSYSLVVGD--VEHPLDI 781


>gi|242062502|ref|XP_002452540.1| hypothetical protein SORBIDRAFT_04g027700 [Sorghum bicolor]
 gi|241932371|gb|EES05516.1| hypothetical protein SORBIDRAFT_04g027700 [Sorghum bicolor]
          Length = 784

 Score =  697 bits (1798), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 353/764 (46%), Positives = 495/764 (64%), Gaps = 37/764 (4%)

Query: 30  SSSPVFVCDPGRFSKLGLQMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVP 89
           +S P + C  G          +  FCD++LP   RV DLVSR+T+ EK+ QLGD +  +P
Sbjct: 35  ASEPPYTCGAG-------APPNIPFCDTALPIDRRVDDLVSRLTVAEKISQLGDESPAIP 87

Query: 90  RLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVST 149
           RLG+P Y+WWSEALHGV+N G G H D  +  ATSFP VILT ASFN  LW +IGQ +  
Sbjct: 88  RLGVPAYKWWSEALHGVANAGRGIHLDGPLRAATSFPQVILTAASFNPHLWYRIGQVIGV 147

Query: 150 EARAMYNLGRA-GLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGH 208
           EARA+YN G+A GLT+W+PNINV RDPRWGR  ETPGEDP + G+YA  +VRG+Q   G+
Sbjct: 148 EARAVYNNGQAEGLTFWAPNINVFRDPRWGRGQETPGEDPTMTGKYAAVFVRGVQ---GY 204

Query: 209 ENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEG 268
             A  +NS  L+ S+CCKH+ AYD++NWKG+ RY +DA+VT QD+E+T+  PF+ CV++G
Sbjct: 205 GVAGPVNSTDLEASACCKHFTAYDLENWKGITRYVYDAKVTAQDLEDTYNPPFKSCVEDG 264

Query: 269 DASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKE 328
            AS +MCSYNRVNG+P+CAD  LL++T R  W  +GYI +DCD++ ++ D   + A + E
Sbjct: 265 HASGIMCSYNRVNGVPTCADYNLLSKTARQSWGFYGYITSDCDAVSIIHDAQGY-AKTSE 323

Query: 329 DAVAQTLKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ- 387
           DAVA  LKAG+D++CG Y   +  +A+QQGK+ E DI+++L  L+TV MRLG F+G P+ 
Sbjct: 324 DAVADVLKAGMDVNCGGYVQKYGASALQQGKITEQDINRALHNLFTVRMRLGLFNGDPRR 383

Query: 388 --YVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVA 445
             Y ++G   +C+ E+ +LA EAA++GIVLLKND   LPL+ + V ++AV+G +AN   +
Sbjct: 384 NRYGNIGPDQVCTQEHQDLALEAAQDGIVLLKNDGGALPLSKSGVASLAVIGFNANNATS 443

Query: 446 MIGNYAGIPCRYMSPIAGFSGYA-NVTYKTGCDDVACKSNNSIFAASEAAKTADATIILA 504
           ++GNY G PC  ++P+    GY  + ++  GC+  AC    +I  A +AA +AD+ ++  
Sbjct: 444 LLGNYFGPPCVTVTPLQVLQGYVKDTSFVAGCNSAACNVT-TIPEAVQAASSADSVVLFM 502

Query: 505 GLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKA 564
           GLD + E E +DR DL LPG Q  LI  VA  AK PVILV++  G VD++FA+TN  I A
Sbjct: 503 GLDQNQEREEVDRLDLTLPGQQQTLIESVANAAKKPVILVLLCGGPVDVSFAKTNPKIGA 562

Query: 565 ILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGR 624
           ILWAGYPGE GG AIA V+FG+ NPGGRLP+TWY  D+ + +P+T M +R   + GYPGR
Sbjct: 563 ILWAGYPGEAGGIAIAQVLFGEHNPGGRLPVTWYPQDFTK-VPMTDMRMRADPATGYPGR 621

Query: 625 TYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGV 684
           TY+FY GPT++ FGYGLSY+++ +  ++       N+  L+          A  T   GV
Sbjct: 622 TYRFYRGPTVFNFGYGLSYSKYSHRFVTKPPPSMSNVAGLK----------ALATTAGGV 671

Query: 685 LVNDLR------CDDY-FEFKVDFQNVGSTDGSDVVIVYSKPP--AEIAATYIKQVIGFQ 735
              D+       CD   F   V  QN G  DG   V+V+ + P   + +    +Q+IGFQ
Sbjct: 672 ATYDVEAIGSETCDRLKFPAVVRVQNHGPMDGKHPVLVFLRWPNATDGSGRPARQLIGFQ 731

Query: 736 RVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGN 779
            + +RA +   ++F  + CK  +        ++  G H + VG+
Sbjct: 732 SLHLRATQTAHVEFEVSPCKHFSRATEDGRKVIDQGSHFVMVGD 775


>gi|302141935|emb|CBI19138.3| unnamed protein product [Vitis vinifera]
          Length = 1411

 Score =  696 bits (1796), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 359/764 (46%), Positives = 489/764 (64%), Gaps = 52/764 (6%)

Query: 30   SSSPVFVCDPGRFSKLGLQMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVP 89
            SSSP F CD            S+ FC+++L  S R  DL+SR+TLDEK+ QL   A  +P
Sbjct: 693  SSSPPFACDSS-----DPLTKSYAFCNTTLRISQRASDLISRLTLDEKISQLISSAASIP 747

Query: 90   RLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVST 149
            RLG+P YEWWSEALHG+ +   G  F+  I  ATSFP VILT ASF+  LW +IGQA+  
Sbjct: 748  RLGIPAYEWWSEALHGIRDRH-GIRFNGTIRSATSFPQVILTAASFDAHLWYRIGQAIGI 806

Query: 150  EARAMYNLGRA-GLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGH 208
            E RAMYN G+A G+T+W+PNIN+ RDPRWGR  ETPGEDP V G+YAV+YVRGLQ     
Sbjct: 807  ETRAMYNAGQAMGMTFWAPNINIFRDPRWGRGQETPGEDPVVAGKYAVSYVRGLQGDTFE 866

Query: 209  ENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEG 268
                D+    L+ S+CCKH+ AYD+DNW  +DRY FDARVT QD+ +T+  PF  C++EG
Sbjct: 867  GGKVDV----LQASACCKHFTAYDLDNWTSIDRYTFDARVTMQDLADTYQPPFRSCIEEG 922

Query: 269  DASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKE 328
             AS +MC+YN VNG+P+CAD  LL++T RG+W   GYIV+DCD++ ++ D   + A S E
Sbjct: 923  RASGLMCAYNLVNGVPNCADFNLLSKTARGQWGFDGYIVSDCDAVSLVHDVQGY-AKSPE 981

Query: 329  DAVAQTLKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ- 387
            DAVA  L AG+D+ CG Y      +AV Q K+ E++ID++L  L+TV MRLG F+G+P+ 
Sbjct: 982  DAVAIVLTAGMDVACGGYLQKHAKSAVSQKKLTESEIDRALLNLFTVRMRLGLFNGNPRK 1041

Query: 388  --YVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVA 445
              + ++G   +CS E+  LA EAAR GIVLLKN    LPL+  +  ++AV+GP+ANAT  
Sbjct: 1042 LPFGNIGPDQVCSTEHQTLALEAARSGIVLLKNSDRLLPLSKGETLSLAVIGPNANATDT 1101

Query: 446  MIGNYAGIPCRYMSPIAGFSGYANVT-YKTGCDDVACKSNNSIFAASEAAKTADATIILA 504
            ++GNYAG PC+++SP+ G   Y N T Y  GC+DVAC S+ SI  A + AK AD  +++ 
Sbjct: 1102 LLGNYAGPPCKFISPLQGLQSYVNNTMYHAGCNDVAC-SSASIENAVDVAKQADYVVLVM 1160

Query: 505  GLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKA 564
            GLD + E E  DR DL LPG Q QLI  VA+ AK PV+LV++  G VDI+FA+ ++NI +
Sbjct: 1161 GLDQTQEREKYDRLDLVLPGKQEQLITGVAKAAKKPVVLVLLCGGPVDISFAKGSSNIGS 1220

Query: 565  ILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGR 624
            ILWAGYPGE GG AIA+ +FG  NPGGRLP+TWY  D+++ +P+T M +RP    GYPGR
Sbjct: 1221 ILWAGYPGEAGGAAIAETIFGDHNPGGRLPVTWYPKDFIK-IPMTDMRMRPEPQSGYPGR 1279

Query: 625  TYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGV 684
            T++FY G T++ FG GLSY+ + Y  LS T       NKL       Y +  S T     
Sbjct: 1280 THRFYTGKTVFEFGNGLSYSPYSYEFLSVTP------NKL-------YLNQPSTTHV--- 1323

Query: 685  LVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRN 744
                             +N G   G   V+++ K       + +KQ++GFQ VF+ AG +
Sbjct: 1324 ----------------VENSGKMAGKHPVLLFVKQAKAGNGSPMKQLVGFQNVFLDAGES 1367

Query: 745  KRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGNGGVSFPIHL 788
              ++F+ + C+ L+  +     ++  G H + VG+    +PI +
Sbjct: 1368 SNVEFILSPCEHLSRANKDGLMVMEQGIHLLVVGDK--EYPIAI 1409



 Score =  656 bits (1692), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 341/684 (49%), Positives = 459/684 (67%), Gaps = 37/684 (5%)

Query: 13  LSIALLVFSTNAVDANGSSSPVFVCDPGRFSKLGLQMSSFLFCDSSLPYSIRVKDLVSRM 72
           L I L+  +   V    + SP F CD    S       S+ FC ++LP   RV+DLVSR+
Sbjct: 7   LLINLIYVTVILVGVESTQSPPFSCDSSNPS-----TKSYHFCKTTLPIPDRVRDLVSRL 61

Query: 73  TLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGATSFPTVILTT 132
           TLDEK+ QL + A  +PRLG+P YEWWSEALHGV++ GPG  F+  I  ATSFP VILT 
Sbjct: 62  TLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVADAGPGIRFNGTIRSATSFPQVILTA 121

Query: 133 ASFNESLWKKIGQAVSTEARAMYNLGRA-GLTYWSPNINVARDPRWGRITETPGEDPFVV 191
           ASF+  LW +IG+A+  EARA+YN G+  G+T+W+PNIN+ RDPRWGR  ETPGEDP V 
Sbjct: 122 ASFDVHLWYRIGRAIGVEARAVYNAGQTKGMTFWAPNINIFRDPRWGRGQETPGEDPLVT 181

Query: 192 GRYAVNYVRGLQD--VEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVT 249
           G YAV+YVRG+Q   + G +   +L     + S+CCKH+ AYD+D+WKG+DR+ FDARVT
Sbjct: 182 GSYAVSYVRGVQGDCLRGLKRCGEL-----QASACCKHFTAYDLDDWKGIDRFKFDARVT 236

Query: 250 EQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVAD 309
            QD+ +T+  PF  C++EG AS +MC+YNRVNG+PSCAD  LL  T R  W+  GYI +D
Sbjct: 237 MQDLADTYQPPFHRCIEEGRASGIMCAYNRVNGVPSCADFNLLTNTARKRWNFQGYITSD 296

Query: 310 CDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSL 369
           CD++ ++ D++ F A + EDAV   LKAG+D++CG Y  N T +AV Q K+ E+++D++L
Sbjct: 297 CDAVSLIHDSYGF-AKTPEDAVVDVLKAGMDVNCGTYLLNHTKSAVMQKKLPESELDRAL 355

Query: 370 KYLYTVLMRLGFFDGSPQ---YVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLN 426
           + L+ V MRLG F+G+P+   Y  +G   +CS E+  LA +AAR+GIVLLKN Q  LPL 
Sbjct: 356 ENLFAVRMRLGLFNGNPKGQPYGDIGPNQVCSVEHQTLALDAARDGIVLLKNSQRLLPLP 415

Query: 427 SAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFSGYANVT-YKTGCDDVACKSNN 485
             K  ++AV+GP+AN+   +IGNYAG PC++++P+     Y   T Y  GCD VAC S+ 
Sbjct: 416 KGKTMSLAVIGPNANSPKTLIGNYAGPPCKFITPLQALQSYVKSTMYHPGCDAVAC-SSP 474

Query: 486 SIFAASEAAKTADATIILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVI 545
           SI  A E A+ AD  +++ GLD + E E+ DR DL LPG Q QLI  VA  AK PV+LV+
Sbjct: 475 SIEKAVEIAQKADYVVLVMGLDQTQEREAHDRLDLVLPGKQQQLIICVANAAKKPVVLVL 534

Query: 546 MSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQM 605
           +S G VDI+FA+ + NI +ILWAGYPG  GG AIA+ +FG  NPGGRLP+TWY  D+ + 
Sbjct: 535 LSGGPVDISFAKYSNNIGSILWAGYPGGAGGAAIAETIFGDHNPGGRLPVTWYPQDFTK- 593

Query: 606 LPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKL- 664
           +P+T M +RP  + GYPGRTY+FY G  ++ FGYGLSY+ +        +TI V  NKL 
Sbjct: 594 IPMTDMRMRPESNSGYPGRTYRFYTGEKVFEFGYGLSYSTYS------CETIPVTRNKLY 647

Query: 665 ----------QHCRNLNYTSDASK 678
                     ++  ++ YTS A K
Sbjct: 648 FNQSSTAHVYENTDSIRYTSMAGK 671


>gi|302791321|ref|XP_002977427.1| hypothetical protein SELMODRAFT_106899 [Selaginella moellendorffii]
 gi|300154797|gb|EFJ21431.1| hypothetical protein SELMODRAFT_106899 [Selaginella moellendorffii]
          Length = 772

 Score =  694 bits (1790), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 350/754 (46%), Positives = 471/754 (62%), Gaps = 47/754 (6%)

Query: 50  SSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNV 109
           SSF FCD SLP   RV DLV RM L EK+ Q+   A G+PRLG+P Y+WW EALHGV+  
Sbjct: 31  SSFPFCDVSLPVPDRVADLVGRMNLSEKIAQIVSNASGIPRLGIPGYQWWEEALHGVAE- 89

Query: 110 GPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNI 169
            PG  F   +P ATSFP VILT ASFN SLW KI QA+S EA AMYN GR+GLT+WSPNI
Sbjct: 90  SPGVKFAAPVPSATSFPQVILTVASFNSSLWNKIAQAISIEAIAMYNAGRSGLTFWSPNI 149

Query: 170 NVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENA--TDLNSRP--LKVSSCC 225
           N+ RDPRWGR  ETPGEDP +  +YA  +VRGLQ+ +  E    + +   P  LKVSSCC
Sbjct: 150 NIFRDPRWGRGQETPGEDPLLSSKYAAYFVRGLQEGDYDEGTAISTMQGSPTRLKVSSCC 209

Query: 226 KHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPS 285
           KH+ AYD++  +G D +HF+A+VT QD+++TF  PF  C+ +G AS +MCSYNRVNG+PS
Sbjct: 210 KHFTAYDMEKSEGTDCFHFNAQVTVQDLQDTFDPPFRSCIVDGQASGLMCSYNRVNGVPS 269

Query: 286 CADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQ 345
           CAD   L +TVR  W   GYIV+DCD++ ++ +   +   + EDAVA  L AG+DL+CG 
Sbjct: 270 CADYTFLTETVRNSWGFEGYIVSDCDAVALLYEYINY-TTTAEDAVADVLSAGMDLNCGT 328

Query: 346 YYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSP--QYVSLGKQDICSDENIE 403
           +    T  A++QGKV E  +D++L  + TV MRLG FDG+    Y S+G   +C+ E+ +
Sbjct: 329 FLLRHTAAAIEQGKVTEAAVDRALSNVMTVRMRLGLFDGNSGETYNSIGPDAVCTPEHRQ 388

Query: 404 LAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAG 463
           L+ EAA +GIVLLKN  N LP     + T+AV+GP  NAT  M+GNYAG+PC+Y++P  G
Sbjct: 389 LSLEAAEQGIVLLKNSGNVLPFPRNDLMTIAVIGPSGNATETMLGNYAGVPCQYITPFQG 448

Query: 464 FSGYAN-VTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAESLDREDLWL 522
              Y   V ++ GC D+ C       AA  AA+ +DA +I+ GLD   E E LDR  L L
Sbjct: 449 LQEYTKCVVFEPGCKDIMCNDTTLFLAAVRAAENSDAVVIVVGLDKDQEREGLDRTSLLL 508

Query: 523 PGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADV 582
           PG Q  L+ +V++VAKGPVILV+MS G +D+ FA+ N  I  +LW GYPGE GG+AIA V
Sbjct: 509 PGNQQGLVLEVSKVAKGPVILVVMSGGPIDVTFAKENCKISNVLWVGYPGEAGGKAIARV 568

Query: 583 VFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLS 642
           +FG  NP GRLP+TWY   + + + + +M LRP  S G+PGRTY+FY G  +Y FG+GLS
Sbjct: 569 IFGDHNPAGRLPMTWYPQAFAEHVSILNMHLRPNTSTGFPGRTYRFYTGENVYEFGHGLS 628

Query: 643 YTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTS-DASKTRCPGVLVNDLRCDDYFEFKVDF 701
           YT F Y                  C   N T+ +    R P      LR D   +F +D+
Sbjct: 629 YTNFTYT---------------NFCAPSNITARNTVAIRTP------LREDGARQFPIDY 667

Query: 702 --------------QNVGSTDGSDVVIVYSKPPAEIAATYI--KQVIGFQRVFVRAGRNK 745
                          N G+ D   + ++Y+ PPA  ++     KQ+I F+R  + AGR  
Sbjct: 668 TGCEALAFKVVAYISNTGTRDSDHISLLYAIPPAASSSLSPPRKQLISFKRQHLIAGRCA 727

Query: 746 RIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGN 779
           +++F  + CK L + + A   +L  G++ + +G+
Sbjct: 728 KVEFDVDTCKDLGLTNEAGTKVLVHGDYKLSLGD 761


>gi|449496501|ref|XP_004160150.1| PREDICTED: probable beta-D-xylosidase 6-like, partial [Cucumis
           sativus]
          Length = 767

 Score =  692 bits (1786), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 337/743 (45%), Positives = 484/743 (65%), Gaps = 18/743 (2%)

Query: 51  SFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVG 110
           S+ FC+ SL ++ R + LVS +TLDEK+QQL + A  +PRLG+P Y+WWSE LHG++  G
Sbjct: 19  SYPFCNRSLSFTARAQSLVSLLTLDEKIQQLSNNASSIPRLGIPSYQWWSEGLHGIATNG 78

Query: 111 PGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNIN 170
           PG  F+  I  ATSFP V++T ASFN +LW  IG A++ EARAM+N+G+ GLT W+PNIN
Sbjct: 79  PGVSFNGSITSATSFPQVLVTAASFNRTLWFLIGSAIAVEARAMFNVGQCGLTIWAPNIN 138

Query: 171 VARDPRWGRITETPGEDPFVVGRYAVNYVRGLQD---VEGHENATDLNSR-----PLKVS 222
           + RDPRWGR  ETPGEDP V   Y++ +VRGLQ    ++ HE   ++         L VS
Sbjct: 139 IFRDPRWGRGQETPGEDPMVASAYSIQFVRGLQSGNWMKEHEIRNEVLEEDNGMGSLMVS 198

Query: 223 SCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNG 282
           +CCKH+ AYD++ W    RY FD+ VTEQD+ +T+  PF  C+++G AS +MCSYN VNG
Sbjct: 199 ACCKHFTAYDLEKWNNFTRYTFDSVVTEQDLGDTYQPPFRSCIQQGKASCLMCSYNAVNG 258

Query: 283 IPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLD 342
           +P+CA+P LL +  R +W L GYI +DCD++  + +  K+  D+ EDA+A  LKAG+D++
Sbjct: 259 VPACANPDLLKKA-RNDWGLKGYITSDCDAVATVYEYQKY-TDTPEDAIADVLKAGMDIN 316

Query: 343 CGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSP---QYVSLGKQDICSD 399
           CG +    T +A+ QGKV+E ++D +L  L++V  RLGFFDG+P   ++  LG QD+C+ 
Sbjct: 317 CGTFMLRGTKSAIDQGKVREEELDSALINLFSVQARLGFFDGNPREGKFGELGAQDVCTA 376

Query: 400 ENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMS 459
           ++  LA EAAR+GIVLLKN+   LPL+   + ++ V+G  AN +  ++G YAG+PC  MS
Sbjct: 377 QHKTLALEAARQGIVLLKNENKFLPLDKNAISSLTVIGSLANDSSKLLGGYAGVPCSPMS 436

Query: 460 PIAGFSGYAN-VTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAESLDRE 518
            + GF  YA  + + +GC DV C S+N    A   AK AD  I +AGLD S E E LDR 
Sbjct: 437 LVEGFQEYAETIFFASGCLDVPCASDNRFEDAILIAKKADFVIAVAGLDASQETEDLDRV 496

Query: 519 DLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRA 578
            L LPG Q  L++ VA V+K P+ILV++  G +DI+FA+ ++ + +ILW G PGE GG+A
Sbjct: 497 SLLLPGKQMDLVSSVASVSKKPIILVLIGGGPLDISFAKKDSRVASILWIGNPGEAGGKA 556

Query: 579 IADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFG 638
           +A+V+FG +NPGGRLP+TWY   +   +P+  M +RP  S GYPGRTY+FY G  +Y FG
Sbjct: 557 LAEVIFGDYNPGGRLPVTWYPQSFTN-VPMNDMHMRPNPSRGYPGRTYRFYTGDRIYGFG 615

Query: 639 YGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDY--FE 696
            GLSYT FKY LLS  K + + L K +  R               + V ++   D   FE
Sbjct: 616 EGLSYTSFKYRLLSAPKKVNL-LGKAETSRRRIIPQVRDGVNMSYMEVEEVESCDLLRFE 674

Query: 697 FKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKS 756
            K+   N+G  DGS VV+++S+ P  +  T  +Q+IGF R++V+  ++     + + C  
Sbjct: 675 VKLSVSNIGEFDGSHVVMMFSEFPKVLTGTPQRQLIGFDRLYVKRNQSAESSIMVDPCNH 734

Query: 757 LNIVDYAANTLLPAGEHTIFVGN 779
           +++ D     ++P G+HTI +G+
Sbjct: 735 VSLADEYGKRVIPLGDHTISLGD 757


>gi|15238197|ref|NP_196618.1| putative beta-D-xylosidase 6 [Arabidopsis thaliana]
 gi|75264319|sp|Q9LXA8.1|BXL6_ARATH RecName: Full=Probable beta-D-xylosidase 6; Short=AtBXL6; Flags:
           Precursor
 gi|7671447|emb|CAB89387.1| beta-xylosidase-like protein [Arabidopsis thaliana]
 gi|15982753|gb|AAL09717.1| AT5g10560/F12B17_90 [Arabidopsis thaliana]
 gi|332004180|gb|AED91563.1| putative beta-D-xylosidase 6 [Arabidopsis thaliana]
          Length = 792

 Score =  692 bits (1785), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 360/795 (45%), Positives = 487/795 (61%), Gaps = 40/795 (5%)

Query: 11  FSLSIALLVFSTNAVD---ANGSSSPVFVCDPGRFSKLGLQMSSFLFCDSSLPYSIRVKD 67
             L++  L+F T+A+     N  S P F C P  FS       S+ FC+ SL    R   
Sbjct: 3   LQLTLISLLFFTSAIAETFKNLDSHPQFPCKPPHFS-------SYPFCNVSLSIKQRAIS 55

Query: 68  LVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGATSFPT 127
           LVS + L EK+ QL + A  VPRLG+P YEWWSE+LHG+++ GPG  F+  I  ATSFP 
Sbjct: 56  LVSLLMLPEKIGQLSNTAASVPRLGIPPYEWWSESLHGLADNGPGVSFNGSISAATSFPQ 115

Query: 128 VILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNINVARDPRWGRITETPGED 187
           VI++ ASFN +LW +IG AV+ E RAMYN G+AGLT+W+PNINV RDPRWGR  ETPGED
Sbjct: 116 VIVSAASFNRTLWYEIGSAVAVEGRAMYNGGQAGLTFWAPNINVFRDPRWGRGQETPGED 175

Query: 188 PFVVGRYAVNYVRGLQDVEGHENATDLNSR-------------PLKVSSCCKHYAAYDVD 234
           P VV  Y V +VRG Q+ +  +      S               L +S+CCKH+ AYD++
Sbjct: 176 PKVVSEYGVEFVRGFQEKKKRKVLKRRFSDDVDDDRHDDDADGKLMLSACCKHFTAYDLE 235

Query: 235 NWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQ 294
            W    RY F+A VTEQDME+T+  PFE C+++G AS +MCSYN VNG+P+CA   LL Q
Sbjct: 236 KWGNFTRYDFNAVVTEQDMEDTYQPPFETCIRDGKASCLMCSYNAVNGVPACAQGDLL-Q 294

Query: 295 TVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFTGNA 354
             R EW   GYI +DCD++  +     +   S E+AVA  +KAG+D++CG Y    T +A
Sbjct: 295 KARVEWGFEGYITSDCDAVATIFAYQGY-TKSPEEAVADAIKAGVDINCGTYMLRHTQSA 353

Query: 355 VQQGKVKETDIDKSLKYLYTVLMRLGFFDGSP---QYVSLGKQDICSDENIELAAEAARE 411
           ++QGKV E  +D++L  L+ V +RLG FDG P   QY  LG  DICS ++ +LA EA R+
Sbjct: 354 IEQGKVSEELVDRALLNLFAVQLRLGLFDGDPRRGQYGKLGSNDICSSDHRKLALEATRQ 413

Query: 412 GIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFSGYANVT 471
           GIVLLKND   LPLN   V ++A+VGP AN    M G Y G PC+  +       Y   T
Sbjct: 414 GIVLLKNDHKLLPLNKNHVSSLAIVGPMANNISNMGGTYTGKPCQRKTLFTELLEYVKKT 473

Query: 472 -YKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAESLDREDLWLPGYQTQLI 530
            Y +GC DV+C S+     A   AK AD  I++AGLDLS E E  DR  L LPG Q  L+
Sbjct: 474 SYASGCSDVSCDSDTGFGEAVAIAKGADFVIVVAGLDLSQETEDKDRVSLSLPGKQKDLV 533

Query: 531 NQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPG 590
           + VA V+K PVILV+   G VD+ FA+ +  I +I+W GYPGE GG+A+A+++FG FNPG
Sbjct: 534 SHVAAVSKKPVILVLTGGGPVDVTFAKNDPRIGSIIWIGYPGETGGQALAEIIFGDFNPG 593

Query: 591 GRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNL 650
           GRLP TWY   +   + ++ M +R   S GYPGRTY+FY GP +Y FG GLSYT+F+Y +
Sbjct: 594 GRLPTTWYPESFTD-VAMSDMHMRANSSRGYPGRTYRFYTGPQVYSFGTGLSYTKFEYKI 652

Query: 651 LSFTKTIQVNLNKL-----QHCRNLNYTSDASKTRCPGVLVNDLRCDDY-FEFKVDFQNV 704
           LS    I+++L++L      H + L +  +    +   V+VN   C+   F  +V   N 
Sbjct: 653 LS--APIRLSLSELLPQQSSHKKQLQHGEELRYLQLDDVIVNS--CESLRFNVRVHVSNT 708

Query: 705 GSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAA 764
           G  DGS VV+++SK P  ++    KQ+IG+ RV VR+       FV + CK L++ +   
Sbjct: 709 GEIDGSHVVMLFSKMPPVLSGVPEKQLIGYDRVHVRSNEMMETVFVIDPCKQLSVANDVG 768

Query: 765 NTLLPAGEHTIFVGN 779
             ++P G H +F+G+
Sbjct: 769 KRVIPLGSHVLFLGD 783


>gi|449451581|ref|XP_004143540.1| PREDICTED: probable beta-D-xylosidase 6-like [Cucumis sativus]
          Length = 777

 Score =  691 bits (1784), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 337/743 (45%), Positives = 484/743 (65%), Gaps = 18/743 (2%)

Query: 51  SFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVG 110
           S+ FC+ SL ++ R + LVS +TLDEK+QQL + A  +PRLG+P Y+WWSE LHG++  G
Sbjct: 29  SYPFCNRSLSFTARAQSLVSLLTLDEKIQQLSNNASSIPRLGIPSYQWWSEGLHGIATNG 88

Query: 111 PGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNIN 170
           PG  F+  I  ATSFP V++T ASFN +LW  IG A++ EARAM+N+G+ GLT W+PNIN
Sbjct: 89  PGVSFNGSITSATSFPQVLVTAASFNRTLWFLIGSAIAVEARAMFNVGQCGLTIWAPNIN 148

Query: 171 VARDPRWGRITETPGEDPFVVGRYAVNYVRGLQD---VEGHENATDLNSR-----PLKVS 222
           + RDPRWGR  ETPGEDP V   Y++ +VRGLQ    ++ HE   ++         L VS
Sbjct: 149 IFRDPRWGRGQETPGEDPMVASAYSIQFVRGLQSGNWMKEHEIRNEVLEEDNGMGSLMVS 208

Query: 223 SCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNG 282
           +CCKH+ AYD++ W    RY FD+ VTEQD+ +T+  PF  C+++G AS +MCSYN VNG
Sbjct: 209 ACCKHFTAYDLEKWNNFTRYTFDSVVTEQDLGDTYQPPFRSCIQQGKASCLMCSYNAVNG 268

Query: 283 IPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLD 342
           +P+CA+P LL +  R +W L GYI +DCD++  + +  K+  D+ EDA+A  LKAG+D++
Sbjct: 269 VPACANPDLLKKA-RNDWGLKGYITSDCDAVATVYEYQKY-TDTPEDAIADVLKAGMDIN 326

Query: 343 CGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSP---QYVSLGKQDICSD 399
           CG +    T +A+ QGKV+E ++D +L  L++V  RLGFFDG+P   ++  LG QD+C+ 
Sbjct: 327 CGTFMLRGTKSAIDQGKVREEELDSALINLFSVQARLGFFDGNPREGKFGELGAQDVCTA 386

Query: 400 ENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMS 459
           ++  LA EAAR+GIVLLKN+   LPL+   + ++ V+G  AN +  ++G YAG+PC  MS
Sbjct: 387 QHKTLALEAARQGIVLLKNENKFLPLDKNAISSLTVIGSLANDSSKLLGGYAGVPCSPMS 446

Query: 460 PIAGFSGYAN-VTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAESLDRE 518
            + GF  YA  + + +GC DV C S+N    A   AK AD  I +AGLD S E E LDR 
Sbjct: 447 LVEGFQEYAETIFFASGCLDVPCASDNRFEDAILIAKKADFVIAVAGLDASQETEDLDRV 506

Query: 519 DLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRA 578
            L LPG Q  L++ VA V+K P+ILV++  G +DI+FA+ ++ + +ILW G PGE GG+A
Sbjct: 507 SLLLPGKQMDLVSSVASVSKKPIILVLIGGGPLDISFAKKDSRVASILWIGNPGEAGGKA 566

Query: 579 IADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFG 638
           +A+V+FG +NPGGRLP+TWY   +   +P+  M +RP  S GYPGRTY+FY G  +Y FG
Sbjct: 567 LAEVIFGDYNPGGRLPVTWYPQSFTN-VPMNDMHMRPNPSRGYPGRTYRFYTGDRIYGFG 625

Query: 639 YGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDY--FE 696
            GLSYT FKY LLS  K + + L K +  R               + V ++   D   FE
Sbjct: 626 EGLSYTSFKYRLLSAPKKVNL-LGKAETSRRRIIPQVRDGVNMSYMEVEEVESCDLLRFE 684

Query: 697 FKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKS 756
            K+   N+G  DGS VV+++S+ P  +  T  +Q+IGF R++V+  ++     + + C  
Sbjct: 685 VKLSVSNIGEFDGSHVVMMFSEFPKVLTGTPQRQLIGFDRLYVKRNQSAESSIMVDPCNH 744

Query: 757 LNIVDYAANTLLPAGEHTIFVGN 779
           +++ D     ++P G+HTI +G+
Sbjct: 745 VSLADEYGKRVIPLGDHTISLGD 767


>gi|18025342|gb|AAK38482.1| beta-D-xylosidase [Hordeum vulgare]
          Length = 777

 Score =  691 bits (1782), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 349/739 (47%), Positives = 482/739 (65%), Gaps = 19/739 (2%)

Query: 50  SSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNV 109
           SS  FCD  LP   R  DLVS++TL+EK+ QLGD +  V RLG+P Y+WWSEALHGV+N 
Sbjct: 40  SSAAFCDRRLPIEQRAADLVSKLTLEEKISQLGDESPAVDRLGVPAYKWWSEALHGVANA 99

Query: 110 GPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRA-GLTYWSPN 168
           G G H D  +  ATSFP VILT ASFN  LW +IGQ + TEAR +YN G+A GLT+W+PN
Sbjct: 100 GRGVHLDGPLRAATSFPQVILTAASFNPHLWYRIGQVIGTEARGVYNNGQAEGLTFWAPN 159

Query: 169 INVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHY 228
           INV RDPRWGR  ETPGEDP + G+YA  +VRG+Q   G+  +  +NS  L+ S+CCKH+
Sbjct: 160 INVFRDPRWGRGQETPGEDPTMTGKYAAVFVRGVQ---GYGMSGAINSSDLEASACCKHF 216

Query: 229 AAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCAD 288
            AYD++NWKGV R+ FDA+VTEQD+ +T+  PF+ CV++G AS +MCSYNRVNG+P+CAD
Sbjct: 217 TAYDLENWKGVTRFAFDAKVTEQDLADTYNPPFKSCVEDGGASGIMCSYNRVNGVPTCAD 276

Query: 289 PKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYT 348
             LL++T RG+W  +GYI +DCD++ ++ D   + A + EDAVA  LKAG+D++CG Y  
Sbjct: 277 HNLLSKTARGDWSFNGYITSDCDAVAIIHDVQGY-AKAPEDAVADVLKAGMDVNCGGYIQ 335

Query: 349 NFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ---YVSLGKQDICSDENIELA 405
               +A QQGK+   DID++L+ L+ + MRLG FDG+P+   Y ++G   +CS E+ +LA
Sbjct: 336 THGVSAYQQGKITGEDIDRALRNLFAIRMRLGLFDGNPKYNRYGNIGADQVCSKEHQDLA 395

Query: 406 AEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFS 465
            +AAR+GIVLLKND   LPL+ +KV ++AV+GP+ N    ++GNY G PC  ++P+    
Sbjct: 396 LQAARDGIVLLKNDGAALPLSKSKVSSLAVIGPNGNNASLLLGNYFGPPCISVTPLQALQ 455

Query: 466 GYA-NVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAESLDREDLWLPG 524
           GY  +  +  GC+   C  +N I  A  AA +AD  ++  GLD + E E +DR +L LPG
Sbjct: 456 GYVKDARFVQGCNAAVCNVSN-IGEAVHAAGSADYVVLFMGLDQNQEREEVDRLELGLPG 514

Query: 525 YQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVF 584
            Q  L+N VA+ AK PVILV++  G VD+ FA+ N  I AI+WAGYPG+ GG AIA V+F
Sbjct: 515 MQESLVNSVADAAKKPVILVLLCGGPVDVTFAKNNPKIGAIVWAGYPGQAGGIAIAQVLF 574

Query: 585 GKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYT 644
           G  NPGGRLP+TWY  ++   +P+T M +R   S GYPGRTY+FY G T+Y FGYGLSY+
Sbjct: 575 GDHNPGGRLPVTWYPKEFT-AVPMTDMRMRADPSTGYPGRTYRFYKGKTVYNFGYGLSYS 633

Query: 645 QFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDL---RCDDY-FEFKVD 700
           ++ +   S   T   +++ ++    L  T+ AS        V ++    CD   F   V 
Sbjct: 634 KYSHRFAS-KGTKPPSMSGIE---GLKATARASAAGTVSYDVEEMGAEACDRLRFPAVVR 689

Query: 701 FQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIV 760
            QN G  DG  +V+++ + P         Q+IGFQ V +RA     ++F  + CK L+  
Sbjct: 690 VQNHGPMDGGHLVLLFLRWPNATDGRPASQLIGFQSVHLRADEAAHVEFEVSPCKHLSRA 749

Query: 761 DYAANTLLPAGEHTIFVGN 779
                 ++  G H + VG+
Sbjct: 750 AEDGRKVIDQGSHFVRVGD 768


>gi|297811163|ref|XP_002873465.1| glycosyl hydrolase family 3 protein [Arabidopsis lyrata subsp.
           lyrata]
 gi|297319302|gb|EFH49724.1| glycosyl hydrolase family 3 protein [Arabidopsis lyrata subsp.
           lyrata]
          Length = 796

 Score =  688 bits (1776), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 365/797 (45%), Positives = 494/797 (61%), Gaps = 44/797 (5%)

Query: 13  LSIALLVFSTNAVD---ANGSSSPVFVCDPGRFSKLGLQMSSFLFCDSSLPYSIRVKDLV 69
           L++  LVF T+A+     N  S P F C P  FS       S+ FC+ SL    R   LV
Sbjct: 5   LTLISLVFFTSAIAETFKNLDSHPQFPCKPPHFS-------SYPFCNVSLSIKQRAISLV 57

Query: 70  SRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGATSFPTVI 129
           S +TL EK+ QL   A  VPRLG+P YEWWSE+LHG+++ GPG  F+  I  ATSFP VI
Sbjct: 58  SLLTLPEKIGQLSTTAASVPRLGIPPYEWWSESLHGLADNGPGVSFNGSISAATSFPQVI 117

Query: 130 LTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNINVARDPRWGRITETPGEDPF 189
           ++ ASFN +LW +IG AV+ EARAMYN G+AGLT+W+PNIN+ RDPRWGR  ETPGEDP 
Sbjct: 118 VSAASFNRTLWYEIGSAVAVEARAMYNGGQAGLTFWAPNINLFRDPRWGRGQETPGEDPK 177

Query: 190 VVGRYAVNYVRGLQDVE---------GHENATDLNSR------PLKVSSCCKHYAAYDVD 234
           VV  Y V +VRG Q+ +         G +N  D           L +S+CCKH+ AYD++
Sbjct: 178 VVSEYGVEFVRGFQEKKKRKVLKTRFGSDNVDDDARYDDDADGKLMLSACCKHFTAYDLE 237

Query: 235 NWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQ 294
            W    RY F+A VTEQDME+T+  PFE C+K+G AS +MCSYN VNG+P+CA   LL Q
Sbjct: 238 KWGNFTRYDFNAVVTEQDMEDTYQPPFETCIKDGKASCLMCSYNAVNGVPACAQGDLL-Q 296

Query: 295 TVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFTGNA 354
             R EW   GYI +DCD++  + +   +   S E+AVA  +KAG+D++CG Y    T +A
Sbjct: 297 KARVEWGFDGYITSDCDAVATIFEYQGY-TKSPEEAVADAIKAGVDINCGTYMLRNTQSA 355

Query: 355 VQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ---YVSLGKQDICSDENIELAAEAARE 411
           ++QGKV E  +D++L  L+ V +RLG FDG P+   Y  LG  DICS ++ +LA EAAR+
Sbjct: 356 IEQGKVSEELVDRALLNLFAVQLRLGLFDGDPRGGHYGKLGSNDICSSDHRKLALEAARQ 415

Query: 412 GIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFSGYANVT 471
           GIVLLKND   LPLN   V ++A+VGP AN    M G Y G PC+  +       Y   T
Sbjct: 416 GIVLLKNDYKLLPLNKNHVSSLAIVGPMANNISNMGGTYTGKPCQRKTLFTELLEYVKKT 475

Query: 472 -YKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAESLDREDLWLPGYQTQLI 530
            Y +GC DV+C S+     A   AK AD  I++AGLDLS E E  DR  L LPG Q  L+
Sbjct: 476 SYASGCSDVSCVSDTGFGEAVAIAKGADFVIVVAGLDLSQETEDKDRFSLSLPGKQKDLV 535

Query: 531 NQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPG 590
           + VA V+K PVILV+   G VD+ FA+T+  I +I+W GYPGE GG+A+A+++FG FNPG
Sbjct: 536 SSVAAVSKKPVILVLTGGGPVDVTFAKTDPRIGSIIWIGYPGETGGQALAEIIFGDFNPG 595

Query: 591 GRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNL 650
           GRLPITWY   +   +P++ M +R   S GYPGRTY+FY GP +Y FG GLSYT+F Y +
Sbjct: 596 GRLPITWYPESFAD-VPMSDMHMRADSSRGYPGRTYRFYTGPQVYSFGTGLSYTKFDYKI 654

Query: 651 LSFTKTIQVNLNKL-----QHCRNL--NYTSDASKTRCPGVLVNDLRCDDY-FEFKVDFQ 702
           +S    I+++L++L      H + L  +        +   V+VN   C+   F  +V+ +
Sbjct: 655 IS--APIRLSLSELLPQQSSHKKQLLQHGEEQLQYIQLDDVMVNS--CESLRFNVRVNVR 710

Query: 703 NVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDY 762
           N G  DGS V++++SK    ++    KQ+IGF RV +R+       FV + CK L++ + 
Sbjct: 711 NTGEIDGSHVLMLFSKMARVLSGVPEKQLIGFDRVHIRSNEMMETVFVIDPCKYLSVAND 770

Query: 763 AANTLLPAGEHTIFVGN 779
               ++P G H +F+G+
Sbjct: 771 VGKRVIPLGIHALFLGD 787


>gi|189380221|gb|ACD93208.1| beta xylosidase [Camellia sinensis]
          Length = 767

 Score =  688 bits (1775), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 357/760 (46%), Positives = 479/760 (63%), Gaps = 46/760 (6%)

Query: 31  SSPVFVCDPGRFSKLGLQMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPR 90
           S P F CD            +  FC  SLP   RV+DL+ R+TL EK++ L + A  VPR
Sbjct: 27  SRPAFACDGA--------TRNLPFCRVSLPIQDRVRDLIGRLTLQEKIRLLVNNAAAVPR 78

Query: 91  LGLPQYEWWSEALHGVSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTE 150
           LG+  YEWWSEALHGVSN  PG  F    PGATSFP VI T ASFN SLW+ IG+ VS E
Sbjct: 79  LGIKGYEWWSEALHGVSNADPGVKFGGAFPGATSFPQVISTAASFNASLWEHIGRVVSDE 138

Query: 151 ARAMYNLGRAGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHEN 210
           ARAMYN G AGLTYWSPN+N+ RDPRWGR  ETPGEDP + G+YA +YVRGLQ   G++ 
Sbjct: 139 ARAMYNGGMAGLTYWSPNVNIFRDPRWGRGQETPGEDPVLAGKYAASYVRGLQGNSGNQ- 197

Query: 211 ATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDA 270
                   LKV++CCKHY AYD+DNW  VDRY F+ARV++QD+ +T+  PF+ CV EG  
Sbjct: 198 --------LKVAACCKHYTAYDLDNWNSVDRYRFNARVSKQDLADTYDVPFKACVVEGK- 248

Query: 271 SSVMCSYNRVNGIPSCADPKLLN----QTVRGEWD--LHGYIVADCDSIQVMVDNHKFLA 324
             V C++     I   A+P +L     Q     W   LH + +  C         H  L 
Sbjct: 249 YQVYCAHT----IKLMANPLVLTLISPQHHPWSWHSWLHCFRLYRCWGFIC----HSTLH 300

Query: 325 DSKEDAVAQTLKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDG 384
            + EDA A T+KAGLDL+CG +    T  AV+QGK+ E D++ +L    +V MRLG FDG
Sbjct: 301 STPEDAAAATIKAGLDLECGPFLAIHTEQAVRQGKLGEADVNGALINTLSVQMRLGMFDG 360

Query: 385 SPQ---YVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHAN 441
            P    Y +LG +D+C+  + +LA EAAR+GIVLL+N   +LPL++   +TVAV+GP+++
Sbjct: 361 EPSSQPYGNLGPRDVCTPAHQQLALEAARQGIVLLQNRGRSLPLSTQLHRTVAVIGPNSD 420

Query: 442 ATVAMIGNYAGIPCRYMSPIAGFSGYANVTYKTGCDDVACKSNNSIFAASE-AAKTADAT 500
            TV M+GNYAG+ C + +P+ G   Y    +++GCD VAC SNN +F  +E AA+ ADAT
Sbjct: 421 VTVTMLGNYAGVACGFTTPLQGIERYVRTIHQSGCDSVAC-SNNQLFGVAETAARQADAT 479

Query: 501 IILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNT 560
           +++ GLD S+E E  DR  L LPG Q +L+++VA  ++GPV+LV+MS G +D++FA+ + 
Sbjct: 480 VLVMGLDQSIETEFKDRVGLLLPGPQQELVSRVAMASRGPVVLVLMSGGPIDVSFAKNDP 539

Query: 561 NIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLG 620
            I AILW GYPG+ GG AIADV+FG+ NPGGRLP+TWY  DY+   P+T+M +R   S G
Sbjct: 540 RIGAILWVGYPGQAGGTAIADVLFGRTNPGGRLPMTWYPQDYLAKAPMTNMAMRANPSSG 599

Query: 621 YPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTR 680
           YPGRTY+FY GP ++PFG+G+SYT F + L     T+ V L  L   +N       S T 
Sbjct: 600 YPGRTYRFYKGPVVFPFGHGMSYTTFAHELAHAPTTVSVPLTSLYGLQN-------STTF 652

Query: 681 CPGVLVNDLRCDDY-FEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFV 739
             G+ V    CD       +D +N G  DG+  V+V+S PP        KQ+IGF++V V
Sbjct: 653 NNGIRVTHTNCDTLILGIHIDVKNTGDMDGTHTVLVFSTPPVGKWGAN-KQLIGFKKVHV 711

Query: 740 RAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGN 779
            A   +R+K   + C  L++VD      +P GEH++ +G+
Sbjct: 712 VARGRQRVKIHVHVCNQLSVVDQFGIRRIPIGEHSLHIGD 751


>gi|224082152|ref|XP_002306583.1| predicted protein [Populus trichocarpa]
 gi|222856032|gb|EEE93579.1| predicted protein [Populus trichocarpa]
          Length = 745

 Score =  687 bits (1774), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 343/755 (45%), Positives = 480/755 (63%), Gaps = 46/755 (6%)

Query: 30  SSSPVFVCDPGRFSKLGLQMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVP 89
           S+ P F CD    S       +F FC ++LP S R  DLVSR+TL+EK+ QL + A  +P
Sbjct: 23  STQPPFSCDSSNPS-----TKTFPFCKTTLPISQRANDLVSRLTLEEKISQLVNSAQPIP 77

Query: 90  RLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVST 149
           RLG+P Y+WWSEALHGV+  GPG  F+  I  ATSFP VIL+ ASF+ + W +I QA+  
Sbjct: 78  RLGIPGYQWWSEALHGVAYAGPGIRFNGTIKRATSFPQVILSAASFDANQWYRISQAIGK 137

Query: 150 EARAMYNLGRA-GLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGH 208
           EARA+YN G+A G+T+W+PNIN+ RDPRWGR  ETPGEDP + G+YAV+YVRGLQ   G 
Sbjct: 138 EARALYNAGQATGMTFWAPNINIFRDPRWGRGQETPGEDPLMTGKYAVSYVRGLQ---GD 194

Query: 209 ENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEG 268
                    PL+ S+CCKH+ AYD++NW G  RY FDA VT QD+ +T+  PF+ CV+EG
Sbjct: 195 SFKGGEIKGPLQASACCKHFTAYDLENWNGTSRYVFDAYVTAQDLADTYQPPFKSCVEEG 254

Query: 269 DASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKE 328
            AS +MC+YNRVNGIP+CAD   L++T R +W   GYI +DCD++ ++ D   + A + E
Sbjct: 255 RASGIMCAYNRVNGIPNCADSNFLSRTARAQWGFDGYIASDCDAVSIIHDAQGY-AKTPE 313

Query: 329 DAVAQTLKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSP-- 386
           DAV   LKAG+D++CG Y    T  AV Q K+  ++ID++L  L++V MRLG F+G+P  
Sbjct: 314 DAVVAVLKAGMDVNCGSYLQQHTKAAVDQKKLTISEIDRALHNLFSVRMRLGLFNGNPTG 373

Query: 387 -QYVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVA 445
            Q+ ++G   +CS EN  LA +AAR GIVLLKN    LPL+ +K  ++AV+GP+AN+   
Sbjct: 374 QQFGNIGPDQVCSQENQILALDAARNGIVLLKNSAGLLPLSKSKTMSLAVIGPNANSVQT 433

Query: 446 MIGNYAGIPCRYMSPIAGFSGYANVTYK-TGCDDVACKSNNSIFAASEAAKTADATIILA 504
           ++GNYAG PC+ ++P+     Y   T    GCD V C S+ SI  A   AK AD  +++ 
Sbjct: 434 LLGNYAGPPCKLVTPLQALQSYIKHTIPYPGCDSVQC-SSASIVGAVNVAKGADHVVLIM 492

Query: 505 GLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKA 564
           GLD + E E LDR DL LPG Q +LI  VA+ AK PV+LV++S G VDI+FA+ + NI +
Sbjct: 493 GLDDTQEKEGLDRRDLVLPGKQQELIISVAKAAKNPVVLVLLSGGPVDISFAKNDKNIGS 552

Query: 565 ILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGR 624
           ILWAGYPGE G  A+A+++FG  NPGG+LP+TWY  ++V+ +P+T M +RP  S GYPGR
Sbjct: 553 ILWAGYPGEAGAIALAEIIFGDHNPGGKLPMTWYPQEFVK-VPMTDMRMRPETSSGYPGR 611

Query: 625 TYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGV 684
           TY+FY GPT++ FGYGLSY+++ Y L    + I +     + C N+              
Sbjct: 612 TYRFYKGPTVFEFGYGLSYSKYTYEL----RAIYIG---EEQCENIK------------- 651

Query: 685 LVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRN 744
                     F+  V  +N G   G   V+++++         IK+++GFQ V + AG  
Sbjct: 652 ----------FKVTVSVKNEGQMAGKHPVLLFARHAKPGKGRPIKKLVGFQTVKLGAGEK 701

Query: 745 KRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGN 779
             I++  + C+ L+  +     ++  G   + VG+
Sbjct: 702 TEIEYELSPCEHLSSANEDGVMVMEEGSQILLVGD 736


>gi|358349509|ref|XP_003638778.1| Xylan 1 4-beta-xylosidase [Medicago truncatula]
 gi|355504713|gb|AES85916.1| Xylan 1 4-beta-xylosidase [Medicago truncatula]
          Length = 776

 Score =  685 bits (1768), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 358/791 (45%), Positives = 498/791 (62%), Gaps = 44/791 (5%)

Query: 5   VSSLLCFSLSIALLVFSTNAVDANGSSSPVFVCDPGRFSKLGLQMSSFLFCDSSLPYSIR 64
           +SS   F   I+L +  T +V A     P F CD    S       S+ FC+  LP + R
Sbjct: 3   LSSTFTFVTIISLFLTLTYSVLAQ---LPPFACDYSNPST-----RSYPFCNPKLPITQR 54

Query: 65  VKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGATS 124
            KDLVSR+TLDEK+ QL + A  +PRLG+P YEWWSEALHG+ NVG G  F+  I  ATS
Sbjct: 55  TKDLVSRLTLDEKLAQLVNSAPPIPRLGIPAYEWWSEALHGIGNVGRGIFFNGSITSATS 114

Query: 125 FPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRA-GLTYWSPNINVARDPRWGRITET 183
           FP VILT ASF+  LW +IGQA+  EARA+YN G+A G+T+W+PNIN+ RDPRWGR  ET
Sbjct: 115 FPQVILTAASFDSHLWYRIGQAIGVEARAIYNGGQAMGMTFWAPNINIFRDPRWGRGQET 174

Query: 184 PGEDPFVVGRYAVNYVRGLQ-------DVEGHENATDLNSRPLKVSSCCKHYAAYDVDNW 236
            GEDP +   YAV+YVRGLQ        + GH          L+ S+CCKH+ AYD+DNW
Sbjct: 175 AGEDPMMTSNYAVSYVRGLQGDSFQGGKLRGH----------LQASACCKHFTAYDLDNW 224

Query: 237 KGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTV 296
           KGV+R+HFDARV+ QD+ +T+  PF  C+++G AS +MC+YNRVNGIPSCAD  LL  TV
Sbjct: 225 KGVNRFHFDARVSLQDLADTYQPPFRSCIEQGRASGIMCAYNRVNGIPSCADFNLLTNTV 284

Query: 297 RGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFTGNAVQ 356
           R +W+ HGYIV+DC ++ ++ D   + A S EDAVA  L AG+DL+CG Y T+   +AVQ
Sbjct: 285 RKQWEFHGYIVSDCGAVGIIHDEQGY-AKSAEDAVADVLHAGMDLECGSYLTDHAKSAVQ 343

Query: 357 QGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVS---LGKQDICSDENIELAAEAAREGI 413
           Q K+    ID++L  L+++ +RLG FDG+P  +    +G   +CS+ ++ LA EAAR GI
Sbjct: 344 QKKLPIVRIDRALHNLFSIRIRLGQFDGNPAKLPFGMIGPNHVCSENHLYLALEAARNGI 403

Query: 414 VLLKNDQNTLPLNSAKVKTVAVVGPHANAT-VAMIGNYAGIPCRYMSPIAGFSGYA-NVT 471
           VLLKN  + LPL    + ++AV+GP+ANA+ + ++GNYAG PC+ ++ + GF  Y  N  
Sbjct: 404 VLLKNTASLLPLPKTSI-SLAVIGPNANASPLTLLGNYAGPPCKSITILQGFQHYVKNAV 462

Query: 472 YKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAESLDREDLWLPGYQTQLIN 531
           +  GCD     ++  I  A + AK AD  +++ GLD SVE E  DR  L LPG Q +LIN
Sbjct: 463 FHPGCDGGPKCASAPIDKAVKVAKNADYVVLVMGLDQSVEREERDRVHLDLPGKQLELIN 522

Query: 532 QVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGG 591
            VA+ +K PVILV++  G +DI+ A+ N  I  I+WAGYPGE GG A+A ++FG  NPGG
Sbjct: 523 SVAKASKRPVILVLLCGGPIDISSAKNNDKIGGIIWAGYPGELGGIALAQIIFGDHNPGG 582

Query: 592 RLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLL 651
           RLPITWY  DY++ +P+T M +R   + GYPGRTY+FY GPT+Y FG+GLSYT++ Y  +
Sbjct: 583 RLPITWYPKDYIK-VPMTDMRMRADPTTGYPGRTYRFYKGPTVYEFGHGLSYTKYSYEFV 641

Query: 652 SFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDL---RCDDY-FEFKVDFQNVGST 707
           S T       +KL   ++  +    +       LV++L    C        V  +N G+ 
Sbjct: 642 SVTH------DKLHFNQSSTHLMTENSETIRYKLVSELDEETCKSMSVSVTVGVKNHGNI 695

Query: 708 DGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTL 767
            G   ++++ +P      + +KQ++GF  + + AG    + F  + C+ L+  + A   +
Sbjct: 696 VGRHPILLFMRPQKHRTRSPMKQLVGFHSLLLDAGEMSHVGFELSPCEHLSRANEAGLKI 755

Query: 768 LPAGEHTIFVG 778
           +  G H + VG
Sbjct: 756 IEEGSHLLHVG 766


>gi|296084630|emb|CBI25718.3| unnamed protein product [Vitis vinifera]
          Length = 768

 Score =  684 bits (1766), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 352/783 (44%), Positives = 483/783 (61%), Gaps = 43/783 (5%)

Query: 8   LLCFSLSIALLVFSTNAVDANGSSSPVFVCDPGRFSKLGLQMSSFLFCDSSLPYSIRVKD 67
            +C  L + L +FS +      S+ P F C P          S + FC++SLP S R + 
Sbjct: 9   FICLFLQV-LPLFSISE-----STHPQFPCMPP-------TNSDYPFCNTSLPISTRAQS 55

Query: 68  LVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGATSFPT 127
           LVS +TL EK+QQL D A  +PRL +P YEWWSE+LHG++  GPG  F+  +  ATSFP 
Sbjct: 56  LVSLLTLSEKIQQLSDEAAAIPRLYIPAYEWWSESLHGIATNGPGVSFNGTVSAATSFPQ 115

Query: 128 VILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNINVARDPRWGRITETPGED 187
           V+LT ASFN SLW  IG A++ EARAMYN+G+AGLT+W+PNIN+ RDPRWGR  ETPGED
Sbjct: 116 VLLTAASFNRSLWFSIGSAIAVEARAMYNVGQAGLTFWAPNINIFRDPRWGRGQETPGED 175

Query: 188 PFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDAR 247
           P V   YAV +VRG Q         D +   L +S+CCKH  AYD++ W    RY FDA 
Sbjct: 176 PMVASAYAVEFVRGFQG--------DSDGDGLMLSACCKHLTAYDLEKWGNFSRYSFDAV 227

Query: 248 VTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIV 307
           V+ QD+E+T+  PF  CV++G AS +MCSYNRVNG+P+CA   L  Q  + EW   GYI 
Sbjct: 228 VSNQDLEDTYQPPFRSCVQQGKASCLMCSYNRVNGVPACARQDLF-QKAKTEWGFKGYIT 286

Query: 308 ADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDK 367
           +DCD++  + + ++  A+S EDAVA  LKAG D++CG Y    T +A+ QGKVKE DID+
Sbjct: 287 SDCDAVATVYE-YQHYANSPEDAVADVLKAGTDINCGSYMLRHTQSAIDQGKVKEEDIDR 345

Query: 368 SLKYLYTVLMRLGFFDGSPQ---YVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLP 424
           +L  L++V MRLG FDG P    Y +LG +D+C+ E+  LA EAAR+GIVLLKND+  LP
Sbjct: 346 ALFNLFSVQMRLGLFDGDPANGLYGNLGPKDVCTKEHRTLALEAARQGIVLLKNDKKFLP 405

Query: 425 LNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFSGYANVT-YKTGCDDVACKS 483
           L+ +++ ++A++GP A+    + G Y GIPC+  S + G   Y   T +  GC DV C S
Sbjct: 406 LDKSRISSLAIIGPQADQPF-LGGGYTGIPCKPESLVEGLKTYVEKTSFAAGCVDVPCLS 464

Query: 484 NNSIFAASEAAKTADATIILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVIL 543
           +     A   A+ AD  +++AGLDLS E E  DR  L LPG Q  LI+ VA   + P++L
Sbjct: 465 DTGFDEAVSIARKADIVVVVAGLDLSQETEDHDRVSLLLPGKQMALISSVASAIQKPLVL 524

Query: 544 VIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYV 603
           V+   G +D++FAE +  I +ILW GYPGE G +A+A+++FG FNPGGRLP+TWY   + 
Sbjct: 525 VLTGGGPLDVSFAEQDPRIASILWIGYPGEAGAKALAEIIFGDFNPGGRLPMTWYPESFT 584

Query: 604 QMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNK 663
           + +P+  M +R     GYPGRTY+FY G  +Y FG GLSYT+F Y  +S         NK
Sbjct: 585 R-VPMNDMNMRADPYRGYPGRTYRFYIGHRVYGFGQGLSYTKFAYQFVSAP-------NK 636

Query: 664 LQHCRNLNYTSDASKTRCPGVLVNDLR------CDDY-FEFKVDFQNVGSTDGSDVVIVY 716
           L   R+ +  S  +  R     VN         CD   F  ++   NVG  DGS VV+++
Sbjct: 637 LNLLRSSDTVSSKNLPRQRREEVNYFHIEELDTCDSLRFHVEISVTNVGDMDGSHVVMLF 696

Query: 717 SKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIF 776
           S+ P  +  T  KQ+IGF RV   + R+     + + C+  +I +     ++P G+HTI 
Sbjct: 697 SRVPKIVKGTPEKQLIGFSRVHTVSRRSTETSIMVDPCEHFSIANEQGKRIMPLGDHTIM 756

Query: 777 VGN 779
           +G+
Sbjct: 757 LGD 759


>gi|218191593|gb|EEC74020.1| hypothetical protein OsI_08964 [Oryza sativa Indica Group]
          Length = 774

 Score =  684 bits (1764), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 351/744 (47%), Positives = 481/744 (64%), Gaps = 28/744 (3%)

Query: 50  SSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNV 109
           SS  FC+  LP   R  DLVSR+TL+EK+ QLGD +  V RLG+P Y+WWSEALHGVSN 
Sbjct: 36  SSAAFCNPRLPIEQRADDLVSRLTLEEKISQLGDQSPAVDRLGVPAYKWWSEALHGVSNA 95

Query: 110 GPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRA-GLTYWSPN 168
           G G H D  +  ATSFP VILT ASFN  LW +IGQ + TEARA+YN G+A GLT+W+PN
Sbjct: 96  GRGIHLDGPLRAATSFPQVILTAASFNPHLWYRIGQVIGTEARAVYNNGQAEGLTFWAPN 155

Query: 169 INVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHY 228
           INV RDPRWGR  ETPGEDP V G+YA  +VRG+Q   G+  A  +NS  L+ S+CCKH+
Sbjct: 156 INVFRDPRWGRGQETPGEDPTVTGKYAAVFVRGVQ---GYALAGAINSTDLEASACCKHF 212

Query: 229 AAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCAD 288
            AYD++NWKGV RY FDA+VT QD+ +T+  PF  CV++G AS +MCSYNRVNG+P+CAD
Sbjct: 213 TAYDLENWKGVTRYAFDAKVTAQDLADTYNPPFRSCVEDGGASGIMCSYNRVNGVPTCAD 272

Query: 289 PKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYT 348
             LL++T RG+W  +GYI +DCD++ ++ D   + A + EDAVA  LKAG+D++CG Y  
Sbjct: 273 YNLLSKTARGDWRFYGYITSDCDAVSIIHDVQGY-AKTAEDAVADVLKAGMDVNCGSYVQ 331

Query: 349 NFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYV---SLGKQDICSDENIELA 405
               +A+QQGK+ E DI+++L  L+ V MRLG F+G+P+Y    ++G   +C+ E+  LA
Sbjct: 332 EHGLSAIQQGKITEQDINRALHNLFAVRMRLGLFNGNPKYNRYGNIGPDQVCTQEHQNLA 391

Query: 406 AEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFS 465
            EAA+ G+VLLKND N LPL+ ++V ++AV+G +AN    ++GNY G PC  ++P+    
Sbjct: 392 LEAAQHGVVLLKNDANALPLSKSQVSSIAVIGHNANDATRLLGNYFGPPCISVTPLQVLQ 451

Query: 466 GYANVT-YKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAESLDREDLWLPG 524
           GY   T +  GC+  AC  + SI  A++ A + D  ++  GLD   E E +DR +L LPG
Sbjct: 452 GYVKDTRFLAGCNSAACNVS-SIGEAAQLASSVDYVVLFMGLDQDQEREEVDRLELSLPG 510

Query: 525 YQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVF 584
            Q  LIN VA  AK PVILV++  G VD+ FA+ N  I AILWAGYPGE GG AIA V+F
Sbjct: 511 MQENLINTVANAAKKPVILVLLCGGPVDVTFAKYNPKIGAILWAGYPGEAGGIAIAQVLF 570

Query: 585 GKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYT 644
           G+ NPGGRLP+TWY  ++   +P+T M +R   S GYPGRTY+FY G T+Y FGYGLSY+
Sbjct: 571 GEHNPGGRLPVTWYPKEFTS-VPMTDMRMRADPSTGYPGRTYRFYRGNTVYKFGYGLSYS 629

Query: 645 QFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLR------CDDY-FEF 697
           ++ ++ ++       N  KL    +++    A  T   G +  D+       CD   F  
Sbjct: 630 KYSHHFVA-------NGTKLPSLSSIDGLK-AMATAAAGTVSYDVEEIGTETCDKLKFPA 681

Query: 698 KVDFQNVGSTDGSDVVIVYSKPP--AEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACK 755
            V  QN G  DG   V+++ + P  A        Q+IGFQ + +++ +   ++F  + CK
Sbjct: 682 LVRVQNHGPMDGRHPVLLFLRWPNGAADGGRPASQLIGFQSLHLKSMQTVHVEFEVSPCK 741

Query: 756 SLNIVDYAANTLLPAGEHTIFVGN 779
             +        ++  G H + VG+
Sbjct: 742 HFSRATEDGKKVIDHGSHFMMVGD 765


>gi|115448721|ref|NP_001048140.1| Os02g0752200 [Oryza sativa Japonica Group]
 gi|46390122|dbj|BAD15557.1| putative beta-D-xylosidase [Oryza sativa Japonica Group]
 gi|46390225|dbj|BAD15656.1| putative beta-D-xylosidase [Oryza sativa Japonica Group]
 gi|113537671|dbj|BAF10054.1| Os02g0752200 [Oryza sativa Japonica Group]
 gi|125583710|gb|EAZ24641.1| hypothetical protein OsJ_08409 [Oryza sativa Japonica Group]
          Length = 780

 Score =  684 bits (1764), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 351/744 (47%), Positives = 481/744 (64%), Gaps = 28/744 (3%)

Query: 50  SSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNV 109
           SS  FC+  LP   R  DLVSR+TL+EK+ QLGD +  V RLG+P Y+WWSEALHGVSN 
Sbjct: 42  SSAAFCNPRLPIEQRADDLVSRLTLEEKISQLGDQSPAVDRLGVPAYKWWSEALHGVSNA 101

Query: 110 GPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRA-GLTYWSPN 168
           G G H D  +  ATSFP VILT ASFN  LW +IGQ + TEARA+YN G+A GLT+W+PN
Sbjct: 102 GRGIHLDGPLRAATSFPQVILTAASFNPHLWYRIGQVIGTEARAVYNNGQAEGLTFWAPN 161

Query: 169 INVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHY 228
           INV RDPRWGR  ETPGEDP V G+YA  +VRG+Q   G+  A  +NS  L+ S+CCKH+
Sbjct: 162 INVFRDPRWGRGQETPGEDPTVTGKYAAVFVRGVQ---GYALAGAINSTDLEASACCKHF 218

Query: 229 AAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCAD 288
            AYD++NWKGV RY FDA+VT QD+ +T+  PF  CV++G AS +MCSYNRVNG+P+CAD
Sbjct: 219 TAYDLENWKGVTRYAFDAKVTAQDLADTYNPPFRSCVEDGGASGIMCSYNRVNGVPTCAD 278

Query: 289 PKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYT 348
             LL++T RG+W  +GYI +DCD++ ++ D   + A + EDAVA  LKAG+D++CG Y  
Sbjct: 279 YNLLSKTARGDWRFYGYITSDCDAVSIIHDVQGY-AKTAEDAVADVLKAGMDVNCGSYVQ 337

Query: 349 NFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQY---VSLGKQDICSDENIELA 405
               +A+QQGK+ E DI+++L  L+ V MRLG F+G+P+Y    ++G   +C+ E+  LA
Sbjct: 338 EHGLSAIQQGKITEQDINRALHNLFAVRMRLGLFNGNPKYNRYGNIGPDQVCTQEHQNLA 397

Query: 406 AEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFS 465
            EAA+ G+VLLKND N LPL+ ++V ++AV+G +AN    ++GNY G PC  ++P+    
Sbjct: 398 LEAAQHGVVLLKNDANALPLSKSQVSSIAVIGHNANDATRLLGNYFGPPCISVTPLQVLQ 457

Query: 466 GYANVT-YKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAESLDREDLWLPG 524
           GY   T +  GC+  AC  + SI  A++ A + D  ++  GLD   E E +DR +L LPG
Sbjct: 458 GYVKDTRFLAGCNSAACNVS-SIGEAAQLASSVDYVVLFMGLDQDQEREEVDRLELSLPG 516

Query: 525 YQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVF 584
            Q  LIN VA  AK PVILV++  G VD+ FA+ N  I AILWAGYPGE GG AIA V+F
Sbjct: 517 MQENLINTVANAAKKPVILVLLCGGPVDVTFAKYNPKIGAILWAGYPGEAGGIAIAQVLF 576

Query: 585 GKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYT 644
           G+ NPGGRLP+TWY  ++   +P+T M +R   S GYPGRTY+FY G T+Y FGYGLSY+
Sbjct: 577 GEHNPGGRLPVTWYPKEFTS-VPMTDMRMRADPSTGYPGRTYRFYRGNTVYKFGYGLSYS 635

Query: 645 QFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLR------CDDY-FEF 697
           ++ ++ ++       N  KL    +++    A  T   G +  D+       CD   F  
Sbjct: 636 KYSHHFVA-------NGTKLPSLSSIDGLK-AMATAAAGTVSYDVEEIGPETCDKLKFPA 687

Query: 698 KVDFQNVGSTDGSDVVIVYSKPP--AEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACK 755
            V  QN G  DG   V+++ + P  A        Q+IGFQ + +++ +   ++F  + CK
Sbjct: 688 LVRVQNHGPMDGRHPVLLFLRWPNGAADGGRPASQLIGFQSLHLKSMQTVHVEFEVSPCK 747

Query: 756 SLNIVDYAANTLLPAGEHTIFVGN 779
             +        ++  G H + VG+
Sbjct: 748 HFSRATEDGKKVIDHGSHFMMVGD 771


>gi|168065036|ref|XP_001784462.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162663987|gb|EDQ50724.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 726

 Score =  682 bits (1761), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 356/743 (47%), Positives = 491/743 (66%), Gaps = 39/743 (5%)

Query: 54  FCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGT 113
           FCD+SL   IRV DLVSR+TL+EKV QL + A  +PRL +P YEWW E LHGV++V    
Sbjct: 3   FCDTSLSDEIRVFDLVSRLTLEEKVTQLVNTASAIPRLSIPAYEWWQEGLHGVAHVS--- 59

Query: 114 HFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNINVAR 173
            F   +P ATSFP  ILTTASFN+ LW +IGQA STEARA YN G AGLTYWSP IN+AR
Sbjct: 60  -FGGSLPRATSFPLPILTTASFNKDLWNQIGQAFSTEARAFYNDGIAGLTYWSPVINIAR 118

Query: 174 DPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDV 233
           DPRWGRI ET GEDP+    YA ++V+G+Q  EG     D NS+ LK+S+CCKH+ AYDV
Sbjct: 119 DPRWGRIQETSGEDPYTTSAYATHFVQGMQ--EG-----DANSKRLKLSACCKHFTAYDV 171

Query: 234 DNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLN 293
           DNW+G+DRYHFDA+    ++ +T+  PF+ CV+EG ++S+MCSYN+VNG+P+CA+   L 
Sbjct: 172 DNWEGIDRYHFDAKA---NLADTYNPPFQSCVQEGRSASLMCSYNKVNGVPTCANYDFLE 228

Query: 294 QTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFTGN 353
            TVR  W L+GYIV+DCDS+ VM ++  + A + EDA A  L AGLDL+CG Y  ++T  
Sbjct: 229 NTVRRAWGLNGYIVSDCDSVLVMHESTNY-APTTEDAAADALNAGLDLNCGDYLASYTEG 287

Query: 354 AVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSP---QYVSLGKQDICSDENIELAAEAAR 410
           AV  GKV  + +D ++  ++ V MRLG FDG+P   ++ ++G  D+C+  + ELA EAAR
Sbjct: 288 AVAMGKVNASRVDNAVYNVFLVRMRLGMFDGNPANQEFGNIGVADVCTPAHQELAVEAAR 347

Query: 411 EGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGF----SG 466
           +GIVLLKND N LPL  +K    AV+GP+ANAT  M+GNY GIPC+Y++P+ G     SG
Sbjct: 348 QGIVLLKNDGNILPL--SKNINTAVIGPNANATHTMLGNYEGIPCQYITPLQGLVKFGSG 405

Query: 467 -YANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAESLDREDLWLPGY 525
            Y  V +  GC + AC+ ++ I +A   A  ADA +++ GL    E+E+LDR  L LPGY
Sbjct: 406 DYHKVWFSEGCVNTACQQDDQISSAVSTAAVADAVVLVVGLSQVQESEALDRTSLLLPGY 465

Query: 526 QTQLINQVAEVAKG-PVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVF 584
           Q  LI++VA  A G PV+LV+M AG VDI FA+ +  I++ILW GYPG+ GG+AIA+V+F
Sbjct: 466 QQTLIDEVAGAAAGRPVVLVLMCAGPVDINFAKNDKRIQSILWVGYPGQSGGQAIAEVIF 525

Query: 585 GKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYT 644
           G  NPGG+LP++WY  DY + + +T+M +RP     YPGRTY+FY G  +Y FGYGLSYT
Sbjct: 526 GAHNPGGKLPMSWYPEDYTK-ISMTNMNMRPDSRSNYPGRTYRFYTGEKIYDFGYGLSYT 584

Query: 645 QFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNV 704
           ++K++      T+       Q C + + TS  SKT C             F+  ++ +N+
Sbjct: 585 EYKHSFALAPTTVMTPSIHSQLC-DPHQTSAGSKT-C---------SSSNFDVHINVENI 633

Query: 705 GSTDGSDVVIV-YSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYA 763
           G+  G+  +++ ++ P A    T +KQ+  F  V++R+G  +++    N C+ L  V   
Sbjct: 634 GAMAGNHTLLLFFTAPSAGKNGTPLKQLAAFDSVYIRSGSQEKVVLTLNPCQHLGTVAED 693

Query: 764 ANTLLPAGEHTIFVGNGGVSFPI 786
              +L AG H + VG+   S  +
Sbjct: 694 GTRMLEAGNHILSVGDAKHSLSV 716


>gi|225459350|ref|XP_002285805.1| PREDICTED: probable beta-D-xylosidase 7-like [Vitis vinifera]
          Length = 774

 Score =  681 bits (1758), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 361/785 (45%), Positives = 501/785 (63%), Gaps = 44/785 (5%)

Query: 13  LSIALLVFSTNAVDANGSSSPVFVCDPGRFSKLGLQMSSFLFCDSSLPYSIRVKDLVSRM 72
           L I L+  +   V    + SP F CD    S       S+ FC ++LP   RV+DLVSR+
Sbjct: 7   LLINLIYVTVILVGVESTQSPPFSCDSSNPS-----TKSYHFCKTTLPIPDRVRDLVSRL 61

Query: 73  TLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGATSFPTVILTT 132
           TLDEK+ QL + A  +PRLG+P YEWWSEALHGV++ GPG  F+  I  ATSFP VILT 
Sbjct: 62  TLDEKISQLVNSAPAIPRLGIPAYEWWSEALHGVADAGPGIRFNGTIRSATSFPQVILTA 121

Query: 133 ASFNESLWKKIGQAVSTEARAMYNLGRA-GLTYWSPNINVARDPRWGRITETPGEDPFVV 191
           ASF+  LW +IG+A+  EARA+YN G+  G+T+W+PNIN+ RDPRWGR  ETPGEDP V 
Sbjct: 122 ASFDVHLWYRIGRAIGVEARAVYNAGQTKGMTFWAPNINIFRDPRWGRGQETPGEDPLVT 181

Query: 192 GRYAVNYVRGLQD--VEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVT 249
           G YAV+YVRG+Q   + G +   +L     + S+CCKH+ AYD+D+WKG+DR+ FDARVT
Sbjct: 182 GSYAVSYVRGVQGDCLRGLKRCGEL-----QASACCKHFTAYDLDDWKGIDRFKFDARVT 236

Query: 250 EQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVAD 309
            QD+ +T+  PF  C++EG AS +MC+YNRVNG+PSCAD  LL  T R  W+  GYI +D
Sbjct: 237 MQDLADTYQPPFHRCIEEGRASGIMCAYNRVNGVPSCADFNLLTNTARKRWNFQGYITSD 296

Query: 310 CDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSL 369
           CD++ ++ D++ F A + EDAV   LKAG+D++CG Y  N T +AV Q K+ E+++D++L
Sbjct: 297 CDAVSLIHDSYGF-AKTPEDAVVDVLKAGMDVNCGTYLLNHTKSAVMQKKLPESELDRAL 355

Query: 370 KYLYTVLMRLGFFDGSPQ---YVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLN 426
           + L+ V MRLG F+G+P+   Y  +G   +CS E+  LA +AAR+GIVLLKN Q  LPL 
Sbjct: 356 ENLFAVRMRLGLFNGNPKGQPYGDIGPNQVCSVEHQTLALDAARDGIVLLKNSQRLLPLP 415

Query: 427 SAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFSGYANVT-YKTGCDDVACKSNN 485
             K  ++AV+GP+AN+   +IGNYAG PC++++P+     Y   T Y  GCD VAC S+ 
Sbjct: 416 KGKTMSLAVIGPNANSPKTLIGNYAGPPCKFITPLQALQSYVKSTMYHPGCDAVAC-SSP 474

Query: 486 SIFAASEAAKTADATIILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVI 545
           SI  A E A+ AD  +++ GLD + E E+ DR DL LPG Q QLI  VA  AK PV+LV+
Sbjct: 475 SIEKAVEIAQKADYVVLVMGLDQTQEREAHDRLDLVLPGKQQQLIICVANAAKKPVVLVL 534

Query: 546 MSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQM 605
           +S G VDI+FA+ + NI +ILWAGYPG  GG AIA+ +FG  NPGGRLP+TWY  D+ + 
Sbjct: 535 LSGGPVDISFAKYSNNIGSILWAGYPGGAGGAAIAETIFGDHNPGGRLPVTWYPQDFTK- 593

Query: 606 LPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKL- 664
           +P+T M +RP  + GYPGRTY+FY G  ++ FGYGLSY+ +        +TI V  NKL 
Sbjct: 594 IPMTDMRMRPESNSGYPGRTYRFYTGEKVFEFGYGLSYSTYS------CETIPVTRNKLY 647

Query: 665 ----------QHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVI 714
                     ++  ++ YTS A        L  +L   +     +  +N G   G   V+
Sbjct: 648 FNQSSTAHVYENTDSIRYTSVAE-------LGKELCDSNNISISIRVRNDGEMAGKHSVL 700

Query: 715 VYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHT 774
           ++ +     A + IKQ++ FQ V +  G +  + F+ N C+  +  +     ++  G H 
Sbjct: 701 LFVRRLKASAGSPIKQLVAFQSVHLNGGESADVGFLLNPCEHFSGPNKDGLMVIEEGTHF 760

Query: 775 IFVGN 779
           + VG+
Sbjct: 761 LVVGD 765


>gi|356515806|ref|XP_003526589.1| PREDICTED: probable beta-D-xylosidase 7-like [Glycine max]
          Length = 772

 Score =  681 bits (1757), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 356/769 (46%), Positives = 490/769 (63%), Gaps = 34/769 (4%)

Query: 22  TNAVDANGSSSPVFVCDPGRFSKLGLQMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQL 81
           T  V ++   +P F CD   FS    +  S+ FC+  LP   R KDL+SR+TLDEK+ QL
Sbjct: 14  TVTVQSSKPEAP-FACD---FSNPSSR--SYPFCNPKLPIPQRTKDLLSRLTLDEKLSQL 67

Query: 82  GDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTHFDD--VIPGATSFPTVILTTASFNESL 139
            + A  +PRLG+P Y+WWSEALHGVS VGPG  FD+   I  ATSFP VILT ASF+  L
Sbjct: 68  VNTAPPIPRLGIPAYQWWSEALHGVSGVGPGILFDNNSTISSATSFPQVILTAASFDSRL 127

Query: 140 WKKIGQAVSTEARAMYNLGRA-GLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNY 198
           W +IG A+  EARA++N G+A GLT+W+PNIN+ RDPRWGR  ET GEDP +  RYAV++
Sbjct: 128 WYRIGHAIGIEARAIFNAGQANGLTFWAPNINIFRDPRWGRGQETAGEDPLLTSRYAVSF 187

Query: 199 VRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFL 258
           VRGLQ               L  S+CCKH+ AYD+DNWKGVDR+ FDARV+ QD+ +T+ 
Sbjct: 188 VRGLQ-------GDSFKGAHLLASACCKHFTAYDLDNWKGVDRFVFDARVSLQDLADTYQ 240

Query: 259 RPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVD 318
            PF+ CV++G AS +MC+YNRVNG+P+CAD  LL QT R +WD +GYI +DC ++  + D
Sbjct: 241 PPFQSCVQQGRASGIMCAYNRVNGVPNCADYGLLTQTARNQWDFNGYITSDCGAVGFIHD 300

Query: 319 NHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMR 378
             ++ A S ED VA  L+AG+DL+CG Y T    +AV Q K+  ++ID++L+ L+++ MR
Sbjct: 301 RQRY-AKSPEDVVADVLRAGMDLECGSYLTYHAKSAVLQKKLGMSEIDRALQNLFSIRMR 359

Query: 379 LGFFDGSPQYVS---LGKQDICSDENIELAAEAAREGIVLLKNDQNTLPL-NSAKVKTVA 434
           LG FDG+P  +S   +G   +CS E+  LA EAAR GIVLLKN    LPL  ++   ++A
Sbjct: 360 LGLFDGNPTRLSFGLIGSNHVCSKEHQYLALEAARNGIVLLKNSPTLLPLPKTSPSISLA 419

Query: 435 VVGPHANAT-VAMIGNYAGIPCRYMSPIAGFSGYA-NVTYKTGCDDVACKSNNSIFAASE 492
           V+GP+AN++ + ++GNYAG PC+Y++ + GF  Y  N  Y  GCD     S+  I  A E
Sbjct: 420 VIGPNANSSPLTLLGNYAGPPCKYVTILQGFRHYVKNAFYHPGCDGGPKCSSAQIDQAVE 479

Query: 493 AAKTADATIILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVD 552
            AK  D  +++ GLD S E E  DR  L LPG Q +LIN VAE +K PVILV++S G +D
Sbjct: 480 VAKKVDYVVLVMGLDQSEEREERDRVHLDLPGKQLELINGVAEASKKPVILVLLSGGPLD 539

Query: 553 IAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMP 612
           I  A+ N  I  ILWAGYPGE GG A+A ++FG  NPGGRLP TWY  DY++ +P+T M 
Sbjct: 540 ITSAKYNHKIGGILWAGYPGELGGIALAQIIFGDHNPGGRLPTTWYPKDYIK-VPMTDMR 598

Query: 613 LRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNY 672
           +R   S GYPGRTY+FY GP +Y FGYGLSY+++ Y  +S T       +KL   ++  +
Sbjct: 599 MRADPSTGYPGRTYRFYKGPKVYEFGYGLSYSKYSYEFVSVTH------DKLHFNQSSTH 652

Query: 673 TSDASKTRCPGVLVNDL---RCDDY-FEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYI 728
               +       LV++L    C        V  QN GS  G   V+++ +P  + + + +
Sbjct: 653 LMVENSETISYKLVSELDEQTCQSMSLSVTVRVQNHGSMVGKHPVLLFIRPKRQKSGSPV 712

Query: 729 KQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFV 777
           KQ++GF+ V + AG    ++F  + C+ L+  + A   ++  G H + V
Sbjct: 713 KQLVGFESVMLDAGEMAHVEFEVSPCEHLSRANEAGAMIIEEGSHMLLV 761


>gi|225469218|ref|XP_002264031.1| PREDICTED: probable beta-D-xylosidase 6-like [Vitis vinifera]
          Length = 789

 Score =  681 bits (1756), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 353/796 (44%), Positives = 486/796 (61%), Gaps = 48/796 (6%)

Query: 8   LLCFSLSIALLVFSTNAVDANGSSSPVFVCDPGRFSKLGLQMSSFLFCDSSLPYSIRVKD 67
            +C  L + L +FS +      S+ P F C P          S + FC++SLP S R + 
Sbjct: 9   FICLFLQV-LPLFSISE-----STHPQFPCMPP-------TNSDYPFCNTSLPISTRAQS 55

Query: 68  LVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGATSFPT 127
           LVS +TL EK+QQL D A  +PRL +P YEWWSE+LHG++  GPG  F+  +  ATSFP 
Sbjct: 56  LVSLLTLSEKIQQLSDEAAAIPRLYIPAYEWWSESLHGIATNGPGVSFNGTVSAATSFPQ 115

Query: 128 VILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNINVARDPRWGRITETPGED 187
           V+LT ASFN SLW  IG A++ EARAMYN+G+AGLT+W+PNIN+ RDPRWGR  ETPGED
Sbjct: 116 VLLTAASFNRSLWFSIGSAIAVEARAMYNVGQAGLTFWAPNINIFRDPRWGRGQETPGED 175

Query: 188 PFVVGRYAVNYVRGLQ--------DVEGHENAT-----DLNSRPLKVSSCCKHYAAYDVD 234
           P V   YAV +VRG Q        ++ G          D +   L +S+CCKH  AYD++
Sbjct: 176 PMVASAYAVEFVRGFQGGNWKGGDEIRGAVGKKRVLRGDSDGDGLMLSACCKHLTAYDLE 235

Query: 235 NWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQ 294
            W    RY FDA V+ QD+E+T+  PF  CV++G AS +MCSYNRVNG+P+CA   L  Q
Sbjct: 236 KWGNFSRYSFDAVVSNQDLEDTYQPPFRSCVQQGKASCLMCSYNRVNGVPACARQDLF-Q 294

Query: 295 TVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFTGNA 354
             + EW   GYI +DCD++  + + ++  A+S EDAVA  LKAG D++CG Y    T +A
Sbjct: 295 KAKTEWGFKGYITSDCDAVATVYE-YQHYANSPEDAVADVLKAGTDINCGSYMLRHTQSA 353

Query: 355 VQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ---YVSLGKQDICSDENIELAAEAARE 411
           + QGKVKE DID++L  L++V MRLG FDG P    Y +LG +D+C+ E+  LA EAAR+
Sbjct: 354 IDQGKVKEEDIDRALFNLFSVQMRLGLFDGDPANGLYGNLGPKDVCTKEHRTLALEAARQ 413

Query: 412 GIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFSGYANVT 471
           GIVLLKND+  LPL+ +++ ++A++GP A+    + G Y GIPC+  S + G   Y   T
Sbjct: 414 GIVLLKNDKKFLPLDKSRISSLAIIGPQADQPF-LGGGYTGIPCKPESLVEGLKTYVEKT 472

Query: 472 -YKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAESLDREDLWLPGYQTQLI 530
            +  GC DV C S+     A   A+ AD  +++AGLDLS E E  DR  L LPG Q  LI
Sbjct: 473 SFAAGCVDVPCLSDTGFDEAVSIARKADIVVVVAGLDLSQETEDHDRVSLLLPGKQMALI 532

Query: 531 NQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPG 590
           + VA   + P++LV+   G +D++FAE +  I +ILW GYPGE G +A+A+++FG FNPG
Sbjct: 533 SSVASAIQKPLVLVLTGGGPLDVSFAEQDPRIASILWIGYPGEAGAKALAEIIFGDFNPG 592

Query: 591 GRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNL 650
           GRLP+TWY   + + +P+  M +R     GYPGRTY+FY G  +Y FG GLSYT+F Y  
Sbjct: 593 GRLPMTWYPESFTR-VPMNDMNMRADPYRGYPGRTYRFYIGHRVYGFGQGLSYTKFAYQF 651

Query: 651 LSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLR------CDDY-FEFKVDFQN 703
           +S         NKL   R+ +  S  +  R     VN         CD   F  ++   N
Sbjct: 652 VSAP-------NKLNLLRSSDTVSSKNLPRQRREEVNYFHIEELDTCDSLRFHVEISVTN 704

Query: 704 VGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYA 763
           VG  DGS VV+++S+ P  +  T  KQ+IGF RV   + R+     + + C+  +I +  
Sbjct: 705 VGDMDGSHVVMLFSRVPKIVKGTPEKQLIGFSRVHTVSRRSTETSIMVDPCEHFSIANEQ 764

Query: 764 ANTLLPAGEHTIFVGN 779
              ++P G+HTI +G+
Sbjct: 765 GKRIMPLGDHTIMLGD 780


>gi|212275712|ref|NP_001130324.1| uncharacterized protein LOC100191418 precursor [Zea mays]
 gi|194688848|gb|ACF78508.1| unknown [Zea mays]
 gi|413938927|gb|AFW73478.1| putative O-Glycosyl hydrolase superfamily protein [Zea mays]
          Length = 780

 Score =  680 bits (1754), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 349/762 (45%), Positives = 481/762 (63%), Gaps = 32/762 (4%)

Query: 30  SSSPVFVCDPGRFSKLGLQMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVP 89
           +S P + C  G          +  FCD+ LP   RV DLVSRMT+ EK+ QLGD +  +P
Sbjct: 30  ASEPPYTCGAG-------APPNIPFCDAGLPIDRRVDDLVSRMTVAEKISQLGDQSPAIP 82

Query: 90  RLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVST 149
           RLG+P Y+WWSEALHG+SN G G H D  +  ATSFP VILT ASFN  LW +IGQ +  
Sbjct: 83  RLGVPAYKWWSEALHGISNQGRGIHLDGPLRAATSFPQVILTAASFNPHLWYRIGQVIGV 142

Query: 150 EARAMYNLGRA-GLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGH 208
           EARA+YN G+A GLT+W+PNINV RDPRWGR  ETPGEDP + G+YA  +VRG+Q   G+
Sbjct: 143 EARAVYNNGQAEGLTFWAPNINVFRDPRWGRGQETPGEDPTMTGKYAAVFVRGVQ---GY 199

Query: 209 ENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEG 268
             A  +NS  L+ S+CCKH+ AYD++NWKGV RY FDA+VT QD+ +T+  PF+ CV++G
Sbjct: 200 GLAGPVNSTGLEASACCKHFTAYDLENWKGVTRYVFDAKVTAQDLADTYNPPFKSCVEDG 259

Query: 269 DASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKE 328
            AS +MCSYNRVNG+P+CAD  LL+ T R +W  +GYI +DCD++ ++ D   + A + E
Sbjct: 260 HASGIMCSYNRVNGVPTCADYNLLSTTARQDWGFYGYITSDCDAVAIIHDAQGY-AKTAE 318

Query: 329 DAVAQTLKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ- 387
           DAVA  LKAG+D++CG Y  +   +A+QQGK+ E DI+++L  L+ V MRLG F+G P+ 
Sbjct: 319 DAVADVLKAGMDVNCGSYVQDHGASALQQGKITEQDINRALHNLFAVRMRLGLFNGDPRR 378

Query: 388 --YVSLGKQDICSDENIELAAEAAREGIVLLKND--QNTLPLNSAKVKTVAVVGPHANAT 443
             Y  +G   +C+ E+ +LA EAA++GIVLLKND     LPL+   V ++AV+G +AN  
Sbjct: 379 NLYGDIGPDQVCTQEHQDLALEAAQDGIVLLKNDGGAGALPLSKPNVASLAVIGFNANDA 438

Query: 444 VAMIGNYAGIPCRYMSPIAGFSGYA-NVTYKTGCDDVACKSNNSIFAASEAAKTADATII 502
           + + GNY G PC  ++P+    GY  + ++  GC+  AC    +I  A +AA +AD+ ++
Sbjct: 439 IRLRGNYFGPPCVTVTPLQVLQGYVKDTSFVAGCNSAACNVT-TIPEAVQAASSADSVVL 497

Query: 503 LAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNI 562
             GLD   E E +DR DL LPG Q  LI  VA  AK PVILV++  G VD++FA+TN  I
Sbjct: 498 FMGLDQDQEREEVDRLDLTLPGQQQTLIESVANAAKKPVILVLLCGGPVDVSFAKTNPKI 557

Query: 563 KAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYP 622
            AILWAGYPGE GG AIA V+FG+ NPGGRLP+TWY  D+ + +P+T M +R   + GYP
Sbjct: 558 GAILWAGYPGEAGGIAIAQVLFGEHNPGGRLPVTWYPQDFTR-VPMTDMRMRADPATGYP 616

Query: 623 GRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQ--VNLNKLQHCRNLNYTSDASKTR 680
           GRTY+FY GPT++ FGYGLSY+++ +   +          L  ++       + D     
Sbjct: 617 GRTYRFYRGPTVFNFGYGLSYSKYSHRFATKPPPTSNVAGLKAVEATAGGMASYDVEA-- 674

Query: 681 CPGVLVNDLRCDDY-FEFKVDFQNVGSTDGSDVVIVYSKPP--AEIAATYIKQVIGFQRV 737
                +    CD   F   V  QN G  DG   V+V+ + P   + +     Q+IGFQ +
Sbjct: 675 -----IGSETCDRLKFPAVVRVQNHGPMDGKHSVLVFMRWPNATDGSGRPASQLIGFQSL 729

Query: 738 FVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGN 779
            +RA +   ++F  + CK  +        ++  G H + VG 
Sbjct: 730 HLRATQTAHVEFEVSPCKHFSRATEDGRKVIDQGSHFVMVGE 771


>gi|222618262|gb|EEE54394.1| hypothetical protein OsJ_01415 [Oryza sativa Japonica Group]
          Length = 776

 Score =  677 bits (1748), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 371/799 (46%), Positives = 483/799 (60%), Gaps = 114/799 (14%)

Query: 36  VCDPGRFSKLGLQMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQ 95
           VCDP RF+  GL M+ F +CD+SLPY+ RV+DLV RMTL+EKV  LGD A G PR+GLP+
Sbjct: 46  VCDPARFAAAGLDMAGFPYCDASLPYADRVRDLVGRMTLEEKVANLGDRAGGAPRVGLPR 105

Query: 96  YEWWSEALHGVSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEAR--- 152
           Y             G G          T+ PT     ++  + +W++  +     AR   
Sbjct: 106 Y------------CGGGRR-------CTACPT-----SARRDVVWRRRARRHQLPARHQQ 141

Query: 153 ---------------------AMYNLGRAGLTYWSPNINVARDPRWGRITETPGEDPFVV 191
                                 MYNLG A LTYWSPNINV RDPRWGR +ETPGEDPFVV
Sbjct: 142 RRVVQRDAVARHRRRGVDGDQGMYNLGHAELTYWSPNINVVRDPRWGRASETPGEDPFVV 201

Query: 192 GRYAVNYVRGLQDVEGHENATDLN------SRPLKVSSCCKHYAAYDVDNWKGVDRYHFD 245
           GRYAVN+VRG+QD++G   A          SRP+KVSSCCKHYAA               
Sbjct: 202 GRYAVNFVRGMQDIDGATTAASAAAATDAFSRPIKVSSCCKHYAA--------------- 246

Query: 246 ARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGY 305
                                      VMCSYNR+NG+P+CAD +LL +TVR +W LHGY
Sbjct: 247 --------------------------CVMCSYNRINGVPACADARLLTETVRRDWQLHGY 280

Query: 306 IVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQY-------YTNFTGNAVQQG 358
           IV+DCDS++VMV + K+L  +  +A A  +KAGLDLDCG +       +T +  +AV+QG
Sbjct: 281 IVSDCDSVRVMVRDAKWLGYTGVEATAAAMKAGLDLDCGMFWEGVHDFFTTYGVDAVRQG 340

Query: 359 KVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQDICSDENIELAAEAAREGIVLLKN 418
           K+KE+ +D +L  LY  LMRLGFFDG P+  SLG  D+C++E+ ELAA+AAR+G+VLLKN
Sbjct: 341 KLKESAVDNALTNLYLTLMRLGFFDGIPELESLGAADVCTEEHKELAADAARQGMVLLKN 400

Query: 419 DQNTLPLNSAKVKTVAVVGP--HANATVAMIGNYAGIPCRYMSPIAGFSGYANVTYKTGC 476
           D   LPL+  KV +VA+ G   H NAT  M+G+Y G PCR ++P  G     + T    C
Sbjct: 401 DAALLPLSPEKVNSVALFGQLQHINATDVMLGDYRGKPCRVVTPYDGVRKVVSSTSVHAC 460

Query: 477 DDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEV 536
           D  +C +      A+ AAKT DATI++AGL++SVE ES DREDL LP  Q   IN VAE 
Sbjct: 461 DKGSCDT------AAAAAKTVDATIVVAGLNMSVERESNDREDLLLPWSQASWINAVAEA 514

Query: 537 AKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPIT 596
           +  P++LVIMSAGGVD++FA+ N  I A++WAGYPGEEGG AIADV+FGK+NPGGRLP+T
Sbjct: 515 SPSPIVLVIMSAGGVDVSFAQDNPKIGAVVWAGYPGEEGGTAIADVLFGKYNPGGRLPLT 574

Query: 597 WYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGP-TLYPFGYGLSYTQFKYNLLSFTK 655
           WY  +YV  +P+TSM LRP    GYPGRTYKFY G   LYPFG+GLSYT F Y   +   
Sbjct: 575 WYKNEYVSKIPMTSMALRPDAEHGYPGRTYKFYGGADVLYPFGHGLSYTNFTYASATAAA 634

Query: 656 TIQVNLNKLQHCRNLNYTSD-ASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVI 714
            + V +   ++C+ L Y +  +S   CP V V    C +   F V   N G  DG+ VV 
Sbjct: 635 PVTVKVGAWEYCKQLTYKAGVSSPPACPAVNVASHACQEEVSFAVTVANTGGRDGTHVVP 694

Query: 715 VYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHT 774
           +Y+ PPAE+     KQ++ F+RV V AG    + F  N CK+  IV+  A T++P+G   
Sbjct: 695 MYTAPPAEVDGAPRKQLVAFRRVRVAAGAAVEVAFALNVCKAFAIVEETAYTVVPSGVSR 754

Query: 775 IFVGNGG--VSFPIHLNFN 791
           + VG+    +SFP+ ++  
Sbjct: 755 VLVGDDALSLSFPVQIDLQ 773


>gi|85813770|emb|CAJ65921.1| xylan 1,4-beta-xylosidase [Populus tremula x Populus alba]
          Length = 704

 Score =  675 bits (1741), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 365/717 (50%), Positives = 465/717 (64%), Gaps = 53/717 (7%)

Query: 3   KVVSSLLCFSLSIALLVFSTNAVDANGSSSPVFVCDPGRFSKLGLQMSSFLFCDSSLPYS 62
           KVV  L C       LVF +  V A   SSPVF CD      L    +S  FC++S+  +
Sbjct: 14  KVVFLLFCM-----FLVFLSTHVSAQ--SSPVFACDVVSNPSL----ASLGFCNTSIGIN 62

Query: 63  IRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGA 122
            RV DLV R+TL EK+  L + A  V RLG+P+YEWWSEALHGVS VGPGTHF D + GA
Sbjct: 63  DRVVDLVKRLTLQEKIVFLVNSAGNVSRLGIPKYEWWSEALHGVSYVGPGTHFSDDVAGA 122

Query: 123 TSFPTVILTTASFNESLWKKIG-----QAVSTEARAMYNLGRAGLTYWSPNINVARDPRW 177
           TSFP VILT ASFN SL++ IG     Q VSTEARAMYN+G AGLT+WSPNIN+ RDPRW
Sbjct: 123 TSFPQVILTAASFNTSLFEAIGKVYYTQVVSTEARAMYNVGLAGLTFWSPNINIFRDPRW 182

Query: 178 GRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWK 237
           GR  ETPGEDP +  +Y   YV+GLQ  +      D +   LKV++CCKHY AYD+DNWK
Sbjct: 183 GRGQETPGEDPLLSSKYGSCYVKGLQQRD------DGDPDKLKVAACCKHYTAYDLDNWK 236

Query: 238 GVDRYHFDARV-TEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTV 296
           G DRYHF+A V T+QDM++TF  PF+ CV +G+ +SVMCSYN+VNG P+CADP LL+  +
Sbjct: 237 GSDRYHFNAVVVTKQDMDDTFQPPFKSCVIDGNVASVMCSYNQVNGKPTCADPDLLSGVI 296

Query: 297 RGEWDLHGY-------IVADCDSIQVMVDNHKFLADSKEDA-----VAQTLKAGLDLDCG 344
           RGEW+L+GY       IV DCDS+ V   +  +    +E A        +L  G+DL+CG
Sbjct: 297 RGEWNLNGYQWGCCRYIVTDCDSLDVFYKSQNYTKTPEEAAAAAILAGNSLVTGVDLNCG 356

Query: 345 QYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ---YVSLGKQDICSDEN 401
            +    T  AV+ G V E  ID ++   +  LMRLGFFDG P    Y  LG +D+C+ EN
Sbjct: 357 SFLGQHTEAAVKGGLVNEHAIDIAVSNNFATLMRLGFFDGDPSKQLYGKLGPKDVCTAEN 416

Query: 402 IELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNY-AGIPCRYMSP 460
            ELA EAAR+GIVLLKN   +LPL+   +K +AV+GP+AN T  MIGNY  G PC+Y +P
Sbjct: 417 QELAREAARQGIVLLKNTAGSLPLSPTAIKNLAVIGPNANVTKTMIGNYEGGTPCKYTTP 476

Query: 461 IAGFSGYANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAESLDREDL 520
           + G +     TY  GC +VAC S   +  A + A  ADAT+++ G DLS+EAES DR D+
Sbjct: 477 LQGLAASVATTYLPGCSNVAC-STAQVDDAKKLAAAADATVLVMGADLSIEAESRDRVDV 535

Query: 521 WLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIA 580
            LPG Q  LI  VA V+ GPVILVIMS GG+D++FA TN  I +ILW GYPGE GG AIA
Sbjct: 536 LLPGQQQLLITAVANVSCGPVILVIMSGGGMDVSFARTNDKITSILWVGYPGEAGGAAIA 595

Query: 581 DVVFGKFNPG----GRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYP 636
           D++FG +NP     GRLP+TWY   YV  +P+T+M +RP  S GYPGRTY+FY G T+Y 
Sbjct: 596 DIIFGYYNPSTHQPGRLPMTWYPQSYVDKVPMTNMNMRPDPSNGYPGRTYRFYTGETVYS 655

Query: 637 FGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDD 693
           FG GLSY+QF + L+   + + V L +   C +         + C  V+ ++  C +
Sbjct: 656 FGDGLSYSQFTHELIQAPQLVYVPLEESHVCHS---------SECQSVVASEQTCQN 703


>gi|384872601|gb|AFI25186.1| putative beta-D-xylosidase [Nicotiana tabacum]
          Length = 791

 Score =  673 bits (1737), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 336/747 (44%), Positives = 468/747 (62%), Gaps = 28/747 (3%)

Query: 52  FLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGP 111
           + FC+ +LP S RV+ L+S +T+DEK+  L D    +PRLGLP YEWWSE+LHG++  GP
Sbjct: 41  YTFCNKNLPISTRVQSLISLLTIDEKILHLSDNTTSIPRLGLPAYEWWSESLHGIATNGP 100

Query: 112 GTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNINV 171
             +F+  I G TSFP VILT A+FN +LW  I  A++ EARAMYNLG+AGLT+W+PNIN+
Sbjct: 101 AVNFNGQIKGVTSFPQVILTAAAFNRTLWHSIATAIAVEARAMYNLGQAGLTFWAPNINI 160

Query: 172 ARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDV-----EGHENATDLNSRPLK------ 220
            RDPRWGR  ETPGEDP VV  YA+ YV G Q +     +G+ N      R LK      
Sbjct: 161 LRDPRWGRGQETPGEDPMVVSAYAIEYVTGFQGLNPKAKKGNRNGYGKKRRVLKEDDNDG 220

Query: 221 ----VSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCS 276
               +S+CCKH+ AYD++ W    RY F+A VT+QDME+TF  PF  C+++G AS +MCS
Sbjct: 221 ERLMLSACCKHFTAYDLEKWGDATRYDFNAVVTKQDMEDTFQAPFRSCIQQGKASCLMCS 280

Query: 277 YNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLK 336
           YN VNG+P+CAD +LL++ VR +W   GYI +DCD++  + +N K+   + EDAVA  LK
Sbjct: 281 YNSVNGVPACADKELLDK-VRTDWGFDGYITSDCDAVATIYENQKY-TKTPEDAVAVALK 338

Query: 337 AGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSP---QYVSLGK 393
           AG +++CG Y      +A QQG V E D+D++L+YL++V  RLG FDG+P   Q+ + G 
Sbjct: 339 AGTNINCGTYMLRHMKSAFQQGSVLEEDLDRALQYLFSVQFRLGLFDGNPADGQFANFGA 398

Query: 394 QDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGI 453
           QD+C+  ++ LA +AAR+GIVLLKNDQ  LPL+   V T+A+VGP AN + +  G Y+G+
Sbjct: 399 QDVCTSNHLNLALDAARQGIVLLKNDQKFLPLDKTSVSTLAIVGPMANVS-SPGGTYSGV 457

Query: 454 PCRYMSPIAGFSGYANVT-YKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEA 512
           PC+  S   GF  + N T Y  GC DV C S      A    K AD  I++AG DLS E 
Sbjct: 458 PCKLKSIREGFHRHINRTLYAAGCLDVGCNSTAGFQDAISIVKEADYVIVVAGSDLSEET 517

Query: 513 ESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPG 572
           E  DR  L LPG QT L+  +A  +K P+ILV+   G VD++FAE +  I +ILW  YPG
Sbjct: 518 EDHDRYSLLLPGQQTNLVTTLAAASKKPIILVLTGGGPVDVSFAEKDPRIASILWVAYPG 577

Query: 573 EEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGP 632
           E GG+A+++++FG  NPGG+LP+TWY   + + +P+T M +R   S GYPGRTY+FY G 
Sbjct: 578 ETGGKALSEIIFGYQNPGGKLPMTWYLESFTK-VPMTDMNMRADPSNGYPGRTYRFYTGD 636

Query: 633 TLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRC- 691
            LY FG+GLSYT F   LLS    + ++L K    R++       ++R   + V+++   
Sbjct: 637 VLYGFGHGLSYTSFSSQLLSAPSRLSLSLAKSNRKRSI---LAKGRSRLGYIHVDEVESC 693

Query: 692 -DDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFV 750
               F   +   N G  DGS V++++S+          KQ++GF RV V A +      +
Sbjct: 694 HSSKFFVHISVTNDGDMDGSHVLMLFSRVLQNFQGAPQKQLVGFDRVHVPARKYVETSLL 753

Query: 751 FNACKSLNIVDYAANTLLPAGEHTIFV 777
            + C+  +  +   N +L  GEHT  +
Sbjct: 754 VDPCELFSFANDQGNRILALGEHTFIL 780


>gi|115485165|ref|NP_001067726.1| Os11g0297800 [Oryza sativa Japonica Group]
 gi|62734696|gb|AAX96805.1| beta-D-xylosidase [Oryza sativa Japonica Group]
 gi|77549999|gb|ABA92796.1| Glycosyl hydrolase family 3 C terminal domain containing protein,
           expressed [Oryza sativa Japonica Group]
 gi|113644948|dbj|BAF28089.1| Os11g0297800 [Oryza sativa Japonica Group]
 gi|125534139|gb|EAY80687.1| hypothetical protein OsI_35869 [Oryza sativa Indica Group]
 gi|215766717|dbj|BAG98945.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 782

 Score =  671 bits (1732), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 345/739 (46%), Positives = 472/739 (63%), Gaps = 34/739 (4%)

Query: 54  FCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGT 113
           FCD++LP   R  DLV+R+T  EKV QLGD A GVPRLG+P Y+WWSEALHG++  G G 
Sbjct: 52  FCDATLPAEQRAADLVARLTAAEKVAQLGDQAAGVPRLGVPAYKWWSEALHGLATSGRGL 111

Query: 114 HFDDVIPG-----ATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRA-GLTYWSP 167
           HFD   PG     ATSFP V+LT A+F++ LW +IGQA+ TEARA+YN+G+A GLT WSP
Sbjct: 112 HFD--APGSAARAATSFPQVLLTAAAFDDDLWFRIGQAIGTEARALYNIGQAEGLTMWSP 169

Query: 168 NINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKH 227
           N+N+ RDPRWGR  ETPGEDP +  +YAV +V+G+Q   G+ +A       L+ S+CCKH
Sbjct: 170 NVNIFRDPRWGRGQETPGEDPTMASKYAVAFVKGMQ---GNSSAI------LQTSACCKH 220

Query: 228 YAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCA 287
             AYD+++W GV RY+F+A+VT QD+E+T+  PF  CV +  A+ +MC+Y  +NG+P+CA
Sbjct: 221 VTAYDLEDWNGVQRYNFNAKVTAQDLEDTYNPPFRSCVVDAKATCIMCAYTGINGVPACA 280

Query: 288 DPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYY 347
           +  LL +TVRG+W L GYI +DCD++ +M D  ++   + EDAVA  LKAGLD++CG Y 
Sbjct: 281 NADLLTKTVRGDWGLDGYIASDCDAVAIMRDAQRY-TQTPEDAVAVALKAGLDMNCGTYM 339

Query: 348 TNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ----YVSLGKQDICSDENIE 403
                 A+QQGK+ E DIDK+LK L+ + MRLG FDG P+    Y  LG  DIC+ E+  
Sbjct: 340 QQHATAAIQQGKLTEEDIDKALKNLFAIRMRLGHFDGDPRSNSVYGGLGAADICTPEHRS 399

Query: 404 LAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAG 463
           LA EAA +GIVLLKND   LPL+   V + AV+GP+AN  +A+IGNY G PC   +P+ G
Sbjct: 400 LALEAAMDGIVLLKNDAGILPLDRTAVASAAVIGPNANDGLALIGNYFGPPCESTTPLNG 459

Query: 464 FSGY-ANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAESLDREDLWL 522
             GY  NV +  GC+  AC    +  AA+ A+ ++D   +  GL    E+E  DR  L L
Sbjct: 460 ILGYIKNVRFLAGCNSAACDVAATDQAAAVAS-SSDYVFLFMGLSQKQESEGRDRTSLLL 518

Query: 523 PGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADV 582
           PG Q  LI  VA+ AK PVILV+++ G VD+ FA+TN  I AILWAGYPG+ GG AIA V
Sbjct: 519 PGEQQSLITAVADAAKRPVILVLLTGGPVDVTFAQTNPKIGAILWAGYPGQAGGLAIARV 578

Query: 583 VFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLS 642
           +FG  NPGGRLP+TWY  ++ + +P+T M +R   + GYPGR+Y+FY G T+Y FGYGLS
Sbjct: 579 LFGDHNPGGRLPVTWYPEEFTK-VPMTDMRMRADPATGYPGRSYRFYQGKTVYKFGYGLS 637

Query: 643 YTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFK---- 698
           Y+ +   L+S  K  +   N L   R    TS+  ++      + ++  D   + K    
Sbjct: 638 YSSYSRQLVSGGKPAESYTNLLASLRTTT-TSEGDES----YHIEEIGTDGCEQLKFPAV 692

Query: 699 VDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLN 758
           V+ QN G  DG   V++Y + P         Q+IGF+   ++ G    I+F  + C+  +
Sbjct: 693 VEVQNHGPMDGKHSVLMYLRWPNAKGGRPTTQLIGFRSQHLKVGEKANIRFDISPCEHFS 752

Query: 759 IVDYAANTLLPAGEHTIFV 777
            V      ++  G H + V
Sbjct: 753 RVRKDGKKVIDRGSHYLMV 771


>gi|357489431|ref|XP_003615003.1| Xylan 1 4-beta-xylosidase [Medicago truncatula]
 gi|355516338|gb|AES97961.1| Xylan 1 4-beta-xylosidase [Medicago truncatula]
          Length = 780

 Score =  671 bits (1732), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 353/798 (44%), Positives = 502/798 (62%), Gaps = 41/798 (5%)

Query: 11  FSLSIALL-VFSTN-----AVDANGSSSPVFVCDPGRFSKLGLQMSSFLFCDSSLPYSIR 64
           FS++I  + +F T        D+  ++ P + CD            SF FC+ +L  + R
Sbjct: 4   FSITITFIFLFLTRYHRLVHADSLATNVPPYSCDTSN-----PLTKSFPFCNLNLTITQR 58

Query: 65  VKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGATS 124
            KD+VSR+TLDEK+ QL + A  +PRLG+P Y+WW+EALHGVS VG G   +  I  ATS
Sbjct: 59  AKDIVSRLTLDEKISQLVNTAPAIPRLGIPSYQWWNEALHGVSYVGKGIRLNGSITAATS 118

Query: 125 FPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRA-GLTYWSPNINVARDPRWGRITET 183
           FP +IL  ASF+  LW +I + + TEAR +YN G+A G+T+W+PNIN+ RDPRWGR  ET
Sbjct: 119 FPQIILIAASFDPKLWYRISKVIGTEARGVYNAGQAQGMTFWAPNINIFRDPRWGRGQET 178

Query: 184 PGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYH 243
            GEDP V  +Y V+YVRGLQ  +  E    +  R LK S+CCKH+ AYD++NWKGV+RY 
Sbjct: 179 AGEDPLVNSKYGVSYVRGLQG-DSFEGGKLIGGR-LKASACCKHFTAYDLENWKGVNRYV 236

Query: 244 FDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLH 303
           FDA+VT QD+ +T+   F  CV +G +S +MC+YNRVNG+P+CAD  LL  T R +W+ +
Sbjct: 237 FDAKVTLQDLADTYQPSFHSCVVQGRSSGIMCAYNRVNGVPNCADYNLLTNTARKKWNFN 296

Query: 304 GYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFTGNAVQQGKVKET 363
           GYI +DCD+++ + +   + A + ED VA  L+AG+D++CG Y T    +AV Q K+  +
Sbjct: 297 GYIASDCDAVRFIYEKQGY-AKTPEDVVADVLRAGMDVECGNYMTKHAKSAVLQKKIPIS 355

Query: 364 DIDKSLKYLYTVLMRLGFFDGSP---QYVSLGKQDICSDENIELAAEAAREGIVLLKNDQ 420
            ID++L  L+T+ +RLG FDG+P   QY  +G   +CS EN++LA EAAR GIVLLKN  
Sbjct: 356 QIDRALHNLFTIRIRLGLFDGNPTKLQYGRIGPNQVCSKENLDLALEAARSGIVLLKNTA 415

Query: 421 NTLPLNSAKVKTVAVVGPHAN-ATVAMIGNYAGIPCRYMSPIAGFSGYANVT-YKTGCDD 478
           + LPL   +V T+ V+GP+AN +++ ++GNY G PC+ +S + GF  YA+ T Y++GC D
Sbjct: 416 SILPL--PRVNTLGVIGPNANKSSIVLLGNYFGQPCKQVSILKGFYTYASQTHYRSGCTD 473

Query: 479 VACKSNNSIFAASEAAKTADATIILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAK 538
               ++  I  A E AK +D  I++ GLD S E E+LDR+ L LPG Q +LIN VA+ +K
Sbjct: 474 GVKCASAEIDRAVEVAKISDYVILVMGLDQSQETETLDRDHLELPGKQQKLINSVAKASK 533

Query: 539 GPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWY 598
            PVILVI+  G VDI FA+ N  I  I+WAGYPGE GGRA+A VVFG +NPGGRLP+TWY
Sbjct: 534 KPVILVILCGGPVDITFAKNNDKIGGIIWAGYPGELGGRALAQVVFGDYNPGGRLPMTWY 593

Query: 599 NGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQ 658
             D+++ +P+T M +R   S GYPGRTY+FY GP +Y FGYGLSY+ + YN +S  K   
Sbjct: 594 PKDFIK-IPMTDMRMRADPSSGYPGRTYRFYTGPKVYEFGYGLSYSNYSYNFIS-VKNNN 651

Query: 659 VNLNK------LQHCRNLNY--TSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGS 710
           +++N+      L++   + Y   S+  K  C  + ++           +   N GS  G 
Sbjct: 652 IHINQSTTHSILENSETIRYKLVSELGKKACKTMSIS---------VTLGITNTGSMAGK 702

Query: 711 DVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPA 770
             V+++ KP        +KQ++GF+ V V  G    + F  + C+ L+  + +   ++  
Sbjct: 703 HPVLLFVKPKKGRNGNPVKQLVGFESVTVEGGGKGEVGFEVSVCEHLSRANESGVKVIEE 762

Query: 771 GEHTIFVGNGGVSFPIHL 788
           G +   VG    S  I L
Sbjct: 763 GGYLFLVGELEYSINITL 780


>gi|253761874|ref|XP_002489311.1| hypothetical protein SORBIDRAFT_0010s012040 [Sorghum bicolor]
 gi|241946959|gb|EES20104.1| hypothetical protein SORBIDRAFT_0010s012040 [Sorghum bicolor]
          Length = 791

 Score =  671 bits (1731), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 353/761 (46%), Positives = 465/761 (61%), Gaps = 35/761 (4%)

Query: 30  SSSPVFVCDPGRFSKLGLQMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVP 89
           + +P F C P   SK         FC+  LP S R  DLVSRMT  EK  QLGD A+GVP
Sbjct: 44  AGAPPFSCGPSSPSK------GLPFCNMKLPASQRAADLVSRMTPAEKASQLGDIANGVP 97

Query: 90  RLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVST 149
           RLG+P Y+WW+EALHGV+  G G H +  +  ATSFP V+ T ASFN++LW +IGQA   
Sbjct: 98  RLGVPSYKWWNEALHGVAISGKGIHMNQGVRSATSFPQVLHTAASFNDNLWFRIGQATGK 157

Query: 150 EARAMYNLGRA-GLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGH 208
           EARA YN+G+A GLT WSPN+N+ RDPRWGR  ETPGEDP V  RY   +VRGLQ   G 
Sbjct: 158 EARAFYNIGQAEGLTMWSPNVNIFRDPRWGRGQETPGEDPAVASRYGAAFVRGLQ---GS 214

Query: 209 ENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEG 268
            + T      L+ S+CCKH  AYD+++WKGV RY F A VT QD+ +TF  PF  CV +G
Sbjct: 215 SSNTKSVPPVLQTSACCKHATAYDLEDWKGVSRYSFKATVTIQDLADTFNPPFRSCVVDG 274

Query: 269 DASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKE 328
            AS VMC+Y  VNG+PSCA+  LL +T RG W L GY+ ADCD++ +M  N +F   + E
Sbjct: 275 KASCVMCAYTIVNGVPSCANGDLLTKTFRGSWGLDGYVAADCDAVAIM-RNSQFYRPTAE 333

Query: 329 DAVAQTLKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ- 387
           D VA TLKAGLD+DCG Y   +   A+Q+GK+ + D+DK++K L T  MRLG FDG P+ 
Sbjct: 334 DTVAATLKAGLDIDCGPYIQQYAMAAIQKGKLTQQDVDKAVKNLLTTRMRLGHFDGDPKT 393

Query: 388 --YVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVA 445
             Y +LG   IC+ E+  LA EAA +GIVLLKN    LPL    V + AV+G +AN  +A
Sbjct: 394 NVYGNLGAGHICTAEHKNLALEAALDGIVLLKNSAGVLPLKRGTVNSAAVIGHNANDVLA 453

Query: 446 MIGNYAGIPCRYMSPIAGFSGYA-NVTYKTGCDDVACKSNNSIFAASEAAKTADATIILA 504
           ++GNY G PC   +P+ G  GY  NV +  GC+  AC    +   A+  A ++DA I+  
Sbjct: 454 LLGNYWGPPCAPTTPLQGIQGYVKNVKFLAGCNKAACNV-AATPQATALASSSDAVILFM 512

Query: 505 GLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKA 564
           GL    E+E  DR  L LPG Q  LIN VA  AK PVILV+++ G VDI FA+ N  I A
Sbjct: 513 GLSQEQESEGKDRTTLLLPGNQQSLINAVANAAKRPVILVLLTGGPVDITFAQANPKIGA 572

Query: 565 ILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGR 624
           ILWAGYPG+ GG AIA V+FG+ NP G+LP TWY  ++ + +P+T M +R   S  YPGR
Sbjct: 573 ILWAGYPGQAGGLAIAKVLFGEKNPSGKLPNTWYPEEFTR-IPMTDMRMRAAGS--YPGR 629

Query: 625 TYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHC------RNLNYTSDASK 678
           TY+FYNG T+Y FGYGLSY++F + +++  K    N + L          NL+Y  +   
Sbjct: 630 TYRFYNGKTIYKFGYGLSYSKFSHRVVTGRKNPAHNTSLLAAGLAAMTEDNLSYHVEH-- 687

Query: 679 TRCPGVLVNDLRCDDY-FEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRV 737
                  + D+ CD   F   V  QN G  DG    +++ + P+       +Q+IGFQ  
Sbjct: 688 -------IGDVVCDQLKFLAVVKVQNHGPIDGKHTALMFLRWPSATDGRPTRQLIGFQSQ 740

Query: 738 FVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVG 778
            ++AG    ++F  + C+  + V      ++  G H + VG
Sbjct: 741 HIKAGEKANLRFEVSPCEHFSRVRQDGRKVIDKGSHFLKVG 781


>gi|357156904|ref|XP_003577615.1| PREDICTED: probable beta-D-xylosidase 7-like [Brachypodium
           distachyon]
          Length = 767

 Score =  669 bits (1726), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 362/758 (47%), Positives = 476/758 (62%), Gaps = 33/758 (4%)

Query: 30  SSSPVFVCDPGRFSKLGLQMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVP 89
           +  P F C        G   SS+ FCD++LP + R  DLVSR+T  EKV QLGD A GVP
Sbjct: 22  AGDPPFSC--------GQASSSYAFCDAALPVAQRAADLVSRLTAAEKVAQLGDEAAGVP 73

Query: 90  RLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVST 149
           RLG+P Y+WW+EALHG++  G G HFD  +  ATSFP V LT A+F++ LW +IGQA+  
Sbjct: 74  RLGVPGYKWWNEALHGLATSGKGLHFDGAVRSATSFPQVCLTAAAFDDDLWFRIGQAIGR 133

Query: 150 EARAMYNLGRA-GLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGH 208
           EARA+YNLG+A GLT WSPN+N+ RDPRWGR  ETPGEDP    RYAV +VRG+Q     
Sbjct: 134 EARALYNLGQAEGLTMWSPNVNIYRDPRWGRGQETPGEDPTTASRYAVAFVRGMQG---- 189

Query: 209 ENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEG 268
            N+T L    L+ S+CCKH  AYD+++W GV RY+FDA+VT QD+E+TF  PF  CV +G
Sbjct: 190 -NSTSL----LQASACCKHATAYDLEDWNGVARYNFDAKVTAQDLEDTFNPPFRSCVVDG 244

Query: 269 DASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKE 328
            AS VMC+Y  +NG+P+CA+  LL +TVRG+W L GY  +DCD++ +M D  ++ A S E
Sbjct: 245 KASCVMCAYTGINGVPACANADLLTKTVRGDWGLDGYTASDCDAVAIMRDAQRY-AQSPE 303

Query: 329 DAVAQTLKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ- 387
           DAVA  LKAGLD+DCG Y       A+QQGK+ E DIDK+LK L+ + MRLG FDG P+ 
Sbjct: 304 DAVALALKAGLDIDCGTYMQQHAAAAIQQGKITEEDIDKALKNLFAIRMRLGHFDGDPRT 363

Query: 388 --YVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVA 445
             Y  LG  DIC+ E+  LA +AA++GIVLLKND   LPL+ A V + AV+GP+AN   A
Sbjct: 364 NMYGGLGAADICTAEHRSLALDAAQDGIVLLKNDAGILPLDRAAVASTAVIGPNANNPGA 423

Query: 446 MIGNYAGIPCRYMSPIAGFSGYA-NVTYKTGCDDVACKSNNSIFAASEAAKTADATIILA 504
           +I NY G PC   +P+ G  GY  +  +  GC   AC    +  AA+  A T+D   +  
Sbjct: 424 LIANYFGPPCESTTPLKGIQGYVKDARFLAGCSSTACDVATTDQAAA-LASTSDYVFLFM 482

Query: 505 GLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKA 564
           GL    E+E  DR  L LPG Q  LI  VA+ A+ PVILV++S G VD+ FA+TN  I A
Sbjct: 483 GLGQRQESEGRDRTSLLLPGKQQSLITAVADAAQRPVILVLLSGGPVDVTFAQTNPKIGA 542

Query: 565 ILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGR 624
           ILWAGYPG+ GG AIA V+FG  NP GRLP+TWY  ++   +P+T M +R   + GYPGR
Sbjct: 543 ILWAGYPGQAGGLAIARVLFGDHNPSGRLPVTWYPEEFTN-VPMTDMRMRADPANGYPGR 601

Query: 625 TYKFYNGPTLYPFGYGLSYTQFKYNLLSF-TKTIQVNLNKLQHCRNLNYTSDASKTRCPG 683
           +Y+FY G T+Y FGYGLSY+ +   LLS  T T   N + L    +L  T  +++     
Sbjct: 602 SYRFYQGKTVYKFGYGLSYSSYSRRLLSSGTSTPAPNADLLA---SLTTTMPSAENILGS 658

Query: 684 VLVNDLRCDD----YFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFV 739
             V  +         F   V+ QN G  DG   V++Y + P   A    +Q+IGF++  +
Sbjct: 659 YHVEQIGAQGCEMLKFPAVVEVQNHGPMDGKQSVLMYLRWPNATAGRPERQLIGFKKEHL 718

Query: 740 RAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFV 777
           +AG    IKF    C+ L+ V    N ++  G H + V
Sbjct: 719 KAGEKAHIKFEIRPCEHLSRVREDGNKVIDRGSHFLRV 756


>gi|371917284|dbj|BAL44718.1| SlArf/Xyl3 [Solanum lycopersicum]
          Length = 777

 Score =  669 bits (1725), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 347/789 (43%), Positives = 482/789 (61%), Gaps = 35/789 (4%)

Query: 11  FSLSIALLVFSTNAVDANGSSSPVFVCDPGRFSKLGLQMSSFLFCDSSLPYSIRVKDLVS 70
           F   I +L+F         S+ P F CD           SS+ FC+++LP   RV DLVS
Sbjct: 13  FIFVILVLLFRRTE-----STKPPFSCDSSN-----PNTSSYPFCNAALPIPQRVNDLVS 62

Query: 71  RMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGATSFPTVIL 130
           R+T+DEK+ QL + A  +PRLG+  YEWWSE LHG+S  G GT F+  I  AT FP +IL
Sbjct: 63  RLTVDEKILQLVNGAPEIPRLGISAYEWWSEGLHGISRHGKGTLFNGTIKAATQFPQIIL 122

Query: 131 TTASFNESLWKKIGQAVSTEARAMYNLGR-AGLTYWSPNINVARDPRWGRITETPGEDPF 189
           T +SF+E+LW +I QA+  EARA+YN G+  G+T W+PNIN+ RDPRWGR  ETPGEDP 
Sbjct: 123 TASSFDENLWYRIAQAIGREARAVYNAGQLKGITLWAPNINILRDPRWGRGQETPGEDPM 182

Query: 190 VVGRYAVNYVRGLQ--DVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDAR 247
           +VG+Y V YVRGLQ    EG +    L    L+ S+CCKH+ A D+DNW    RY FDA+
Sbjct: 183 MVGKYGVAYVRGLQGDSFEGGK----LKDGHLQTSACCKHFIAQDMDNWHNFSRYTFDAQ 238

Query: 248 VTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIV 307
           V +QD+ +++  PF+ CV++G ASSVMC+YN VNGIP+CA+  LL  T RG+W L GYIV
Sbjct: 239 VLKQDLADSYEPPFKDCVEQGKASSVMCAYNLVNGIPNCANFDLLTTTARGKWGLQGYIV 298

Query: 308 ADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDK 367
           +DCD++  M     + A   EDAVA TLKAG+D++CG +   +T +A+++ KVKE+DID+
Sbjct: 299 SDCDAVDKMYSEQHY-AKEPEDAVAATLKAGMDVNCGSHLKTYTKSALEKQKVKESDIDR 357

Query: 368 SLKYLYTVLMRLGFFDGSP---QYVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLP 424
           +L  L++V MRLG F+G P   +Y  +   ++CS+E+  LA EAAR G VLLKN    LP
Sbjct: 358 ALHNLFSVRMRLGLFNGDPSKLEYGDISAAEVCSEEHRALAVEAARSGSVLLKNSNRLLP 417

Query: 425 LNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFSGY-ANVTYKTGCDDVACKS 483
           L+  K  ++AV+GP AN +  ++GNY G  C+ ++   G  GY AN  Y  GCD + C S
Sbjct: 418 LSKMKTASLAVIGPKANDSEVLLGNYEGFSCKNVTLFQGLQGYVANTMYHPGCDFINCTS 477

Query: 484 NNSIFAASEAAKTADATIILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVIL 543
             +I  A   AK AD  +++ GLD ++E E  DR +L LPG Q +LI  +AE A  PVIL
Sbjct: 478 P-AIDEAVNIAKKADYVVLVMGLDQTLEREKFDRTELGLPGMQEKLITSIAEAASKPVIL 536

Query: 544 VIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYV 603
           V+M  G VD+ FA+ N  I  ILW GYPGE G  A+A ++FG+ NPGGR P+TWY  ++ 
Sbjct: 537 VLMCGGPVDVTFAKDNPKIGGILWVGYPGEGGAAALAQILFGEHNPGGRSPVTWYPKEF- 595

Query: 604 QMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNK 663
             + +  M +RP  S GYPGRTY+FYNGP ++ FGYGLSYT + Y   S +K      N+
Sbjct: 596 NKVAMNDMRMRPESSSGYPGRTYRFYNGPKVFEFGYGLSYTNYSYTFASVSK------NQ 649

Query: 664 LQHCRNLNYTSDASKTRCPGVLVNDLR---CDD-YFEFKVDFQNVGSTDGSDVVIVYSKP 719
           L   +N        K     + V+D+    C+      KV  +N G   G   V+++ K 
Sbjct: 650 LLF-KNPKINQSTEKGSVLNIAVSDVGPEVCNSAMITVKVAVKNQGEMAGKHPVLLFLKH 708

Query: 720 PAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGN 779
            + +     K +IGF+ V + AG N ++ F    C+     +     ++  G+H + +G+
Sbjct: 709 SSTVDEVPKKTLIGFKSVNLEAGANTQVTFDVKPCEHFTRANRDGTLVIDEGKHFLLLGD 768

Query: 780 GGVSFPIHL 788
                P+ L
Sbjct: 769 QEYPIPVSL 777


>gi|224058158|ref|XP_002299457.1| predicted protein [Populus trichocarpa]
 gi|222846715|gb|EEE84262.1| predicted protein [Populus trichocarpa]
          Length = 780

 Score =  669 bits (1725), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 334/739 (45%), Positives = 469/739 (63%), Gaps = 20/739 (2%)

Query: 31  SSPVFVCDPGRFSKLGLQMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPR 90
           ++P F C P   +       ++ FC+ SLP + R + L+S +TL EK+QQL D A G+PR
Sbjct: 26  ANPQFPCKPPTHN-------TYSFCNKSLPITRRAQSLISHLTLQEKIQQLSDNASGIPR 78

Query: 91  LGLPQYEWWSEALHGVSNVGPGTHFDDVIP--GATSFPTVILTTASFNESLWKKIGQAVS 148
           LG+P YEWWSE+LHG+S  GPG  F +  P   AT FP VI++ ASFN +LW  IG A++
Sbjct: 79  LGIPHYEWWSESLHGISINGPGVSFKNGGPVTSATGFPQVIVSAASFNRTLWFLIGSAIA 138

Query: 149 TEARAMYNLGRAGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGH 208
            EARAMYN+G+AGLT+W+PNIN+ RDPRWGR  ETPGEDP V   YA+ +V+G Q     
Sbjct: 139 IEARAMYNVGQAGLTFWAPNINIFRDPRWGRGQETPGEDPMVASAYAIEFVKGFQGGHWK 198

Query: 209 ENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEG 268
               ++N   L +S+CCKH  AYD++ W    RY F+A VTEQDME+T+  PF  C+++G
Sbjct: 199 NEDGEINDDKLMLSACCKHSTAYDLEKWGNFSRYSFNAVVTEQDMEDTYQPPFRSCIQKG 258

Query: 269 DASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKE 328
            AS +MCSYN VNG+P+CA   LL Q  R EW   GYI +DCD++  + +   + + S E
Sbjct: 259 KASCLMCSYNEVNGVPACAREDLL-QKPRTEWGFKGYITSDCDAVATIFEYQNY-SKSPE 316

Query: 329 DAVAQTLKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSP-- 386
           DAVA  LKAG+D++CG Y      +AV++GK++E DID++L  L++V +RLG FDG P  
Sbjct: 317 DAVAIALKAGMDINCGTYVLRNAQSAVEKGKLQEEDIDRALHNLFSVQLRLGLFDGDPRK 376

Query: 387 -QYVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVA 445
            Q+  LG +++C+ E+  LA EAAR+GIVLLKND+  LPLN   V ++A++GP AN   +
Sbjct: 377 GQFGKLGPKNVCTKEHKTLALEAARQGIVLLKNDKKLLPLNKKAVSSLAIIGPLANMANS 436

Query: 446 MIGNYAGIPCRYMSPIAGFSGYANVT-YKTGCDDVACKSNNSIFAASEAAKTADATIILA 504
           + G+Y G PC   S   G   Y   T Y  GC DVAC S+     A   AK AD  II+A
Sbjct: 437 LGGDYTGYPCDPQSLFEGLKAYVKKTSYAIGCLDVACVSDTQFHKAIIVAKRADFVIIVA 496

Query: 505 GLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKA 564
           GLDLS E E  DR  L LPG Q  L++ VA  +K PVILV+   G +D++FA+ +  I +
Sbjct: 497 GLDLSQETEEHDRVSLLLPGKQMSLVSSVAAASKKPVILVLTGGGPLDVSFAKGDPRIAS 556

Query: 565 ILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGR 624
           ILW GYPGE G +A+A+++FG++NPGGRLP+TWY   + + + +T M +RP  S GYPGR
Sbjct: 557 ILWIGYPGEAGAKALAEIIFGEYNPGGRLPMTWYPESFTE-VSMTDMNMRPNPSRGYPGR 615

Query: 625 TYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGV 684
           TY+FY G  +Y FG GLSYT F Y +LS    + ++ +   + R           R   +
Sbjct: 616 TYRFYTGNRVYGFGGGLSYTNFTYKILSAPSKLSLSGSLSSNSRKRILQQGGE--RLSYI 673

Query: 685 LVNDL-RCDDY-FEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAG 742
            +N++  CD   F  ++  +NVG+ DG  VV+++S+ P        KQ++GF RV   + 
Sbjct: 674 NINEITSCDSLRFYMQILVENVGNMDGGHVVMLFSRVPTVFRGAPEKQLVGFDRVHTISH 733

Query: 743 RNKRIKFVFNACKSLNIVD 761
           R+  +  + + C+ L++ +
Sbjct: 734 RSTEMSILVDPCEHLSVAN 752


>gi|413925162|gb|AFW65094.1| putative O-Glycosyl hydrolase superfamily protein [Zea mays]
          Length = 774

 Score =  668 bits (1724), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 347/755 (45%), Positives = 465/755 (61%), Gaps = 20/755 (2%)

Query: 30  SSSPVFVCDPGRFSKLGLQMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVP 89
           +  P F C P              FCD +L  + R  DLVSR+T  EK+ QLGD A GVP
Sbjct: 26  AGDPPFSCGPSSAEA----SEGLAFCDVTLAPAQRAADLVSRLTAAEKIAQLGDQAPGVP 81

Query: 90  RLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVST 149
           RLG+P Y+WW+EALHG++  G G HFD  +  ATSFP V+LT A+F++ LW +IGQA+  
Sbjct: 82  RLGVPGYKWWNEALHGLATSGKGLHFDAAVRAATSFPQVLLTAAAFDDDLWLRIGQAIGR 141

Query: 150 EARAMYNLGRA-GLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGH 208
           EARA++N+G+A GLT WSPN+N+ RDPRWGR  ETPGEDP V  RYAV +VRG+Q     
Sbjct: 142 EARALFNVGQAEGLTIWSPNVNIFRDPRWGRGQETPGEDPAVASRYAVAFVRGIQG---- 197

Query: 209 ENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEG 268
               + +S  L+ S+CCKH  AYD+++W GV RY F ARVTEQD+E+TF  PF  CV E 
Sbjct: 198 ----NSSSSLLQTSACCKHATAYDLEDWNGVARYSFVARVTEQDLEDTFNPPFRSCVVEA 253

Query: 269 DASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKE 328
            AS VMC+Y  +NG+P+CA+  LL  TVRG+W L GY+ +DCD++ +M D  ++ A + E
Sbjct: 254 KASCVMCAYTAINGVPACANSDLLTGTVRGDWGLDGYVASDCDAVAIMRDAQRY-APTPE 312

Query: 329 DAVAQTLKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ- 387
           DAVA +LKAGLD+DCG Y       A+QQGK+ E DIDK+L  LY V MRLG FDG P+ 
Sbjct: 313 DAVAVSLKAGLDIDCGSYVQQHAAAAIQQGKLTEQDIDKALTNLYAVRMRLGHFDGDPRK 372

Query: 388 --YVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVA 445
             Y  LG  DIC+ E+  LA EAA++GIVLLKND   LPL+ + V + AV+GP+AN  +A
Sbjct: 373 NMYGVLGAADICTPEHRNLALEAAQDGIVLLKNDGGILPLDRSTVTSAAVIGPNANDGMA 432

Query: 446 MIGNYAGIPCRYMSPIAGFSGYAN-VTYKTGCDDVACKSNNSIFAASEAAKTADATIILA 504
           +I NY G PC   +P+ G   Y N V +  GC+  AC    +  A + A  + D   +  
Sbjct: 433 LIANYFGPPCESTTPLKGLQSYVNDVRFLAGCNSAACDVAATDQAVALAG-SEDYVFLFM 491

Query: 505 GLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKA 564
           GL    E+E  DR  L LPG Q  LI  VA+ +K PVILV++S G VDI FA++N  I A
Sbjct: 492 GLSQKQESEGKDRTSLLLPGMQQSLITAVADASKRPVILVLLSGGPVDITFAQSNPKIGA 551

Query: 565 ILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGR 624
           ILWAGYPG+ GG AIA V+FG  NP GRLP+TWY  ++ + +P+T M +R   + GYPGR
Sbjct: 552 ILWAGYPGQAGGLAIAKVLFGDHNPSGRLPVTWYPEEFTK-VPMTDMRMRADPTSGYPGR 610

Query: 625 TYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGV 684
           +Y+FY G T+Y FGYGLSY+ F   L+  T    ++   L   R      D  ++     
Sbjct: 611 SYRFYQGNTVYKFGYGLSYSTFSRRLVHGTSVPALSSTLLTGLRETMTPQDGDRSYHVDA 670

Query: 685 LVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRN 744
           +  +      F   V+ QN G  DG   V+++ + P         Q+IGF+   ++AG  
Sbjct: 671 IGTEGCEQLKFPAMVEVQNHGPMDGKHSVLMFLRWPNTKQGRPASQLIGFRSQHLKAGET 730

Query: 745 KRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGN 779
            +++F  + CK  + V      ++  G H + V N
Sbjct: 731 AKLRFDISPCKHFSRVRADGRKVIDIGSHFLMVDN 765


>gi|357489441|ref|XP_003615008.1| Xylan 1 4-beta-xylosidase [Medicago truncatula]
 gi|355516343|gb|AES97966.1| Xylan 1 4-beta-xylosidase [Medicago truncatula]
          Length = 798

 Score =  666 bits (1719), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 347/768 (45%), Positives = 488/768 (63%), Gaps = 46/768 (5%)

Query: 51  SFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVG 110
           S  FC+ +L  + R KD+VSR+TLDEK+ QL + A  +PRLG+P Y+WW EALHGV+N G
Sbjct: 47  SLPFCNLNLTITQRAKDIVSRLTLDEKISQLVNTAPSIPRLGIPSYQWWDEALHGVANAG 106

Query: 111 PGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRA-GLTYWSPNI 169
            G   +  + GATSFP VILT ASF+  LW +I + + TEAR +YN G+A G+T+W+PNI
Sbjct: 107 KGIRLNGSVAGATSFPQVILTAASFDSKLWYQISKVIGTEARGVYNAGQAQGMTFWAPNI 166

Query: 170 NVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYA 229
           N+ RDPRWGR  ET GEDP V  +Y V+YVRGLQ  +  E    +  R LK S+CCKH+ 
Sbjct: 167 NIFRDPRWGRGQETAGEDPLVNSKYGVSYVRGLQG-DSFEGGKLIGDR-LKASACCKHFT 224

Query: 230 AYDVDNWKGVDRYHFDARV----------------TEQDMEETFLRPFEMCVKEGDASSV 273
           AYD+DNWKG+DR+ FDA+V                T QD+ +T+  PF  C+ +G +S +
Sbjct: 225 AYDLDNWKGLDRFDFDAKVSFLFSMAYSPWMINYVTLQDLADTYQPPFHSCIVQGRSSGI 284

Query: 274 MCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQ 333
           MC+YNRVNG+P+CAD  LL +T R +W+ +GYI +DC++++++ DN  + A + EDAVA 
Sbjct: 285 MCAYNRVNGVPNCADYNLLTKTARQKWNFNGYITSDCEAVRIIYDNQGY-AKTPEDAVAD 343

Query: 334 TLKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSP---QYVS 390
            L+AG+D++CG Y T     AV Q KV  + ID++L  L+T+ +RLG FDG+P   QY  
Sbjct: 344 VLQAGMDVECGDYLTKHAKAAVLQKKVPISQIDRALHNLFTIRIRLGLFDGNPTKLQYGR 403

Query: 391 LGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHAN-ATVAMIGN 449
           +G   +CS EN++LA EAAR GIVLLKN  + LPL   +V T+ V+GP+AN ++  ++GN
Sbjct: 404 IGPNQVCSKENLDLALEAARSGIVLLKNTASILPL--PRVNTLGVIGPNANKSSKVVLGN 461

Query: 450 YAGIPCRYMSPIAGFSGYANVT-YKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDL 508
           Y G PCR +  + GF  YA+ T Y++GC D    ++  I  A E AK +D  I++ GLD 
Sbjct: 462 YFGRPCRLVPILKGFYTYASQTHYRSGCLDGTKCASAEIDRAVEVAKISDYVILVMGLDQ 521

Query: 509 SVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWA 568
           S E ES DR+DL LPG Q +LIN VA+ +K PVILV++  G VDI FA+ N  I  I+WA
Sbjct: 522 SQERESRDRDDLELPGKQQELINSVAKASKKPVILVLLCGGPVDITFAKNNDKIGGIIWA 581

Query: 569 GYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKF 628
           GYPGE GGRA+A VVFG +NPGGRLP+TWY  D+++ +P+T M +R   S GYPGRTY+F
Sbjct: 582 GYPGELGGRALAQVVFGDYNPGGRLPMTWYPKDFIK-IPMTDMRMRADPSSGYPGRTYRF 640

Query: 629 YNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNK------LQHCRNLNY--TSDASKTR 680
           Y GP +Y FGYGLSY+ + YN +S  K   +++N+      L++   + Y   S+  +  
Sbjct: 641 YTGPKVYEFGYGLSYSNYSYNFIS-VKNNNLHINQSTTHSILENSETIYYKLVSELGEET 699

Query: 681 CPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVR 740
           C  + ++           +   N GS  G   V+++ KP        +KQ++GF+ V V 
Sbjct: 700 CKTMSIS---------VTLGITNTGSMAGKHPVLLFVKPKKGRNGNPVKQLVGFESVTVE 750

Query: 741 AGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGNGGVSFPIHL 788
            G    + F  + C+ L+  + +   ++  G H + VG    S  I L
Sbjct: 751 GGGKGEVGFEVSVCEHLSRANESGVKVIEEGGHLLVVGEEEYSINITL 798


>gi|357152329|ref|XP_003576084.1| PREDICTED: probable beta-D-xylosidase 7-like [Brachypodium
           distachyon]
          Length = 779

 Score =  666 bits (1718), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 347/787 (44%), Positives = 475/787 (60%), Gaps = 34/787 (4%)

Query: 13  LSIALLVFSTNAVDANGSSSPVFVCDPGRFSKLGLQMSSFLFCDSSLPYSIRVKDLVSRM 72
           LS+  ++     +    +++P F C PG  ++       + FCD +LP   R  DLVSR+
Sbjct: 14  LSLIAMIMPAALLRTAAAATPPFSCGPGSATQ------GYAFCDKALPVERRAADLVSRL 67

Query: 73  TLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGATSFPTVILTT 132
           TL EKV QLGD A  VPRLG+P Y+WWSE LHG+S  G G HFD  +   TSFP V+LT 
Sbjct: 68  TLAEKVSQLGDEADAVPRLGVPAYKWWSEGLHGLSFWGHGMHFDGAVRAITSFPQVLLTA 127

Query: 133 ASFNESLWKKIGQAVSTEARAMYNLGRA-GLTYWSPNINVARDPRWGRITETPGEDPFVV 191
           ASF++ +W +IGQA+ TEARA+YNLG+A GLT WSPN+N+ RDPRWGR  ETPGEDP   
Sbjct: 128 ASFDQDIWYRIGQAIGTEARALYNLGQAQGLTIWSPNVNIYRDPRWGRGQETPGEDPTTA 187

Query: 192 GRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQ 251
            +YAV +V+GLQ           ++  L+ S+CCKH  AYD+++W GV RY+F+A+VT Q
Sbjct: 188 SKYAVAFVKGLQGT---------SATTLQTSACCKHATAYDLEDWNGVVRYNFNAKVTLQ 238

Query: 252 DMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCD 311
           D+ +TF  PF+ CV+EG A+ VMC+Y  +NG+P+CA   L+ +T +G+W L+GY+ +DCD
Sbjct: 239 DLADTFNPPFKSCVEEGKATCVMCAYTNINGVPACASSDLITKTFKGDWGLNGYVSSDCD 298

Query: 312 SIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKY 371
           ++ ++ D  ++ A + ED VA  LKAGLDL+CG Y      +A+QQGK+ E D+D +LK 
Sbjct: 299 AVALLRDAQRYRA-TPEDTVAVALKAGLDLNCGNYTQVHGMSALQQGKMTEQDVDNALKN 357

Query: 372 LYTVLMRLGFFDGSPQ----YVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNS 427
           L+ V MRLG FDG P+    Y SLG  D+CS  +  LA EAA+ GIVLLKND   LPL+ 
Sbjct: 358 LFAVRMRLGHFDGDPRTSALYGSLGAADVCSPAHKNLALEAAQSGIVLLKNDAGILPLDP 417

Query: 428 AKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFSGYA-NVTYKTGCDDVACKSNNS 486
           + V + A +G +AN   A+ GNY G PC   +P+ G  GY  NV +  GCD  AC     
Sbjct: 418 SAVASAAAIGHNANDPAALNGNYFGPPCETTTPLQGLQGYVKNVKFLAGCDSAACG---- 473

Query: 487 IFAASEAAKT----ADATIILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVI 542
            FAA+  A T    +D  I+  GL    E E +DR  L LPG Q  LI  VA  +K PVI
Sbjct: 474 -FAATGQAVTLASSSDYVILFMGLSQKEEQEGIDRTSLLLPGKQQNLITAVASASKRPVI 532

Query: 543 LVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDY 602
           LV+++ G VDI FA++N  I AILWAGYPG+ GG AIA V+FG  NP GRLP+TWY  ++
Sbjct: 533 LVLLTGGSVDITFAKSNPKIGAILWAGYPGQAGGLAIARVLFGDHNPSGRLPVTWYPEEF 592

Query: 603 VQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLN 662
            + +P+T M +R   + GYPGR+Y+FY G T+Y FG GLSY++F   L+S T T QV   
Sbjct: 593 TK-VPMTDMRMRADPATGYPGRSYRFYQGKTVYKFGDGLSYSKFSRQLVSSTNTHQVPNT 651

Query: 663 KLQHCRNLNYTSDASKTRCPGVLVNDLRCDDY-FEFKVDFQNVGSTDGSDVVIVYSKPPA 721
            L         +D   +      +    CD   F   V+ QN G  DG   V+++ + P 
Sbjct: 652 NLLTGLTARTATDGGMSYYHVEEIGVEGCDKLKFPAVVEVQNHGPMDGKHSVMMFLRWPN 711

Query: 722 EIAATY-IKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGNG 780
                  + Q++GF+   ++AG    + F  + C+           ++  G H + VG  
Sbjct: 712 STGTGRPVSQLVGFRSQHLKAGEKASLTFDVSPCEHFARAREDGKKVIDRGSHFLVVGKD 771

Query: 781 GVSFPIH 787
                 H
Sbjct: 772 EREISFH 778


>gi|26449574|dbj|BAC41913.1| putative beta-xylosidase [Arabidopsis thaliana]
          Length = 732

 Score =  666 bits (1718), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 338/729 (46%), Positives = 457/729 (62%), Gaps = 30/729 (4%)

Query: 74  LDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGATSFPTVILTTA 133
           L EK+ QL + A  VPRLG+P YEWWSE+LHG+++ GPG  F+  I  ATSFP VI++ A
Sbjct: 2   LPEKIGQLSNTAASVPRLGIPPYEWWSESLHGLADNGPGVSFNGSISAATSFPQVIVSAA 61

Query: 134 SFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNINVARDPRWGRITETPGEDPFVVGR 193
           SFN +LW +IG AV+ E RAMYN G+AGLT+W+PNINV RDPRWGR  ETPGEDP VV  
Sbjct: 62  SFNRTLWYEIGSAVAVEGRAMYNGGQAGLTFWAPNINVFRDPRWGRGQETPGEDPKVVSE 121

Query: 194 YAVNYVRGLQDVEGHENATDLNSR-------------PLKVSSCCKHYAAYDVDNWKGVD 240
           Y V +VRG Q+ +  +      S               L +S+CCKH+ AYD++ W    
Sbjct: 122 YGVEFVRGFQEKKKRKVLKRRFSDDVDDDRHDDDADGKLMLSACCKHFTAYDLEKWGNFT 181

Query: 241 RYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEW 300
           RY F+A VTEQDME+T+  PFE C+++G AS +MCSYN VNG+P+CA   LL Q  R EW
Sbjct: 182 RYDFNAVVTEQDMEDTYQPPFETCIRDGKASCLMCSYNAVNGVPACAQGDLL-QKARVEW 240

Query: 301 DLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFTGNAVQQGKV 360
              GYI +DCD++  +     +   S E+AVA  +KAG+D++CG Y    T +A++QGKV
Sbjct: 241 GFEGYITSDCDAVATIFAYQGY-TKSPEEAVADAIKAGVDINCGTYMLRHTQSAIEQGKV 299

Query: 361 KETDIDKSLKYLYTVLMRLGFFDGSP---QYVSLGKQDICSDENIELAAEAAREGIVLLK 417
            E  +D++L  L+ V +RLG FDG P   QY  LG  DICS ++ +LA EA R+GIVLLK
Sbjct: 300 SEELVDRALLNLFAVQLRLGLFDGDPRRGQYGKLGSNDICSSDHRKLALEATRQGIVLLK 359

Query: 418 NDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFSGYANVT-YKTGC 476
           ND   LPLN   V ++A+VGP AN    M G Y G PC+  +       Y   T Y +GC
Sbjct: 360 NDHKLLPLNKNHVSSLAIVGPMANNISNMGGTYTGKPCQRKTLFTELLEYVKKTSYASGC 419

Query: 477 DDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEV 536
            DV+C S+     A   AK AD  I++AGLDLS E E  DR  L LPG Q  L++ VA V
Sbjct: 420 SDVSCDSDTGFGEAVAIAKGADFVIVVAGLDLSQETEDKDRVSLSLPGKQKDLVSHVAAV 479

Query: 537 AKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPIT 596
           +K PVILV+   G VD+ FA+ +  I +I+W GYPGE GG+A+A+++FG FNPGGRLP T
Sbjct: 480 SKKPVILVLTGGGPVDVTFAKNDPRIGSIIWIGYPGETGGQALAEIIFGDFNPGGRLPTT 539

Query: 597 WYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKT 656
           WY   +   + ++ M +R   S GYPGRTY+FY GP +Y FG GLSYT+F+Y +LS    
Sbjct: 540 WYPESFTD-VAMSDMHMRANSSRGYPGRTYRFYTGPQVYSFGTGLSYTKFEYKILS--AP 596

Query: 657 IQVNLNKL-----QHCRNLNYTSDASKTRCPGVLVNDLRCDDY-FEFKVDFQNVGSTDGS 710
           I+++L++L      H + L +  +    +   V+VN   C+   F  +V   N G  DGS
Sbjct: 597 IRLSLSELLPQQSSHKKQLQHGEELRYLQLDDVIVNS--CESLRFNVRVHVSNTGEIDGS 654

Query: 711 DVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPA 770
            VV+++SK P  ++    KQ+IG+ RV VR+       FV + CK L++ +     ++P 
Sbjct: 655 HVVMLFSKMPPVLSGVPEKQLIGYDRVHVRSNEMMETVFVIDPCKQLSVANDVGKRVIPL 714

Query: 771 GEHTIFVGN 779
           G H +F+G+
Sbjct: 715 GSHVLFLGD 723


>gi|414588273|tpg|DAA38844.1| TPA: putative O-Glycosyl hydrolase superfamily protein [Zea mays]
          Length = 775

 Score =  666 bits (1718), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 345/757 (45%), Positives = 471/757 (62%), Gaps = 22/757 (2%)

Query: 30  SSSPVFVCDPGRFSKLGLQMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVP 89
           +S P+F C P   S+      ++ FCD SLP + R  DLVSR+T+ EKV QLGD A GVP
Sbjct: 25  ASDPMFSCGPSSASR------AYPFCDRSLPAARRAADLVSRLTVAEKVSQLGDEAAGVP 78

Query: 90  RLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVST 149
           RLG+P Y+WWSE LHG++  G G  F+  +   TSFP V+LTTASF+ESLW +IGQA+  
Sbjct: 79  RLGVPPYKWWSEGLHGLAFWGHGMRFNGTVSAVTSFPQVLLTTASFDESLWFRIGQAIGR 138

Query: 150 EARAMYNLGRA-GLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGH 208
           EARA+YNLG+A GLT WSPN+N+ RDPRWGR  ETPGEDP V  +YAV +VRG+Q     
Sbjct: 139 EARALYNLGQAEGLTIWSPNVNIFRDPRWGRGQETPGEDPAVASKYAVAFVRGIQG---- 194

Query: 209 ENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEG 268
            N     + PL+ S+CCKH  AYD+++W GV RY+FDARVT QD+ +TF  PF+ CV +G
Sbjct: 195 SNPAGAAAAPLQASACCKHATAYDLEDWNGVARYNFDARVTLQDLADTFNPPFQSCVVDG 254

Query: 269 DASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKE 328
            AS VMC+Y  +NG+P+CA   LL +T RG W L GY+ +DCD++ +M D  ++   + E
Sbjct: 255 KASCVMCAYTVINGVPACASSDLLTKTFRGAWGLDGYVSSDCDAVAIMRDAQRY-EPTPE 313

Query: 329 DAVAQTLKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ- 387
           D VA  LKAGLDL+CG Y       A+QQGK+ E D+DK+L  L+ V MRLG FDG P+ 
Sbjct: 314 DTVAVALKAGLDLNCGTYTQQHGMAAIQQGKMTEKDVDKALTNLFAVRMRLGHFDGDPRG 373

Query: 388 ---YVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATV 444
              Y  LG  D+C+ ++  LA EAA++GIVLLKND   LPL+ + V + AV+G +AN  +
Sbjct: 374 NALYGRLGAADVCTADHKNLALEAAQDGIVLLKNDAGILPLDRSAVGSAAVIGHNANDPL 433

Query: 445 AMIGNYAGIPCRYMSPIAGFSGYA-NVTYKTGCDDVACKSNNSIFAASEAAKTADATIIL 503
            + GNY G  C   +P+ G   Y  NV +  GC   AC    +   A+  A +A+   + 
Sbjct: 434 VLSGNYFGPACETTTPLEGLQSYVRNVRFLAGCSSAAC-GYAATGQAAALASSAEYVFLF 492

Query: 504 AGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIK 563
            GL    E E LDR  L LPG Q  L+  VA  AK PV+LV+++ G VDI FA++N  I 
Sbjct: 493 MGLSQDQEKEGLDRTSLLLPGKQQSLVTAVASAAKRPVVLVLLTGGPVDITFAQSNPKIG 552

Query: 564 AILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPG 623
           AILWAGYPG+ GG AIA V+FG  NP GRLP+TWY  D+ + +P+T M +R   + GYPG
Sbjct: 553 AILWAGYPGQAGGLAIARVLFGDHNPSGRLPVTWYTEDFTK-VPMTDMRMRADPATGYPG 611

Query: 624 RTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPG 683
           RTY+FY G T+Y FGYGLSY++F   L++  K +  N + L H      T  A+ +    
Sbjct: 612 RTYRFYRGKTIYKFGYGLSYSKFSRQLVTGDKNLAPNTSLLAHLS--AKTQHAATSYYHV 669

Query: 684 VLVNDLRCDDY-FEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAG 742
             +  + C+   F  +V+  N G  DG   V+++ + P       ++Q+IGF+   ++AG
Sbjct: 670 DDIGTVGCEQLKFPAEVEVLNHGPMDGKHSVLMFLRWPNATDGRPVRQLIGFRSQHIKAG 729

Query: 743 RNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGN 779
               ++F  + C+  +        ++  G H + VG 
Sbjct: 730 EKANVRFHVSPCEHFSRTRADGKKVIDRGSHFLMVGK 766


>gi|357485313|ref|XP_003612944.1| Beta-D-xylosidase [Medicago truncatula]
 gi|355514279|gb|AES95902.1| Beta-D-xylosidase [Medicago truncatula]
          Length = 783

 Score =  664 bits (1713), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 336/761 (44%), Positives = 483/761 (63%), Gaps = 26/761 (3%)

Query: 31  SSPVFVCDPGRFSKLGLQMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPR 90
           ++P + C P          S + FC+ SLP S R   L+S +TL +K+ QL + A  +  
Sbjct: 27  TTPDYPCKPPH--------SHYPFCNISLPISTRTTSLISLLTLSDKINQLSNTASSISH 78

Query: 91  LGLPQYEWWSEALHGVSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTE 150
           LG+P Y+WWSEALHG++  GPG +F+  +  AT+FP VI++ A+FN SLW  IG AV  E
Sbjct: 79  LGIPSYQWWSEALHGIATNGPGVNFNGSVKSATNFPQVIVSAAAFNRSLWFLIGYAVGVE 138

Query: 151 ARAMYNLGRAGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHE- 209
            RAM+N+G+AGL++W+PN+NV RDPRWGR  ETPGEDP V   YAV +VRG+Q V+G + 
Sbjct: 139 GRAMFNVGQAGLSFWAPNVNVFRDPRWGRGQETPGEDPMVGSAYAVEFVRGIQGVDGIKK 198

Query: 210 --NATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKE 267
             N  D +   L VS+CCKH+ AYD++ W    RY+F+A VT+QD+E+T+  PF  CV++
Sbjct: 199 VLNDHDSDDDGLMVSACCKHFTAYDLEKWGEFSRYNFNAVVTQQDLEDTYQPPFRGCVQQ 258

Query: 268 GDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSK 327
           G AS +MCSYN VNG+P+CA   LL   VR +W   GYI +DCD++  + +  K+ A S 
Sbjct: 259 GKASCLMCSYNEVNGVPACASKDLLG-LVRNKWGFEGYIASDCDAVATVFEYQKY-AKSA 316

Query: 328 EDAVAQTLKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ 387
           EDAVA  LKAG+D++CG +    T +A++QG VKE D+D++L  L++V MRLG F+G P+
Sbjct: 317 EDAVADVLKAGMDINCGTFMLRHTESAIEQGLVKEEDLDRALFNLFSVQMRLGLFNGDPE 376

Query: 388 ---YVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATV 444
              +  LG QD+C+ E+ +LA EAAR+GIVLLKND   LPL+     ++A++GP A  T 
Sbjct: 377 KGKFGKLGPQDVCTPEHKKLALEAARQGIVLLKNDNKFLPLDKKDRVSLAIIGPMA-TTS 435

Query: 445 AMIGNYAGIPCRYMSPIAGFSGYAN-VTYKTGCDDVACKSNNSIFAASEAAKTADATIIL 503
            + G Y+GIPC   S   G   Y   ++Y  GC DV C S++    A + AK AD  +I+
Sbjct: 436 ELGGGYSGIPCSPRSLYDGLKEYVKTISYAFGCSDVKCDSDDGFAVAIDIAKQADFVVIV 495

Query: 504 AGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIK 563
           AGLD ++E E LDR  L LPG Q  L+++VA  +K PVILV+   G +D++FAE+N  I 
Sbjct: 496 AGLDTTLETEDLDRVSLLLPGKQMDLVSRVAAASKRPVILVLTGGGPLDVSFAESNQLIT 555

Query: 564 AILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPG 623
           +ILW GYPGE GG+A+A+++FG+FNP GRLP+TWY   +   +P+  M +R   S GYPG
Sbjct: 556 SILWIGYPGEAGGKALAEIIFGEFNPAGRLPMTWYPESFTN-VPMNDMGMRADPSRGYPG 614

Query: 624 RTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHC---RNLNYTSDASKTR 680
           RTY+FY G  +Y FG+GLSY+ F Y +LS     +++L+K  +    R+L    +     
Sbjct: 615 RTYRFYTGSRIYGFGHGLSYSDFSYRVLSAPS--KLSLSKTTNGGLRRSLLNKVEKDVFE 672

Query: 681 CPGVLVNDLR-CDDY-FEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVF 738
              V V++L+ C+   F   +   NVG  DGS VV+++SK P  I  +   Q++G  R+ 
Sbjct: 673 VDHVHVDELQNCNSLSFSVHISVMNVGDMDGSHVVMLFSKWPKNIQGSPESQLVGPSRLH 732

Query: 739 VRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGN 779
             + ++     + + C+  +  D     +LP G H + VG+
Sbjct: 733 TVSNKSIETSILADPCEHFSFADEQGKRILPLGNHILNVGD 773


>gi|224066929|ref|XP_002302284.1| predicted protein [Populus trichocarpa]
 gi|222844010|gb|EEE81557.1| predicted protein [Populus trichocarpa]
          Length = 742

 Score =  663 bits (1710), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 340/725 (46%), Positives = 469/725 (64%), Gaps = 32/725 (4%)

Query: 9   LCFSLSIALLVFSTNAVDANGSSSPVFVCDPGRFSKLGLQMSSFLFCDSSLPYSIRVKDL 68
           LC  + I + + +T+      S+ P + CD    S        + FC + LP S RV+DL
Sbjct: 6   LCLRILILIAIHTTSLHLYVESTQPPYSCDSSDPS-----TKLYPFCQTKLPISQRVEDL 60

Query: 69  VSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGV---SNVGPGTHFDDVIPGATSF 125
           VSR+TLDEKV QL D A  +PRLG+P YEWWSEALHGV   + V  G  F+  I  ATSF
Sbjct: 61  VSRLTLDEKVSQLVDTAPAIPRLGIPAYEWWSEALHGVALQTTVRQGIRFNGTIRFATSF 120

Query: 126 PTVILTTASFNESLWKKIGQAVSTEARAMYNLGRA-GLTYWSPNINVARDPRWGRITETP 184
           P VILT ASF+  LW +IGQ +  EAR +YN G+A G+T+W+PNIN+ RDPRWGR  ETP
Sbjct: 121 PQVILTAASFDAHLWYRIGQVIGKEARGIYNAGQATGMTFWAPNINIFRDPRWGRGQETP 180

Query: 185 GEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHF 244
           GEDP V G+YAV+YVRG+Q   G           L+ S+CCKH+ AYD+D WKG++R+ F
Sbjct: 181 GEDPLVAGKYAVSYVRGVQ---GDSFGGGTLGEQLQASACCKHFTAYDLDKWKGMNRFVF 237

Query: 245 DARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHG 304
           DA    QD+ +T+  PF+ C++EG AS +MC+YNRVNG+P+CAD  LL++  RG+W  +G
Sbjct: 238 DA----QDLADTYQPPFQSCIQEGKASGIMCAYNRVNGVPNCADYNLLSKKARGQWGFYG 293

Query: 305 YIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFTGNAVQQGKVKETD 364
           YI +DCD++ ++ D+  + A S EDAVA  LKAG+D++CG Y  N+T +AV++ K+ E++
Sbjct: 294 YITSDCDAVAIIHDDQGY-AKSPEDAVADVLKAGMDVNCGDYLKNYTKSAVKKKKLPESE 352

Query: 365 IDKSLKYLYTVLMRLGFFDGSPQ---YVSLGKQDICSDENIELAAEAAREGIVLLKNDQN 421
           ID++L  L+++ MRLG F+G+P    Y ++    +CS E+  LA +AA++GIVLLKN   
Sbjct: 353 IDRALHNLFSIRMRLGLFNGNPTKQPYGNIAPDQVCSQEHQALALKAAQDGIVLLKNPDK 412

Query: 422 TLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFSGY-ANVTYKTGCDDVA 480
            LPL+  + K++AV+GP+AN +  ++GNY G PC+ ++P+ G   Y  N  Y  GC  VA
Sbjct: 413 LLPLSKLETKSLAVIGPNANNSTKLLGNYFGPPCKTVTPLQGLQNYIKNTRYHPGCSRVA 472

Query: 481 CKSNNSIFAASEAAKTADATIILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGP 540
           C S+ SI  A + AK AD  I++ GLD + E E  DR DL LPG Q +LI  VA+ AK P
Sbjct: 473 C-SSASINQAVKIAKGADQVILVMGLDQTQEKEEQDRVDLVLPGKQRELITAVAKAAKKP 531

Query: 541 VILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNG 600
           V+LV+   G VD++FA+ + NI +I+WAGYPGE GG A+A ++FG  NPGGRLP+TWY  
Sbjct: 532 VVLVLFCGGPVDVSFAKYDQNIGSIIWAGYPGEAGGTALAQIIFGDHNPGGRLPMTWYPQ 591

Query: 601 DYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVN 660
           D+ + +P+T M +RP  S GYPGRTY+FYNG  ++ FGYGLSY+ + Y L S T+     
Sbjct: 592 DFTK-VPMTDMRMRPQLSSGYPGRTYRFYNGKKVFEFGYGLSYSNYSYELASDTQ----- 645

Query: 661 LNKLQHCRNLNYTSDASKTRCPGVLVN---DLRCDDYFEFKVDFQNVGSTDGSDVVIVYS 717
            NKL    + N  +  S T    ++ N   +L     F   V  +N G   G +  I Y 
Sbjct: 646 -NKLYLRASSNQITKNSNTIRHKLISNIGKELCEKTKFTVTVRVKNHGEMAGENAEIQYE 704

Query: 718 KPPAE 722
             P E
Sbjct: 705 LSPCE 709


>gi|15218202|ref|NP_177929.1| putative beta-D-xylosidase 7 [Arabidopsis thaliana]
 gi|259585708|sp|Q9SGZ5.2|BXL7_ARATH RecName: Full=Probable beta-D-xylosidase 7; Short=AtBXL7; Flags:
           Precursor
 gi|18086336|gb|AAL57631.1| At1g78060/F28K19_32 [Arabidopsis thaliana]
 gi|332197942|gb|AEE36063.1| putative beta-D-xylosidase 7 [Arabidopsis thaliana]
          Length = 767

 Score =  662 bits (1707), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 347/774 (44%), Positives = 486/774 (62%), Gaps = 40/774 (5%)

Query: 30  SSSPVFVCDPGR-FSKLGLQMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGV 88
           S+ P   CDP    +KL      + FC + LP   R +DLVSR+T+DEK+ QL + A G+
Sbjct: 19  SAPPPHSCDPSNPTTKL------YQFCRTDLPIGKRARDLVSRLTIDEKISQLVNTAPGI 72

Query: 89  PRLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVS 148
           PRLG+P YEWWSEALHGV+  GPG  F+  +  ATSFP VILT ASF+   W +I Q + 
Sbjct: 73  PRLGVPAYEWWSEALHGVAYAGPGIRFNGTVKAATSFPQVILTAASFDSYEWFRIAQVIG 132

Query: 149 TEARAMYNLGRA-GLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQ--DV 205
            EAR +YN G+A G+T+W+PNIN+ RDPRWGR  ETPGEDP + G YAV YVRGLQ    
Sbjct: 133 KEARGVYNAGQANGMTFWAPNINIFRDPRWGRGQETPGEDPMMTGTYAVAYVRGLQGDSF 192

Query: 206 EGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCV 265
           +G +      S  L+ S+CCKH+ AYD+D WKG+ RY F+A+V+  D+ ET+  PF+ C+
Sbjct: 193 DGRKTL----SNHLQASACCKHFTAYDLDRWKGITRYVFNAQVSLADLAETYQPPFKKCI 248

Query: 266 KEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLAD 325
           +EG AS +MC+YNRVNGIPSCADP LL +T RG+W   GYI +DCD++ ++ D   + A 
Sbjct: 249 EEGRASGIMCAYNRVNGIPSCADPNLLTRTARGQWAFRGYITSDCDAVSIIYDAQGY-AK 307

Query: 326 SKEDAVAQTLKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGS 385
           S EDAVA  LKAG+D++CG Y    T +A+QQ KV ETDID++L  L++V +RLG F+G 
Sbjct: 308 SPEDAVADVLKAGMDVNCGSYLQKHTKSALQQKKVSETDIDRALLNLFSVRIRLGLFNGD 367

Query: 386 PQ---YVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANA 442
           P    Y ++   ++CS  +  LA +AAR GIVLLKN+   LP +   V ++AV+GP+A+ 
Sbjct: 368 PTKLPYGNISPNEVCSPAHQALALDAARNGIVLLKNNLKLLPFSKRSVSSLAVIGPNAHV 427

Query: 443 TVAMIGNYAGIPCRYMSPIAGFSGYA-NVTYKTGCDDVACKSNNSIFAASEAAKTADATI 501
              ++GNYAG PC+ ++P+     Y  N  Y  GCD VAC SN +I  A   AK AD  +
Sbjct: 428 VKTLLGNYAGPPCKTVTPLDALRSYVKNAVYHQGCDSVAC-SNAAIDQAVAIAKNADHVV 486

Query: 502 ILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTN 561
           ++ GLD + E E  DR DL LPG Q +LI  VA  AK PV+LV++  G VDI+FA  N  
Sbjct: 487 LIMGLDQTQEKEDFDRVDLSLPGKQQELITSVANAAKKPVVLVLICGGPVDISFAANNNK 546

Query: 562 IKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGY 621
           I +I+WAGYPGE GG AI++++FG  NPGGRLP+TWY   +V  + +T M +R   + GY
Sbjct: 547 IGSIIWAGYPGEAGGIAISEIIFGDHNPGGRLPVTWYPQSFVN-IQMTDMRMR--SATGY 603

Query: 622 PGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKT-IQVNLNKLQ-HCRNLNYT--SDAS 677
           PGRTYKFY GP +Y FG+GLSY+ + Y   +  +T + +N +K Q +  ++ YT  S+  
Sbjct: 604 PGRTYKFYKGPKVYEFGHGLSYSAYSYRFKTLAETNLYLNQSKAQTNSDSVRYTLVSEMG 663

Query: 678 KTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPP--AEIAATYIKQVIGFQ 735
           K  C              +  V+ +N G   G   V+++++     E      KQ++GF+
Sbjct: 664 KEGCDVAKT---------KVTVEVENQGEMAGKHPVLMFARHERGGEDGKRAEKQLVGFK 714

Query: 736 RVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGNGGVSFPIHLN 789
            + +  G    ++F    C+ L+  +     +L  G++ + VG+     P+ +N
Sbjct: 715 SIVLSNGEKAEMEFEIGLCEHLSRANEFGVMVLEEGKYFLTVGDS--ELPLIVN 766


>gi|357156390|ref|XP_003577440.1| PREDICTED: probable beta-D-xylosidase 7-like [Brachypodium
           distachyon]
          Length = 755

 Score =  660 bits (1703), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 343/759 (45%), Positives = 472/759 (62%), Gaps = 40/759 (5%)

Query: 33  PVFVCDPGRFSKLGLQMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLG 92
           P F C P        Q + + FC+ +LP   R  DLV+++TL+EKV QLGD A GVPR G
Sbjct: 12  PAFSCGPP-------QQAQYAFCNRALPAEQRAADLVAKLTLEEKVSQLGDQAPGVPRFG 64

Query: 93  LPQYEWWSEALHGVSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEAR 152
           +P Y WWSE LHGVS  G G HF+  + G T+FP V+LTTASF++S+W +IGQA+ TEAR
Sbjct: 65  VPGYNWWSEGLHGVSMWGHGMHFNGAVRGVTTFPQVLLTTASFDDSIWYRIGQAIGTEAR 124

Query: 153 AMYNLGRA-GLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENA 211
           AM+NLG+A GLT WSPN+N+ RDPRWGR  ETPGEDP    +YAV +VRGLQ        
Sbjct: 125 AMFNLGQADGLTIWSPNVNIYRDPRWGRGQETPGEDPATASKYAVAFVRGLQGT------ 178

Query: 212 TDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDAS 271
              ++  L+ S+CCKH  AYD+D+W  + RY+F+A+VT QD+EETF  PF+ CV EG A+
Sbjct: 179 ---STTTLQTSACCKHATAYDLDDWNRIGRYNFNAKVTAQDLEETFNPPFKSCVVEGKAT 235

Query: 272 SVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAV 331
            VMC+Y  VNGIP+CAD  LL +T++GEW ++GYI +DCD++ ++       + + EDAV
Sbjct: 236 CVMCAYTSVNGIPACADSGLLTKTIKGEWGMNGYISSDCDAVALLYGTR--YSGTPEDAV 293

Query: 332 AQTLKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDG----SPQ 387
           A  +KAGLD++CG +       A+QQ K+ E D+DK+L+ L+ + MRLG FDG    SP 
Sbjct: 294 AAAIKAGLDMNCGNFSQVHGMAALQQRKMSEQDVDKALRNLFAIRMRLGHFDGDPLQSPL 353

Query: 388 YVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLN--SAKVKTVAVVGPHANATVA 445
           Y  LG QD+CS  + +LA EAA+ GIVLLKND  TLPL+  +A   + AV+GP+AN   A
Sbjct: 354 YGRLGAQDVCSPAHKDLALEAAQNGIVLLKNDAATLPLSRPTAASASFAVIGPNANEPGA 413

Query: 446 MIGNYAGIPCRYMSPIAGFSGY--ANVTYKTGCDDVACKSNNSIFAASEAAKTADATIIL 503
           ++GNY G PC   +P+     +   NV +  GCD  AC   ++ + AS  A T+D TI+ 
Sbjct: 414 LLGNYFGPPCETTTPLQALQKFYSKNVRFVPGCDSAACNVADT-YQASGLAATSDYTILF 472

Query: 504 AGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIK 563
            GL    E E LDR  L LPG Q  LI  VA  AK P+ILV+++ G VDI FA+ N  I 
Sbjct: 473 MGLSQKQEQEGLDRTSLLLPGKQESLITAVAAAAKRPIILVLLTGGPVDITFAKFNPKIG 532

Query: 564 AILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPG 623
           AILWAGYPG+ GG AIA V+FG+ NP GRLP+TWY  +Y + +P+  M +R   + GYPG
Sbjct: 533 AILWAGYPGQAGGLAIAKVLFGEHNPSGRLPVTWYPEEYTK-VPMDDMRMRADPATGYPG 591

Query: 624 RTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTS-DASKTRCP 682
           R+Y+FY G  +Y FGYGLSY++F   L+  + +     N+  +   L   + D   +R  
Sbjct: 592 RSYRFYKGNAVYKFGYGLSYSKFSRQLVRNSSSN----NRAPNTELLAAAAVDCGASRY- 646

Query: 683 GVLVNDLR---CDDY-FEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVF 738
             LV ++    C+   F   V+ +N G  DG   V+++ + P         Q++GF+   
Sbjct: 647 -YLVEEIGGEVCERLKFPAVVEVENHGPMDGKQSVLLFLRWPTATEGRPASQLVGFRSQD 705

Query: 739 VRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFV 777
           +RAG    + F  + C+  +        ++  G H + V
Sbjct: 706 LRAGEKASVSFDISPCEHFSRTTVDGTKVIDRGSHFLMV 744


>gi|356548162|ref|XP_003542472.1| PREDICTED: probable beta-D-xylosidase 7-like [Glycine max]
          Length = 778

 Score =  658 bits (1698), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 339/776 (43%), Positives = 491/776 (63%), Gaps = 42/776 (5%)

Query: 30  SSSPVFVCDPGRFSKLGLQMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVP 89
           S+ P + CD    S        + FC++ LP + R +DLVSR+TLDEK+ QL + A  +P
Sbjct: 26  STRPPYSCDSSSNSPY------YSFCNTKLPITKRAQDLVSRLTLDEKLAQLVNTAPAIP 79

Query: 90  RLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVST 149
           RLG+P Y+WWSEALHGV++ G G  F+  I  ATSFP VILT ASF+ +LW +I + +  
Sbjct: 80  RLGIPSYQWWSEALHGVADAGFGIRFNGTIKSATSFPQVILTAASFDPNLWYQISKTIGR 139

Query: 150 EARAMYNLGRA-GLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQ--DVE 206
           EARA+YN G+A G+T+W+PNINV RDPRWGR  ET GEDP +  +Y V YVRGLQ    E
Sbjct: 140 EARAVYNAGQATGMTFWAPNINVFRDPRWGRGQETAGEDPLMNAKYGVAYVRGLQGDSFE 199

Query: 207 GHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVK 266
           G + A  L     + S+CCKH+ AYD+D WKG+DR+ FDARVT QD+ +T+  PF+ C++
Sbjct: 200 GGKLAERL-----QASACCKHFTAYDLDQWKGLDRFVFDARVTSQDLADTYQPPFQSCIE 254

Query: 267 EGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADS 326
           +G AS +MC+YNRVNG+P+CAD  LL +T R +W   GYI +DC ++ ++ +   + A +
Sbjct: 255 QGRASGIMCAYNRVNGVPNCADFNLLTKTARQQWKFDGYITSDCGAVSIIHEKQGY-AKT 313

Query: 327 KEDAVAQTLKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSP 386
            EDA+A   +AG+D++CG Y T    +AV Q K+  + ID++L+ L+++ +RLG FDG+P
Sbjct: 314 AEDAIADVFRAGMDVECGDYITKHAKSAVFQKKLPISQIDRALQNLFSIRIRLGLFDGNP 373

Query: 387 Q---YVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANAT 443
               + ++G  ++CS ++++LA EAAR+GIVLLKN  + LPL      T+A++GP+ANA+
Sbjct: 374 TKLPFGTIGPNEVCSKQSLQLALEAARDGIVLLKNTNSLLPLPKTN-PTIALIGPNANAS 432

Query: 444 VAM-IGNYAGIPCRYMSPIAGFSGYANVTYKTGCDDVACKSNNSIFAASEAAKTADATII 502
             + +GNY G PC  ++ + GF GYA   Y  GCDD    +   I  A E AK  D  ++
Sbjct: 433 SKVFLGNYYGRPCNLVTLLQGFEGYAKTVYHPGCDDGPQCAYAQIEEAVEVAKKVDYVVL 492

Query: 503 LAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNI 562
           + GLD S E ES DRE L LPG Q +LI  VA  AK PV++V++  G VDI  A+ +  +
Sbjct: 493 VMGLDQSQERESHDREYLGLPGKQEELIKSVARAAKRPVVVVLLCGGPVDITSAKFDDKV 552

Query: 563 KAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYP 622
             ILWAGYPGE GG A+A VVFG  NPGG+LPITWY  D+++ +P+T M +R   + GYP
Sbjct: 553 GGILWAGYPGELGGVALAQVVFGDHNPGGKLPITWYPKDFIK-VPMTDMRMRADPASGYP 611

Query: 623 GRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFT-KTIQVNLNK----LQHCRNLNY--TSD 675
           GRTY+FY GP +Y FGYGLSYT++ Y LLS +  T+ +N +      Q+   + Y   S+
Sbjct: 612 GRTYRFYTGPKVYEFGYGLSYTKYSYKLLSLSHSTLHINQSSTHLMTQNSETIRYKLVSE 671

Query: 676 ASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVY---SKPPAEIAATYIKQVI 732
            ++  C  +L++           +   N G+  G   V+++    K         +KQ++
Sbjct: 672 LAEETCQTMLLS---------IALGVTNRGNLAGKHPVLLFVRQGKVRNINNGNPVKQLV 722

Query: 733 GFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGNGGVSFPIHL 788
           GFQ V V AG   ++ F  + C+ L++ + A + ++  G +   VG+    +PI +
Sbjct: 723 GFQSVKVNAGETVQVGFELSPCEHLSVANEAGSMVIEEGSYLFIVGDQ--EYPIEV 776


>gi|356531391|ref|XP_003534261.1| PREDICTED: probable beta-D-xylosidase 6-like [Glycine max]
          Length = 780

 Score =  657 bits (1696), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 333/734 (45%), Positives = 472/734 (64%), Gaps = 12/734 (1%)

Query: 54  FCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGT 113
           FCD+SLP   R + LVS +TL EK+  L + A  +PRLG+P Y+WWSE+LHG++  GPG 
Sbjct: 41  FCDTSLPTLTRARSLVSLLTLPEKILLLSNNASSIPRLGIPAYQWWSESLHGLALNGPGV 100

Query: 114 HFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNINVAR 173
            F   +P ATSFP VIL+ ASFN SLW +   A++ EARAM+N+G+AGLT+W+PNIN+ R
Sbjct: 101 SFAGAVPSATSFPQVILSAASFNRSLWLRTAAAIAREARAMFNVGQAGLTFWAPNINLFR 160

Query: 174 DPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEG-HENATDLNSRPLKVSSCCKHYAAYD 232
           DPRWGR  ETPGEDP +   YAV YVRGLQ + G  +     +   L VS+CCKH+ AYD
Sbjct: 161 DPRWGRGQETPGEDPMLASAYAVEYVRGLQGLSGIQDAVVVDDDDTLMVSACCKHFTAYD 220

Query: 233 VDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLL 292
           +D W    RY+F+A V++QD+E+T+  PF  C+++G AS +MCSYN VNG+P+CA  +LL
Sbjct: 221 LDMWGQFSRYNFNAVVSQQDLEDTYQPPFRSCIQQGKASCLMCSYNEVNGVPACASEELL 280

Query: 293 NQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFTG 352
               R +W   GYI +DCD++  + +  K+ A S+EDAVA  LKAG+D++CG +    T 
Sbjct: 281 G-LARDKWGFKGYITSDCDAVATVYEYQKY-AKSQEDAVADVLKAGMDINCGTFMLRHTE 338

Query: 353 NAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSP---QYVSLGKQDICSDENIELAAEAA 409
           +A++QGKVKE D+D++L  L++V +RLG FDG P   ++  LG +D+C+ E+  LA +AA
Sbjct: 339 SAIEQGKVKEEDLDRALLNLFSVQLRLGLFDGDPIRGRFGKLGPKDVCTQEHKTLALDAA 398

Query: 410 REGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFSGYAN 469
           R+GIVLLKND+  LPL+     ++AV+GP A  T  + G Y+GIPC   S   G   +A 
Sbjct: 399 RQGIVLLKNDKKFLPLDRDIGASLAVIGPLAT-TTKLGGGYSGIPCSSSSLYEGLGEFAE 457

Query: 470 -VTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAESLDREDLWLPGYQTQ 528
            ++Y  GC DV C S++    A + AK AD  +I+AGLD + E E  DR  L LPG Q  
Sbjct: 458 RISYAFGCYDVPCDSDDGFAEAIDTAKQADFVVIVAGLDATQETEDHDRVSLLLPGKQMN 517

Query: 529 LINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFN 588
           L++ VA+ +K PVILV++  G +D++FAE N  I +I+W GYPGE GG+A+A+++FG+FN
Sbjct: 518 LVSSVADASKNPVILVLIGGGPLDVSFAEKNPQIASIIWLGYPGEAGGKALAEIIFGEFN 577

Query: 589 PGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKY 648
           P GRLP+TWY   +   +P+  M +R   S GYPGRTY+FY G  +Y FG+GLS++ F Y
Sbjct: 578 PAGRLPMTWYPEAFTN-VPMNEMSMRADPSRGYPGRTYRFYTGGRVYGFGHGLSFSDFSY 636

Query: 649 NLLSFTKTIQVNLNKLQHCRN-LNYTSDASKTRCPGVLVNDLR-CDDY-FEFKVDFQNVG 705
           N LS    I ++       R  L Y  +        V VN L+ C+   F   +   N+G
Sbjct: 637 NFLSAPSKISLSRTIKDGSRKRLLYQVENEVYGVDYVPVNQLQNCNKLSFSVHISVMNLG 696

Query: 706 STDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAAN 765
             DGS VV+++SK P  +  +   Q++GF R+   + +      + + C+ L+  D    
Sbjct: 697 GLDGSHVVMLFSKGPKVVDGSPETQLVGFSRLHTISSKPTETSILVHPCEHLSFADKQGK 756

Query: 766 TLLPAGEHTIFVGN 779
            +LP G HT+ VG+
Sbjct: 757 RILPLGPHTLSVGD 770


>gi|253761872|ref|XP_002489310.1| hypothetical protein SORBIDRAFT_0010s010920 [Sorghum bicolor]
 gi|241946958|gb|EES20103.1| hypothetical protein SORBIDRAFT_0010s010920 [Sorghum bicolor]
          Length = 772

 Score =  657 bits (1695), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 346/767 (45%), Positives = 466/767 (60%), Gaps = 39/767 (5%)

Query: 33  PVFVCDPGRFSKLGLQMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLG 92
           P F C P              FCD +L  + R  DLVSR+T  EK+ QLGD A GVPRLG
Sbjct: 26  PPFSCGPTSAEA----SEGLAFCDVTLSPAQRAADLVSRLTPAEKIAQLGDQATGVPRLG 81

Query: 93  LPQYEWWSEALHGVSNVGPGTHFDDV--IPGATSFPTVILTTASFNESLWKKIGQAVSTE 150
           +P Y+WW+EALHG++  G G HFD V  +  ATSFP V+LT A+F++ LW +IGQA+  E
Sbjct: 82  VPGYKWWNEALHGLATSGKGLHFDVVGGVRAATSFPQVLLTAAAFDDDLWFRIGQAIGRE 141

Query: 151 ARAMYNLGRA-GLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHE 209
           ARA++N+G+A GLT WSPN+N+ RDPRWGR  ETPGEDP V  RYAV +VRG+Q      
Sbjct: 142 ARALFNVGQAEGLTIWSPNVNIFRDPRWGRGQETPGEDPAVASRYAVAFVRGIQG----- 196

Query: 210 NATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGD 269
              + +S  L+ S+CCKH  AYD+++W GV RY F ARVT QD+E+TF  PF  CV EG 
Sbjct: 197 ---NSSSSLLQTSACCKHATAYDLEDWNGVARYSFVARVTAQDLEDTFNPPFRSCVVEGK 253

Query: 270 ASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKED 329
           AS +MC+Y  +NG+P+CA+  LL  TVRG+W L GY+ +DCD++ +M D  ++ A + ED
Sbjct: 254 ASCIMCAYTAINGVPACANTDLLTGTVRGDWGLDGYVASDCDAVAIMRDAQRY-APTPED 312

Query: 330 AVAQTLKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ-- 387
           AVA +LKAGLD+DCG Y       A+QQGK+ E DIDK+L  L+ V MRLG FDG P+  
Sbjct: 313 AVAVSLKAGLDIDCGSYIQQHATAAIQQGKLTELDIDKALVNLFAVRMRLGHFDGDPRKN 372

Query: 388 -YVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAM 446
            Y +L   DIC+ E+  LA EAA++GIVLLKND   LPL+ + V + AV+GP++N  +A+
Sbjct: 373 MYGALSAADICTPEHRSLALEAAQDGIVLLKNDGGILPLDRSTVTSAAVIGPNSNDGMAL 432

Query: 447 IGNYAGIPCRYMSPIAGFSGYA-NVTYKTGCD----DVACKSNNSIFAASEAAKTADATI 501
           I NY G PC   +P+ G   Y  NV +  GC     DVA      + + SE     D   
Sbjct: 433 IANYFGPPCESTTPLQGLQSYVNNVRFLAGCSSAACDVAVTDQAVVLSGSE-----DYVF 487

Query: 502 ILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTN 561
           +  GL    E+E  DR  L LPG Q  LI  VA+ +K PVILV++S G VDI FA++N  
Sbjct: 488 LFMGLSQQQESEGKDRTSLLLPGMQQSLITAVADASKRPVILVLLSGGPVDITFAQSNPK 547

Query: 562 IKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGY 621
           I AILWAGYPG+ GG AIA V+FG  NP GRLP+TWY  D+ + +P+T M +R   + GY
Sbjct: 548 IGAILWAGYPGQAGGLAIAKVLFGDHNPSGRLPMTWYPEDFTK-VPMTDMRMRADPTSGY 606

Query: 622 PGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRC 681
           PGR+Y+FY G  +Y FGYGLSY+ F   LL  T    ++   L   R    T +  ++  
Sbjct: 607 PGRSYRFYQGNAVYKFGYGLSYSTFSSRLLYGTSMPALSSTVLAGLRE-TVTEEGDRS-- 663

Query: 682 PGVLVNDLRCDDYFEFK----VDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRV 737
               ++D+  D   + K    V+ QN G  DG    +++ + P         Q+IGF   
Sbjct: 664 --YHIDDIGTDGCEQLKFPAMVEVQNHGPMDGKHSALMFLRWPNTNGGRPASQLIGFMSQ 721

Query: 738 FVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGNGGVSF 784
            ++AG    ++F  + C+  + V      ++  G H + V N  +  
Sbjct: 722 HLKAGETANLRFDISPCEHFSRVRADGMKVIDIGSHFLTVDNHAIEI 768


>gi|413925164|gb|AFW65096.1| putative O-Glycosyl hydrolase superfamily protein [Zea mays]
          Length = 829

 Score =  657 bits (1695), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 351/756 (46%), Positives = 462/756 (61%), Gaps = 28/756 (3%)

Query: 33  PVFVCDPGRFSKLGLQMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLG 92
           P F C  G    LGL      FC++ LP + R  DLVSRMT  EK  QLGD A+GVPRLG
Sbjct: 84  PPFSCGGG--PSLGLP-----FCNTKLPAAQRAADLVSRMTPAEKASQLGDVANGVPRLG 136

Query: 93  LPQYEWWSEALHGVSNVGPGTHFD-DVIPGATSFPTVILTTASFNESLWKKIGQAVSTEA 151
           +P Y+WW+EALHGV+  G G H D   +  ATSFP V+LT ASFN++LW +IGQA   EA
Sbjct: 137 VPSYKWWNEALHGVAISGKGIHMDRGAVRSATSFPQVLLTAASFNDNLWFRIGQATGKEA 196

Query: 152 RAMYNLGRA-GLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHEN 210
           RA YN+G+A GLT WSPN+N+ RDPRWGR  ETPGEDP V  RYA  +VRGLQ      +
Sbjct: 197 RAFYNIGQAEGLTMWSPNVNIFRDPRWGRGQETPGEDPAVASRYAAAFVRGLQG-----S 251

Query: 211 ATDLNSRP--LKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEG 268
           +++  S P  L  S+CCKH  AYD+++WKGV RY F A VT QD+ +TF  PF  CV +G
Sbjct: 252 SSNTKSVPPVLLTSACCKHATAYDLEDWKGVTRYSFRATVTVQDLADTFNPPFRSCVVDG 311

Query: 269 DASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKE 328
            AS VMC+Y  VNG+PSCA+  LL +T RG W L GY+ ADCD++ +M  N +F   + E
Sbjct: 312 KASCVMCAYTSVNGVPSCANADLLTKTFRGSWGLDGYVAADCDAVSIM-RNSQFYRPTAE 370

Query: 329 DAVAQTLKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ- 387
           D VA TLKAGLD+DCG Y       A+Q+GK+ + D+DK++K L+T  MRLG FDG P+ 
Sbjct: 371 DTVATTLKAGLDIDCGPYVQQHAMAAIQKGKLTQQDVDKAVKNLFTTRMRLGHFDGDPKA 430

Query: 388 --YVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVA 445
             Y +LG   IC+ E+  LA EAA +GIVLLKN    LPL    V + AV+G +AN  +A
Sbjct: 431 HVYGNLGAAHICTQEHKNLALEAALDGIVLLKNSAGVLPLKRGSVASAAVIGHNANDVLA 490

Query: 446 MIGNYAGIPCRYMSPIAGFSGYA-NVTYKTGCDDVACKSNNSIFAASEAAKTADATIILA 504
           ++GNY G PC   +P+ G  GY  NV +  GC   AC    +  AA+ A+ T+D+ I+  
Sbjct: 491 LLGNYWGPPCAPTTPLQGIQGYVKNVRFLAGCHKAACNVAATPQAAALAS-TSDSVILFM 549

Query: 505 GLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKA 564
           GL    E+E  DR  L LPG Q  LI  VA  AK PVILV+++ G VDI FA+ N  I A
Sbjct: 550 GLSQEQESEGKDRTTLLLPGNQQSLITAVANAAKRPVILVLLTGGPVDITFAQANPKIGA 609

Query: 565 ILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGR 624
           ILWAGYPG+ GG AIA V+FG+ NP GRLP+TWY  ++ + +P+T M +R   S  YPGR
Sbjct: 610 ILWAGYPGQAGGLAIAKVLFGEKNPSGRLPVTWYPEEFTK-VPMTDMRMRSAGS--YPGR 666

Query: 625 TYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGV 684
           +Y+FY G T+Y FGYGLSY++F + +++       N   L    +   T D         
Sbjct: 667 SYRFYKGKTIYKFGYGLSYSKFSHRVVTARNNPAHNTTLLLAAGHAATTEDNLSYHVDH- 725

Query: 685 LVNDLRCDDY-FEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGR 743
            + D  C    F   V  QN G  DG    +++ + P        +Q++GFQ   ++AG 
Sbjct: 726 -IGDELCRQLKFLAVVKVQNHGPMDGKHTALMFLRWPNATDGRPARQLVGFQSQHIKAGE 784

Query: 744 NKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGN 779
              ++F  + C+  + V      ++  G H + VG 
Sbjct: 785 KAHLRFEVSPCEDFSRVRDDGRKVIDKGSHFLKVGK 820


>gi|297842585|ref|XP_002889174.1| glycosyl hydrolase family 3 protein [Arabidopsis lyrata subsp.
           lyrata]
 gi|297335015|gb|EFH65433.1| glycosyl hydrolase family 3 protein [Arabidopsis lyrata subsp.
           lyrata]
          Length = 766

 Score =  657 bits (1695), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 344/773 (44%), Positives = 485/773 (62%), Gaps = 38/773 (4%)

Query: 30  SSSPVFVCDPGR-FSKLGLQMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGV 88
           S+ P   CDP    +KL      + FC + LP S R +DLVSR+ +DEK+ QLG+ A G+
Sbjct: 18  SAPPPHSCDPSNPTTKL------YQFCRTDLPISQRARDLVSRLNIDEKISQLGNTAPGI 71

Query: 89  PRLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVS 148
           PRLG+P YEWWSEALHGV+  GPG  F+  +  ATSFP VILT ASF+   W +I Q + 
Sbjct: 72  PRLGVPAYEWWSEALHGVAYAGPGIRFNGTVKAATSFPQVILTAASFDSYEWFRIAQVIG 131

Query: 149 TEARAMYNLGRA-GLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQ--DV 205
            EAR +YN G+A G+T+W+PNIN+ RDPRWGR  ETPGEDP + G YAV YVRGLQ    
Sbjct: 132 KEARGVYNAGQAQGMTFWAPNINIFRDPRWGRGQETPGEDPIMTGTYAVAYVRGLQGDSF 191

Query: 206 EGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCV 265
           +G +      S  L+ S+CCKH+ AYD+D WKG+ RY F+A+V+  D+ ET+  PF+ C+
Sbjct: 192 DGRKTL----SIHLQASACCKHFTAYDLDRWKGITRYVFNAQVSLADLAETYQPPFKKCI 247

Query: 266 KEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLAD 325
           +EG AS +MC+YNRVNGIPSCADP LL +T RG W   GYI +DCD++ ++ D   + A 
Sbjct: 248 EEGRASGIMCAYNRVNGIPSCADPNLLTRTARGLWRFRGYITSDCDAVSIIHDAQGY-AK 306

Query: 326 SKEDAVAQTLKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGS 385
           + EDAVA  LKAG+D++CG Y    T +A+QQ KV ETDID++L  L++V +RLG F+G 
Sbjct: 307 TPEDAVADVLKAGMDVNCGSYLQKHTKSALQQKKVSETDIDRALLNLFSVRIRLGLFNGD 366

Query: 386 PQ---YVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANA 442
           P    Y ++   D+CS  +  LA EAAR GIVLLKN+   LP +   V ++AV+GP+A+ 
Sbjct: 367 PTKLPYGNISPNDVCSPAHQALALEAARNGIVLLKNNLKLLPFSKRSVSSLAVIGPNAHV 426

Query: 443 TVAMIGNYAGIPCRYMSPIAGFSGYA-NVTYKTGCDDVACKSNNSIFAASEAAKTADATI 501
              ++GNYAG PC+ ++P+     Y  N  Y  GCD VAC SN +I  A   A+ AD  +
Sbjct: 427 AKTLLGNYAGPPCKTVTPLDALRSYVKNAVYHNGCDSVAC-SNAAIDQAVAIARNADHVV 485

Query: 502 ILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTN 561
           ++ GLD + E E +DR DL LPG Q +LI  VA  AK PV+LV++  G VDI+FA  N  
Sbjct: 486 LIMGLDQTQEKEDMDRVDLSLPGKQQELITSVANAAKKPVVLVLICGGPVDISFATNNDK 545

Query: 562 IKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGY 621
           I +I+WAGYPGE GG A+A+++FG  NPGGRLP+TWY   +V  + +T M +R   + GY
Sbjct: 546 IGSIMWAGYPGEAGGIALAEIIFGDHNPGGRLPVTWYPQSFVN-VQMTDMRMR--SATGY 602

Query: 622 PGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKT-IQVNLNKLQ-HCRNLNYT--SDAS 677
           PGRTYKFY GP ++ FG+GLSY+ + Y   +   T + +N +K Q +  ++ YT  S+  
Sbjct: 603 PGRTYKFYKGPKVFEFGHGLSYSTYSYRFKTLGATNLYLNQSKAQLNSDSVRYTLVSEMG 662

Query: 678 KTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPP--AEIAATYIKQVIGFQ 735
           +  C       +         V  +N G   G   V+++++     E      KQ++GF+
Sbjct: 663 EEGCNIAKTKVI---------VTVENQGEMAGKHPVLMFARHERGGENGKRAEKQLVGFK 713

Query: 736 RVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGNGGVSFPIHL 788
            + +  G    ++F    C+ L+  +     ++  G++ + VG+  +   I++
Sbjct: 714 SIVLSNGEKAEMEFEIGLCEHLSRANEVGVMVVEEGKYFLTVGDSELPLTINV 766


>gi|115459584|ref|NP_001053392.1| Os04g0530700 [Oryza sativa Japonica Group]
 gi|38346629|emb|CAD41212.2| OSJNBa0074L08.23 [Oryza sativa Japonica Group]
 gi|38346760|emb|CAE03865.2| OSJNBa0081C01.11 [Oryza sativa Japonica Group]
 gi|113564963|dbj|BAF15306.1| Os04g0530700 [Oryza sativa Japonica Group]
 gi|218195263|gb|EEC77690.1| hypothetical protein OsI_16749 [Oryza sativa Indica Group]
          Length = 770

 Score =  657 bits (1694), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 329/740 (44%), Positives = 476/740 (64%), Gaps = 24/740 (3%)

Query: 50  SSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNV 109
           S++ FC+++LP+  R + LVS +TLDEK+ QL + A G PRLG+P +EWWSE+LHGV + 
Sbjct: 36  SAYPFCNATLPFPARARALVSLLTLDEKIAQLSNTAAGAPRLGVPPFEWWSESLHGVCDN 95

Query: 110 GPGTHFDD-VIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPN 168
           GPG +F    +  AT FP VIL+ A+FN SLW+   +A++ EARAM+N G+AGLT+W+PN
Sbjct: 96  GPGVNFSSGPVRSATIFPQVILSAAAFNRSLWRAAARAIAVEARAMHNAGQAGLTFWAPN 155

Query: 169 INVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHY 228
           INV RDPRWGR  ETPGEDP VV  Y+V YV+G Q   G E         + +S+CCKHY
Sbjct: 156 INVFRDPRWGRGQETPGEDPAVVSAYSVEYVKGFQRDYGEEGR-------MMLSACCKHY 208

Query: 229 AAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCAD 288
            AYD++ W+G  RY F+A+V  QDME+T+  PF+ C++EG AS +MCSYN+VNG+P+CA 
Sbjct: 209 IAYDLEKWRGFTRYTFNAKVNAQDMEDTYQPPFKSCIQEGRASCLMCSYNQVNGVPACAR 268

Query: 289 PKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYT 348
             +L Q  R EW   GYI +DCD++ ++ +N  + A S ED++A  LKAG+D++CG +  
Sbjct: 269 KDIL-QRARDEWGFQGYITSDCDAVAIIHENQTYTA-SDEDSIAVVLKAGMDINCGSFLI 326

Query: 349 NFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ---YVSLGKQDICSDENIELA 405
             T +A+++GKV+E DI+ +L  L++V +RLGFFD + +   +  LG  ++C+ E+ ELA
Sbjct: 327 RHTKSAIEKGKVQEEDINHALFNLFSVQLRLGFFDKTNENQWFTQLGPNNVCTTEHRELA 386

Query: 406 AEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFS 465
           AEA R+G VLLKND   LPL  ++V  +A++GP AN    + G+Y G+PC   + + G  
Sbjct: 387 AEAVRQGTVLLKNDNGFLPLKRSEVGHIALIGPAANDPYILGGDYTGVPCHSTTFVKGMQ 446

Query: 466 GYA-NVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAESLDREDLWLPG 524
            Y    T+  GC DV C S +    A EAAK AD  +++AGL+L+ E E  DR  L LPG
Sbjct: 447 AYVPKTTFAAGCKDVPCNSTDGFGEAIEAAKRADVVVLIAGLNLTEETEDHDRVSLLLPG 506

Query: 525 YQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVF 584
            Q  LI+ VA V K PV+LV+M  G VD++FA+ +  I +ILW GYPGE GG  + +++F
Sbjct: 507 RQMDLIHTVASVTKKPVVLVLMGGGPVDVSFAKHDPRIASILWIGYPGEVGGNVLPEILF 566

Query: 585 GKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYT 644
           GK+NPGG+LPITWY   +   +P+  M +R   S GYPGRTY+FY G  +Y FGYGLSY+
Sbjct: 567 GKYNPGGKLPITWYPESFT-AVPMDDMNMRADASRGYPGRTYRFYTGDVVYGFGYGLSYS 625

Query: 645 QFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPG---VLVNDLRCDDYFEFKVDF 701
           ++ Y++L   K I ++ + +        +   + TR  G   V V D+   +  +F V  
Sbjct: 626 KYSYSILQAPKKISLSRSSVPDL----ISRKPAYTRRDGVDYVQVEDIASCEALQFPVHI 681

Query: 702 --QNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNI 759
              N G+ DGS  V++++        + IKQ++GF+RV   AGR+  ++   + CK ++ 
Sbjct: 682 SVSNDGAMDGSHAVLLFASSKPSFPGSPIKQLVGFERVHTAAGRSTDVEITVDPCKLMSF 741

Query: 760 VDYAANTLLPAGEHTIFVGN 779
            +     +L  G H + VG+
Sbjct: 742 ANTEGTRVLFLGTHVLMVGD 761


>gi|222629651|gb|EEE61783.1| hypothetical protein OsJ_16354 [Oryza sativa Japonica Group]
          Length = 771

 Score =  657 bits (1694), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 339/739 (45%), Positives = 466/739 (63%), Gaps = 68/739 (9%)

Query: 88  VPRLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQA- 146
           +PRLG+P YEWWSEALHGVS VGPGT F  ++PGATSFP  ILT ASFN SL++ IG++ 
Sbjct: 45  LPRLGIPAYEWWSEALHGVSYVGPGTRFSTLVPGATSFPQPILTAASFNASLFRAIGESA 104

Query: 147 -----------------------------------------VSTEARAMYNLGRAGLTYW 165
                                                    VSTEARAM+N+G AGLT+W
Sbjct: 105 CNNTSQFFFSSKSPFSICIAMENLHCDFRSRLVRFYRGARVVSTEARAMHNVGLAGLTFW 164

Query: 166 SPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCC 225
           SPNIN+ RDPRWGR  ETPGEDP +  +YAV YV GLQD  G  +A       LKV++CC
Sbjct: 165 SPNINIFRDPRWGRGQETPGEDPLLASKYAVGYVTGLQDAGGGSDA-------LKVAACC 217

Query: 226 KHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPS 285
           KHY AYDVDNWKGV+RY FDA V++QD+++TF  PF+ CV +G+ +SVMCSYN+VNG P+
Sbjct: 218 KHYTAYDVDNWKGVERYTFDAVVSQQDLDDTFQPPFKSCVIDGNVASVMCSYNKVNGKPT 277

Query: 286 CADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQ 345
           CAD  LL+  +RG+W L+GYIV+DCDS+ V+ +N  +  +  EDA A T+K+GLDL+CG 
Sbjct: 278 CADKDLLSGVIRGDWKLNGYIVSDCDSVDVLYNNQHYTKN-PEDAAAITIKSGLDLNCGN 336

Query: 346 YYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ---YVSLGKQDICSDENI 402
           +    T  AVQ GK+ E+D+D+++   + VLMRLGFFDG P+   + SLG +D+C+  N 
Sbjct: 337 FLAQHTVAAVQAGKLSESDVDRAITNNFIVLMRLGFFDGDPRKLPFGSLGPKDVCTSSNQ 396

Query: 403 ELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIA 462
           ELA EAAR+GIVLLKN    LPL++  +K++AV+GP+ANA+  MIGNY G PC+Y +P+ 
Sbjct: 397 ELAREAARQGIVLLKN-TGALPLSAKSIKSMAVIGPNANASFTMIGNYEGTPCKYTTPLQ 455

Query: 463 GFSGYANVTYKTGCDDVACKSNN-SIFAASEAAKTADATIILAGLDLSVEAESLDREDLW 521
           G        Y+ GC +V C  N+  + AA++AA +AD T+++ G D SVE ESLDR  L 
Sbjct: 456 GLGANVATVYQPGCTNVGCSGNSLQLSAATQAAASADVTVLVVGADQSVERESLDRTSLL 515

Query: 522 LPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIAD 581
           LPG Q QL++ VA  ++GPVILV+MS G  DI+FA+++  I AILW GYP     R    
Sbjct: 516 LPGQQPQLVSAVANASRGPVILVVMSGGPFDISFAKSSDKISAILWVGYPRRSRWRRPRR 575

Query: 582 VVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGL 641
                      LP+TWY   +   + +T M +RP  S GYPGRTY+FY G T+Y FG GL
Sbjct: 576 HPLRIPQ--SWLPVTWYPASFADKVSMTDMRMRPDSSTGYPGRTYRFYTGDTVYAFGDGL 633

Query: 642 SYTQFKYNLLSFTKTIQVNLNKLQHCRNLN-YTSDASKTRCPGVLVNDLRCDDYFEFKVD 700
           SYT+F ++L+S  + + V L +   C   + ++ +A+   C  +          F+  + 
Sbjct: 634 SYTKFAHSLVSAPEQVAVQLAEGHACHTEHCFSVEAAGEHCGSL---------SFDVHLR 684

Query: 701 FQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIV 760
            +N G   G   V ++S PP+ + +   K ++GF++V +  G+   + F  + CK L++V
Sbjct: 685 VRNAGGMAGGHTVFLFSSPPS-VHSAPAKHLLGFEKVSLEPGQAGVVAFKVDVCKDLSVV 743

Query: 761 DYAANTLLPAGEHTIFVGN 779
           D   N  +  G HT+ VG+
Sbjct: 744 DELGNRKVALGSHTLHVGD 762


>gi|356552866|ref|XP_003544783.1| PREDICTED: probable beta-D-xylosidase 7-like [Glycine max]
          Length = 776

 Score =  654 bits (1688), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 338/775 (43%), Positives = 493/775 (63%), Gaps = 41/775 (5%)

Query: 30  SSSPVFVCDPGRFSKLGLQMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVP 89
           S+ P + CD    S        + FC++ LP S R +DLVSR+TLDEK+ QL + A  +P
Sbjct: 25  STQPPYSCDSSSNSPY------YPFCNTRLPISKRAQDLVSRLTLDEKLAQLVNTAPAIP 78

Query: 90  RLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVST 149
           RLG+P Y+WWSEALHGV++ G G  F+  I  ATSFP VILT ASF+ +LW +I + +  
Sbjct: 79  RLGIPSYQWWSEALHGVADAGFGIRFNGTIKSATSFPQVILTAASFDPNLWYQISKTIGK 138

Query: 150 EARAMYNLGRA-GLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQ--DVE 206
           EARA+YN G+A G+T+W+PNINV RDPRWGR  ET GEDP +  +Y V YVRGLQ    E
Sbjct: 139 EARAVYNAGQATGMTFWAPNINVFRDPRWGRGQETAGEDPLMNAKYGVAYVRGLQGDSFE 198

Query: 207 GHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVK 266
           G +    L  R L+ S+CCKH+ AYD+D+WKG+DR+ +DARVT QD+ +T+  PF+ C++
Sbjct: 199 GGK----LGER-LQASACCKHFTAYDLDHWKGLDRFVYDARVTSQDLADTYQPPFQSCIE 253

Query: 267 EGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADS 326
           +G AS +MC+YNRVNG+P+CA+  LL +T R +W   GYI +DC ++ ++ D   + A +
Sbjct: 254 QGRASGIMCAYNRVNGVPNCANFNLLTKTARQQWKFDGYITSDCGAVSIIHDEQGY-AKT 312

Query: 327 KEDAVAQTLKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSP 386
            EDA+A   +AG+D++CG Y T    +AV Q K+  + ID++L+ L+++ +RLG  DG+P
Sbjct: 313 AEDAIADVFRAGMDVECGDYITKHGKSAVSQKKLPISQIDRALQNLFSIRIRLGLLDGNP 372

Query: 387 Q---YVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANAT 443
               + ++G   +CS ++++LA EAAR+GIVLLKN  + LPL      T+A++GP+ANA+
Sbjct: 373 TKLPFGTIGPDQVCSKQSLQLALEAARDGIVLLKNTNSLLPLPKTN-PTIALIGPNANAS 431

Query: 444 VAM-IGNYAGIPCRYMSPIAGFSGYANVT-YKTGCDDVACKSNNSIFAASEAAKTADATI 501
             + +GNY G PC  ++ + GF GYA  T Y  GCDD    +   I  A E AK  D  +
Sbjct: 432 SKVFLGNYYGRPCNLVTLLQGFEGYAKDTVYHPGCDDGPQCAYAQIEGAVEVAKKVDYVV 491

Query: 502 ILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTN 561
           ++ GLD S E ES DRE L LPG Q +LI  VA  +K PV+LV++  G VDI  A+ +  
Sbjct: 492 LVMGLDQSQERESHDREYLGLPGKQEELIKSVARASKRPVVLVLLCGGPVDITSAKFDDK 551

Query: 562 IKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGY 621
           +  ILWAGYPGE GG A+A VVFG  NPGG+LPITWY  D+++ +P+T M +R   + GY
Sbjct: 552 VGGILWAGYPGELGGVALAQVVFGDHNPGGKLPITWYPKDFIK-VPMTDMRMRADPASGY 610

Query: 622 PGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTK-TIQVNLNK----LQHCRNLNY--TS 674
           PGRTY+FY GP +Y FGYGLSYT++ Y LLS +  T+ +N +      Q+   + Y   S
Sbjct: 611 PGRTYRFYTGPKVYEFGYGLSYTKYSYKLLSLSHNTLHINQSSTHLTTQNSETIRYKLVS 670

Query: 675 DASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKP-PAEIAATYIKQVIG 733
           + ++  C  +L++           +   N G+  G   V+++ +          +KQ++G
Sbjct: 671 ELAEETCQTMLLS---------IALGVTNHGNMAGKHPVLLFVRQGKVRNNGNPVKQLVG 721

Query: 734 FQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGNGGVSFPIHL 788
           FQ V + AG   ++ F  + C+ L++ + A + ++  G + + VG+    +PI +
Sbjct: 722 FQSVKLNAGETVQVGFELSPCEHLSVANEAGSMVIEEGSYLLLVGD--QEYPIEI 774


>gi|62701898|gb|AAX92971.1| beta-D-xylosidase [Oryza sativa Japonica Group]
 gi|62733926|gb|AAX96035.1| beta-D-xylosidase [Oryza sativa Japonica Group]
 gi|77550045|gb|ABA92842.1| Glycosyl hydrolase family 3 C terminal domain containing protein,
           expressed [Oryza sativa Japonica Group]
 gi|125576900|gb|EAZ18122.1| hypothetical protein OsJ_33667 [Oryza sativa Japonica Group]
          Length = 771

 Score =  652 bits (1683), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 341/756 (45%), Positives = 464/756 (61%), Gaps = 29/756 (3%)

Query: 33  PVFVCDPGRFSKLGLQMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLG 92
           P + C P   S      S + FCD+ LP + R  DLVSR+T  EKV QLGD A GVPRLG
Sbjct: 25  PPYSCGPRSPS------SGYAFCDARLPPARRAADLVSRLTAAEKVAQLGDEAGGVPRLG 78

Query: 93  LPQYEWWSEALHGVSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEAR 152
           +P Y+WWSE LHG+S  G G HF+  +   TSFP V+LT A+F++ LW +IGQA+ TEAR
Sbjct: 79  VPPYKWWSEGLHGLSYWGHGMHFNGAVTAITSFPQVLLTAAAFDDRLWFRIGQAIGTEAR 138

Query: 153 AMYNLGRA-GLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENA 211
           A+YNLG+A GLT WSPN+N+ RDPRWGR  ETPGEDP    +YAV +V+GLQ   G    
Sbjct: 139 ALYNLGQAEGLTIWSPNVNIYRDPRWGRGQETPGEDPTTASKYAVAFVKGLQ---GSTPG 195

Query: 212 TDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDAS 271
           T      L+ S+CCKH  AYD++ W GV RY+F+A+VT QD+ +TF  PF+ CV +  AS
Sbjct: 196 T------LQTSACCKHATAYDLEEWNGVARYNFNAKVTAQDLADTFNPPFKSCVVDAKAS 249

Query: 272 SVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAV 331
            VMC+Y  +NG+P+CA   LL++T RG+W L GY+ +DCD++ ++ D  ++ A + ED V
Sbjct: 250 CVMCAYTDINGVPACASSDLLSKTFRGQWGLDGYVSSDCDAVALLRDAQRY-APTPEDTV 308

Query: 332 AQTLKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ---- 387
           A  +KAGLDL+CG Y       A+QQGK++E+D+D++L  L+ V MRLG FDG P+    
Sbjct: 309 AVAIKAGLDLNCGNYTQVHGMAALQQGKMRESDVDRALTNLFAVRMRLGHFDGDPRSNAA 368

Query: 388 YVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMI 447
           Y  LG  D+C+  + +LA EAA++GIVLLKND   LPL+ A V++ AV+GP+AN   A+ 
Sbjct: 369 YGHLGAADVCTQAHRDLALEAAQDGIVLLKNDAGALPLDRATVRSAAVIGPNANDPAALN 428

Query: 448 GNYAGIPCRYMSPIAGFSGY-ANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGL 506
           GNY G PC   +P+ G   Y ++V +  GCD  AC    +   A+  A ++D  I+  GL
Sbjct: 429 GNYFGPPCETTTPLQGVQRYISSVRFLAGCDSPAC-GFAATGQAAALASSSDQVIMFMGL 487

Query: 507 DLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAIL 566
               E E LDR  L LPG Q  LI  VA  A+ PVILV+++ G VD+ FA+ N  I AIL
Sbjct: 488 SQDQEKEGLDRTSLLLPGKQQSLITAVASAARRPVILVLLTGGPVDVTFAKNNPKIGAIL 547

Query: 567 WAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTY 626
           WAGYPG+ GG AIA V+FG  NP GRLP+TWY  ++ + +P+T M +R   + GYPGR+Y
Sbjct: 548 WAGYPGQAGGLAIAKVLFGDHNPSGRLPVTWYPEEFTR-IPMTDMRMRADPATGYPGRSY 606

Query: 627 KFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLV 686
           +FY G  +Y FGYGLSY++F   L++  K  + N N L                     +
Sbjct: 607 RFYQGNPVYKFGYGLSYSKFSRRLVAAAKPRRPNRNLLAGVIPKPAGDGGESYHVE--EI 664

Query: 687 NDLRCDDY-FEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATY--IKQVIGFQRVFVRAGR 743
            +  C+   F   V+  N G  DG   V+V+ + P   A      +Q++GF    VRAG 
Sbjct: 665 GEEGCERLKFPATVEVHNHGPMDGKHSVLVFVRWPNATAGASRPARQLVGFSSQHVRAGE 724

Query: 744 NKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGN 779
             R+    N C+ L+        ++  G H + VG 
Sbjct: 725 KARLTMEINPCEHLSRAREDGTKVIDRGSHFLKVGE 760


>gi|413925166|gb|AFW65098.1| putative O-Glycosyl hydrolase superfamily protein [Zea mays]
          Length = 830

 Score =  652 bits (1683), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 351/757 (46%), Positives = 462/757 (61%), Gaps = 29/757 (3%)

Query: 33  PVFVCDPGRFSKLGLQMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLG 92
           P F C  G    LGL      FC++ LP + R  DLVSRMT  EK  QLGD A+GVPRLG
Sbjct: 84  PPFSCGGG--PSLGLP-----FCNTKLPAAQRAADLVSRMTPAEKASQLGDVANGVPRLG 136

Query: 93  LPQYEWWSEALHGVSNVGPGTHFD-DVIPGATSFPTVILTTASFNESLWKKIGQAVSTEA 151
           +P Y+WW+EALHGV+  G G H D   +  ATSFP V+LT ASFN++LW +IGQA   EA
Sbjct: 137 VPSYKWWNEALHGVAISGKGIHMDRGAVRSATSFPQVLLTAASFNDNLWFRIGQATGKEA 196

Query: 152 RAMYNLGRA-GLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHEN 210
           RA YN+G+A GLT WSPN+N+ RDPRWGR  ETPGEDP V  RYA  +VRGLQ      +
Sbjct: 197 RAFYNIGQAEGLTMWSPNVNIFRDPRWGRGQETPGEDPAVASRYAAAFVRGLQG-----S 251

Query: 211 ATDLNSRP--LKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEG 268
           +++  S P  L  S+CCKH  AYD+++WKGV RY F A VT QD+ +TF  PF  CV +G
Sbjct: 252 SSNTKSVPPVLLTSACCKHATAYDLEDWKGVTRYSFRATVTVQDLADTFNPPFRSCVVDG 311

Query: 269 DASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHG-YIVADCDSIQVMVDNHKFLADSK 327
            AS VMC+Y  VNG+PSCA+  LL +T RG W L G Y+ ADCD++ +M  N +F   + 
Sbjct: 312 KASCVMCAYTSVNGVPSCANADLLTKTFRGSWGLDGRYVAADCDAVSIM-RNSQFYRPTA 370

Query: 328 EDAVAQTLKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ 387
           ED VA TLKAGLD+DCG Y       A+Q+GK+ + D+DK++K L+T  MRLG FDG P+
Sbjct: 371 EDTVATTLKAGLDIDCGPYVQQHAMAAIQKGKLTQQDVDKAVKNLFTTRMRLGHFDGDPK 430

Query: 388 ---YVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATV 444
              Y +LG   IC+ E+  LA EAA +GIVLLKN    LPL    V + AV+G +AN  +
Sbjct: 431 AHVYGNLGAAHICTQEHKNLALEAALDGIVLLKNSAGVLPLKRGSVASAAVIGHNANDVL 490

Query: 445 AMIGNYAGIPCRYMSPIAGFSGYA-NVTYKTGCDDVACKSNNSIFAASEAAKTADATIIL 503
           A++GNY G PC   +P+ G  GY  NV +  GC   AC    +  AA+ A+ T+D+ I+ 
Sbjct: 491 ALLGNYWGPPCAPTTPLQGIQGYVKNVRFLAGCHKAACNVAATPQAAALAS-TSDSVILF 549

Query: 504 AGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIK 563
            GL    E+E  DR  L LPG Q  LI  VA  AK PVILV+++ G VDI FA+ N  I 
Sbjct: 550 MGLSQEQESEGKDRTTLLLPGNQQSLITAVANAAKRPVILVLLTGGPVDITFAQANPKIG 609

Query: 564 AILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPG 623
           AILWAGYPG+ GG AIA V+FG+ NP GRLP+TWY  ++ + +P+T M +R   S  YPG
Sbjct: 610 AILWAGYPGQAGGLAIAKVLFGEKNPSGRLPVTWYPEEFTK-VPMTDMRMRSAGS--YPG 666

Query: 624 RTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPG 683
           R+Y+FY G T+Y FGYGLSY++F + +++       N   L    +   T D        
Sbjct: 667 RSYRFYKGKTIYKFGYGLSYSKFSHRVVTARNNPAHNTTLLLAAGHAATTEDNLSYHVDH 726

Query: 684 VLVNDLRCDDY-FEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAG 742
             + D  C    F   V  QN G  DG    +++ + P        +Q++GFQ   ++AG
Sbjct: 727 --IGDELCRQLKFLAVVKVQNHGPMDGKHTALMFLRWPNATDGRPARQLVGFQSQHIKAG 784

Query: 743 RNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGN 779
               ++F  + C+  + V      ++  G H + VG 
Sbjct: 785 EKAHLRFEVSPCEDFSRVRDDGRKVIDKGSHFLKVGK 821


>gi|168046596|ref|XP_001775759.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162672911|gb|EDQ59442.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 784

 Score =  652 bits (1682), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 341/766 (44%), Positives = 486/766 (63%), Gaps = 41/766 (5%)

Query: 35  FVCDPGRFSKLGLQMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLP 94
           + CDP   + L      F FC++S+    RV+DL+SR+T+ EK++QL + A  V RLG+P
Sbjct: 20  YACDPDGPADL-----LFPFCNTSISDDDRVEDLISRLTIQEKIEQLVNTAANVSRLGIP 74

Query: 95  QYEWWSEALHGVSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAM 154
            Y+WW E LHGV+ + P  +F    P ATSFP   L+  S+N +LW KIGQ VSTE RAM
Sbjct: 75  PYQWWGEGLHGVA-ISPSVYFGGATPAATSFPLPCLSVCSYNRTLWNKIGQVVSTEGRAM 133

Query: 155 YNLGRAGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHEN---A 211
           YN GR+GLTYWSPNIN+ARDPRWGR  ETPGEDP +   YAV++V+GLQ+ +  +N   A
Sbjct: 134 YNQGRSGLTYWSPNINIARDPRWGRTQETPGEDPKLSSGYAVHFVKGLQEGDYDQNQPQA 193

Query: 212 TDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDAS 271
                R LK+S+CCKH+ A+D+D WK  DR HFD++VT+QD+E+T+   F+ CVKEG +S
Sbjct: 194 VSRGPRRLKISACCKHFTAHDLDRWKDYDRDHFDSKVTQQDLEDTYNPSFKSCVKEGQSS 253

Query: 272 SVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAV 331
           SVMCSYNR+NGIP C   +LL  TVR +W   GYIV+DCD++ ++ D   + A + EDAV
Sbjct: 254 SVMCSYNRLNGIPMCTHYELLTLTVRNQWGFDGYIVSDCDAVALIHDYINY-APTSEDAV 312

Query: 332 AQTLKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ---Y 388
           +  + AG+DL+CG         A+ +  + E  ID  L+ L+ V MRLG FDG+P    Y
Sbjct: 313 SYVMLAGMDLNCGSTTLVHGLAALDKKLIWEGLIDMHLRNLFRVRMRLGMFDGNPSTLPY 372

Query: 389 VSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIG 448
            SLG +D+C+++N  LA EAAR+ +VLLKN++N LP        +AV+G HA+AT  M+G
Sbjct: 373 GSLGPEDMCTEDNQHLALEAARQSLVLLKNEKNALPWKKTHGLKLAVIGHHADATREMLG 432

Query: 449 NYAGIPCRYMSPIAGFSGYAN-----VTYKTGCDDVACKSNNSIFAASEAAKTADATIIL 503
           NY G PC+++SP+ GF+   +     ++++ GC D AC+    I+AA EAA  ADA +++
Sbjct: 433 NYEGYPCKFVSPLQGFAKVLSDHSPRISHERGCSDAACEDQFYIYAAKEAAAQADAVVLV 492

Query: 504 AGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKG-PVILVIMSAGGVDIAFAETNTNI 562
            G+  + E E  DR+ L LPG Q +L++ V E + G PV+LV++S   +D++FA  +  I
Sbjct: 493 LGISQAQEKEGRDRDSLLLPGRQMELVSSVVEASAGRPVVLVLLSGSPLDVSFANDDPRI 552

Query: 563 KAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYP 622
           ++I+WAGYPG+ GG AIA+ +FG  NPGGRL  +WY  +Y   + +++M +RP  S GYP
Sbjct: 553 QSIIWAGYPGQSGGEAIAEAIFGLVNPGGRLAQSWYYENYTN-IDMSNMNMRPNASTGYP 611

Query: 623 GRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCP 682
           GRTY+F+    L+ FG+GLSY+ FKY ++S  ++I     + Q C     +SD +     
Sbjct: 612 GRTYRFFTDTPLWEFGHGLSYSDFKYTMVSAPQSIMAPHLRYQLC-----SSDRA----- 661

Query: 683 GVLVNDLRCDDY---------FEFKVDFQNVGSTDGSDVVIVYSKPPAE-IAATYIKQVI 732
            V+ +DL C  Y         F  +V   N G   G   V+++SKPP+  I    +KQ++
Sbjct: 662 -VMTSDLNCLHYEKEACKESSFHVRVWVINHGPLSGDHSVLLFSKPPSRGIDGIPLKQLV 720

Query: 733 GFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVG 778
            F+RV + AG  + I F  N C+ L  V       +  GEHT+ VG
Sbjct: 721 SFERVHLEAGAGQEILFKVNPCEDLGTVGDDGIRTVELGEHTLMVG 766


>gi|242076578|ref|XP_002448225.1| hypothetical protein SORBIDRAFT_06g023450 [Sorghum bicolor]
 gi|241939408|gb|EES12553.1| hypothetical protein SORBIDRAFT_06g023450 [Sorghum bicolor]
          Length = 766

 Score =  651 bits (1679), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 328/755 (43%), Positives = 483/755 (63%), Gaps = 36/755 (4%)

Query: 50  SSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNV 109
           S++ FCD+SL    R + LVS +TLDEK+ QL + A GVPRLG+P Y+WWSE+LHG+++ 
Sbjct: 32  SAYPFCDASLSIPARARALVSLLTLDEKIAQLSNTAGGVPRLGIPPYQWWSESLHGLADN 91

Query: 110 GPGTHFDD-VIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPN 168
           GPG +F    +  AT+FP VIL+TA+FN SLW+ + +AV+TEA  M+N G+AGLTYW+PN
Sbjct: 92  GPGVNFSSGPVRAATTFPQVILSTAAFNRSLWRAVAEAVATEALGMHNAGQAGLTYWAPN 151

Query: 169 INVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHY 228
           IN+ RDPRWGR  ET GEDP V   Y++ YV+G Q  +G E         +++S+CCKHY
Sbjct: 152 INIFRDPRWGRGQETSGEDPAVAAAYSLEYVKGFQGEQGEEGR-------IRLSACCKHY 204

Query: 229 AAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCAD 288
            AYD++ W+G  RY F+A+V  QD+E+T+  PF+ C++E  AS +MC+YN+VNG+P CA+
Sbjct: 205 TAYDMEKWEGFSRYTFNAKVNAQDLEDTYQPPFKTCIQEARASCLMCAYNQVNGVPMCAN 264

Query: 289 PKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYT 348
             LL +T R EW   GYI +DCD++ ++ +N  +   S ED++A  LKAG+D++CG +  
Sbjct: 265 KDLLQKT-RDEWGFQGYITSDCDAVAIIHENQTY-TKSDEDSIAIVLKAGMDINCGSFLV 322

Query: 349 NFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFD---GSPQYVSLGKQDICSDENIELA 405
             T +AV++GKV+E DID++L  L++V +RLG FD    +     LG  ++C+ E+ ELA
Sbjct: 323 RHTKSAVEKGKVQEQDIDRALFNLFSVQLRLGIFDKPNNNQWSTQLGPNNVCTKEHRELA 382

Query: 406 AEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFS 465
           AEA R+G VLLKND + LPL  ++V+ VA++GP AN   AM G+Y G+ C   + + G  
Sbjct: 383 AEAVRQGAVLLKNDHSFLPLKRSEVRHVAIIGPSANDVYAMGGDYTGVACNPTTFLKGIQ 442

Query: 466 GYA-NVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAESLDREDLWLPG 524
            YA   T+  GC DV+C S      A  AAK AD  +++AGL+L+ E E  DR  L LPG
Sbjct: 443 AYATQTTFAAGCKDVSCNSTELFGEAIAAAKRADIVVVVAGLNLTEEREDFDRVSLLLPG 502

Query: 525 YQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVF 584
            Q  LI+ VA VAK P++LV++  G VD++FA+ +  I +ILW GYPGE GG+ + +++F
Sbjct: 503 KQMSLIHAVASVAKKPLVLVLLGGGPVDVSFAKQDPRIASILWLGYPGEVGGQVLPEILF 562

Query: 585 GKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYT 644
           G++NPGG+L +TWY   +   +P+T M +R   S GYPGRTY+FY G  +Y FGYGLSY+
Sbjct: 563 GEYNPGGKLAMTWYPESFT-AIPMTDMNMRADPSRGYPGRTYRFYTGDVVYGFGYGLSYS 621

Query: 645 QFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVND----LRCDDY------ 694
           ++ Y++LS  K I ++ + +     L+  S     R P  +  D    ++ +D       
Sbjct: 622 KYSYSILSAPKKITMSRSSV-----LDIIS-----RKPSYIRRDGLDFVKTEDIASCEAL 671

Query: 695 -FEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNA 753
            F   V   N GS DGS  V+++++  + +    IKQ++GF+RV   AG    ++   + 
Sbjct: 672 AFSVHVAVSNHGSMDGSHAVLLFARSKSSVPGFPIKQLVGFERVHTAAGSASNVEISVDP 731

Query: 754 CKSLNIVDYAANTLLPAGEHTIFVGNGGVSFPIHL 788
           CK ++  +     +L  G+H + VG+      I L
Sbjct: 732 CKHMSAANPEGKRVLLLGDHVLTVGDEEFELFIEL 766


>gi|357164885|ref|XP_003580200.1| PREDICTED: probable beta-D-xylosidase 6-like [Brachypodium
           distachyon]
          Length = 771

 Score =  651 bits (1679), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 325/743 (43%), Positives = 473/743 (63%), Gaps = 32/743 (4%)

Query: 52  FLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGP 111
           + FCD+SLP+ +R + LVS +TLDEK+ QL + A GVPRLG+P YEWWSE+LHG+++ GP
Sbjct: 37  YPFCDASLPFPVRARALVSLLTLDEKIAQLSNTAAGVPRLGIPPYEWWSESLHGLADNGP 96

Query: 112 GTHFDD-VIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNIN 170
           G +F    +  AT FP VIL+ ASFN SLW+ + +AV+ EARAM+N G+AGLTYW+PNIN
Sbjct: 97  GVNFSSGPVGAATIFPQVILSAASFNRSLWRAVAEAVAVEARAMHNAGQAGLTYWAPNIN 156

Query: 171 VARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAA 230
           V RDPRWGR  ETPGEDP V+  Y+V YV+G Q   G     D     + +S+CCKHY A
Sbjct: 157 VFRDPRWGRGQETPGEDPAVIAAYSVEYVKGFQGEYG-----DGKEGRMMLSACCKHYVA 211

Query: 231 YDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPK 290
           YD++ W    RY F+A+V EQD E+T+  PF+ C++EG AS +MCSYN+VNG+P+CA   
Sbjct: 212 YDLEKWGNFTRYTFNAKVNEQDFEDTYEPPFKSCIQEGRASCLMCSYNQVNGVPACARKD 271

Query: 291 LLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNF 350
           LL Q VR EW   GY+V+DCD++ ++     +  +S ED++A  LKAG+D++CG +    
Sbjct: 272 LL-QKVRDEWGFQGYVVSDCDAVGIIYGYQNY-TNSDEDSIAIVLKAGMDINCGSFLIRH 329

Query: 351 TGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFD---GSPQYVSLGKQDICSDENIELAAE 407
           T +A+Q+GK+ E DI+ +L  L++V +RLG FD   G+  +  LG  +IC+ E+ ELAAE
Sbjct: 330 TKSAIQKGKITEEDINHALFNLFSVQLRLGLFDKTSGNQWFTQLGPSNICTKEHRELAAE 389

Query: 408 AAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFSGY 467
           AAR+G VLLKND + LPL  ++V  +A++GP AN    M G+Y G+PC   + + G    
Sbjct: 390 AARQGTVLLKNDNSFLPLKRSEVSHIAIIGPVANDAYIMGGDYTGVPCNPTTFLKGMQAV 449

Query: 468 A-NVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAESLDREDLWLPGYQ 526
               T   GC D++C S +    A E AK AD  +++AGL+L+ E E LDR  L LPG Q
Sbjct: 450 VPQTTIAAGCKDISCNSTDGFGEAIEVAKRADIVVLIAGLNLTQETEDLDRVSLLLPGKQ 509

Query: 527 TQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGK 586
             LIN +A V K P++LVI   G VD++FA+ +  I ++LW GYPGE GG+ + +++FG+
Sbjct: 510 MDLINSIASVTKKPLVLVITGGGPVDVSFAKQDKRIASVLWIGYPGEVGGQVLPEILFGE 569

Query: 587 FNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQF 646
           +NPGG+LPITWY   +   +P+  M +R   S  YPGRTY+FY G  +Y FGYGLSY+++
Sbjct: 570 YNPGGKLPITWYPESFT-AVPMNDMNMRADPSRSYPGRTYRFYTGDVVYGFGYGLSYSKY 628

Query: 647 KYNL------LSFTKTIQVNL--NKLQHCRN--LNYTSDASKTRCPGVLVNDLRCDDYFE 696
            YN+      +S +++  V+    K  H R   L+Y        C  +          F 
Sbjct: 629 SYNIIQAPTKISLSRSSAVDFISTKRAHTRRDGLDYVQVEDIASCESI---------KFS 679

Query: 697 FKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKS 756
             +   N G+ DGS  V+++++  + +    +KQ++GF+R++  AG+   ++   + CK 
Sbjct: 680 VHISVANDGAMDGSHAVLLFTRSKSSVPGFPLKQLVGFERLYAAAGKATNVEITVDPCKL 739

Query: 757 LNIVDYAANTLLPAGEHTIFVGN 779
           ++  +     +L  G H + VG+
Sbjct: 740 MSSANTEGRRVLLLGSHLLMVGD 762


>gi|326517420|dbj|BAK00077.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 781

 Score =  650 bits (1677), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 343/755 (45%), Positives = 470/755 (62%), Gaps = 24/755 (3%)

Query: 30  SSSPVFVCDPGRFSKLGLQMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVP 89
           ++ P F C P   +        + FCD++LP + R  DLV+R+T  EKV QLGD A GVP
Sbjct: 33  AADPPFSCGPSSTAA----TQGYAFCDATLPVAQRAADLVARLTTAEKVAQLGDEAAGVP 88

Query: 90  RLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVST 149
           RLG+P Y+WW+EALHG++  G G HF+  +  ATSFP V LT A+F++ LW +IGQA+  
Sbjct: 89  RLGVPAYKWWNEALHGLATSGKGLHFNGAVRSATSFPQVSLTAAAFDDDLWLRIGQAIGR 148

Query: 150 EARAMYNLGRA-GLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGH 208
           EARA+YN+G+A GLT WSPN+N+ RDPRWGR  ETPGEDP    RY V +V+GLQ     
Sbjct: 149 EARALYNVGQAEGLTMWSPNVNIYRDPRWGRGQETPGEDPTTASRYGVAFVKGLQ----- 203

Query: 209 ENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEG 268
                 +S  L+ S+CCKH  AYD+++W GV RY+FDARVT QD+E+T+  PF  CV +G
Sbjct: 204 --GNSTSSSLLQTSACCKHATAYDLEDWGGVARYNFDARVTAQDLEDTYNPPFRSCVVDG 261

Query: 269 DASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKE 328
            AS VMC+Y  +NG+P+CA+  LL  TVR +W L GY+ +DCD++ +M D  ++ A + E
Sbjct: 262 KASCVMCAYTAINGVPACANSGLLTNTVRADWGLDGYVASDCDAVAIMRDAQRY-APTPE 320

Query: 329 DAVAQTLKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ- 387
           DAVA  LKAGLD+DCG Y       A+QQGK+ E D+DK+LK L+ + MRLG FDG P+ 
Sbjct: 321 DAVALALKAGLDIDCGTYMQQHAPAALQQGKITEDDVDKALKNLFAIRMRLGHFDGDPRA 380

Query: 388 --YVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVA 445
             Y  L    IC+ E+  LA EAA++GIVLLKND   LPL+ A + + AV+GP+AN    
Sbjct: 381 NIYGGLNAAHICTPEHRSLALEAAQDGIVLLKNDAGILPLDRAAIASAAVIGPNANNPGL 440

Query: 446 MIGNYAGIPCRYMSPIAGFSGYA-NVTYKTGCDDVACKSNNSIFAASEAAKTADATIILA 504
           +IGNY G PC  ++P+ G  GY  +V +  GC   AC   ++  AA+ A  ++D  ++  
Sbjct: 441 LIGNYFGPPCESVTPLKGVQGYVKDVRFMAGCGSAACDVADTDQAATLAG-SSDYVLLFM 499

Query: 505 GLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKA 564
           GL    E+E  DR  L LPG Q  LI  VA+ AK PVILV+++ G VD+ FA+ N  I A
Sbjct: 500 GLSQQQESEGRDRTSLLLPGQQQSLITAVADAAKRPVILVLLTGGPVDVTFAKNNPKIGA 559

Query: 565 ILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGR 624
           ILWAGYPG+ GG AIA V+FG  NPGGRLP+TWY  ++ + +P+T M +R   + GYPGR
Sbjct: 560 ILWAGYPGQAGGLAIARVLFGDHNPGGRLPVTWYPEEFTK-VPMTDMRMRADPATGYPGR 618

Query: 625 TYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGV 684
           +Y+FY G T+Y FGYGLSY+ +   LLS       N + L     +   ++        V
Sbjct: 619 SYRFYQGETVYKFGYGLSYSSYSRRLLSSGTP---NTDLLAGLSTMPTPAEEGGVASYHV 675

Query: 685 LVNDLRCDDYFEFK--VDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAG 742
                R  +  +F   V+ +N G  DG   V++Y +     A    KQ+IGF+R  ++AG
Sbjct: 676 EHIGARGCEQLKFPAVVEVENHGPMDGKHSVLMYLRWANATAGRPAKQLIGFRRQHLKAG 735

Query: 743 RNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFV 777
               + F  + C+  + V    N ++  G H + V
Sbjct: 736 EKASLTFDISPCEHFSRVRKDGNKVVDRGSHFLMV 770


>gi|326491679|dbj|BAJ94317.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 772

 Score =  649 bits (1675), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 324/742 (43%), Positives = 482/742 (64%), Gaps = 22/742 (2%)

Query: 48  QMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVS 107
           + +S+ FCD SLP+ +R + LVS +TLDEK+ QL + A GVPRLG+P YEWWSE+LHG++
Sbjct: 34  EANSYAFCDGSLPFPVRARALVSLLTLDEKIAQLSNTAAGVPRLGVPPYEWWSESLHGLA 93

Query: 108 NVGPGTHFDD-VIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWS 166
           + GPG +F    +  AT FP VIL+ A+FN SLW+ + +AV+ EARAM+N G+AGLTYW+
Sbjct: 94  DNGPGVNFSSGPVAAATIFPQVILSAAAFNRSLWRAVAEAVAVEARAMHNAGQAGLTYWA 153

Query: 167 PNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCK 226
           PNINV RDPRWGR  ETPGEDP ++  Y+V YV+G Q   G     D     + +S+CCK
Sbjct: 154 PNINVFRDPRWGRGQETPGEDPAMIAAYSVEYVKGFQGEYG-----DGREGRMMLSACCK 208

Query: 227 HYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSC 286
           HY AYD++ W    RY F+A V  QD E+T+  PF+ C++EG AS +MCSYN+VNG+P+C
Sbjct: 209 HYIAYDLEKWGKFARYTFNAEVNAQDFEDTYEPPFKSCIQEGRASCLMCSYNQVNGVPAC 268

Query: 287 ADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQY 346
           A   LL Q +R EW   GYIV+DCD++ ++ +N  + + S ED+VA  LKAG+D++CG +
Sbjct: 269 ARKDLL-QKIRDEWGFKGYIVSDCDAVAIIHENQTYTS-SDEDSVAIVLKAGMDVNCGSF 326

Query: 347 YTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ---YVSLGKQDICSDENIE 403
               T +A+++GK++E DI+ +L  L++V +RLG F+ + +   +  LG  ++C+ E+ E
Sbjct: 327 LIRHTKSAIEKGKIQEEDINHALYNLFSVQLRLGLFEKANENQWFTRLGPSNVCTKEHRE 386

Query: 404 LAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAG 463
           LAAEA R+G VLLKND + LPL  +KV  +A++G  AN    M G+Y G+PC  ++ + G
Sbjct: 387 LAAEAVRQGTVLLKNDNSFLPLKRSKVSHIALIGAAANDAYIMGGDYTGVPCDPITFLKG 446

Query: 464 FSGYA-NVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAESLDREDLWL 522
              +    T   GC DV+C S +    A EAAK AD  +++AGL+L+ E+E LDR  L L
Sbjct: 447 MQAFVPQTTVAAGCKDVSCDSPDGFGEAIEAAKRADIVVVIAGLNLTQESEDLDRVTLLL 506

Query: 523 PGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADV 582
           PG Q  L+N +A V K P++LVI   G VD+AFA+ +  I ++LW GYPGE GG+ + ++
Sbjct: 507 PGRQQDLVNIIASVTKKPIVLVITGGGPVDVAFAKQDPRIASVLWIGYPGEVGGQVLPEI 566

Query: 583 VFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLS 642
           +FG++NPGG+LP+TWY   +   +P+  M +R   S GYPGRTY+FY G  +Y FGYGLS
Sbjct: 567 LFGEYNPGGKLPMTWYPESFT-AVPMNDMNMRADPSRGYPGRTYRFYTGEVVYGFGYGLS 625

Query: 643 YTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPG---VLVNDL-RCDDY-FEF 697
           Y+++ YN++   + I ++ + +        +   + TR  G   V V D+  C+   F  
Sbjct: 626 YSKYSYNIVQAPQRISLSHSPVPGL----ISRKPAYTRRDGLDYVQVEDIASCESLVFSV 681

Query: 698 KVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSL 757
            +   N G+ DGS  V+++++  + +    +KQ++GF+RV+  AG +K +    + CK +
Sbjct: 682 HISVANDGAMDGSHAVLLFARSKSSVPGFPLKQLVGFERVYTAAGSSKNVAITVDPCKYM 741

Query: 758 NIVDYAANTLLPAGEHTIFVGN 779
           +  +     +L  G H + VG+
Sbjct: 742 SAANTEGRRVLLLGSHHLMVGD 763


>gi|125534137|gb|EAY80685.1| hypothetical protein OsI_35867 [Oryza sativa Indica Group]
          Length = 779

 Score =  649 bits (1674), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 332/755 (43%), Positives = 461/755 (61%), Gaps = 29/755 (3%)

Query: 32  SPVFVCDPGRFSKLGLQMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRL 91
           +P F C P    K       F FC+++LP   R  DLV+R+T  EKV QLGD A GVPRL
Sbjct: 34  NPGFTCGPASAQK------GFAFCNAALPAEQRAADLVARLTTAEKVGQLGDQAPGVPRL 87

Query: 92  GLPQYEWWSEALHGVSNVGPGTHFDD-VIPGATSFPTVILTTASFNESLWKKIGQAVSTE 150
           G+P Y+WWSEALHG++  G G HF +     ATSFP VI T A+F++ LW +IGQA+  E
Sbjct: 88  GIPVYKWWSEALHGLAISGKGIHFGNGPARTATSFPQVIHTAAAFDDGLWFRIGQAIGKE 147

Query: 151 ARAMYNLGRA-GLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHE 209
            RA YNLG+A GL  WSPN+N+ RDPRWGR  ETPGEDP    +Y   +V+GLQ      
Sbjct: 148 GRAFYNLGQAEGLAMWSPNVNIFRDPRWGRGQETPGEDPATASKYGAAFVKGLQ------ 201

Query: 210 NATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGD 269
             + L +  L+ S+CCKH  AYD++ WKGV RY+F+A+VT QD+ +T+  PF  CV +G 
Sbjct: 202 -GSSLTN--LQTSACCKHITAYDIEEWKGVSRYNFNAKVTPQDLADTYNPPFRSCVVDGK 258

Query: 270 ASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKED 329
           AS +MC+Y  +NG+P+CA   LL +TVRGEW L GY  +DCD++ ++  +  F   + E+
Sbjct: 259 ASCIMCAYTLINGVPACASSDLLTKTVRGEWKLDGYTASDCDAVAILHKSEHF-TRTAEE 317

Query: 330 AVAQTLKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ-- 387
           AVA  LKAGLD++CG Y      +A+QQGK+ E D+DK+LK L+ + MRLG FDG P+  
Sbjct: 318 AVAVALKAGLDINCGVYMQQNAASALQQGKMTEKDVDKALKNLFAIRMRLGHFDGDPRGN 377

Query: 388 --YVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVA 445
             Y  LG  D+C+  +  LA EAAR G+VLLKND   LPL +  V + AV+G +AN  +A
Sbjct: 378 KLYGRLGAADVCTPVHKALALEAARRGVVLLKNDARLLPLRAPTVSSAAVIGHNANDILA 437

Query: 446 MIGNYAGIPCRYMSPIAGFSGYA-NVTYKTGCDDVACKSNNSIFAASEAAKTADATIILA 504
           ++GNY G+PC   +P  G   Y  +  +  GC   AC    +   A+  AK++D   ++ 
Sbjct: 438 LLGNYYGLPCETTTPFGGIQKYVKSAKFLPGCSSAACDV-AATDQATALAKSSDYVFLVM 496

Query: 505 GLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKA 564
           GL    E E LDR  L LPG Q  LI  VA  +K PVIL++++ G VDI FA+TN  I A
Sbjct: 497 GLSQKQEQEGLDRTSLLLPGKQQALITAVATASKRPVILILLTGGPVDITFAQTNPKIGA 556

Query: 565 ILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGR 624
           ILWAGYPG+ GG+AIADV+FG+FNP G+LP+TWY  ++ +   +T M +RP  + GYPGR
Sbjct: 557 ILWAGYPGQAGGQAIADVLFGEFNPSGKLPVTWYPEEFTKFT-MTDMRMRPDPATGYPGR 615

Query: 625 TYKFYNGPTLYPFGYGLSYTQFKYNLLSFT-KTIQVNLNKLQHCRNLNYTSDASKTRCPG 683
           +Y+FY G T+Y FGYGLSY++F   ++S    +       L   R        +  R   
Sbjct: 616 SYRFYKGKTVYKFGYGLSYSKFACRIVSGAGNSSSYGKAALAGLRAATTPEGDAVYRVD- 674

Query: 684 VLVNDLRCDDY-FEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAG 742
             + D RC+   F   V+ QN G  DG   V+++ +  +      ++Q+IGF+   ++ G
Sbjct: 675 -EIGDDRCERLRFPVMVEVQNHGPMDGKHTVLMFVRWSSTDGGRPVRQLIGFRNQHLKVG 733

Query: 743 RNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFV 777
             K++K   + C+ L+        ++  G H + V
Sbjct: 734 EKKKLKMEISPCEHLSRARVDGEKVIDRGSHFLMV 768


>gi|357489463|ref|XP_003615019.1| hypothetical protein MTR_5g062650 [Medicago truncatula]
 gi|355516354|gb|AES97977.1| hypothetical protein MTR_5g062650 [Medicago truncatula]
          Length = 785

 Score =  649 bits (1673), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 338/756 (44%), Positives = 480/756 (63%), Gaps = 35/756 (4%)

Query: 51  SFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVG 110
           S+ FC+ +L    R KD+VSR+TLDEK+ QL + A  +PRLG+  Y+WWSEALHGV++ G
Sbjct: 47  SYTFCNLNLTTIQRAKDIVSRLTLDEKLAQLVNTAPAIPRLGIHSYQWWSEALHGVADYG 106

Query: 111 PGTHFDD--VIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRA-GLTYWSP 167
            G   +    I  AT FP VILT ASF+  LW +I + + TEARA+YN G+A G+T+W+P
Sbjct: 107 KGIRLNGNVTIKAATIFPQVILTAASFDSKLWYRISKVIGTEARAVYNAGQAEGMTFWAP 166

Query: 168 NINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQ--DVEGHENATDLNSRPLKVSSCC 225
           NIN+ RDPRWGR  ET GEDP V  +YAV++VRGLQ    EG +    LN   LK S+CC
Sbjct: 167 NINIFRDPRWGRGQETAGEDPLVSAKYAVSFVRGLQGDSFEGGK----LNEDRLKASACC 222

Query: 226 KHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPS 285
           KH+ AYD+DNWKGVDR+ FDA VT QD+ +T+  PF  C+ +G +S +MC+YNRVNGIP+
Sbjct: 223 KHFTAYDLDNWKGVDRFDFDANVTLQDLADTYQPPFHSCIVQGRSSGIMCAYNRVNGIPN 282

Query: 286 CADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQ 345
           CAD  LL  T R +W+ +GYI +DC ++ ++ D   + A + EDAVA  L+AG+D++CG 
Sbjct: 283 CADYNLLTNTARKKWNFNGYITSDCSAVDIIHDRQGY-AKAPEDAVADVLQAGMDVECGD 341

Query: 346 YYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSP---QYVSLGKQDICSDENI 402
           Y+T+ + +AV Q KV  + ID++L  L+++ +RLG FDG P   +Y  +G   +CS +N+
Sbjct: 342 YFTSHSKSAVLQKKVPISQIDRALHNLFSIRIRLGLFDGHPTKLKYGKIGPNRVCSKQNL 401

Query: 403 ELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMI-GNYAGIPCRYMSPI 461
            +A EAAR GIVLLKN  + LPL  +   ++ V+GP+AN++  ++ GNY G PC  ++ +
Sbjct: 402 NIALEAARSGIVLLKNAASILPLPKS-TDSIVVIGPNANSSSQVVLGNYFGRPCNLVTIL 460

Query: 462 AGFSGYA-NVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAESLDREDL 520
            GF  Y+ N+ Y  GC D     +  I  A E AK  D  +++ GLD S E+E  DR+DL
Sbjct: 461 QGFENYSDNLLYHPGCSDGTKCVSAEIDRAVEVAKVVDYVVLVMGLDQSQESEGHDRDDL 520

Query: 521 WLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIA 580
            LPG Q +LIN VA+ +K PVILV+   G VDI+FA+ +  I  ILWAGYPGE GG A+A
Sbjct: 521 ELPGKQQELINSVAKASKRPVILVLFCGGPVDISFAKVDDKIGGILWAGYPGELGGMALA 580

Query: 581 DVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYG 640
            VVFG +NPGGRLP+TWY  D+++ +P+T M +R   S GYPGRTY+FY GP +Y FGYG
Sbjct: 581 QVVFGDYNPGGRLPMTWYPKDFIK-IPMTDMRMRADPSSGYPGRTYRFYTGPKVYEFGYG 639

Query: 641 LSYTQFKYNLLSFTKTIQVNLNK------LQHCRNLNY--TSDASKTRCPGVLVNDLRCD 692
           LSY+ + YN +S  K   +++N+      L+  + ++Y   S+  K  C  + ++     
Sbjct: 640 LSYSNYSYNFIS-VKNNNLHINQSTTYSILEKSQTIHYKLVSELGKKACKTMSIS----- 693

Query: 693 DYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFN 752
                 +   N GS  G   V+++ KP        +KQ++GF+ V V  G    + F  +
Sbjct: 694 ----VTLGITNTGSMAGKHPVLLFVKPKKGRNGNPVKQLVGFESVTVEGGGKGEVGFEVS 749

Query: 753 ACKSLNIVDYAANTLLPAGEHTIFVGNGGVSFPIHL 788
            C+ L+  + +   ++  G +   VG    S  I L
Sbjct: 750 VCEHLSRANESGVKVIEEGGYLFLVGELEYSINITL 785


>gi|62734691|gb|AAX96800.1| Glycosyl hydrolase family 3 C terminal domain, putative [Oryza
           sativa Japonica Group]
 gi|77549994|gb|ABA92791.1| beta-D-xylosidase, putative, expressed [Oryza sativa Japonica
           Group]
          Length = 853

 Score =  648 bits (1671), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 331/755 (43%), Positives = 460/755 (60%), Gaps = 29/755 (3%)

Query: 32  SPVFVCDPGRFSKLGLQMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRL 91
           +P F C P    K       F FC+++LP   R  DLV+R+T  EKV QLGD A GVPRL
Sbjct: 108 NPGFTCGPASAQK------GFAFCNAALPAEQRAADLVARLTTAEKVGQLGDQAPGVPRL 161

Query: 92  GLPQYEWWSEALHGVSNVGPGTHFDD-VIPGATSFPTVILTTASFNESLWKKIGQAVSTE 150
           G+P Y+WWSEALHG++  G G HF +     ATSFP VI T A+F++ LW +IGQA+  E
Sbjct: 162 GIPVYKWWSEALHGLAISGKGIHFGNGPARTATSFPQVIHTAAAFDDGLWFRIGQAIGKE 221

Query: 151 ARAMYNLGRA-GLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHE 209
            RA YNLG+A GL  WSPN+N+ RDPRWGR  ETPGEDP    +Y   +V+GLQ      
Sbjct: 222 GRAFYNLGQAEGLAMWSPNVNIFRDPRWGRGQETPGEDPATASKYGAAFVKGLQ------ 275

Query: 210 NATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGD 269
             + L +  L+ S+CCKH  AYD++ WKGV RY+F+A+VT QD+ +T+  PF  CV +G 
Sbjct: 276 -GSSLTN--LQTSACCKHITAYDIEEWKGVSRYNFNAKVTPQDLADTYNPPFRSCVVDGK 332

Query: 270 ASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKED 329
           AS +MC+Y  +NG+P+CA   LL +TVRGEW L GY  +DCD++ ++  +  F   + E+
Sbjct: 333 ASCIMCAYTLINGVPACASSDLLTKTVRGEWKLDGYTASDCDAVAILHKSEHF-TRTAEE 391

Query: 330 AVAQTLKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ-- 387
           AVA  LKAGLD++CG Y      +A+QQGK+ E D+DK+LK L+ + MRLG FDG P+  
Sbjct: 392 AVAVALKAGLDINCGVYMQQNAASALQQGKMTEKDVDKALKNLFAIRMRLGHFDGDPRGN 451

Query: 388 --YVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVA 445
             Y  L   D+C+  +  LA EAAR G+VLLKND   LPL +  V + AV+G +AN  +A
Sbjct: 452 KLYGRLSAADVCTPVHKALALEAARRGVVLLKNDARLLPLRAPTVASAAVIGHNANDILA 511

Query: 446 MIGNYAGIPCRYMSPIAGFSGYA-NVTYKTGCDDVACKSNNSIFAASEAAKTADATIILA 504
           ++GNY G+PC   +P  G   Y  +  +  GC   AC    +   A+  AK++D   ++ 
Sbjct: 512 LLGNYYGLPCETTTPFGGIQKYVKSAKFLPGCSSAACDV-AATDQATALAKSSDYVFLVM 570

Query: 505 GLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKA 564
           GL    E E LDR  L LPG Q  LI  VA  +K PVIL++++ G VDI FA+TN  I A
Sbjct: 571 GLSQKQEQEGLDRTSLLLPGKQQALITAVATASKRPVILILLTGGPVDITFAQTNPKIGA 630

Query: 565 ILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGR 624
           ILWAGYPG+ GG+AIADV+FG+FNP G+LP+TWY  ++ +   +T M +RP  + GYPGR
Sbjct: 631 ILWAGYPGQAGGQAIADVLFGEFNPSGKLPVTWYPEEFTK-FTMTDMRMRPDPATGYPGR 689

Query: 625 TYKFYNGPTLYPFGYGLSYTQFKYNLLSFT-KTIQVNLNKLQHCRNLNYTSDASKTRCPG 683
           +Y+FY G T+Y FGYGLSY++F   ++S    +       L   R        +  R   
Sbjct: 690 SYRFYKGKTVYKFGYGLSYSKFACRIVSGAGNSSSYGKAALAGLRAATTPEGDAVYRVD- 748

Query: 684 VLVNDLRCDDY-FEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAG 742
             + D RC+   F   V+ QN G  DG   V+++ +  +      ++Q+IGF+   ++ G
Sbjct: 749 -EIGDDRCERLRFPVMVEVQNHGPMDGKHTVLMFVRWSSTDGGRPVRQLIGFRNQHLKVG 807

Query: 743 RNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFV 777
             K++K   + C+ L+        ++  G H + V
Sbjct: 808 EKKKLKMEISPCEHLSRARVDGEKVIDRGSHFLMV 842


>gi|125534112|gb|EAY80660.1| hypothetical protein OsI_35838 [Oryza sativa Indica Group]
          Length = 771

 Score =  647 bits (1670), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 342/756 (45%), Positives = 465/756 (61%), Gaps = 29/756 (3%)

Query: 33  PVFVCDPGRFSKLGLQMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLG 92
           P + C P R   LG     + FCD+ LP + R  DLVSR+T  EKV QLGD A GV RLG
Sbjct: 25  PPYSCGP-RSPSLG-----YAFCDARLPPARRAADLVSRLTAAEKVAQLGDEAGGVARLG 78

Query: 93  LPQYEWWSEALHGVSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEAR 152
           +P Y+WWSE LHG+S  G G HF+  +   TSFP V+LT A+F++ LW +IGQA+ TEAR
Sbjct: 79  VPPYKWWSEGLHGLSYWGHGMHFNGAVTAITSFPQVLLTAAAFDDRLWFRIGQAIGTEAR 138

Query: 153 AMYNLGRA-GLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENA 211
           A+YNLG+A GLT WSPN+N+ RDPRWGR  ETPGEDP    +YAV +V+GLQ   G    
Sbjct: 139 ALYNLGQAEGLTIWSPNVNIYRDPRWGRGQETPGEDPTTASKYAVAFVKGLQ---GSTPG 195

Query: 212 TDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDAS 271
           T      L+ S+CCKH  AYD++ W GV RY+F+A+VT QD+ +TF  PF+ CV +  AS
Sbjct: 196 T------LQTSACCKHATAYDLEEWNGVARYNFNAKVTAQDLADTFNPPFKSCVVDAKAS 249

Query: 272 SVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAV 331
            VMC+Y  +NG+P+CA   LL++T RG+W L GY+ +DCD++ ++ D  ++ A + ED V
Sbjct: 250 CVMCAYTDINGVPACASSDLLSKTFRGQWGLDGYVSSDCDAVALLRDAQRY-APTPEDTV 308

Query: 332 AQTLKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ---- 387
           A  +KAGLDL+CG Y       A+QQGK++E+D+D++L  L+ V MRLG FDG P+    
Sbjct: 309 AVAIKAGLDLNCGNYTQVHGMAALQQGKMRESDVDRALTNLFAVRMRLGHFDGDPRSNAA 368

Query: 388 YVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMI 447
           Y  LG  D+C+  + +LA EAA+ GIVLLKND   LPL+ A V++ AV+GP+AN   A+ 
Sbjct: 369 YGHLGAADVCTQAHRDLALEAAQNGIVLLKNDAGALPLDRATVRSAAVIGPNANDPAALN 428

Query: 448 GNYAGIPCRYMSPIAGFSGY-ANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGL 506
           GNY G PC   +P+ G   Y ++V +  GCD  AC    +  AA+ A+ ++D  I+  GL
Sbjct: 429 GNYFGPPCETTTPLQGVQRYISSVRFLAGCDSPACGFAATGQAAALAS-SSDQVIMFMGL 487

Query: 507 DLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAIL 566
               E E LDR  L LPG Q  LI  VA  A+ PVILV+++ G VD+ FA+ N  I AIL
Sbjct: 488 SQDQEKEGLDRTSLLLPGKQQSLITAVASAARRPVILVLLTGGPVDVTFAKNNPKIGAIL 547

Query: 567 WAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTY 626
           WAGYPG+ GG AIA V+FG  NP GRLP+TWY  ++ + +P+T M +R   + GYPGR+Y
Sbjct: 548 WAGYPGQAGGLAIAKVLFGDHNPSGRLPVTWYPEEFTR-IPMTDMRMRADPATGYPGRSY 606

Query: 627 KFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLV 686
           +FY G  +Y FGYGLSY++F   L++  K  + N N L                     +
Sbjct: 607 RFYQGNPVYKFGYGLSYSKFTRRLVAAAKPRRPNRNLLAGVIPKPAGDGGESYHVE--EI 664

Query: 687 NDLRCDDY-FEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATY--IKQVIGFQRVFVRAGR 743
            +  C+   F   V+  N G  DG   V+V+ + P   A      +Q++GF    VRAG 
Sbjct: 665 GEEGCERLKFPATVEVHNHGPMDGKHSVLVFVQWPNATAGASRPARQLVGFSSQHVRAGE 724

Query: 744 NKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGN 779
             R+    N C+ L+        ++  G H + VG 
Sbjct: 725 KARLTMEINPCEHLSRARDDGTKVIDRGSHFLKVGE 760


>gi|115485163|ref|NP_001067725.1| Os11g0297300 [Oryza sativa Japonica Group]
 gi|113644947|dbj|BAF28088.1| Os11g0297300 [Oryza sativa Japonica Group]
          Length = 779

 Score =  646 bits (1667), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 331/755 (43%), Positives = 460/755 (60%), Gaps = 29/755 (3%)

Query: 32  SPVFVCDPGRFSKLGLQMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRL 91
           +P F C P    K       F FC+++LP   R  DLV+R+T  EKV QLGD A GVPRL
Sbjct: 34  NPGFTCGPASAQK------GFAFCNAALPAEQRAADLVARLTTAEKVGQLGDQAPGVPRL 87

Query: 92  GLPQYEWWSEALHGVSNVGPGTHFDD-VIPGATSFPTVILTTASFNESLWKKIGQAVSTE 150
           G+P Y+WWSEALHG++  G G HF +     ATSFP VI T A+F++ LW +IGQA+  E
Sbjct: 88  GIPVYKWWSEALHGLAISGKGIHFGNGPARTATSFPQVIHTAAAFDDGLWFRIGQAIGKE 147

Query: 151 ARAMYNLGRA-GLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHE 209
            RA YNLG+A GL  WSPN+N+ RDPRWGR  ETPGEDP    +Y   +V+GLQ      
Sbjct: 148 GRAFYNLGQAEGLAMWSPNVNIFRDPRWGRGQETPGEDPATASKYGAAFVKGLQ------ 201

Query: 210 NATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGD 269
             + L +  L+ S+CCKH  AYD++ WKGV RY+F+A+VT QD+ +T+  PF  CV +G 
Sbjct: 202 -GSSLTN--LQTSACCKHITAYDIEEWKGVSRYNFNAKVTPQDLADTYNPPFRSCVVDGK 258

Query: 270 ASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKED 329
           AS +MC+Y  +NG+P+CA   LL +TVRGEW L GY  +DCD++ ++  +  F   + E+
Sbjct: 259 ASCIMCAYTLINGVPACASSDLLTKTVRGEWKLDGYTASDCDAVAILHKSEHF-TRTAEE 317

Query: 330 AVAQTLKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ-- 387
           AVA  LKAGLD++CG Y      +A+QQGK+ E D+DK+LK L+ + MRLG FDG P+  
Sbjct: 318 AVAVALKAGLDINCGVYMQQNAASALQQGKMTEKDVDKALKNLFAIRMRLGHFDGDPRGN 377

Query: 388 --YVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVA 445
             Y  L   D+C+  +  LA EAAR G+VLLKND   LPL +  V + AV+G +AN  +A
Sbjct: 378 KLYGRLSAADVCTPVHKALALEAARRGVVLLKNDARLLPLRAPTVASAAVIGHNANDILA 437

Query: 446 MIGNYAGIPCRYMSPIAGFSGYA-NVTYKTGCDDVACKSNNSIFAASEAAKTADATIILA 504
           ++GNY G+PC   +P  G   Y  +  +  GC   AC    +   A+  AK++D   ++ 
Sbjct: 438 LLGNYYGLPCETTTPFGGIQKYVKSAKFLPGCSSAACDV-AATDQATALAKSSDYVFLVM 496

Query: 505 GLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKA 564
           GL    E E LDR  L LPG Q  LI  VA  +K PVIL++++ G VDI FA+TN  I A
Sbjct: 497 GLSQKQEQEGLDRTSLLLPGKQQALITAVATASKRPVILILLTGGPVDITFAQTNPKIGA 556

Query: 565 ILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGR 624
           ILWAGYPG+ GG+AIADV+FG+FNP G+LP+TWY  ++ +   +T M +RP  + GYPGR
Sbjct: 557 ILWAGYPGQAGGQAIADVLFGEFNPSGKLPVTWYPEEFTKFT-MTDMRMRPDPATGYPGR 615

Query: 625 TYKFYNGPTLYPFGYGLSYTQFKYNLLSFT-KTIQVNLNKLQHCRNLNYTSDASKTRCPG 683
           +Y+FY G T+Y FGYGLSY++F   ++S    +       L   R        +  R   
Sbjct: 616 SYRFYKGKTVYKFGYGLSYSKFACRIVSGAGNSSSYGKAALAGLRAATTPEGDAVYRVD- 674

Query: 684 VLVNDLRCDDY-FEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAG 742
             + D RC+   F   V+ QN G  DG   V+++ +  +      ++Q+IGF+   ++ G
Sbjct: 675 -EIGDDRCERLRFPVMVEVQNHGPMDGKHTVLMFVRWSSTDGGRPVRQLIGFRNQHLKVG 733

Query: 743 RNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFV 777
             K++K   + C+ L+        ++  G H + V
Sbjct: 734 EKKKLKMEISPCEHLSRARVDGEKVIDRGSHFLMV 768


>gi|195614824|gb|ACG29242.1| auxin-induced beta-glucosidase [Zea mays]
 gi|413920229|gb|AFW60161.1| putative O-Glycosyl hydrolase superfamily protein [Zea mays]
          Length = 655

 Score =  638 bits (1645), Expect = e-180,   Method: Compositional matrix adjust.
 Identities = 323/644 (50%), Positives = 420/644 (65%), Gaps = 21/644 (3%)

Query: 154 MYNLGRAGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATD 213
           MYN GRAGLT+WSPN+N+ RDPRWGR  ETPGEDP V  RYA  YVRGLQ      N   
Sbjct: 1   MYNGGRAGLTFWSPNVNIFRDPRWGRGQETPGEDPAVSARYAAAYVRGLQQPYAAPNGGH 60

Query: 214 LNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSV 273
            N   LK+++CCKH+ AYD+D W G DR+HF+A V  QD+E+TF  PF  CV++G A+SV
Sbjct: 61  RNR--LKLAACCKHFTAYDLDKWGGTDRFHFNAVVAAQDLEDTFNVPFRACVEDGRAASV 118

Query: 274 MCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQ 333
           MCSYN+VNG+P+CAD   L  T+RG W L GYIV+DCDS+ V   +  +   + EDA A 
Sbjct: 119 MCSYNQVNGVPTCADAAFLRGTIRGRWGLDGYIVSDCDSVDVFFRDQHY-TRTPEDAAAA 177

Query: 334 TLKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ---YVS 390
           TL+AGLDLDCG +   + G+AV  GKV + D+D +L    TV MRLG FDG P    +  
Sbjct: 178 TLRAGLDLDCGPFLALYAGSAVAAGKVADADVDAALLNTVTVQMRLGMFDGDPAAGPFGR 237

Query: 391 LGKQDICSDENIELAAEAAREGIVLLKN------DQNTLPLNSAKVKTVAVVGPHANATV 444
           LG  D+C+ E+ +LA +AAR+G+VLLKN      +++ LPL  A  + VAVVGPHA+ATV
Sbjct: 238 LGPADVCTREHQDLALDAARQGVVLLKNRRGARHNRDVLPLRPAAHRVVAVVGPHADATV 297

Query: 445 AMIGNYAGIPCRYMSPIAGFSGYA-NVTYKTGCDDVACKSNNSIFAASEAAKTADATIIL 503
           AMIGNYAG PCRY +P+ G + YA  V ++ GC DVAC+ N  I AA EAA+ ADAT+++
Sbjct: 298 AMIGNYAGKPCRYTTPLQGVAAYAARVAHQAGCTDVACRGNQPIAAAVEAARQADATVVV 357

Query: 504 AGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIK 563
           AGLD  VEAE LDR  L LPG Q +LI+ VA+ +KGPVILV+MS G +DIAFA+ +  I 
Sbjct: 358 AGLDQRVEAEGLDRTTLLLPGRQAELISAVAKASKGPVILVLMSGGPIDIAFAQNDPRID 417

Query: 564 AILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPG 623
            ILW GYPG+ GG+AIADV+FG  NPG +LP+TWY+ DY+Q +P+T+M +R   + GYPG
Sbjct: 418 GILWVGYPGQAGGQAIADVIFGHHNPGAKLPVTWYHQDYLQKVPMTNMAMRANPARGYPG 477

Query: 624 RTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCP- 682
           RTY+FY GPT+YPFG+GLSYTQF + L      + V L+   H      +   +    P 
Sbjct: 478 RTYRFYTGPTIYPFGHGLSYTQFTHTLAHAPTQLTVRLSGSGHSAASAASLLNATLARPV 537

Query: 683 -GVLVNDLRCDDY-FEFKVDFQNVGSTDGSDVVIVYSKPP-----AEIAATYIKQVIGFQ 735
             V V   RC+       VD  NVG  DG+  V+VY   P     A  A    +Q++ F+
Sbjct: 538 RAVRVAHARCEGLTVPVHVDVSNVGDRDGAHAVLVYHAAPSPSHAAPGADAPARQLVAFE 597

Query: 736 RVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGN 779
           +V V AG   R++     C  L++ D      +P GEH + +G 
Sbjct: 598 KVHVPAGGVARVEMRIGVCDRLSVADRNGVRRVPVGEHRLMIGE 641


>gi|32488698|emb|CAE03635.1| OSJNBb0003B01.27 [Oryza sativa Japonica Group]
          Length = 839

 Score =  635 bits (1638), Expect = e-179,   Method: Compositional matrix adjust.
 Identities = 312/646 (48%), Positives = 436/646 (67%), Gaps = 24/646 (3%)

Query: 139 LWKKIGQAVSTEARAMYNLGRAGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNY 198
           ++  I   VSTEARAM+N+G AGLT+WSPNIN+ RDPRWGR  ETPGEDP +  +YAV Y
Sbjct: 204 MYNLIVLVVSTEARAMHNVGLAGLTFWSPNINIFRDPRWGRGQETPGEDPLLASKYAVGY 263

Query: 199 VRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFL 258
           V GLQD  G  +A       LKV++CCKHY AYDVDNWKGV+RY FDA V++QD+++TF 
Sbjct: 264 VTGLQDAGGGSDA-------LKVAACCKHYTAYDVDNWKGVERYTFDAVVSQQDLDDTFQ 316

Query: 259 RPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVD 318
            PF+ CV +G+ +SVMCSYN+VNG P+CAD  LL+  +RG+W L+GYIV+DCDS+ V+ +
Sbjct: 317 PPFKSCVIDGNVASVMCSYNKVNGKPTCADKDLLSGVIRGDWKLNGYIVSDCDSVDVLYN 376

Query: 319 NHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMR 378
           N  +   + EDA A T+K+GLDL+CG +    T  AVQ GK+ E+D+D+++   + VLMR
Sbjct: 377 NQHY-TKNPEDAAAITIKSGLDLNCGNFLAQHTVAAVQAGKLSESDVDRAITNNFIVLMR 435

Query: 379 LGFFDGSPQ---YVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAV 435
           LGFFDG P+   + SLG +D+C+  N ELA EAAR+GIVLLKN    LPL++  +K++AV
Sbjct: 436 LGFFDGDPRKLPFGSLGPKDVCTSSNQELAREAARQGIVLLKN-TGALPLSAKSIKSMAV 494

Query: 436 VGPHANATVAMIGNYAGIPCRYMSPIAGFSGYANVTYKTGCDDVACKSNN-SIFAASEAA 494
           +GP+ANA+  MIGNY G PC+Y +P+ G        Y+ GC +V C  N+  + AA++AA
Sbjct: 495 IGPNANASFTMIGNYEGTPCKYTTPLQGLGANVATVYQPGCTNVGCSGNSLQLSAATQAA 554

Query: 495 KTADATIILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIA 554
            +AD T+++ G D SVE ESLDR  L LPG Q QL++ VA  ++GPVILV+MS G  DI+
Sbjct: 555 ASADVTVLVVGADQSVERESLDRTSLLLPGQQPQLVSAVANASRGPVILVVMSGGPFDIS 614

Query: 555 FAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLR 614
           FA+++  I AILW GYPGE GG A+AD++FG  NPGGRLP+TWY   +   + +T M +R
Sbjct: 615 FAKSSDKISAILWVGYPGEAGGAALADILFGYHNPGGRLPVTWYPASFADKVSMTDMRMR 674

Query: 615 PVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLN-YT 673
           P  S GYPGRTY+FY G T+Y FG GLSYT+F ++L+S  + + V L +   C   + ++
Sbjct: 675 PDSSTGYPGRTYRFYTGDTVYAFGDGLSYTKFAHSLVSAPEQVAVQLAEGHACHTEHCFS 734

Query: 674 SDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIG 733
            +A+   C  +          F+  +  +N G   G   V ++S PP+ + +   K ++G
Sbjct: 735 VEAAGEHCGSL---------SFDVHLRVRNAGGMAGGHTVFLFSSPPS-VHSAPAKHLLG 784

Query: 734 FQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGN 779
           F++V +  G+   + F  + CK L++VD   N  +  G HT+ VG+
Sbjct: 785 FEKVSLEPGQAGVVAFKVDVCKDLSVVDELGNRKVALGSHTLHVGD 830



 Score =  123 bits (308), Expect = 4e-25,   Method: Compositional matrix adjust.
 Identities = 60/116 (51%), Positives = 77/116 (66%), Gaps = 5/116 (4%)

Query: 30  SSSPVFVCDPGRFSKLGLQMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVP 89
           + +PVF CD    +     +S + FCD +   + R  DL+ R+TL EKV  L +    +P
Sbjct: 26  AQTPVFACDASNAT-----VSGYGFCDRTKSSAARAADLLGRLTLAEKVGFLVNKQAALP 80

Query: 90  RLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQ 145
           RLG+P YEWWSEALHGVS VGPGT F  ++PGATSFP  ILT ASFN SL++ IG+
Sbjct: 81  RLGIPAYEWWSEALHGVSYVGPGTRFSTLVPGATSFPQPILTAASFNASLFRAIGE 136


>gi|297611657|ref|NP_001067709.2| Os11g0291000 [Oryza sativa Japonica Group]
 gi|255680005|dbj|BAF28072.2| Os11g0291000 [Oryza sativa Japonica Group]
          Length = 764

 Score =  634 bits (1636), Expect = e-179,   Method: Compositional matrix adjust.
 Identities = 331/746 (44%), Positives = 451/746 (60%), Gaps = 29/746 (3%)

Query: 46  GLQMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHG 105
           G Q     FCD+ L    R  DLV+ +TL EKV QLGD A GV RLG+P YEWWSE LHG
Sbjct: 23  GQQQQPHRFCDAWLTAEQRAADLVANLTLAEKVSQLGDRAAGVARLGVPAYEWWSEGLHG 82

Query: 106 VSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRA-GLTY 164
           +S  G G  F+  +   TSFP VILT A+F+  LW+++G+AV  EARA+YNLG+A GLT 
Sbjct: 83  LSIWGRGIRFNGTVRAVTSFPQVILTAAAFDAGLWRRVGEAVGAEARALYNLGQANGLTI 142

Query: 165 WSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSC 224
           WSPN+N+ RDPRWGR  ETPGEDP    RYAV +V GLQ + G            + S+C
Sbjct: 143 WSPNVNIFRDPRWGRGQETPGEDPVTASRYAVAFVTGLQGIGG------------EASAC 190

Query: 225 CKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIP 284
           CKH  AYD+D W  V RY++D++VT QD+E+T+  PF+ CV EG A+ +MC YN +NG+P
Sbjct: 191 CKHATAYDLDYWNNVVRYNYDSKVTLQDLEDTYNPPFKSCVAEGKATCIMCGYNSINGVP 250

Query: 285 SCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCG 344
           +CA   LL + VR EW ++GY+ +DCD++  + D H +   S ED VA ++K G+D++CG
Sbjct: 251 ACASSDLLTKKVRQEWGMNGYVASDCDAVATIRDAHHYTL-SPEDTVAVSIKVGMDVNCG 309

Query: 345 QYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ----YVSLGKQDICSDE 400
            Y       AVQ+G + E DID++L  L+ V MRLG FDG P+    Y  LG  D+CS  
Sbjct: 310 NYTQVHAMAAVQKGNLTEKDIDRALVNLFAVRMRLGHFDGDPRSNAVYGHLGAADVCSPA 369

Query: 401 NIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSP 460
           +  LA EAA++GIVLLKND   LPL  + V ++AV+GP+A+   A+ GNY G PC   +P
Sbjct: 370 HKSLALEAAQDGIVLLKNDAGALPLQPSAVTSLAVIGPNADNLGALHGNYFGPPCETTTP 429

Query: 461 IAGFSGYA--NVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAESLDRE 518
           + G  GY      +  GCD  AC    +  AA+ A+ ++D  ++  GL    E + LDR 
Sbjct: 430 LQGIKGYLGDRARFLAGCDSPACAVAATNEAAALAS-SSDHVVLFMGLSQKQEQDGLDRT 488

Query: 519 DLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRA 578
            L LPG Q  LI  VA  A+ PVILV+++ G VD+ FA+ N  I AILWAGYPG+ GG A
Sbjct: 489 SLLLPGEQQGLITAVANAARRPVILVLLTGGPVDVTFAKDNPKIGAILWAGYPGQAGGLA 548

Query: 579 IADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFG 638
           IA V+FG  NP GRLP+TWY  ++ + +P+T M +R   + GYPGR+Y+FY G T+Y FG
Sbjct: 549 IAKVLFGDHNPSGRLPVTWYPEEFTK-VPMTDMRMRADPATGYPGRSYRFYQGNTVYNFG 607

Query: 639 YGLSYTQFKYNLL-SFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDL---RCDDY 694
           YGLSY++F   +  SF+ +   NL+ L          D         LV ++   RC   
Sbjct: 608 YGLSYSKFSRRMFSSFSTSNAGNLSLLAGVMARRAGDDGGGMSS--YLVKEIGVERCSRL 665

Query: 695 -FEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNA 753
            F   V+ QN G  DG   V++Y + P        +Q+IGF+   V+ G    + F  + 
Sbjct: 666 VFPAVVEVQNHGPMDGKHSVLMYLRWPTTSGGRPARQLIGFRSQHVKVGEKAMVSFEVSP 725

Query: 754 CKSLNIVDYAANTLLPAGEHTIFVGN 779
           C+  + V      ++  G H + VG+
Sbjct: 726 CEHFSWVGEDGERVIDGGAHFLMVGD 751


>gi|357138088|ref|XP_003570630.1| PREDICTED: probable beta-D-xylosidase 7-like [Brachypodium
           distachyon]
          Length = 1026

 Score =  634 bits (1636), Expect = e-179,   Method: Compositional matrix adjust.
 Identities = 318/649 (48%), Positives = 434/649 (66%), Gaps = 29/649 (4%)

Query: 13  LSIALLVFSTNAVDANGSSSPVFVCDPGRFSKLGLQMSSFLFCDSSLPYSIRVKDLVSRM 72
           L + L++ +T A D      P F C            SS+ FCD  LP   R  DL SR+
Sbjct: 12  LPLCLVLQATMATD------PPFSCG---------SPSSYPFCDRKLPIGQRAADLASRL 56

Query: 73  TLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNV---GPGTHFDD-VIPGATSFPTV 128
           T++EKV  LGD + GVPRLG+P Y+WWSEALHGV+N      G  FDD  +  ATSFP V
Sbjct: 57  TVEEKVSLLGDVSPGVPRLGVPAYKWWSEALHGVANAPADRAGVRFDDGPVRAATSFPQV 116

Query: 129 ILTTASFNESLWKKIGQAVSTEARAMYNLGRA-GLTYWSPNINVARDPRWGRITETPGED 187
           ++T ASFN  LW +IGQ +  EAR +YN G+A GLT+W+PNINV RDPRWGR  ETPGED
Sbjct: 117 LVTAASFNPHLWYRIGQVIGREARGIYNSGQAEGLTFWAPNINVFRDPRWGRGQETPGED 176

Query: 188 PFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDAR 247
           P + G+YA  +VRG+Q   G+  +  +NS  L+ S+CCKH+ AYD++NW GV R+ F+A+
Sbjct: 177 PTMTGKYAAVFVRGVQ---GYGASGAVNSSGLEASACCKHFTAYDLENWNGVTRFAFNAK 233

Query: 248 VTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIV 307
           V+EQD+ +T+  PF  CV++G AS +MCSYNRVNG+P+CAD  LL++T RG+W  +GYI 
Sbjct: 234 VSEQDLADTYNPPFRSCVEDGGASGIMCSYNRVNGVPTCADHNLLSKTARGDWRFNGYIT 293

Query: 308 ADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDK 367
           +DCD++ ++ D   + A   EDAVA  LKAG+D++CG Y      +A  QGK+ E DID+
Sbjct: 294 SDCDAVAIIHDVQGY-AKEPEDAVADVLKAGMDVNCGDYVQKHGVSAFHQGKITEQDIDR 352

Query: 368 SLKYLYTVLMRLGFFDGSPQYV---SLGKQDICSDENIELAAEAAREGIVLLKNDQNTLP 424
           +L+ L+ + MRLG FDG+P+Y    ++G   +C  E+ +LA EAA++GIVLLKND  TLP
Sbjct: 353 ALQNLFAIRMRLGLFDGNPKYNRYGNIGADQVCKKEHQDLALEAAQDGIVLLKNDAGTLP 412

Query: 425 LNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFSGYANVT-YKTGCDDVACKS 483
           L   K+ ++AV+G +AN    + GNY G PC  +SP+    GY   T +  GC+   C  
Sbjct: 413 LPKQKISSLAVIGHNANDAQRLQGNYFGPPCISVSPLQALQGYVRETKFVAGCNAAVCNV 472

Query: 484 NNSIFAASEAAKTADATIILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVIL 543
           ++ I  A++AA  A+  ++  GLD   E E LDR +L LPG Q  L+N VA+ AK PV+L
Sbjct: 473 SD-IAGAAKAASEAEYVVLFMGLDQDQEREDLDRIELGLPGMQESLVNAVADAAKKPVVL 531

Query: 544 VIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYV 603
           V++  G VD+ FA+ N  I AI+WAGYPG+ GG AIA V+FG+ NPGGRLP+TWY  +Y 
Sbjct: 532 VLLCGGPVDVTFAKGNPKIGAIIWAGYPGQAGGIAIAQVLFGEHNPGGRLPVTWYPKEYA 591

Query: 604 QMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLS 652
             + +T M +R   S GYPGRTY+FY G T+Y FGYGLSY+++ ++ +S
Sbjct: 592 TAVAMTDMRMRADASTGYPGRTYRFYKGKTVYNFGYGLSYSKYSHSFVS 640


>gi|414586138|tpg|DAA36709.1| TPA: putative O-Glycosyl hydrolase superfamily protein [Zea mays]
          Length = 769

 Score =  634 bits (1635), Expect = e-179,   Method: Compositional matrix adjust.
 Identities = 328/750 (43%), Positives = 478/750 (63%), Gaps = 26/750 (3%)

Query: 50  SSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNV 109
           S++ FCD+SL    R + LVS +TLDEK+ QL + A GVPRLG+P Y+WWSE+LHG+++ 
Sbjct: 35  SAYPFCDASLSIPARARALVSLLTLDEKIAQLSNTAGGVPRLGIPPYQWWSESLHGLADN 94

Query: 110 GPGTHFDD-VIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPN 168
           GPG +F    +  AT FP VIL+TA+FN SLW+ + +AV+TEA  M+N G+AGLTYW+PN
Sbjct: 95  GPGVNFSSGPVRAATDFPQVILSTAAFNRSLWRAVAEAVATEALGMHNAGQAGLTYWAPN 154

Query: 169 INVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHY 228
           IN+ RDPRWGR  ET GEDP V   Y++ YV+G Q         +     +++S+CCKHY
Sbjct: 155 INIFRDPRWGRGQETSGEDPAVAAAYSLEYVKGFQ-------GEEGEEGRIRLSACCKHY 207

Query: 229 AAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCAD 288
            AYD++ W+G  RY F+A+V  QD+E+T+  PF+ C++E  AS +MC+YN+VNG+P CA 
Sbjct: 208 TAYDMEKWEGFSRYTFNAKVNAQDLEDTYQPPFKTCIQEARASCLMCAYNQVNGVPMCAH 267

Query: 289 PKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYT 348
             LL +T R EW   GYI +DCD++ ++ +N  +   S ED++A  LKAG+D++CG +  
Sbjct: 268 KDLLQKT-RDEWGFQGYITSDCDAVAIIHENQTY-TKSGEDSIAIVLKAGMDINCGSFLV 325

Query: 349 NFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ---YVSLGKQDICSDENIELA 405
             T +A+++GK++E DID++L  L++V +RLG FD       +  LG   +C+ E+ ELA
Sbjct: 326 RHTKSAIEKGKIQEEDIDRALFNLFSVQLRLGIFDKPSNNQWFSQLGPNSVCTKEHRELA 385

Query: 406 AEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFS 465
           AEA R+G VLLKND N LPL  ++V+ VA++GP AN   AM G+Y G+PC   + + G  
Sbjct: 386 AEAVRQGAVLLKNDHNFLPLKRSEVRHVAIIGPSANDAYAMGGDYTGVPCNPTTFLKGIQ 445

Query: 466 GYANVT-YKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAESLDREDLWLPG 524
            YA  T +  GC D +C S +    A EAAK AD  +++AGL+L+ E E  DR  L LPG
Sbjct: 446 AYATQTSFAPGCKDASCNSTDLFGEAVEAAKRADIVVVIAGLNLTEEREDFDRVSLLLPG 505

Query: 525 YQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVF 584
            Q  LI+ +A VAK P++LV++  G VD++FA+ +  I +ILW GYPGE GG+ + +++F
Sbjct: 506 KQMGLIHAIASVAKKPLVLVLLGGGPVDVSFAKQDPRIASILWLGYPGEVGGQVLPEILF 565

Query: 585 GKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYT 644
           G++NPGG+LPITWY   +   +P+T M +R   S GYPGRTY+FY G  +Y FGYGLSY+
Sbjct: 566 GEYNPGGKLPITWYPESFT-AIPMTDMNMRADPSRGYPGRTYRFYTGDVVYGFGYGLSYS 624

Query: 645 QFKYNLLSFTKTIQVNLNKLQHCRNLNYTS-DASKTRCPG---VLVNDL-RCDDY-FEFK 698
           ++ Y++ S  K I V+ +      +L   S   + TR  G   V   D+  C+   F   
Sbjct: 625 KYSYSISSAPKKITVSRSS-----DLGIISRKPAYTRRDGLGSVKTEDIASCEALVFSVH 679

Query: 699 VDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLN 758
           V   N GS DGS  V+++++  + +    IKQ++GF+ V   AG    ++   + CK ++
Sbjct: 680 VAVSNHGSMDGSHAVLLFARSKSSVPGFPIKQLVGFESVHTAAGSASNVEITVDPCKQMS 739

Query: 759 IVDYAANTLLPAGEHTIFVGNGGVSFPIHL 788
             +     +L  G H + VG+      I L
Sbjct: 740 AANPEGKRVLLLGAHVLTVGDEEFELSIEL 769


>gi|224128360|ref|XP_002320310.1| predicted protein [Populus trichocarpa]
 gi|222861083|gb|EEE98625.1| predicted protein [Populus trichocarpa]
          Length = 635

 Score =  634 bits (1635), Expect = e-179,   Method: Compositional matrix adjust.
 Identities = 310/639 (48%), Positives = 426/639 (66%), Gaps = 26/639 (4%)

Query: 145 QAVSTEARAMYNLGRAGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQD 204
           Q VS EARAM+N G AGLTYWSPN+N+ RDPRWGR  ETPGEDP VVG+YA +YVRGLQ 
Sbjct: 2   QVVSDEARAMFNGGVAGLTYWSPNVNIFRDPRWGRGQETPGEDPVVVGKYAASYVRGLQG 61

Query: 205 VEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMC 264
            +G+          LKV++CCKH+ AYD+DNW GVDR+HF+A V++QDME+TF  PF MC
Sbjct: 62  SDGNR---------LKVAACCKHFTAYDLDNWNGVDRFHFNAEVSKQDMEDTFDVPFRMC 112

Query: 265 VKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLA 324
           VKEG  +SVMCSYN+VNGIP+CADP LL +TVRG       +      ++ ++ ++  L 
Sbjct: 113 VKEGKVASVMCSYNQVNGIPTCADPNLLKKTVRGT------LFQTVTLLEFIMGSNTILQ 166

Query: 325 DSKEDAVAQTLKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDG 384
             ++       +A LDLDCG +    T +AV++G + E +I+ +L    TV MRLG FDG
Sbjct: 167 PRRKQPRMLLKQASLDLDCGPFLGQHTEDAVKKGLLNEAEINNALLNTLTVQMRLGMFDG 226

Query: 385 SPQ---YVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHAN 441
            P    Y +LG  D+C+  + ELA EAAR+GIVLLKN   +LPL++ +  +VA+VGP++N
Sbjct: 227 EPSSQLYGNLGPNDVCTPAHQELALEAARQGIVLLKNHGPSLPLSTRRHLSVAIVGPNSN 286

Query: 442 ATVAMIGNYAGIPCRYMSPIAGFSGYANVTYKTGCDDVACKSNNSIFAASEAAKTADATI 501
            T  MIGNYAG+ C Y +P+ G   YA   ++ GC DVAC S+    AA +AA+ ADAT+
Sbjct: 287 VTATMIGNYAGLACGYTTPLQGIQRYAQTIHRQGCADVACVSDQQFSAAIDAARQADATV 346

Query: 502 ILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTN 561
           ++ GLD S+EAE  DR  L LPG Q +L+++VA  +KGP ILV+MS G +D++FAE +  
Sbjct: 347 LVMGLDQSIEAEFRDRTGLLLPGRQQELVSKVAAASKGPTILVLMSGGPIDVSFAENDPK 406

Query: 562 IKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGY 621
           I +I+WAGYPG+ GG AI+DV+FG  NPGG+LP+TWY  DY+  LP+T+M +R   S GY
Sbjct: 407 IGSIVWAGYPGQAGGAAISDVLFGITNPGGKLPMTWYPQDYITNLPMTNMAMRSSKSKGY 466

Query: 622 PGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRC 681
           PGRTY+FY G  +YPFG+G+SYT F + + S    + V L+  +H    N T      R 
Sbjct: 467 PGRTYRFYKGKVVYPFGHGISYTNFVHTIASAPTMVSVPLDGHRHGSG-NATISGKAIR- 524

Query: 682 PGVLVNDLRCDDY-FEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVR 740
               V   RC+      +VD +N GS DG+  ++VYS+PPA   A + KQ++ F++V V 
Sbjct: 525 ----VTHARCNRLSLGMQVDVKNTGSMDGTHTLLVYSRPPARHWAPH-KQLVAFEKVHVA 579

Query: 741 AGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGN 779
           AG  +R+    + CKSL++VD +    +P GEH++ +G+
Sbjct: 580 AGTQQRVGINIHVCKSLSVVDGSGIRRIPMGEHSLHIGD 618


>gi|253761860|ref|XP_002489304.1| hypothetical protein SORBIDRAFT_0010s007570 [Sorghum bicolor]
 gi|241946952|gb|EES20097.1| hypothetical protein SORBIDRAFT_0010s007570 [Sorghum bicolor]
          Length = 750

 Score =  632 bits (1630), Expect = e-178,   Method: Compositional matrix adjust.
 Identities = 337/771 (43%), Positives = 466/771 (60%), Gaps = 46/771 (5%)

Query: 19  VFSTNAVDANGSSSPVFVCDPGRFSKLGLQMSSFLFCDSSLPYSIRVKDLVSRMTLDEKV 78
           VF ++AV    +S P+F C P   S+      ++ FCD SLP + R  DLVSR+T+ EKV
Sbjct: 7   VFFSSAV----ASDPLFSCGPSSPSR------AYPFCDRSLPAARRAADLVSRLTVAEKV 56

Query: 79  QQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGATSFPTVILTTASFNES 138
            QLGD A GVPRLG+P Y+WWSE LHG++  G G  F+  + G TSFP V+LTTASF++ 
Sbjct: 57  SQLGDEAAGVPRLGVPPYKWWSEGLHGLAFWGHGMRFNGTVTGVTSFPQVLLTTASFDDG 116

Query: 139 LWKKIGQAVSTEARAMYNLGRA-GLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVN 197
           LW +IGQA+  EARA+YNLG+A GLT WSPN+N+ RDPRWGR  ETPGEDP V  +YAV 
Sbjct: 117 LWFRIGQAIGREARALYNLGQAEGLTIWSPNVNIFRDPRWGRGQETPGEDPAVASKYAVA 176

Query: 198 YVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETF 257
           +VRG+Q      ++    + PL+ S+CCKH  AYD+++W GV RY+FDARVT QD+ +TF
Sbjct: 177 FVRGIQ-----GSSAAGAAAPLQASACCKHATAYDLEDWNGVARYNFDARVTAQDLADTF 231

Query: 258 LRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMV 317
             PF+ CV +G A+ VMC+Y  +NG+P+CA   LL +T RG W   GY+ +DCD++ +M 
Sbjct: 232 NPPFQSCVVDGKATCVMCAYTGINGVPACASSDLLTKTFRGAWGHDGYVSSDCDAVAIMH 291

Query: 318 DNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLM 377
           D  +++  + ED VA  LK                 A+QQGK+ E D+DK+L  L+ V M
Sbjct: 292 DAQRYVP-TPEDTVAVALK------------EHGMAAIQQGKMTEKDVDKALTNLFAVRM 338

Query: 378 RLGFFDGSPQ----YVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTV 433
           RLG FDG P+    Y  LG  D+C+ ++  LA EAA++GIVLLKND   LPL+ + + + 
Sbjct: 339 RLGHFDGDPRGNALYGHLGAADVCTADHKNLALEAAQDGIVLLKNDAGILPLDRSAMGSA 398

Query: 434 AVVGPHANATVAMIGNYAGIPCRYMSPIAGFSGY-ANVTYKTGCDDVACKSNNSIFAASE 492
           AV+G +AN  + + GNY G  C   +P+ G   Y +NV +  GC   AC    +   A+ 
Sbjct: 399 AVIGHNANDALVLRGNYFGPACETTTPLQGVQSYVSNVRFLAGCSSAAC-GYAATGQAAA 457

Query: 493 AAKTADATIILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVD 552
            A +++   +  GL    E E LDR  L LPG Q  LI  VA  AK PVILV+++ G VD
Sbjct: 458 LASSSEYVFLFMGLSQDQEKEGLDRTSLLLPGKQQSLITAVASAAKRPVILVLLTGGPVD 517

Query: 553 IAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMP 612
           I FA++N  I AILWAGYPG+ GG AIA V+FG  NP GRLP+TWY  ++ + +P+T M 
Sbjct: 518 ITFAQSNPKIGAILWAGYPGQAGGLAIARVLFGDHNPSGRLPVTWYPEEFTK-VPMTDMR 576

Query: 613 LRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNY 672
           +R   + GYPGR+Y+FY G T+Y FGYGLSY++F   L++  K    +L  L        
Sbjct: 577 MRADPANGYPGRSYRFYRGNTIYKFGYGLSYSKFSRQLVTGGKNQLASL--LAGLSATTK 634

Query: 673 TSDASKTRCPGVLVNDLRCDD----YFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYI 728
             DA+        V+D+  D      F  +V+ QN G  DG   V+++ + P       +
Sbjct: 635 DDDATSY----YHVDDIGADGCEQLRFPAEVEVQNHGPMDGKHSVLMFLRWPNATDGRPV 690

Query: 729 KQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGN 779
            Q+IGF    ++AG    ++F    C+  +        ++  G H + VG 
Sbjct: 691 SQLIGFTSQHIKAGEKANVRFDVRPCEHFSRARADGKKVIDRGSHFLMVGK 741


>gi|359473427|ref|XP_002265788.2| PREDICTED: beta-xylosidase/alpha-L-arabinofuranosidase 1-like
           [Vitis vinifera]
          Length = 464

 Score =  630 bits (1626), Expect = e-178,   Method: Compositional matrix adjust.
 Identities = 293/458 (63%), Positives = 356/458 (77%), Gaps = 2/458 (0%)

Query: 154 MYNLGRAGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATD 213
           MYNLG AGLT+WSPNINV RD RWGR  ET  EDPF+VG +AVNYVRGLQDVEG EN TD
Sbjct: 1   MYNLGHAGLTFWSPNINVVRDTRWGRTQETSREDPFMVGEFAVNYVRGLQDVEGTENVTD 60

Query: 214 LNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSV 273
           LNSRPLKVSSCCKHYAAYD+D+W  +DR+ FDARV+EQDM+ETF+ PFE CV+EGD SSV
Sbjct: 61  LNSRPLKVSSCCKHYAAYDIDSWLNIDRHTFDARVSEQDMKETFVSPFERCVREGDVSSV 120

Query: 274 MCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQ 333
           MCS+N++NGIP C+DP+LL   +R EWDLHGYIV+DC  ++V+VDN  +L DSK DAVA+
Sbjct: 121 MCSFNKINGIPPCSDPRLLKGVIRDEWDLHGYIVSDCYGLEVIVDNQNYLNDSKVDAVAK 180

Query: 334 TLKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGK 393
           TL+AGLDL+CG YYT+     V  GKV + ++D++LK +Y +LMR+G+FDG P Y SLG 
Sbjct: 181 TLQAGLDLECGHYYTDALNELVLTGKVSQYELDRALKNIYVLLMRVGYFDGIPAYESLGL 240

Query: 394 QDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGI 453
           +DIC+ ++IELA EAAR+GIVLLKND    PL     K +A+VGPHANAT  MIGNYAG+
Sbjct: 241 KDICAADHIELAREAARQGIVLLKNDYEVFPLKPG--KKLALVGPHANATEVMIGNYAGL 298

Query: 454 PCRYMSPIAGFSGYANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAE 513
           P +Y+SP+  FS   NVTY TGC D +C ++     A EAAK+A+ TII  G DLS+EAE
Sbjct: 299 PRKYVSPLEAFSAIGNVTYTTGCLDASCSNDTYFSEAKEAAKSAEVTIIFVGTDLSIEAE 358

Query: 514 SLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGE 573
            +DR D  LPG QT+LI QVAEV+ GPVILV++S   +DI FA+ N  I AILW G+PGE
Sbjct: 359 FVDRVDFLLPGNQTELIKQVAEVSSGPVILVVLSGSNIDITFAKNNPRISAILWVGFPGE 418

Query: 574 EGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSM 611
           +GG AIADVVFGK+NPGGRLP+TWY  DYV  L    M
Sbjct: 419 QGGHAIADVVFGKYNPGGRLPVTWYEADYVACLETHIM 456


>gi|318136853|gb|ADV41671.1| alpha-L-arabinofuranosidase/beta-D-xylosidase [Actinidia deliciosa
           var. deliciosa]
          Length = 634

 Score =  629 bits (1622), Expect = e-177,   Method: Compositional matrix adjust.
 Identities = 310/636 (48%), Positives = 426/636 (66%), Gaps = 25/636 (3%)

Query: 150 EARAMYNLGRAGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHE 209
           EARAMYN G AGLT+WSPN+N+ RDPRWGR  ETPGEDP + G YA +YVRGLQ  +G  
Sbjct: 2   EARAMYNGGMAGLTFWSPNVNIFRDPRWGRGQETPGEDPMLAGNYAASYVRGLQGNDGER 61

Query: 210 NATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGD 269
                    LKV++CCKHY AYD+DNW+GVDR+HF+ARV++QD+++TF  PF  CV  G 
Sbjct: 62  ---------LKVAACCKHYTAYDLDNWRGVDRFHFNARVSKQDIKDTFEIPFRECVLGGK 112

Query: 270 ASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKED 329
            +SVMCSYN+VNGIP+CA+PKLL  T+RG W L+GYIV+DCDS+ V  +N  + +   E+
Sbjct: 113 VASVMCSYNQVNGIPTCANPKLLKGTIRGSWRLNGYIVSDCDSVGVFFENQHYTS-KPEE 171

Query: 330 AVAQTLKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSP--- 386
           AVA  +KAGLDLDCG +    T  AV++G V + +I+ +L    T  MRLG FDG P   
Sbjct: 172 AVAAAIKAGLDLDCGPFLAIHTEAAVRRGLVSQLEINWALANTMTAQMRLGMFDGEPSAH 231

Query: 387 QYVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAM 446
           QY +LG +D+C+  + +LA EAAR+GIVLL+N   +LPL+  + +TVAV+GP+++ TV M
Sbjct: 232 QYGNLGPRDVCTPAHQQLALEAARQGIVLLENRGRSLPLSIRRHRTVAVIGPNSDVTVTM 291

Query: 447 IGNYAGIPCRYMSPIAGFSGYANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGL 506
           IGNYAG+ C Y +P+ G   Y    ++ GC DV C  N    AA  AA+ ADAT+++ GL
Sbjct: 292 IGNYAGVACGYTTPLQGIGRYTRTIHQAGCTDVHCNGNQLFGAAEAAARQADATVLVMGL 351

Query: 507 DLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAIL 566
           D S+EAE +DR    LPG+Q +L+++VA  ++GP ILV+MS G +D+ FA+ +  I AI+
Sbjct: 352 DQSIEAEFVDRAGPLLPGHQQELVSRVARASRGPTILVLMSGGPIDVTFAKNDPRISAII 411

Query: 567 WAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTY 626
           W GYPG+ GG AIADV+FG  NPGG+LP+TWY  +YV  LP+T M +R   + GYPGRTY
Sbjct: 412 WVGYPGQAGGTAIADVLFGTTNPGGKLPMTWYPQNYVTHLPMTDMAMRADPARGYPGRTY 471

Query: 627 KFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLV 686
           +FY GP ++PFG GLSYT F +NL      + V L  L+   N    S A       V V
Sbjct: 472 RFYRGPVVFPFGLGLSYTTFAHNLAHGPTLVSVPLTSLKATANSTMLSKA-------VRV 524

Query: 687 NDLRCDDY--FEFKVDFQNVGSTDGSDVVIVYSKPP-AEIAATYIKQVIGFQRVFVRAGR 743
           +   C+     +  VD +N GS DG+  ++V++ PP  + AA+  KQ++GF ++ + AG 
Sbjct: 525 SHADCNALSPLDVHVDVKNTGSMDGTHTLLVFTSPPDGKWAAS--KQLVGFHKIHIAAGS 582

Query: 744 NKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGN 779
             R++   + CK L++VD      +P GEH + +G+
Sbjct: 583 ETRVRIAVHVCKHLSVVDRFGIRRIPLGEHKLQIGD 618


>gi|356510699|ref|XP_003524073.1| PREDICTED: beta-xylosidase/alpha-L-arabinofuranosidase 2-like
           [Glycine max]
          Length = 613

 Score =  628 bits (1619), Expect = e-177,   Method: Compositional matrix adjust.
 Identities = 305/558 (54%), Positives = 401/558 (71%), Gaps = 16/558 (2%)

Query: 35  FVCDPGRFSKLGLQMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLP 94
           F CD G+   +    + + FCD SL    RVKDLV R+TL EK+  L + A  V RLG+P
Sbjct: 30  FACDVGKSPAV----AGYGFCDKSLGVEARVKDLVGRLTLQEKIGNLVNSAGDVSRLGIP 85

Query: 95  QYEWWSEALHGVSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAM 154
           +YEWWSEALHGVSNVG GT F +V+PGATSFP  ILT ASFN SL++ IG+ VSTEA AM
Sbjct: 86  RYEWWSEALHGVSNVGLGTRFSNVVPGATSFPMPILTAASFNTSLFEVIGRVVSTEAGAM 145

Query: 155 YNLGRAGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDL 214
           YN+G AGLTYWSPNIN+ RDPRWGR  ETPGEDP +  +YA  YV+GLQ  +G +     
Sbjct: 146 YNVGLAGLTYWSPNINIFRDPRWGRGLETPGEDPVLTSKYAAGYVKGLQQTDGGD----- 200

Query: 215 NSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVM 274
               LKV++CCKHY AYDVD WKG+ RY F+A +T+QD+E+TF  PF+ CV +G+ +SVM
Sbjct: 201 -PNKLKVAACCKHYTAYDVDKWKGIQRYTFNAVLTKQDLEDTFQPPFKSCVIDGNVASVM 259

Query: 275 CSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQT 334
           CSYN+VNG P+CADP LL   VRGEW L+GY+V+DCDS++V+   ++    + E+A A +
Sbjct: 260 CSYNKVNGKPTCADPDLLKGVVRGEWKLNGYMVSDCDSVEVLY-KYQHYTKTPEEAAAIS 318

Query: 335 LKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ---YVSL 391
           + AGLDL+CG++   +T  AV+QG + E+ I+ ++   +  LMRLGFFDG P+   Y +L
Sbjct: 319 ILAGLDLNCGRFLGQYTEGAVKQGLIDES-INNAVSNNFATLMRLGFFDGDPRKQPYGNL 377

Query: 392 GKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYA 451
           G +D+C+  N ELA EAAR+GIV LKN   +LPLN+  +K++AV+GP+ANAT  MIGNY 
Sbjct: 378 GPKDVCTPANQELAREAARQGIVSLKNSPASLPLNAKAIKSLAVIGPNANATRVMIGNYE 437

Query: 452 GIPCRYMSPIAGFSGYANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVE 511
           GIPC+Y+SP+ G + +   +Y  GC DV C  N  +  A + + + DAT+I+ G  L++E
Sbjct: 438 GIPCKYISPLQGLTAFVPTSYAAGCLDVRC-PNPVLDDAKKISASGDATVIVVGASLAIE 496

Query: 512 AESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYP 571
           AESLDR ++ LPG Q  L+ +VA  +KGPVILVIMS GG+D++FA+ N  I +ILW GYP
Sbjct: 497 AESLDRVNILLPGQQQLLVTEVANASKGPVILVIMSGGGMDVSFAKDNNKITSILWVGYP 556

Query: 572 GEEGGRAIADVVFGKFNP 589
           GE GG AIADV+FG  NP
Sbjct: 557 GEAGGAAIADVIFGFHNP 574


>gi|62701894|gb|AAX92967.1| beta-xylosidase, putative [Oryza sativa Japonica Group]
 gi|77550041|gb|ABA92838.1| Glycosyl hydrolase family 3 C terminal domain containing protein
           [Oryza sativa Japonica Group]
          Length = 793

 Score =  619 bits (1597), Expect = e-174,   Method: Compositional matrix adjust.
 Identities = 331/774 (42%), Positives = 451/774 (58%), Gaps = 57/774 (7%)

Query: 46  GLQMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHG 105
           G Q     FCD+ L    R  DLV+ +TL EKV QLGD A GV RLG+P YEWWSE LHG
Sbjct: 24  GQQQQPHRFCDAWLTAEQRAADLVANLTLAEKVSQLGDRAAGVARLGVPAYEWWSEGLHG 83

Query: 106 VSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRA-GLTY 164
           +S  G G  F+  +   TSFP VILT A+F+  LW+++G+AV  EARA+YNLG+A GLT 
Sbjct: 84  LSIWGRGIRFNGTVRAVTSFPQVILTAAAFDAGLWRRVGEAVGAEARALYNLGQANGLTI 143

Query: 165 WSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSC 224
           WSPN+N+ RDPRWGR  ETPGEDP    RYAV +V GLQ + G            + S+C
Sbjct: 144 WSPNVNIFRDPRWGRGQETPGEDPVTASRYAVAFVTGLQGIGG------------EASAC 191

Query: 225 CKHYAAYDVDNWKGVDRYHFDAR----------------------------VTEQDMEET 256
           CKH  AYD+D W  V RY++D++                            VT QD+E+T
Sbjct: 192 CKHATAYDLDYWNNVVRYNYDSKDGASTGKSGETSSQVEKKHGPYEKGYFAVTLQDLEDT 251

Query: 257 FLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVM 316
           +  PF+ CV EG A+ +MC YN +NG+P+CA   LL + VR EW ++GY+ +DCD++  +
Sbjct: 252 YNPPFKSCVAEGKATCIMCGYNSINGVPACASSDLLTKKVRQEWGMNGYVASDCDAVATI 311

Query: 317 VDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVL 376
            D H +   S ED VA ++K G+D++CG Y       AVQ+G + E DID++L  L+ V 
Sbjct: 312 RDAHHYTL-SPEDTVAVSIKVGMDVNCGNYTQVHAMAAVQKGNLTEKDIDRALVNLFAVR 370

Query: 377 MRLGFFDGSPQ----YVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKT 432
           MRLG FDG P+    Y  LG  D+CS  +  LA EAA++GIVLLKND   LPL  + V +
Sbjct: 371 MRLGHFDGDPRSNAVYGHLGAADVCSPAHKSLALEAAQDGIVLLKNDAGALPLQPSAVTS 430

Query: 433 VAVVGPHANATVAMIGNYAGIPCRYMSPIAGFSGYA--NVTYKTGCDDVACKSNNSIFAA 490
           +AV+GP+A+   A+ GNY G PC   +P+ G  GY      +  GCD  AC    +  AA
Sbjct: 431 LAVIGPNADNLGALHGNYFGPPCETTTPLQGIKGYLGDRARFLAGCDSPACAVAATNEAA 490

Query: 491 SEAAKTADATIILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGG 550
           + A+ ++D  ++  GL    E + LDR  L LPG Q  LI  VA  A+ PVILV+++ G 
Sbjct: 491 ALAS-SSDHVVLFMGLSQKQEQDGLDRTSLLLPGEQQGLITAVANAARRPVILVLLTGGP 549

Query: 551 VDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTS 610
           VD+ FA+ N  I AILWAGYPG+ GG AIA V+FG  NP GRLP+TWY  ++ + +P+T 
Sbjct: 550 VDVTFAKDNPKIGAILWAGYPGQAGGLAIAKVLFGDHNPSGRLPVTWYPEEFTK-VPMTD 608

Query: 611 MPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLL-SFTKTIQVNLNKLQHCRN 669
           M +R   + GYPGR+Y+FY G T+Y FGYGLSY++F   +  SF+ +   NL+ L     
Sbjct: 609 MRMRADPATGYPGRSYRFYQGNTVYNFGYGLSYSKFSRRMFSSFSTSNAGNLSLLAGVMA 668

Query: 670 LNYTSDASKTRCPGVLVNDL---RCDDY-FEFKVDFQNVGSTDGSDVVIVYSKPPAEIAA 725
                D         LV ++   RC    F   V+ QN G  DG   V++Y + P     
Sbjct: 669 RRAGDDGGGMSS--YLVKEIGVERCSRLVFPAVVEVQNHGPMDGKHSVLMYLRWPTTSGG 726

Query: 726 TYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGN 779
              +Q+IGF+   V+ G    + F  + C+  + V      ++  G H + VG+
Sbjct: 727 RPARQLIGFRSQHVKVGEKAMVSFEVSPCEHFSWVGEDGERVIDGGAHFLMVGD 780


>gi|77552476|gb|ABA95273.1| Beta-D-xylosidase, putative, expressed [Oryza sativa Japonica
           Group]
          Length = 883

 Score =  616 bits (1588), Expect = e-173,   Method: Compositional matrix adjust.
 Identities = 330/653 (50%), Positives = 430/653 (65%), Gaps = 28/653 (4%)

Query: 145 QAVSTEARAMYNLGRAGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQD 204
           QAVS E RAMYN G+AGLT+WSPN+N+ RDPRWGR  ETPGEDP V  RYA  YVRGLQ 
Sbjct: 227 QAVSDEGRAMYNGGQAGLTFWSPNVNIFRDPRWGRGQETPGEDPAVAARYAAAYVRGLQQ 286

Query: 205 VEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMC 264
            +        +S  LK+++CCKH+ AYD+DNW G DR+HF+A VT QD+E+TF  PF  C
Sbjct: 287 QQ-------PSSGRLKLAACCKHFTAYDLDNWSGTDRFHFNAVVTRQDLEDTFNVPFRSC 339

Query: 265 VKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLA 324
           V +G A+SVMCSYN+VNG+P+CAD   L  T+R  W L GYIV+DCDS+ V   +  +  
Sbjct: 340 VVDGRAASVMCSYNQVNGVPTCADAAFLRGTIRRRWGLAGYIVSDCDSVDVFYSDQHY-T 398

Query: 325 DSKEDAVAQTLKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDG 384
            ++EDAVA TL+AGLDLDCG +   +T  AV QGKV + DID ++    TV MRLG FDG
Sbjct: 399 RTREDAVAATLRAGLDLDCGPFLAQYTEGAVAQGKVGDGDIDAAVTNTVTVQMRLGMFDG 458

Query: 385 SPQ---YVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVK-TVAVVGPHA 440
            P    +  LG Q +C+  + ELA EAAR+GIVLLKND   LPL+ A  +  VAVVGPHA
Sbjct: 459 DPAAQPFGHLGPQHVCTAAHQELAVEAARQGIVLLKNDGRALPLSPATARRAVAVVGPHA 518

Query: 441 NATVAMIGNYAGIPCRYMSPIAGFSGYA-NVTYKTGCDDVACK-SNNSIFAASEAAKTAD 498
            ATVAMIGNYAG PCRY +P+ G + YA    ++ GC DVAC  S   I AA +AA+ AD
Sbjct: 519 EATVAMIGNYAGKPCRYTTPLQGVARYAARAAHQPGCTDVACAGSGQPIAAAVDAARRAD 578

Query: 499 ATIILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAET 558
           ATI++AGLD  +EAE LDR  L LPG Q +LI+ VA+ +KGPVILV+MS G +DI FA+ 
Sbjct: 579 ATIVVAGLDQKIEAEGLDRASLLLPGRQAELISSVAKASKGPVILVLMSGGPIDIGFAQN 638

Query: 559 NTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDS 618
           +  I  ILWAGYPG+ GG+AIADV+FG  NPGG+LP+TWY  DY+Q +P+T+M +R   +
Sbjct: 639 DPKIAGILWAGYPGQAGGQAIADVIFGHHNPGGKLPVTWYPQDYLQKVPMTNMAMRANPA 698

Query: 619 LGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNL----NKLQHCRNLNYTS 674
            GYPGRTY+FY GPT++PFG+GLSYT F +++      + V L           +LN T+
Sbjct: 699 KGYPGRTYRFYTGPTIHPFGHGLSYTSFTHSIAHAPSQLTVRLSAHHAAASASASLNATA 758

Query: 675 DASKTRCPGVLVNDLRCDDY-FEFKVDFQNVGSTDGSDVVIVY-------SKPPAEIAAT 726
             S+     V V   RC++      VD +NVG  DG+  V+VY       +   A     
Sbjct: 759 RLSRAAA--VRVAHARCEELRMPVHVDVRNVGERDGAHTVLVYAAAPASSAAEAAAGHGA 816

Query: 727 YIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGN 779
            ++Q++ F++V V AG   R++   + C  L++ D      +P GEH + +G 
Sbjct: 817 PVRQLVAFEKVHVGAGGTARVEMGIDVCDGLSVADRNGVRRIPVGEHRLIIGE 869



 Score =  120 bits (302), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 60/113 (53%), Positives = 74/113 (65%)

Query: 46  GLQMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHG 105
           G   ++  FC  SLP   R +DLV+R+T  EKV+ L + A GVPRLG+  YEWWSEALHG
Sbjct: 35  GGPAATLPFCRRSLPARARARDLVARLTRAEKVRLLVNNAAGVPRLGVAGYEWWSEALHG 94

Query: 106 VSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLG 158
           VS+ GPG  F    PGAT+FP VI T ASFN +LW+ IGQ  S+ +     LG
Sbjct: 95  VSDTGPGVRFGGAFPGATAFPQVIGTAASFNATLWELIGQFRSSLSSMDKTLG 147


>gi|125535275|gb|EAY81823.1| hypothetical protein OsI_36995 [Oryza sativa Indica Group]
          Length = 885

 Score =  615 bits (1586), Expect = e-173,   Method: Compositional matrix adjust.
 Identities = 330/655 (50%), Positives = 430/655 (65%), Gaps = 30/655 (4%)

Query: 145 QAVSTEARAMYNLGRAGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQD 204
           QAVS E RAMYN G+AGLT+WSPN+N+ RDPRWGR  ETPGEDP V  RYA  YVRGLQ 
Sbjct: 227 QAVSDEGRAMYNGGQAGLTFWSPNVNIFRDPRWGRGQETPGEDPAVAARYAAAYVRGLQQ 286

Query: 205 VEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMC 264
            +        +S  LK+++CCKH+ AYD+DNW G DR+HF+A VT QD+E+TF  PF  C
Sbjct: 287 QQ-------PSSGRLKLAACCKHFTAYDLDNWSGTDRFHFNAVVTRQDLEDTFNVPFRSC 339

Query: 265 VKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLA 324
           V +G A+SVMCSYN+VNG+P+CAD   L  T+R  W L GYIV+DCDS+ V   +  +  
Sbjct: 340 VVDGRAASVMCSYNQVNGVPTCADAAFLRGTIRRRWGLAGYIVSDCDSVDVFYSDQHY-T 398

Query: 325 DSKEDAVAQTLKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDG 384
            ++EDAVA TL+AGLDLDCG +   +T  AV QGKV + DID ++    TV MRLG FDG
Sbjct: 399 RTREDAVAATLRAGLDLDCGPFLAQYTEGAVAQGKVGDGDIDAAVTNTVTVQMRLGMFDG 458

Query: 385 SPQ---YVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVK-TVAVVGPHA 440
            P    +  LG Q +C+  + ELA EAAR+GIVLLKND   LPL+ A  +  VAVVGPHA
Sbjct: 459 DPAAQPFGHLGPQHVCTAAHQELAVEAARQGIVLLKNDGRALPLSPATARRAVAVVGPHA 518

Query: 441 NATVAMIGNYAGIPCRYMSPIAGFSGYA-NVTYKTGCDDVACK-SNNSIFAASEAAKTAD 498
            ATVAMIGNYAG PCRY +P+ G + YA    ++ GC DVAC  S   I AA +AA+ AD
Sbjct: 519 EATVAMIGNYAGKPCRYTTPLQGVARYAARAAHQPGCTDVACAGSGQPIAAAVDAARRAD 578

Query: 499 ATIILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAET 558
           ATI++AGLD  +EAE LDR  L LPG Q +LI+ VA+ +KGPVILV+MS G +DI FA+ 
Sbjct: 579 ATIVVAGLDQKIEAEGLDRASLLLPGRQAELISSVAKASKGPVILVLMSGGPIDIGFAQN 638

Query: 559 NTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDS 618
           +  I  ILWAGYPG+ GG+AIADV+FG  NPGG+LP+TWY  DY+Q +P+T+M +R   +
Sbjct: 639 DPKIAGILWAGYPGQAGGQAIADVIFGHHNPGGKLPVTWYPQDYLQKVPMTNMAMRANPA 698

Query: 619 LGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNL------NKLQHCRNLNY 672
            GYPGRTY+FY GPT++PFG+GLSYT F +++      + V L             +LN 
Sbjct: 699 KGYPGRTYRFYTGPTIHPFGHGLSYTSFTHSIAHAPSQLTVRLAAHHAAASASASASLNA 758

Query: 673 TSDASKTRCPGVLVNDLRCDDY-FEFKVDFQNVGSTDGSDVVIVY-------SKPPAEIA 724
           T+  S+     V V   RC++      VD +NVG  DG+  V+VY       +   A   
Sbjct: 759 TARLSRAAA--VRVAHARCEELRMPVHVDVRNVGERDGAHTVLVYAAAPASSAAEAAAGH 816

Query: 725 ATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGN 779
              ++Q++ F++V V AG   R++   + C  L++ D      +P GEH + +G 
Sbjct: 817 GAPVRQLVAFEKVHVGAGGTARVEMGIDVCDGLSVADRNGVRRIPVGEHRLIIGE 871



 Score =  122 bits (306), Expect = 8e-25,   Method: Compositional matrix adjust.
 Identities = 61/113 (53%), Positives = 74/113 (65%)

Query: 46  GLQMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHG 105
           G   ++  FC  SLP   R +DLV+RMT  EKV+ L + A GVPRLG+  YEWWSEALHG
Sbjct: 35  GGPAATLPFCRRSLPARARARDLVARMTRAEKVRLLVNNAAGVPRLGVAGYEWWSEALHG 94

Query: 106 VSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLG 158
           VS+ GPG  F    PGAT+FP VI T ASFN +LW+ IGQ  S+ +     LG
Sbjct: 95  VSDTGPGVRFGGAFPGATAFPQVIGTAASFNATLWELIGQFRSSLSSMDKTLG 147


>gi|222629257|gb|EEE61389.1| hypothetical protein OsJ_15562 [Oryza sativa Japonica Group]
          Length = 771

 Score =  602 bits (1553), Expect = e-169,   Method: Compositional matrix adjust.
 Identities = 316/748 (42%), Positives = 460/748 (61%), Gaps = 39/748 (5%)

Query: 50  SSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPR---------LGLPQYEWWS 100
           S++ FC+++LP+  R + LVS +TLDEK+ QL     G P          +G+P     +
Sbjct: 36  SAYPFCNATLPFPARARALVSLLTLDEKIAQLLQHRRGRPPPRRPALRVVVGVPSTASAT 95

Query: 101 EALHGVSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRA 160
                 S  GP       +  AT FP VIL+ A+FN SLW+   +A++ EARAM+N G+A
Sbjct: 96  TGPGSTSPRGP-------VRSATIFPQVILSAAAFNRSLWRAAARAIAVEARAMHNAGQA 148

Query: 161 GLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLK 220
           GLT+W+PNINV RDPRWGR  ETPGEDP VV  Y+V YV+G Q   G E         + 
Sbjct: 149 GLTFWAPNINVFRDPRWGRGQETPGEDPAVVSAYSVEYVKGFQRDYGEEGR-------MM 201

Query: 221 VSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRV 280
           +S+CCKHY AYD++ W+G  RY F+A+V  QDME+T+  PF+ C++EG AS +MCSYN+V
Sbjct: 202 LSACCKHYIAYDLEKWRGFTRYTFNAKVNAQDMEDTYQPPFKSCIQEGRASCLMCSYNQV 261

Query: 281 NGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLD 340
           NG+P+CA   +L Q  R EW   GYI +DCD++ ++ +N  + A S ED++A  LKAG+D
Sbjct: 262 NGVPACARKDIL-QRARDEWGFQGYITSDCDAVAIIHENQTYTA-SDEDSIAVVLKAGMD 319

Query: 341 LDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ---YVSLGKQDIC 397
           ++CG +    T +A+++GKV+E DI+ +L  L++V +RLGFFD + +   +  LG  ++C
Sbjct: 320 INCGSFLIRHTKSAIEKGKVQEEDINHALFNLFSVQLRLGFFDKTNENQWFTQLGPNNVC 379

Query: 398 SDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRY 457
           + E+ ELAAEA R+G VLLKND   LPL  ++V  +A++GP AN    + G+Y G+PC  
Sbjct: 380 TTEHRELAAEAVRQGTVLLKNDNGFLPLKRSEVGHIALIGPAANDPYILGGDYTGVPCHS 439

Query: 458 MSPIAGFSGYA-NVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAESLD 516
            + + G   Y    T+  GC DV C S +    A EAAK AD  +++AGL+L+ E E  D
Sbjct: 440 TTFVKGMQAYVPKTTFAAGCKDVPCNSTDGFGEAIEAAKRADVVVLIAGLNLTEETEDHD 499

Query: 517 REDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGG 576
           R  L LPG Q  LI+ VA V K PV+LV+M  G VD++FA+ +  I +ILW GYPGE GG
Sbjct: 500 RVSLLLPGRQMDLIHTVASVTKKPVVLVLMGGGPVDVSFAKHDPRIASILWIGYPGEVGG 559

Query: 577 RAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYP 636
             + +++FGK+NPGG+LPITWY   +   +P+  M +R   S GYPGRTY+FY G  +Y 
Sbjct: 560 NVLPEILFGKYNPGGKLPITWYPESFT-AVPMDDMNMRADASRGYPGRTYRFYTGDVVYG 618

Query: 637 FGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPG---VLVNDLRCDD 693
           FGYGLSY+++ Y++L   K I ++ + +        +   + TR  G   V V D+   +
Sbjct: 619 FGYGLSYSKYSYSILQAPKKISLSRSSVPDL----ISRKPAYTRRDGVDYVQVEDIASCE 674

Query: 694 YFEFKVDF--QNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVF 751
             +F V     N G+ DGS  V++++        + IKQ++GF+RV   AGR+  ++   
Sbjct: 675 ALQFPVHISVSNDGAMDGSHAVLLFASSKPSFPGSPIKQLVGFERVHTAAGRSTDVEITV 734

Query: 752 NACKSLNIVDYAANTLLPAGEHTIFVGN 779
           + CK ++  +     +L  G H + VG+
Sbjct: 735 DPCKLMSFANTEGTRVLFLGTHVLMVGD 762


>gi|326431595|gb|EGD77165.1| beta-glucosidase [Salpingoeca sp. ATCC 50818]
          Length = 900

 Score =  602 bits (1551), Expect = e-169,   Method: Compositional matrix adjust.
 Identities = 336/771 (43%), Positives = 471/771 (61%), Gaps = 47/771 (6%)

Query: 29  GSSSPVFVCD--PGRFSKLGLQMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAH 86
           GS +P   CD  PG+         S  FC+++L Y  R++DL+SR+   +    L + A 
Sbjct: 167 GSPTPR-TCDVEPGK---------SLPFCNTALSYDDRIRDLISRINDSDLPGLLVNSAT 216

Query: 87  GVPRLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQA 146
           GV  L LP Y+WWSEALHGV +  PG HF   +P ATSFP VI T A+FN++L++KIG  
Sbjct: 217 GVEHLNLPAYQWWSEALHGVGH-SPGVHFGGDVPAATSFPQVIHTGATFNKTLYRKIGTV 275

Query: 147 VSTEARAMYNLGRAGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVE 206
           +STEARAM N+ RAG T+W+PNIN+ RDPRWGR  ETPGEDPF  G YA N+V G QD E
Sbjct: 276 ISTEARAMNNVQRAGNTFWAPNINIIRDPRWGRGQETPGEDPFATGEYAANFVSGFQDGE 335

Query: 207 GHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVK 266
                 D+N   +K SSCCKH+  Y+++NW GVDR+H++A  T+QD+ +T+L  FE CV+
Sbjct: 336 ------DMNY--IKASSCCKHFFDYNLENWHGVDRHHYNAIATDQDIADTYLPSFEACVR 387

Query: 267 EGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADS 326
            G AS +MCSYN VNG+PSCA+  ++    R  W   GYI +DC ++  ++++HKF  ++
Sbjct: 388 YGRASGLMCSYNAVNGVPSCANGDIMTVMARESWGFDGYITSDCGAVADVLNSHKFTRNT 447

Query: 327 KEDAVAQTLKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFD--G 384
            E  +   L+AG+D DCG +   +   A+Q+G V    ++ +L  L+ V  RLG FD   
Sbjct: 448 SE-TIRAVLEAGMDTDCGSFVQQYLAKAMQEGVVPRELVNTALHRLFMVQFRLGLFDPVS 506

Query: 385 SPQYVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATV 444
              Y +     + +  N +LA EAA++GIVLLKN    LPL +     VA++GP+A+AT 
Sbjct: 507 KQPYTNYSVARVNTPANQQLALEAAQQGIVLLKNTNARLPLKTGL--HVALIGPNADATT 564

Query: 445 AMIGNYAGIPCRYMSPIAGFSGY-ANVTYKTGCDDVACKSNNSIFAASEAAKTADATIIL 503
            M GNY G     +SP+ GF  Y A VTY  GCD VACK  +   AA  AAK ADA +++
Sbjct: 565 VMQGNYQGTAPFLISPVRGFKNYSAAVTYAKGCD-VACKDTSGFDAAVAAAKEADAVVVV 623

Query: 504 AGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIK 563
            GLD   E+E  DR  + LPG+Q  L+ QVA  AK P+++ +M+ G VD++  + N N+ 
Sbjct: 624 VGLDQGQESEGHDRTSITLPGHQEDLVAQVAAAAKSPIVVFVMTGGAVDLSTIKANKNVA 683

Query: 564 AILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPG 623
            ILW GYPG+ GG+A+ADVVFG  +PGGRLP T Y G YV    +    +RP  + G PG
Sbjct: 684 GILWCGYPGQSGGQAMADVVFGAVSPGGRLPYTIYPGSYVDACSMLDNGMRPNKTSGNPG 743

Query: 624 RTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPG 683
           RTY+FY G  +Y +G GLSYT F Y+ + +  T+  +L  +Q      Y  DA +     
Sbjct: 744 RTYRFYTGKPVYEYGTGLSYTSFSYH-IHYLNTMDTSLATVQ-----TYVQDAKQNH--- 794

Query: 684 VLVNDLRCD--DYFEFKVDFQNVGSTDGSDVVIVYSKP--PAEIAATYIKQVIGFQRVFV 739
                +R D  ++   +V+  NVG   G+DVV V+ +P  PAE+ A  IK +IGF+RVF+
Sbjct: 795 ---KFIRYDAPEFTRVEVNVTNVGRVAGADVVQVFVEPKTPAELGAP-IKTLIGFERVFL 850

Query: 740 RAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGNGG-VSFPIHLN 789
             G+   ++F  NA   L  VD +   +  AGE  + +G+   ++FP+H+N
Sbjct: 851 NPGQWTIVQFSVNA-HDLTFVDASGKRVARAGEWLVHIGHDSRLTFPVHVN 900


>gi|90399376|emb|CAJ86207.1| B1011H02.4 [Oryza sativa Indica Group]
          Length = 738

 Score =  598 bits (1543), Expect = e-168,   Method: Compositional matrix adjust.
 Identities = 311/740 (42%), Positives = 451/740 (60%), Gaps = 56/740 (7%)

Query: 50  SSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNV 109
           S++ FC+++LP+  R + LVS +TLDEK+ QL + A G PRLG+P +EWWSE+LHGV + 
Sbjct: 36  SAYPFCNATLPFPARARALVSLLTLDEKIAQLSNTAAGAPRLGVPPFEWWSESLHGVCDN 95

Query: 110 GPGTHFDD-VIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPN 168
           GPG +F    +  AT FP VIL+ A+FN SLW+   +A++ EARAM+N G+AGLT+W+PN
Sbjct: 96  GPGVNFSSGPVRSATIFPQVILSAAAFNRSLWRAAARAIAVEARAMHNAGQAGLTFWAPN 155

Query: 169 INVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHY 228
           INV RDPRWGR  ETPGEDP VV  Y+V YV+G Q   G E         + +S+CCKHY
Sbjct: 156 INVFRDPRWGRGQETPGEDPAVVSAYSVEYVKGFQRDYGEEGR-------MMLSACCKHY 208

Query: 229 AAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCAD 288
            AYD++ W+G  RY F+A+V                                NG+P+CA 
Sbjct: 209 IAYDLEKWRGFTRYTFNAKV--------------------------------NGVPACAR 236

Query: 289 PKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYT 348
             +L Q  R EW   GYI +DCD++ ++ +N  + A S ED++A  LKAG+D++CG +  
Sbjct: 237 KDIL-QRARDEWGFQGYITSDCDAVAIIHENQTYTA-SDEDSIAVVLKAGMDINCGSFLI 294

Query: 349 NFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ---YVSLGKQDICSDENIELA 405
             T +A+++GKV+E DI+ +L  L++V +RLGFFD + +   +  LG  ++C+ E+ ELA
Sbjct: 295 RHTKSAIEKGKVQEEDINHALFNLFSVQLRLGFFDKTNENQWFTQLGPNNVCTTEHRELA 354

Query: 406 AEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFS 465
           AEA R+G VLLKND   LPL  ++V  +A++GP AN    + G+Y G+PC   + + G  
Sbjct: 355 AEAVRQGTVLLKNDNGFLPLKRSEVGHIALIGPAANDPYILGGDYTGVPCHSTTFVKGMQ 414

Query: 466 GYA-NVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAESLDREDLWLPG 524
            Y    T+  GC DV C S +    A EAAK AD  +++AGL+L+ E E  DR  L LPG
Sbjct: 415 AYVPKTTFAAGCKDVPCNSTDGFGEAIEAAKRADVVVLIAGLNLTEETEDHDRVSLLLPG 474

Query: 525 YQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVF 584
            Q  LI+ VA V K PV+LV+M  G VD++FA+ +  I +ILW GYPGE GG  + +++F
Sbjct: 475 RQMDLIHTVASVTKKPVVLVLMGGGPVDVSFAKHDPRIASILWIGYPGEVGGNVLPEILF 534

Query: 585 GKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYT 644
           GK+NPGG+LPITWY   +   +P+  M +R   S GYPGRTY+FY G  +Y FGYGLSY+
Sbjct: 535 GKYNPGGKLPITWYPESFTA-VPMDDMNMRADASRGYPGRTYRFYTGDVVYGFGYGLSYS 593

Query: 645 QFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPG---VLVNDLRCDDYFEFKVDF 701
           ++ Y++L   K I ++ + +        +   + TR  G   V V D+   +  +F V  
Sbjct: 594 KYSYSILQAPKKISLSRSSVPDL----ISRKPAYTRRDGVDYVQVEDIASCEALQFPVHI 649

Query: 702 --QNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNI 759
              N G+ DGS  V++++        + IKQ++GF+RV   AGR+  ++   + CK ++ 
Sbjct: 650 SVSNDGAMDGSHAVLLFASSKPSFPGSPIKQLVGFERVHTAAGRSTDVEITVDPCKLMSF 709

Query: 760 VDYAANTLLPAGEHTIFVGN 779
            +     +L  G H + VG+
Sbjct: 710 ANTEGTRVLFLGTHVLMVGD 729


>gi|357489437|ref|XP_003615006.1| Xylan 1 4-beta-xylosidase [Medicago truncatula]
 gi|355516341|gb|AES97964.1| Xylan 1 4-beta-xylosidase [Medicago truncatula]
          Length = 685

 Score =  595 bits (1535), Expect = e-167,   Method: Compositional matrix adjust.
 Identities = 310/692 (44%), Positives = 440/692 (63%), Gaps = 30/692 (4%)

Query: 112 GTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRA-GLTYWSPNIN 170
           G   +  IP ATSFP VILT ASF+  LW +I + + TEAR +YN G+A G+ +W+PNIN
Sbjct: 2   GIILNGSIPAATSFPQVILTAASFDPKLWYQISKVIGTEARGVYNAGQAQGMNFWAPNIN 61

Query: 171 VARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAA 230
           + RDPRWGR  ET GEDP V  +Y V+YVRGLQ  +  E    +  R LK S+CCKH+ A
Sbjct: 62  IFRDPRWGRGQETAGEDPLVNSKYGVSYVRGLQG-DSFEGGKLIGGR-LKASACCKHFTA 119

Query: 231 YDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPK 290
           YD++NWKGV+RY FDA+VT QD+ +T+   F  CV +G +S +MC+YNRVNG+P+CAD  
Sbjct: 120 YDLENWKGVNRYVFDAKVTLQDLADTYQPSFHSCVVQGRSSGIMCAYNRVNGVPNCADYN 179

Query: 291 LLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNF 350
           LL  T R +W+ +GYI +DCD+++ + +   + A + ED VA  L+AG+DL+CG Y T  
Sbjct: 180 LLTNTARKKWNFNGYIASDCDAVRFIYEKQGY-AKTPEDVVADVLRAGMDLECGNYMTKH 238

Query: 351 TGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSP---QYVSLGKQDICSDENIELAAE 407
             +AV Q K+  + ID++L  L+T+ +RLG FDG+P   QY  +G   +CS EN++LA E
Sbjct: 239 AKSAVLQKKIPISQIDRALHNLFTIRIRLGLFDGNPTKLQYGRIGPNQVCSKENLDLALE 298

Query: 408 AAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHAN-ATVAMIGNYAGIPCRYMSPIAGFSG 466
           AAR GIVLLKN  + LPL   +V T+ V+GP+AN +++ ++GNY G PC+ +S + GF  
Sbjct: 299 AARSGIVLLKNTASILPL--PRVNTLGVIGPNANKSSIVLLGNYIGPPCKNVSILKGFYT 356

Query: 467 YANVT-YKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAESLDREDLWLPGY 525
           YA+ T Y +GC D    ++  I  A E AK +D  I++ GLD S E E+LDR+ L LPG 
Sbjct: 357 YASQTHYHSGCTDGTKCASAEIDRAVEVAKISDYVILVMGLDQSQETETLDRDHLELPGK 416

Query: 526 QTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFG 585
           Q +LIN VA+ +K PVILV++  G VDI FA+ N  I  I+WAGYPGE GGRA+A VVFG
Sbjct: 417 QQKLINSVAKASKKPVILVLLCGGPVDITFAKNNDKIGGIIWAGYPGELGGRALAQVVFG 476

Query: 586 KFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQ 645
            +NPGGRLP+TWY  D+++ +P+T M +R   S GYPGRTY+FY GP +Y FGYGLSY+ 
Sbjct: 477 DYNPGGRLPMTWYPKDFIK-IPMTDMRMRADPSSGYPGRTYRFYTGPKVYEFGYGLSYSN 535

Query: 646 FKYNLLSFTKTIQVNLNK------LQHCRNLNY--TSDASKTRCPGVLVNDLRCDDYFEF 697
           + YN +S  K   +++N+      L++   +NY   S+  +  C  + ++          
Sbjct: 536 YSYNFIS-VKNNNLHINQSTTYSILENSETINYKLVSELGEETCKTMSIS---------V 585

Query: 698 KVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSL 757
            +   N GS  G   V+++ KP        +KQ++GF+ V V  G    + F  + C+ L
Sbjct: 586 TLGITNTGSMAGKHPVLLFVKPKKGRNGNPVKQLVGFESVTVEGGGKGEVGFEVSVCEHL 645

Query: 758 NIVDYAANTLLPAGEHTIFVGNGGVSFPIHLN 789
           +  + +   ++  G +   VG    S  I L+
Sbjct: 646 SRANESGVKVIEEGGYLFLVGQEEYSINIMLD 677


>gi|222615852|gb|EEE51984.1| hypothetical protein OsJ_33664 [Oryza sativa Japonica Group]
          Length = 753

 Score =  592 bits (1527), Expect = e-166,   Method: Compositional matrix adjust.
 Identities = 318/746 (42%), Positives = 439/746 (58%), Gaps = 40/746 (5%)

Query: 46  GLQMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHG 105
           G Q     FCD+ L    R  DLV+ +TL EKV QLGD A GV RLG+P YEWWSE LHG
Sbjct: 23  GQQQQPHRFCDAWLTAEQRAADLVANLTLAEKVSQLGDRAAGVARLGVPAYEWWSEGLHG 82

Query: 106 VSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRA-GLTY 164
           +S  G G  F+  +   TSFP VILT A+F+  LW+++G+AV  EARA+YNLG+A GLT 
Sbjct: 83  LSIWGRGIRFNGTVRAVTSFPQVILTAAAFDAGLWRRVGEAVGAEARALYNLGQANGLTI 142

Query: 165 WSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSC 224
           WSPN+N+ RDP   R    PG+      R    +  G Q + G            + S+C
Sbjct: 143 WSPNVNIFRDPSGTR----PGD-----ARRGPRH--GEQGIGG------------EASAC 179

Query: 225 CKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIP 284
           CKH  AYD+D W  V RY++D++VT QD+E+T+  PF+ CV EG A+ +MC YN +NG+P
Sbjct: 180 CKHATAYDLDYWNNVVRYNYDSKVTLQDLEDTYNPPFKSCVAEGKATCIMCGYNSINGVP 239

Query: 285 SCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCG 344
           +CA   LL + VR EW ++GY+ +DCD++  + D H +   S ED VA ++K G+D++CG
Sbjct: 240 ACASSDLLTKKVRQEWGMNGYVASDCDAVATIRDAHHYTL-SPEDTVAVSIKVGMDVNCG 298

Query: 345 QYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ----YVSLGKQDICSDE 400
            Y       AVQ+G + E DID++L  L+ V MRLG FDG P+    Y  LG  D+CS  
Sbjct: 299 NYTQVHAMAAVQKGNLTEKDIDRALVNLFAVRMRLGHFDGDPRSNAVYGHLGAADVCSPA 358

Query: 401 NIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSP 460
           +  LA EAA++GIVLLKND   LPL  + V ++AV+GP+A+   A+ GNY G PC   +P
Sbjct: 359 HKSLALEAAQDGIVLLKNDAGALPLQPSAVTSLAVIGPNADNLGALHGNYFGPPCETTTP 418

Query: 461 IAGFSGYA--NVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAESLDRE 518
           + G  GY      +  GCD  AC  + +  AA+  A ++D  ++  GL    E + LDR 
Sbjct: 419 LQGIKGYLGDRARFLAGCDSPACAVDATNEAAA-LASSSDHVVLFMGLSQKQEQDGLDRT 477

Query: 519 DLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRA 578
            L LPG Q  LI  VA  A+ PVILV+++ G VD+ FA+ N  I AILWAGYPG+ GG A
Sbjct: 478 SLLLPGEQQGLITAVANAARRPVILVLLTGGPVDVTFAKDNPKIGAILWAGYPGQAGGLA 537

Query: 579 IADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFG 638
           IA V+FG  NP GRLP+TWY  ++ + +P+T M +R   + GYPGR+Y+FY G T+Y FG
Sbjct: 538 IAKVLFGDHNPSGRLPVTWYPEEFTK-VPMTDMRMRADPATGYPGRSYRFYQGNTVYNFG 596

Query: 639 YGLSYTQFKYNLL-SFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDL---RCDDY 694
           YGLSY++F   +  SF+ +   NL+ L          D         LV ++   RC   
Sbjct: 597 YGLSYSKFSRRMFSSFSTSNAGNLSLLAGVMARRAGDDGGGMSS--YLVKEIGVERCSRL 654

Query: 695 -FEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNA 753
            F   V+ QN G  DG   V++Y + P        +Q+IGF+   V+ G    + F  + 
Sbjct: 655 VFPAVVEVQNHGPMDGKHSVLMYLRWPTTSGGRPARQLIGFRSQHVKVGEKAMVSFEVSP 714

Query: 754 CKSLNIVDYAANTLLPAGEHTIFVGN 779
           C+  + V      ++  G H + VG+
Sbjct: 715 CEHFSWVGEDGERVIDGGAHFLMVGD 740


>gi|37359708|dbj|BAC98299.1| LEXYL2 [Solanum lycopersicum]
          Length = 633

 Score =  587 bits (1513), Expect = e-165,   Method: Compositional matrix adjust.
 Identities = 296/642 (46%), Positives = 424/642 (66%), Gaps = 24/642 (3%)

Query: 143 IGQAVSTEARAMYNLGRAGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGL 202
           IG+ VSTE RAMYN+G+AGLTYWSPN+N+ RDPRWGR  ET GEDP +  RY V YV+GL
Sbjct: 2   IGKVVSTEGRAMYNVGQAGLTYWSPNVNIYRDPRWGRGQETAGEDPTLSSRYGVAYVKGL 61

Query: 203 QDVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFE 262
           Q  +      D     LKV+SCCKHY AYDVD+WKG+ RY+F+A+VT+QD+++TF  PF+
Sbjct: 62  QQRD------DGKKDMLKVASCCKHYTAYDVDDWKGIQRYNFNAKVTQQDLDDTFNPPFK 115

Query: 263 MCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKF 322
            CV +G+ +SVMCSYN+V+G P+C D  LL   +RG+W L+GYIV DCDS+  M     +
Sbjct: 116 SCVLDGNVASVMCSYNQVDGKPTCGDYDLLAGVIRGQWKLNGYIVTDCDSLNEMYWAQHY 175

Query: 323 LADSKEDAVAQTLKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFF 382
              + E+  A +L AGL L+CG +   +T  AV QG V E+ ID+++   +  LMRLGFF
Sbjct: 176 -TKTPEETAALSLNAGLGLNCGSWLGKYTQGAVNQGLVNESVIDRAVTNNFATLMRLGFF 234

Query: 383 DGSPQ---YVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPH 439
           DG+P+   Y +LG +DIC++++ ELA EAAR+GIVLLKN   +LPL+   +K++AV+GP+
Sbjct: 235 DGNPKNQLYGNLGPKDICTEDHQELAREAARQGIVLLKNTAGSLPLSPKSIKSLAVIGPN 294

Query: 440 ANATVAMIGNYAGIPCRYMSPIAGFSGYANVTYKTGCDDVACKSNNSIFAASEAAKTADA 499
           AN    M+G+Y G PC+Y +P+ G     +  Y+ GCD +AC +   +  A + A  ADA
Sbjct: 295 ANLAYTMVGSYEGSPCKYTTPLDGLGASVSTVYQQGCD-IAC-ATAQVDNAKKVAAAADA 352

Query: 500 TIILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETN 559
            +++ G D ++E ES DR ++ LPG Q+ L+ +VA V+KGPVILVIMS GG+D+ FA  N
Sbjct: 353 VVLVMGSDQTIERESKDRFNITLPGQQSLLVTEVASVSKGPVILVIMSGGGMDVKFAVDN 412

Query: 560 TNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSL 619
             + +ILW G+PGE GG A+ADVVFG  NPGGRLP+TWY   YV  + +T+M +R     
Sbjct: 413 PKVTSILWVGFPGEAGGAALADVVFGYHNPGGRLPMTWYPQSYVDKVDMTNMNMRADPKT 472

Query: 620 GYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKT 679
           G+PGR+Y+FY GPT++ FG GLSYTQ+K++L+   K + + L +   CR+         T
Sbjct: 473 GFPGRSYRFYKGPTVFNFGDGLSYTQYKHHLVKAPKFVSIPLEEGHACRS---------T 523

Query: 680 RCPGV-LVNDLRCDDY-FEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRV 737
           +C  +  VN+  C++   +  +  QNVG   GS  V++++ PP+   A   K ++ FQ++
Sbjct: 524 KCKSIDAVNEQGCNNLGLDIHLKVQNVGKMRGSHTVLLFTSPPSVHNAPQ-KHLLDFQKI 582

Query: 738 FVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGN 779
            +       +KF  + CK L++VD   N  +  G H + +G+
Sbjct: 583 HLTPQSEGVVKFNLDVCKHLSVVDEVGNRKVALGLHVLHIGD 624


>gi|348667575|gb|EGZ07400.1| xylosidase [Phytophthora sojae]
          Length = 751

 Score =  573 bits (1478), Expect = e-161,   Method: Compositional matrix adjust.
 Identities = 306/739 (41%), Positives = 444/739 (60%), Gaps = 59/739 (7%)

Query: 48  QMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVS 107
           ++SS  FCD SLP   RV DLV+R+ L++ V  L + A   P + +P YEWW+EALHGV+
Sbjct: 28  KVSSLPFCDGSLPIDARVSDLVNRIPLEQAVGLLVNKASAAPSVNVPSYEWWNEALHGVA 87

Query: 108 NVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSP 167
            + PG  F   +  ATSFP V+ T ASFN +L+ +I +A+STEARA YN   AGLT+W+P
Sbjct: 88  -LSPGVTFKGPLTAATSFPQVLSTAASFNRTLFYQIAEAISTEARAFYNEKNAGLTFWTP 146

Query: 168 NINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQD--VEGHENATDLNSRPLKVSSCC 225
           N+N+ RDPRWGR  ETPGEDP++ G YAV +VRGLQ   +EGHEN  D  ++ LK+SSCC
Sbjct: 147 NVNIFRDPRWGRGQETPGEDPYLTGEYAVAFVRGLQGEAMEGHENKDD--NKFLKISSCC 204

Query: 226 KHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPS 285
           KH++AY  +    V R+  DA VT+QD  +T+   FE CVK G  SS+MCSYN VNGIPS
Sbjct: 205 KHFSAYSQE----VPRHRNDAIVTKQDQADTYFPAFEDCVKRGHVSSIMCSYNAVNGIPS 260

Query: 286 CADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQ 345
           CAD  LL   VR +W   GYI +DC+++  ++  H F   S E   A TL AG+DL+CG+
Sbjct: 261 CADKGLLTDLVRNQWKFDGYITSDCEAVADVIYRHHF-TQSPEQTCATTLDAGMDLNCGE 319

Query: 346 YYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFD-GSPQYVSLGKQDICSDENIEL 404
           +      +A++QG V    +  +LK  + V+MRLG F+ G+  + ++ K  + +  + +L
Sbjct: 320 FLRQHLSSAIEQGIVSTEMVHNALKNQFRVMMRLGMFEKGTQPFSNITKDAVDTAAHRQL 379

Query: 405 AAEAAREGIVLLKNDQNTLPLNS---AKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPI 461
           A EAAR+ +VLLKN+ NTLPL +   +K  ++A++GPH NA+ A++GNY GIP   ++P+
Sbjct: 380 ALEAARQSVVLLKNEDNTLPLATDVFSKDGSLALIGPHFNASTALLGNYFGIPSHIVTPL 439

Query: 462 AGFSGYA-NVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAESLDREDL 520
            G S Y  NV Y  GC  V+ +       A E  K AD  ++  GLD S E E +DR  L
Sbjct: 440 KGVSSYVPNVAYSLGCK-VSGEVLPDFDEAIEVVKKADRVVVFMGLDQSQEREEIDRYHL 498

Query: 521 WLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIA 580
            LPG+Q  L+N++   A  P++LV++S G VD++  + +  + AI++ GY G+ GG+A+A
Sbjct: 499 KLPGFQIALLNRILAAASHPIVLVLISGGSVDLSLYKNHPKVGAIVFGGYLGQAGGQALA 558

Query: 581 DVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYG 640
           D++FGK++P GRL  T+Y+ DYV  +P+  M +RP    G PGRTY+F++G  +Y FG+G
Sbjct: 559 DMLFGKYSPAGRLTQTFYDSDYVNTMPIYDMHMRPTFVTGNPGRTYRFFSGAPVYEFGFG 618

Query: 641 LSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVD 700
           LSYT F                  + CR+                     C   FE  V 
Sbjct: 619 LSYTTFH-----------------KACRS---------------------CVASFEITV- 639

Query: 701 FQNVGSTDGSDVVIVYSKPP-AEIAATYIKQVIGFQRV-FVRAGRNKRIKFVFNACKSLN 758
             N+G  +G D +++Y++PP A      ++ ++ F+R   V  G+     F   A K+  
Sbjct: 640 -TNLGDVEGEDAILIYAEPPHAGEGGRPLRSLVAFERTALVTTGKTATADFCLEA-KAFA 697

Query: 759 IVDYAANTLLPAGEHTIFV 777
           + +   + ++  G  TI V
Sbjct: 698 LANAEGSWVVEQGNWTIHV 716


>gi|163889365|gb|ABY48135.1| beta-D-xylosidase [Medicago truncatula]
          Length = 776

 Score =  565 bits (1457), Expect = e-158,   Method: Compositional matrix adjust.
 Identities = 309/773 (39%), Positives = 452/773 (58%), Gaps = 56/773 (7%)

Query: 31  SSPVFVCDPGRFSKLGLQMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPR 90
           ++P + C P          S + FC+ SLP S R   L+S +TL +K+ QL + A  +  
Sbjct: 26  TTPDYPCKPPH--------SHYPFCNISLPISTRTTSLISLLTLSDKINQLSNTASSISH 77

Query: 91  LGLPQYEWWSEALHGVSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTE 150
           LG+P Y+WWSEALHG++  GPG +F+  +  AT+FP VI++ A+FN SLW  IG AV  E
Sbjct: 78  LGIPSYQWWSEALHGIATNGPGVNFNGSVKSATNFPQVIVSAAAFNRSLWFLIGYAVGVE 137

Query: 151 ARAMYNLGRAGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHE- 209
            RAM+N+G+AGL++W+PN+NV RDPRWGR  ETPGEDP V   YAV +VRG+Q V+G + 
Sbjct: 138 GRAMFNVGQAGLSFWAPNVNVFRDPRWGRGQETPGEDPMVGSAYAVEFVRGIQGVDGIKK 197

Query: 210 --NATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKE 267
             N  D +   L VS+CCKH+ AYD++ W    RY+F+A V       T+  PF  CV++
Sbjct: 198 VLNDHDSDDDGLMVSACCKHFTAYDLEKWGEFSRYNFNAVVN------TYQPPFRGCVQQ 251

Query: 268 GDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGY-IVADCDSIQVMVDNHKFLADS 326
           G AS +MCSYN VNG+P+CA   LL   VR +W   G  I+     + ++  + K + + 
Sbjct: 252 GKASCLMCSYNEVNGVPACASKDLLG-LVRNKWGFEGVGILPQTVMLWLLFLSIKSMQNL 310

Query: 327 KEDAVAQTLKA-----------GLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTV 375
            +  +   LK             +D++CG +    T +A++QG VKE D+D++L  L++V
Sbjct: 311 PKMLLLMFLKQVFFYVFENLWFCMDINCGTFMLRHTESAIEQGLVKEEDLDRALFNLFSV 370

Query: 376 LMRLGFFDGSPQ---YVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKT 432
            MRLG F+G P+   +  LG QD+C+ E+ +LA EAAR+GIVLLKND   LPL+     +
Sbjct: 371 QMRLGLFNGDPEKGKFGKLGPQDVCTPEHKKLALEAARQGIVLLKNDNKFLPLDKKDRVS 430

Query: 433 VAVVGPHANATVAMIGNYAGIPCRYMSPIAGFSGYAN-VTYKTGCDDVACKSNNSIFAAS 491
           +A++GP A  T  + G Y+GIPC   S   G   Y   ++Y  GC DV C S++    A 
Sbjct: 431 LAIIGPMA-TTSELGGGYSGIPCSPRSLYDGLKEYVKTISYAFGCSDVKCDSDDGFAVAI 489

Query: 492 EAAKTADATIILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGV 551
           + AK AD  +I+AGLD ++E E LDR  L LPG Q  L+++VA  +K PVILV+   G +
Sbjct: 490 DIAKQADFVVIVAGLDTTLETEDLDRVSLLLPGKQMDLVSRVAAASKRPVILVLTGGGPL 549

Query: 552 DIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSM 611
           D++FAE+N  I +ILW GYP +             F+  GRLP+TWY   +   +P+  M
Sbjct: 550 DVSFAESNQLITSILWIGYPVD-------------FDAAGRLPMTWYPESFTN-VPMNDM 595

Query: 612 PLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHC---R 668
            +R   S GYPGRTY+FY G  +Y FG+GLSY+ F Y +LS     +++L+K  +    R
Sbjct: 596 GMRADPSRGYPGRTYRFYTGSRIYGFGHGLSYSDFSYRVLSAPS--KLSLSKTTNGGLRR 653

Query: 669 NLNYTSDASKTRCPGVLVNDLR-CDDY-FEFKVDFQNVGSTDGSDVVIVYSKPPAEIAAT 726
           +L    +        V V++L+ C+   F   +   NVG  DGS VV+++SK P  I  +
Sbjct: 654 SLLNKVEKDVFEVDHVHVDELQNCNSLSFSVHISVMNVGDMDGSHVVMLFSKWPKNIQGS 713

Query: 727 YIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGN 779
              Q++G  R+   + ++     + + C+  +  D     +LP G H + VG+
Sbjct: 714 PESQLVGPSRLHTVSNKSIETSILADPCEHFSFADEQGKRILPLGNHILNVGD 766


>gi|326513064|dbj|BAK03439.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 694

 Score =  558 bits (1439), Expect = e-156,   Method: Compositional matrix adjust.
 Identities = 283/623 (45%), Positives = 396/623 (63%), Gaps = 30/623 (4%)

Query: 171 VARDPRWGRI--------TETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVS 222
           V + P  GR+        +ETPGEDP +  +YAV YV GLQD      A  +    LKV+
Sbjct: 79  VNKQPALGRLGIPAYEWWSETPGEDPLLASKYAVGYVTGLQDA----GAGGVTDGALKVA 134

Query: 223 SCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNG 282
           +CCKHY AYDVDNWKGV+RY FDA+V++QD+++TF  PF+ CV +G+ +SVMCSYN+VNG
Sbjct: 135 ACCKHYTAYDVDNWKGVERYTFDAKVSQQDLDDTFQPPFKSCVLDGNVASVMCSYNKVNG 194

Query: 283 IPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLD 342
            P+CAD  LL   +RG+W L+GYIV+DCDS+ V+     +   + E+A A T+K+GLDL+
Sbjct: 195 KPTCADKDLLEGVIRGDWKLNGYIVSDCDSVDVLYTQQHY-TKTPEEAAAITIKSGLDLN 253

Query: 343 CGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ---YVSLGKQDICSD 399
           CG +    T  AVQ G++ E D+D+++   + +LMRLGFFDG P+   + SLG +D+C+ 
Sbjct: 254 CGNFLAQHTVAAVQAGELSEEDVDRAITNNFIMLMRLGFFDGDPRQLAFGSLGPKDVCTS 313

Query: 400 ENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMS 459
            N ELA E AR+GIVLLKN    LPL++  +K++AV+GP+ANA+  MIGNY G PC+Y +
Sbjct: 314 SNRELARETARQGIVLLKN-SGALPLSAKSIKSMAVIGPNANASFTMIGNYEGTPCKYTT 372

Query: 460 PIAGFSGYANVTYKTGCDDVACKSNN-SIFAASEAAKTADATIILAGLDLSVEAESLDRE 518
           P+ G     N  Y+ GC +V C  N+  +  A  AA +AD T+++ G D S+E ESLDR 
Sbjct: 373 PLQGLGAKVNTVYQPGCTNVGCSGNSLQLSTAVAAAASADVTVLVVGADQSIERESLDRT 432

Query: 519 DLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRA 578
            L LPG QTQL++ VA  + GPVILV+MS G  DI+FA+ +  I AILW GYPGE GG A
Sbjct: 433 SLLLPGQQTQLVSAVANASSGPVILVVMSGGPFDISFAKASDKIAAILWVGYPGEAGGAA 492

Query: 579 IADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFG 638
           +AD++FG  NP GRLP+TWY   Y   + +T M +RP  S GYPGRTY+FY G T++ FG
Sbjct: 493 LADILFGSHNPSGRLPVTWYPASYADTVTMTDMRMRPDTSTGYPGRTYRFYTGDTVFAFG 552

Query: 639 YGLSYTQFKYNLLSFTKT-IQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDY-FE 696
            GLSYT+  ++L+S   + + + L +   CR            C  V      CDD  F+
Sbjct: 553 DGLSYTKMSHSLVSAPPSYVSMRLAEDHPCR---------AEECASVEAAGDHCDDLAFD 603

Query: 697 FKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKS 756
            K+  +N G   G+  V+++S PP    A   K ++GF++V +  G    + F  + C+ 
Sbjct: 604 VKLQVRNAGEVAGAHSVLLFSSPPPAHNAP-AKHLLGFEKVSLAPGEAGTVAFRVDVCRD 662

Query: 757 LNIVDYAANTLLPAGEHTIFVGN 779
           L++VD      +  G HT+ VG+
Sbjct: 663 LSVVDELGGRKVALGGHTLHVGD 685



 Score = 62.0 bits (149), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 30/72 (41%), Positives = 43/72 (59%), Gaps = 5/72 (6%)

Query: 30  SSSPVFVCDPGRFSKLGLQMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVP 89
           + +PVF CD    +     ++++ FC+     S R +DLVSR+TL EKV  L +    + 
Sbjct: 32  AQAPVFACDASNAT-----LAAYGFCNRKATASARARDLVSRLTLAEKVGFLVNKQPALG 86

Query: 90  RLGLPQYEWWSE 101
           RLG+P YEWWSE
Sbjct: 87  RLGIPAYEWWSE 98


>gi|301110280|ref|XP_002904220.1| beta-D-xylosidase, putative [Phytophthora infestans T30-4]
 gi|262096346|gb|EEY54398.1| beta-D-xylosidase, putative [Phytophthora infestans T30-4]
          Length = 709

 Score =  546 bits (1406), Expect = e-152,   Method: Compositional matrix adjust.
 Identities = 294/720 (40%), Positives = 421/720 (58%), Gaps = 55/720 (7%)

Query: 64  RVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGAT 123
           R    ++R+ LD+ V  L + A   P + +P YEWW+EALHGV+ + PG  F   I  AT
Sbjct: 7   RSLHCLTRIPLDQAVGLLVNKAAPAPSVNIPSYEWWNEALHGVA-LSPGVTFKGSITAAT 65

Query: 124 SFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNINVARDPRWGRITET 183
           SFP V+ T ASFN SL+ +I   +STEARA +N   AGLT+W+PN+N+ RDPRWGR  ET
Sbjct: 66  SFPQVLSTAASFNRSLFYQIADVISTEARAFHNAKDAGLTFWTPNVNIFRDPRWGRGQET 125

Query: 184 PGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYH 243
           PGEDP++ G YAV +VRGLQ  EG E     NS+ LK+SSCCKH++AY  +    V R+ 
Sbjct: 126 PGEDPYLTGEYAVAFVRGLQG-EGMEGREVENSKFLKISSCCKHFSAYSQE----VPRHR 180

Query: 244 FDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLH 303
            +A VT+QD  +T+   FE CVK G  SS+MCSYN VNGIPSCAD  LL   VRG+W   
Sbjct: 181 NNAMVTKQDQADTYFPAFEDCVKRGHVSSIMCSYNAVNGIPSCADKGLLTDLVRGQWKFD 240

Query: 304 GYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFTGNAVQQGKVKET 363
           GYI +DC+++  ++D+H +   S E   A TL AG+DL+CG++       A++QG V   
Sbjct: 241 GYIASDCEAVADVIDHHHY-TQSPEQTCATTLDAGMDLNCGEFLRQHLPKALEQGIVTTE 299

Query: 364 DIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTL 423
            I  +LK  + VLMRLG F+    + ++ K  + +  + +LA EAAR+ IVLLKND NTL
Sbjct: 300 MIHNALKNQFRVLMRLGMFEKVEPFANITKDSVDTTMHRQLALEAARQSIVLLKNDGNTL 359

Query: 424 PLNS---AKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFSGYA-NVTYKTGCDDV 479
           PL +    + +++A++GPH NA+ A++GNY GIP   ++P+ G S +  NV +  GC  V
Sbjct: 360 PLATKDFTRDRSLALIGPHFNASAALLGNYFGIPSHIVTPLEGISQFVPNVAHSLGCK-V 418

Query: 480 ACKSNNSIFAASEAAKTADATIILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKG 539
           + +       A   AK AD  I+  GLD S E E +DR  + LP +Q+ L+ +V EVA  
Sbjct: 419 SGEVLPDFDDAIAVAKKADRLIVFVGLDQSQEREEIDRYHIGLPAFQSTLLKRVLEVASH 478

Query: 540 PVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYN 599
           P++ V++S G VD++  + +  + AI++ GY G+ GG+A+ADV+FGK+NP G+LP T+Y+
Sbjct: 479 PIVFVVISGGCVDLSAYKNHPKVGAIVFGGYLGQAGGQALADVLFGKYNPSGKLPQTFYD 538

Query: 600 GDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQV 659
            +YV  + +  M +RP    G  GRTY+F+ G  +Y FG+GLSYT F  N  +   T   
Sbjct: 539 SEYVNAMSIYDMHMRPTPVTGNSGRTYRFFTGVPVYEFGFGLSYTTFHKNCHACVAT--- 595

Query: 660 NLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKP 719
                                                F +   N G+  G DV++ Y +P
Sbjct: 596 -------------------------------------FNITVTNAGAISGEDVILTYVEP 618

Query: 720 P-AEIAATYIKQVIGFQRV-FVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFV 777
           P A      +K ++ F+R   + AG+    K    A K+  + + A N ++  G  TI V
Sbjct: 619 PLAGEGGRPLKSLVAFERTPLIAAGQRATAKICLEA-KAFALANEAGNWVVEPGNWTIHV 677


>gi|326488213|dbj|BAJ89945.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 525

 Score =  541 bits (1394), Expect = e-151,   Method: Compositional matrix adjust.
 Identities = 268/505 (53%), Positives = 355/505 (70%), Gaps = 15/505 (2%)

Query: 30  SSSPVFVCDPGRFSKLGLQMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVP 89
           + +PVF CD    +     ++++ FC+     S R +DLVSR+TL EKV  L +    + 
Sbjct: 32  AQAPVFACDASNAT-----LAAYGFCNRKATASARARDLVSRLTLAEKVGFLVNKQPALG 86

Query: 90  RLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVST 149
           RLG+P YEWWSEALHGVS VGPGT F  ++PGATSFP  ILT ASFN SL++ IG+ VST
Sbjct: 87  RLGIPAYEWWSEALHGVSYVGPGTRFSPLVPGATSFPQPILTAASFNASLFRAIGEVVST 146

Query: 150 EARAMYNLGRAGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHE 209
           EARAM+N+G AGLT+WSPNIN+ RDPRWGR  ETPGEDP +  +YAV YV GLQD  G  
Sbjct: 147 EARAMHNVGLAGLTFWSPNINIFRDPRWGRGQETPGEDPLLTSKYAVGYVTGLQDA-GAG 205

Query: 210 NATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGD 269
             TD     LKV++CCKHY AYDVDNWKGV+RY FDA+V++QD+++TF  PF+ CV +G+
Sbjct: 206 GVTD---GALKVAACCKHYTAYDVDNWKGVERYTFDAKVSQQDLDDTFQPPFKSCVLDGN 262

Query: 270 ASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKED 329
            +SVMCSYN+VNG P+CAD  LL   +RG+W L+GYIV+DCDS+ V+     +   + E+
Sbjct: 263 VASVMCSYNKVNGKPTCADKDLLEGVIRGDWKLNGYIVSDCDSVDVLYTQQHY-TKTPEE 321

Query: 330 AVAQTLKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ-- 387
           A A T+K+GLDL+CG +    T  AVQ G++ E D+D+++   + +LMRLGFFDG P+  
Sbjct: 322 AAAITIKSGLDLNCGNFLAQHTVAAVQAGELSEEDVDRAITNNFIMLMRLGFFDGDPRQL 381

Query: 388 -YVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAM 446
            + SLG +D+C+  N ELA E AR+GIVLLKN    LPL++  +K++AV+GP+ANA+  M
Sbjct: 382 AFGSLGPKDVCTSSNRELARETARQGIVLLKN-SGALPLSAKSIKSMAVIGPNANASFTM 440

Query: 447 IGNYAGIPCRYMSPIAGFSGYANVTYKTGCDDVACKSNN-SIFAASEAAKTADATIILAG 505
           IGNY G PC+Y +P+ G     N  Y+ GC +V C  N+  +  A  AA +AD T+++ G
Sbjct: 441 IGNYEGTPCKYTTPLQGLGAKVNTVYQPGCTNVGCSGNSLQLSTAVAAAASADVTVLVVG 500

Query: 506 LDLSVEAESLDREDLWLPGYQTQLI 530
            D S+E ESLDR  L LPG QTQL+
Sbjct: 501 ADQSIERESLDRTSLLLPGQQTQLV 525


>gi|300121549|emb|CBK22068.2| unnamed protein product [Blastocystis hominis]
          Length = 690

 Score =  540 bits (1391), Expect = e-150,   Method: Compositional matrix adjust.
 Identities = 304/725 (41%), Positives = 418/725 (57%), Gaps = 56/725 (7%)

Query: 64  RVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGAT 123
           R + LV+ +TL EK+  +G  A  V RL +P+Y+WWSEALHGV+   PG  F +  P AT
Sbjct: 4   RARALVAELTLAEKMSLMGHTASEVKRLNIPKYQWWSEALHGVA-ASPGVVFQEPTPFAT 62

Query: 124 SFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNINVARDPRWGRITET 183
           +FP V LT  SF++ L+  I   +STEAR M N  RA LTYWSPN+NV RDPRWGR  ET
Sbjct: 63  AFPQVALTAQSFDKPLFHDIASIISTEARVMNNAERANLTYWSPNVNVYRDPRWGRGQET 122

Query: 184 PGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYH 243
           PGEDPF+V  YAV +VRGLQ+ E        + R LKVS+CCKHY+AYD++NW GV+R+ 
Sbjct: 123 PGEDPFLVATYAVEFVRGLQEGE--------DPRYLKVSACCKHYSAYDLENWHGVERFE 174

Query: 244 FDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLH 303
           FDA V+++DM +TF  PFE CVK+G  SS+MCSYN +NGIP+CAD +LL  T RG W   
Sbjct: 175 FDAIVSDRDMTDTFQVPFEQCVKKGHVSSLMCSYNAINGIPACADRELLYGTARGGWGFE 234

Query: 304 GYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFTGNAVQQGKVKET 363
           GYI +DC +I  ++ NH +  D+   A+   ++A  DLDCG +Y     ++V+ G++KE 
Sbjct: 235 GYITSDCGAIDTIIYNHHYTNDTDTTAML-GVRATCDLDCGGFYQQHILHSVESGRLKEA 293

Query: 364 DIDKSLKYLYTVLMRLGFFDGSPQ--YVSLGKQDICSDENIELAAEAAREGIVLLKNDQN 421
           ++D +L  L+ V MRLG FD   Q  Y   G   + + E+  +A  AAREGI LLKN  +
Sbjct: 294 EVDDALANLFKVQMRLGLFDPVEQQVYTHYGLDKLNTKEHQAMALRAAREGIALLKNQND 353

Query: 422 TLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFSGYANVTYKTGCDDVAC 481
            LPL S K K V V+GP+A     M+GNY GIP  ++  +A   G  NV     CD V  
Sbjct: 354 FLPL-SLKDKHVVVMGPYAEDAGVMLGNYNGIP-EFIVTVA--QGLRNV-----CDHVDV 404

Query: 482 KSNNSIFAASEAAKTADATIILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPV 541
             +    +  E     D  ++  GL+  +E E LDREDL LP  Q  L++ +      PV
Sbjct: 405 VKSLEALSKLEG---VDLIVVTVGLNQEIEREGLDREDLLLPASQRALLDGLLAQTDVPV 461

Query: 542 ILVIMSAGG-VDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNG 600
           +L ++S GG VDI+  E N ++  +L  GY G  GG+AIA+V+ G  NP GRL  T Y  
Sbjct: 462 VLTLLSGGGSVDISAYEQNEHVVGVLAVGYGGMFGGQAIAEVIVGDVNPSGRLVNTMYYN 521

Query: 601 DYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVN 660
           DYV  L    M +RP +  G+PGRTY+F+ GP ++PFG+GLSYT F +          V 
Sbjct: 522 DYVTNLDYFDMNMRPKEETGFPGRTYRFFAGPVIHPFGFGLSYTTFAH---------AVE 572

Query: 661 LNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPP 720
           + ++++ R                L + L  D Y    V   N GS  G + V+++ K P
Sbjct: 573 IGQMRNHR----------------LRSALAIDVY----VKVTNTGSRQGDESVLLFVKSP 612

Query: 721 AEIAATY-IKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGN 779
                 Y +K +  F RV +  G  + + FV    + L++ +  A  +L  GE  + V  
Sbjct: 613 LAGKQGYPLKSLADFSRVSLAPGETQTVHFVLGE-EQLHLANEQAKYVLLRGEWKVEVEE 671

Query: 780 GGVSF 784
               F
Sbjct: 672 ASARF 676


>gi|320170454|gb|EFW47353.1| beta-xylosidase [Capsaspora owczarzaki ATCC 30864]
          Length = 779

 Score =  537 bits (1384), Expect = e-150,   Method: Compositional matrix adjust.
 Identities = 311/801 (38%), Positives = 449/801 (56%), Gaps = 64/801 (7%)

Query: 10  CFSLSIALLVFSTNAVDANGSSSPVFVCDPGRFSKLGLQMSSFLFCDSSLPYSIRVKDLV 69
           C ++++A LV +  A            C+      L        FC+ +L +  R  DLV
Sbjct: 8   CITIAVAALVVAPTA--------RALTCEDAALRNLP-------FCNPNLAWEQRADDLV 52

Query: 70  SRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGATSFPTVI 129
            R+TL EK+ Q G  A GV RLG+  YEWWSEALHGV+   PG +F    P +T FP +I
Sbjct: 53  GRLTLQEKISQFGTTAPGVARLGVNAYEWWSEALHGVAE-SPGVNFTGNTPVSTCFPQII 111

Query: 130 --------LTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNINVARDPRWGRIT 181
                      A+FN      + Q +STEARA  N G AGLTY++PNIN+ RDPRWGR  
Sbjct: 112 GNNCSSLSRVGATFNLDSVAAMAQVISTEARAFANAGHAGLTYFTPNINIFRDPRWGRGQ 171

Query: 182 ETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDR 241
           ETPGEDP++  RY    V+ LQ+ E        ++R LKV + CKHY AYD+++W G+DR
Sbjct: 172 ETPGEDPYLTSRYVETLVQNLQNGE--------DARYLKVVATCKHYTAYDMEDWGGIDR 223

Query: 242 YHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWD 301
           +HF+A V++QD+ ETF+ PFE CV+ G  +S+MCSYN VNGIPSCAD  + N+  R +W 
Sbjct: 224 FHFNAVVSDQDLVETFMPPFEACVRVGKGASLMCSYNAVNGIPSCADDFINNEIAREQWG 283

Query: 302 LHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFTGNAVQQGKVK 361
             GYIV+DC +I  +   H +  ++ +   A  ++ G DLDCG +Y +   +A+    + 
Sbjct: 284 FDGYIVSDCGAIDCIQYTHNY-TNTTQATCAAGIQGGCDLDCGDFYQSHLMDAIGNATLH 342

Query: 362 ETDIDKSLKYLYTVLMRLGFFDGSP--QYVSLGKQDICSDENIELAAEAAREGIVLLKND 419
           E D+D SL+ L+   +RLG FD +    Y  +    I S E+ ELA + ARE IVLL ND
Sbjct: 343 EADLDFSLRRLFGHRIRLGEFDAASIQPYRQIPVSAINSQEHQELALQIARESIVLLGND 402

Query: 420 QNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFSGY---ANVTYKTGC 476
            NTLP + A V+ +A++GP+A+    ++GNY G     ++P+ GF       ++T+  GC
Sbjct: 403 NNTLPFSLATVRKLAIIGPNADDAETLLGNYYGDAPYLITPLKGFQQLDPTLSITFVKGC 462

Query: 477 DDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEV 536
            DV     +   AA+ AAK ADATI++ GL+ +VE+E+LDR  L LPG Q +LI  +   
Sbjct: 463 -DVNSTDTSGFVAAAAAAKAADATIVVVGLNQTVESENLDRTTLVLPGVQAELILALTAA 521

Query: 537 AKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPIT 596
           A+GPVILV+MS   +D+  +     ++A LW GYPG+ GGRA+A+ VFG F+P GRLP T
Sbjct: 522 ARGPVILVVMSGSPIDL--SNVIHPVRAALWIGYPGQAGGRALAEAVFGVFSPAGRLPFT 579

Query: 597 WYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKT 656
            Y  DYV  LP+T+M +R       PGRTY+FY G  L+ FG+GLSY+ F+Y   + + +
Sbjct: 580 VYPADYVNQLPMTNMDMR-----AGPGRTYRFYTGTPLFEFGHGLSYSTFQYTWSNSSSS 634

Query: 657 IQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVY 716
              +             +     R P   V+         F+V  QN G     DVV+ +
Sbjct: 635 SSSSATSQHSLSTAALAAQHLAARAPVEAVS---------FRVLVQNTGKMASDDVVLAF 685

Query: 717 SKPPA---------EIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTL 767
           +   A         + A+  I+ ++GF+R+ +  G ++ I F   + +   +    A TL
Sbjct: 686 ASFNASSIIDQSSSQFASPPIRSLVGFRRIHLAPGASQEIFFAVTSSQLAQVDSTGAQTL 745

Query: 768 LPAGEHTIFVGNGGVSFPIHL 788
           +P+     F  +  +   I L
Sbjct: 746 VPSRLQVAFGSDARLVAEIQL 766


>gi|340370206|ref|XP_003383637.1| PREDICTED: probable beta-D-xylosidase 5-like [Amphimedon
           queenslandica]
          Length = 728

 Score =  536 bits (1381), Expect = e-149,   Method: Compositional matrix adjust.
 Identities = 294/741 (39%), Positives = 433/741 (58%), Gaps = 60/741 (8%)

Query: 49  MSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSN 108
            +++ +CD +     RV DL+SRMT+ +K+ QL   A  +P L +P Y+WWSE LHGV+ 
Sbjct: 26  FNTYKYCDYTQSIPERVNDLLSRMTILDKIPQLITSAPAIPSLDIPAYQWWSEGLHGVAG 85

Query: 109 VGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPN 168
             PG HF    P ATSFP VI   A+FN SL   + Q +STEARA  N G+AGLTY++PN
Sbjct: 86  -SPGVHFGGNFPNATSFPQVIGLGATFNMSLVLAMAQVISTEARAFANGGQAGLTYFAPN 144

Query: 169 INVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHY 228
           IN+ RDPRWGR  ETPGEDP++  +YA N+V+G+Q     E A D  +R LK  + CKHY
Sbjct: 145 INIFRDPRWGRGQETPGEDPYLSSQYAANFVKGMQ-----EGADD--TRYLKTIATCKHY 197

Query: 229 AAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCAD 288
           AAYD++N+  + R+ F+A V++QD EET+   F  CV+EG   S+MCSYN VNG+PSCA+
Sbjct: 198 AAYDLENYLNLSRHTFNAIVSDQDFEETYFPAFRSCVEEGKVGSIMCSYNAVNGVPSCAN 257

Query: 289 PKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYT 348
             + N+  RG+W   GY+V+DC +I  ++++HK+ +++ +D VA  L+ G DL+CG +Y+
Sbjct: 258 DFINNEVARGKWGFEGYVVSDCGAISDIINSHKYTSNT-DDTVAAGLRGGCDLNCGHFYS 316

Query: 349 NFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ--YVSLGKQDICSDENIELAA 406
           +    A   G + + DID+++  L+T  MRLG FD      +       + + ++  LA 
Sbjct: 317 DHAQAAYDNGAITDDDIDRAMTRLFTYRMRLGMFDPPSMQPFRDYTNDKVDTKQHEALAL 376

Query: 407 EAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFSG 466
           +A+RE IVLL+N+++ LPL+    + +A+VGPH  A  AM GNY G     +SP+ G   
Sbjct: 377 DASRESIVLLQNNKDILPLSLTTHRKIALVGPHGQAQGAMQGNYKGTAPYLISPMQGLQD 436

Query: 467 YA-NVTYKTGCDDVACKSNNSIFAASEAAK-----TADATIILAGLDLSVEAESLDREDL 520
              +VT+  GC  VAC    +I   SE  K     + +A I + GLD S E+E  DR  L
Sbjct: 437 LGLSVTFAAGCTQVACP---TIAGFSEVTKLVEEHSIEAIIAVIGLDESQESEGHDRTSL 493

Query: 521 WLPGYQTQLINQVAE--VAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRA 578
            LPG Q QL+  + +  V   P I+V+MS G VD++  +   +  AILWAGYPG+ GG+A
Sbjct: 494 TLPGQQVQLLEDIKKKAVPGIPFIVVVMSGGPVDLSGVKDIAD--AILWAGYPGQSGGQA 551

Query: 579 IADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFG 638
           IA+V++GK NP GRLP+T+Y   Y+  +P T+M +R       PGR+YKFY G  ++PFG
Sbjct: 552 IAEVIYGKVNPSGRLPVTFYPASYINEIPYTNMSMRVP-----PGRSYKFYTGTPVFPFG 606

Query: 639 YGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFK 698
           +GLSYT F+   + +     V   K  H  ++NY                         +
Sbjct: 607 FGLSYTTFE---MKWKNPPNVTHLKTTHDVDVNY-------------------------E 638

Query: 699 VDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLN 758
           V   N G   GS  V+ Y    + +    +K++ GFQ+++++  ++  + FV    K   
Sbjct: 639 VVVTNAGKRSGSVSVLAYIT--STVPGAPMKELFGFQKIYLKPEQSMTLSFVAEP-KVFT 695

Query: 759 IVDYAANTLLPAGEHTIFVGN 779
            VD      +  G + I +G+
Sbjct: 696 TVDKHGERKIRPGTYKITIGD 716


>gi|340370204|ref|XP_003383636.1| PREDICTED: probable beta-D-xylosidase 2-like [Amphimedon
           queenslandica]
          Length = 755

 Score =  531 bits (1368), Expect = e-148,   Method: Compositional matrix adjust.
 Identities = 298/738 (40%), Positives = 434/738 (58%), Gaps = 55/738 (7%)

Query: 49  MSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSN 108
            +++L+C+ S   + RVKDL+SR+T+ EK+ Q    A  + RL +P Y+WWSE LHG++ 
Sbjct: 53  FNAYLYCNYSASITERVKDLLSRLTVLEKMSQTATNASAIERLDIPAYDWWSECLHGLAQ 112

Query: 109 VGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPN 168
             PG  F++ +  ATSFP VI   A+FN SL   +GQ +STEARA  N G++GLT+++PN
Sbjct: 113 -SPGVFFENDLTSATSFPQVIGLGATFNMSLVLAMGQVISTEARAFANNGQSGLTFFAPN 171

Query: 169 INVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHY 228
           IN+ RDPRWGR  ETPGEDP++  +YA N+V+G+Q  EG E     + R LK  + CKHY
Sbjct: 172 INIYRDPRWGRGQETPGEDPYLTSQYAANFVKGIQ--EGSE-----DRRYLKAIATCKHY 224

Query: 229 AAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCAD 288
           AAY+++ +  V R +F+A V++QD+EET+L  F+ CV+EG   S+MCSYN +NG+P+CA+
Sbjct: 225 AAYNLERYLDVRRVNFNAIVSDQDLEETYLPAFKACVQEGQVGSIMCSYNAINGVPNCAN 284

Query: 289 PKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYT 348
             + N+  R  W   GYIV+DC +I  +   H + +D+    VA  LK G DL+CG +Y 
Sbjct: 285 DFINNKIARDTWGFEGYIVSDCGAILDIQYKHNYTSDTN-ITVADALKGGCDLNCGHFYE 343

Query: 349 NFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQ----DICSDENIEL 404
            +  +A     + E DIDKSL  L+T  MRLG FD  P  +   +Q    D+ + E  +L
Sbjct: 344 KYMEDAFDNSTITEEDIDKSLTRLFTSRMRLGMFD--PPEIQPFRQYSVKDVNTPEAQDL 401

Query: 405 AAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGF 464
           A  AAREGIVLL+N  + LPL+  K   +A +GP+A+AT  M GNY GI    +SP+ GF
Sbjct: 402 ALNAAREGIVLLQNKGSVLPLDIVKHSNIAAIGPNADATHIMQGNYHGIAPYLISPLQGF 461

Query: 465 SGYA-NVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAESLDREDLWLP 523
           S    N TY+ GC  VAC        A +A +  DA I + GL+ + E ES DR  + LP
Sbjct: 462 SNLGINATYQIGC-PVACNDTEGFPDAVKAVQGVDAVIAVIGLNNTQEGESHDRTSIALP 520

Query: 524 GYQTQLINQVAE-VAKG-PVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIAD 581
           G+Q  L+ ++ +  AKG P+I+V+MS G VD+   +   +  AILWAGYPG+ GG+AIA+
Sbjct: 521 GHQEDLLLELKKNAAKGTPLIVVVMSGGSVDLTGVKDIAD--AILWAGYPGQSGGQAIAE 578

Query: 582 VVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGL 641
           V++GK NP GRLP+T+Y   Y+  +P T+M +R       PGR+YKFY G  ++PFG+GL
Sbjct: 579 VIYGKVNPSGRLPVTFYPASYINEIPYTNMSMRVP-----PGRSYKFYTGTPVFPFGFGL 633

Query: 642 SYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDF 701
           SYT F+      T T +    K  H   +NY +  +                        
Sbjct: 634 SYTTFEIKWKD-TSTAKDYYLKTTHDEVVNYEATVT------------------------ 668

Query: 702 QNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVD 761
            N GS  GS  V+ +    + +    +K++  F+++++    +  + FV    K    VD
Sbjct: 669 -NSGSRPGSVSVLAFIT--SSVPGAPMKELFAFKKIYLEPTESVDVSFVAEP-KVFTTVD 724

Query: 762 YAANTLLPAGEHTIFVGN 779
                 +  G + I +G+
Sbjct: 725 IYGIRKIRPGAYKIIIGD 742


>gi|293336530|ref|NP_001167905.1| uncharacterized protein LOC100381616 [Zea mays]
 gi|223944757|gb|ACN26462.1| unknown [Zea mays]
          Length = 630

 Score =  531 bits (1367), Expect = e-148,   Method: Compositional matrix adjust.
 Identities = 272/645 (42%), Positives = 401/645 (62%), Gaps = 25/645 (3%)

Query: 154 MYNLGRAGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATD 213
           M+N G+AGLTYW+PNIN+ RDPRWGR  ET GEDP V   Y++ YV+G Q         +
Sbjct: 1   MHNAGQAGLTYWAPNINIFRDPRWGRGQETSGEDPAVAAAYSLEYVKGFQ-------GEE 53

Query: 214 LNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSV 273
                +++S+CCKHY AYD++ W+G  RY F+A+V  QD+E+T+  PF+ C++E  AS +
Sbjct: 54  GEEGRIRLSACCKHYTAYDMEKWEGFSRYTFNAKVNAQDLEDTYQPPFKTCIQEARASCL 113

Query: 274 MCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQ 333
           MC+YN+VNG+P CA   LL +T R EW   GYI +DCD++ ++ +N  +   S ED++A 
Sbjct: 114 MCAYNQVNGVPMCAHKDLLQKT-RDEWGFQGYITSDCDAVAIIHENQTY-TKSGEDSIAI 171

Query: 334 TLKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ---YVS 390
            LKAG+D++CG +    T +A+++GK++E DID++L  L++V +RLG FD       +  
Sbjct: 172 VLKAGMDINCGSFLVRHTKSAIEKGKIQEEDIDRALFNLFSVQLRLGIFDKPSNNQWFSQ 231

Query: 391 LGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNY 450
           LG   +C+ E+ ELAAEA R+G VLLKND N LPL  ++V+ VA++GP AN   AM G+Y
Sbjct: 232 LGPNSVCTKEHRELAAEAVRQGAVLLKNDHNFLPLKRSEVRHVAIIGPSANDAYAMGGDY 291

Query: 451 AGIPCRYMSPIAGFSGYANVT-YKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLS 509
            G+PC   + + G   YA  T +  GC D +C S +    A EAAK AD  +++AGL+L+
Sbjct: 292 TGVPCNPTTFLKGIQAYATQTSFAPGCKDASCNSTDLFGEAVEAAKRADIVVVIAGLNLT 351

Query: 510 VEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAG 569
            E E  DR  L LPG Q  LI+ +A VAK P++LV++  G VD++FA+ +  I +ILW G
Sbjct: 352 EEREDFDRVSLLLPGKQMGLIHAIASVAKKPLVLVLLGGGPVDVSFAKQDPRIASILWLG 411

Query: 570 YPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFY 629
           YPGE GG+ + +++FG++NPGG+LPITWY   +   +P+T M +R   S GYPGRTY+FY
Sbjct: 412 YPGEVGGQVLPEILFGEYNPGGKLPITWYPESFT-AIPMTDMNMRADPSRGYPGRTYRFY 470

Query: 630 NGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTS-DASKTRCPG---VL 685
            G  +Y FGYGLSY+++ Y++ S  K I V+ +      +L   S   + TR  G   V 
Sbjct: 471 TGDVVYGFGYGLSYSKYSYSISSAPKKITVSRSS-----DLGIISRKPAYTRRDGLGSVK 525

Query: 686 VNDL-RCDDY-FEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGR 743
             D+  C+   F   V   N GS DGS  V+++++  + +    IKQ++GF+ V   AG 
Sbjct: 526 TEDIASCEALVFSVHVAVSNHGSMDGSHAVLLFARSKSSVPGFPIKQLVGFESVHTAAGS 585

Query: 744 NKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGNGGVSFPIHL 788
              ++   + CK ++  +     +L  G H + VG+      I L
Sbjct: 586 ASNVEITVDPCKQMSAANPEGKRVLLLGAHVLTVGDEEFELSIEL 630


>gi|167525174|ref|XP_001746922.1| hypothetical protein [Monosiga brevicollis MX1]
 gi|163774702|gb|EDQ88329.1| predicted protein [Monosiga brevicollis MX1]
          Length = 1620

 Score =  527 bits (1358), Expect = e-147,   Method: Compositional matrix adjust.
 Identities = 292/747 (39%), Positives = 440/747 (58%), Gaps = 51/747 (6%)

Query: 47   LQMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGV 106
            L   +F FC++SL    R++D++SR+++ +KV    + A      GLP Y+WWSEALHGV
Sbjct: 919  LPAKNFPFCNASLDLDTRIRDVISRLSIQDKVALTANTAGAAADAGLPAYQWWSEALHGV 978

Query: 107  SNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWS 166
                PG  F   +  ATSFP VI T+ASFN++LW  IG  +STEARAM N+ +AGLT+W+
Sbjct: 979  G-FSPGVTFMGKVQAATSFPQVIHTSASFNKTLWHHIGMTISTEARAMNNVNQAGLTFWA 1037

Query: 167  PNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCK 226
            PNIN+ RDPRWGR  ETPGEDP+  G YA N+V G+Q+ E        ++R +K SSCCK
Sbjct: 1038 PNINIIRDPRWGRGQETPGEDPYATGLYAANFVPGMQEGE--------DTRYIKASSCCK 1089

Query: 227  HYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSC 286
            H+  Y++++W  VDR+HF+A  T+QD+ +T+L  FE CV+ G ASS+MCSYN VNG+PSC
Sbjct: 1090 HFFDYNLEDWHNVDRHHFNAIATDQDIADTYLPAFESCVRFGRASSLMCSYNAVNGVPSC 1149

Query: 287  ADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQY 346
            A+  ++    R  W   GYI +DC +++ +  NHK+  ++    V   L AG+D+DCG +
Sbjct: 1150 ANADIMTTLAREAWGFDGYITSDCGAVEDVYSNHKYY-NTTGATVNGVLSAGMDVDCGSF 1208

Query: 347  YTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ--YVSLGKQDICSDENIEL 404
             +    +A+  G V    +D++L  L+ V  RLG FD +    Y++L    + + E+ +L
Sbjct: 1209 LSQHLADAIDSGDVTNATVDQALYNLFRVQFRLGMFDPAEDQPYLNLTTDAVNTPEHQQL 1268

Query: 405  AAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGF 464
            A EAAR+G+ LL+N  + LPL+++ +K +A++GP+ANAT  M GNY G     +SP  G 
Sbjct: 1269 ALEAARQGMTLLENRDSRLPLDASSIKQLALIGPNANATGVMQGNYNGKAPFLISPQQGV 1328

Query: 465  SGY-ANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAESLDREDLWLP 523
              Y +NV  + G              A  AAK AD  +++ GLD + E+E  DRE + LP
Sbjct: 1329 QQYVSNVALELG--------------AVTAAKAADTVVMVIGLDQTQESEGHDREIIALP 1374

Query: 524  GYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVV 583
            G Q +L+ QVA  +  P+++V+M+ G VD+   +   N+         G+ GG+A+A+ +
Sbjct: 1375 GMQAELVAQVANASSSPIVVVVMTGGAVDLTPVKDLDNV---------GQAGGQALAETL 1425

Query: 584  FGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSY 643
            FG  NPGGRLP T Y  D V  + +    +RP  + G PGRTY+FY G  +Y +G GLSY
Sbjct: 1426 FGDNNPGGRLPYTLYPADLVNQVSMFDDGMRPNATSGNPGRTYRFYTGTPVYAYGTGLSY 1485

Query: 644  TQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQN 703
            T F Y   + T +++V+  +++      + +   +T     + +++  +DY    V  QN
Sbjct: 1486 TSFSYE--TSTPSLRVSAERVRA-----WVAARGQT---SFIRDEVDAEDYITVTV--QN 1533

Query: 704  VGSTDGSDVVIVYSKPPAEIA-ATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDY 762
             G+  G+DVV V+ K     A    IK + GF+RVF++ G    I+F       L++V+ 
Sbjct: 1534 NGTVAGADVVQVFIKTTTPGADGNPIKSLCGFERVFLKPGETTSIQFPVTP-HDLSVVNS 1592

Query: 763  AANTLLPAGEHTIFVGNGG-VSFPIHL 788
                +   G  T+ V +   +S PI +
Sbjct: 1593 RGERVAVPGTWTVEVHHEARLSIPISV 1619


>gi|78482949|emb|CAJ41429.1| beta (1,4)-xylosidase [Populus tremula x Populus alba]
          Length = 732

 Score =  516 bits (1330), Expect = e-143,   Method: Compositional matrix adjust.
 Identities = 309/755 (40%), Positives = 423/755 (56%), Gaps = 76/755 (10%)

Query: 35  FVCDPGRFSKLGLQMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLP 94
           F CDP   +   L      FC  +LP   RV DL+ RMTL EKV  L + A  VPRLG+ 
Sbjct: 27  FACDPKDGTNRDLP-----FCQVNLPIHTRVNDLIGRMTLQEKVGLLVNNAAAVPRLGIK 81

Query: 95  QYEWWSEALHGVSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAM 154
            YEWWSEALHGVSNVGPGT F    P ATSFP VI T ASFN +LW+ IG+ VS EARAM
Sbjct: 82  GYEWWSEALHGVSNVGPGTKFGGAFPVATSFPQVITTAASFNATLWEAIGRVVSDEARAM 141

Query: 155 YNLGRAGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDL 214
           +N G AGLTYWSPN+  +  PRWGR  ETPGEDP VVG+YA +YVRGLQ  +G       
Sbjct: 142 FNGGVAGLTYWSPNVTYSVYPRWGRGQETPGEDPVVVGKYAASYVRGLQGSDGIR----- 196

Query: 215 NSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVM 274
               LKV++CCKH+ AYD+DNW GVDR+HF+A+V++QDM +TF  PF MCVKEG  +SVM
Sbjct: 197 ----LKVAACCKHFTAYDLDNWNGVDRFHFNAKVSKQDMVDTFDVPFRMCVKEGKVASVM 252

Query: 275 CSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQT 334
           CSYN+VNGIP+CADP LL +TVRG+W L+GYIV+DCDS  V      F   S   +    
Sbjct: 253 CSYNQVNGIPTCADPNLLKKTVRGQWRLNGYIVSDCDSFGVYYGQQHF--TSPRRSSLGC 310

Query: 335 LKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSP-QYVSLGK 393
            KAGLDLDCG +      +AV++   +E +I+ +     T  + LG FDGSP Q V    
Sbjct: 311 YKAGLDLDCGPFLVTHR-DAVKKA-AEEAEINNAWLKTLTFQISLGIFDGSPLQAVGDVV 368

Query: 394 QDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHA--NATVAMIGNYA 451
             +    N +LA  A +  + + KN      L S +     + GP A   +   M+GNY 
Sbjct: 369 PTMGPPTNQDLAVNAPKR-LFIFKN--RAFLLYSPR----HIFGPVALFKSLPFMLGNYE 421

Query: 452 GIPCRYMSPIAGFSGYANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVE 511
           G+PC+Y+ P+ G +G+ ++ Y  GC +V C   + + +A + A +ADA +++ G D S+E
Sbjct: 422 GLPCKYLFPLQGLAGFVSLLYLPGCSNVICAVAD-VGSAVDLAASADAVVLVVGADQSIE 480

Query: 512 AESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYP 571
            E  DR D +LPG Q +L+ +VA  AKGPV+LVIM     D+A +    +   +      
Sbjct: 481 REGHDRVDFYLPGKQQELVTRVAMAAKGPVLLVIM-----DLAISGGGCSYNQV------ 529

Query: 572 GEEGGRAIADVVFGK-------FNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGR 624
               G  I+DV  G         N  G +P   Y+    + L  T +   P  S     +
Sbjct: 530 ---NGIPISDVCEGSSYRWPSFSNCHGYMPWISYSRAIWETLRFTKVNWVPTWSW---NK 583

Query: 625 TYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGV 684
            +KF           G  +++   +     +     L K  H     +    S+     V
Sbjct: 584 LHKF-----------GSHHSKCTDDGFGTPRRPPPWLRKCNH-----FQGRQSELHMLDV 627

Query: 685 LVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRN 744
           +      D     +VD +N GS DG+  ++VY +PPA   A + KQ++ F++V V AG  
Sbjct: 628 I------DSLLGMQVDVKNTGSMDGTHTLLVYFRPPARHWAPH-KQLVAFEKVHVAAGTQ 680

Query: 745 KRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGN 779
           +R+    + CKSL++VD +    +P GEH++ +G+
Sbjct: 681 QRVGINIHVCKSLSVVDGSGIRRIPMGEHSLHIGD 715


>gi|340370208|ref|XP_003383638.1| PREDICTED: probable beta-D-xylosidase 2-like [Amphimedon
           queenslandica]
          Length = 732

 Score =  516 bits (1329), Expect = e-143,   Method: Compositional matrix adjust.
 Identities = 296/751 (39%), Positives = 434/751 (57%), Gaps = 74/751 (9%)

Query: 48  QMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVS 107
           +  SF +C+ SLP S RVKDL+SRMTL EK+ QLG+ A  + RL +P Y+WWSE LHGV+
Sbjct: 28  KFQSFSYCNYSLPISDRVKDLLSRMTLAEKITQLGNTAGSIDRLDIPAYQWWSEGLHGVA 87

Query: 108 NVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSP 167
           +  PG HF+ +   ATSFP VI T +SFN++L+ +I   +STEARA  N    G+ Y+  
Sbjct: 88  D-SPGVHFNGMFHNATSFPQVITTASSFNKTLYHEIAAVMSTEARAFAN---QGIVYFKQ 143

Query: 168 NINV--------ARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPL 219
           +  +         RDPRWGR  ETPGEDP++  +YA+ +V G Q           +S+ L
Sbjct: 144 HQQLLSNYLLFYCRDPRWGRAQETPGEDPYLNSQYAIQFVTGAQG----------DSKYL 193

Query: 220 KVSSCCKHYAAYDVDNW-KGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYN 278
           KV + CKH+A YD++++  G  R+ F+A++T QD EET+   F+ CV+E + +S+MCSYN
Sbjct: 194 KVVTTCKHFAGYDLEDYVDGETRHSFNAKITPQDFEETYYPAFKACVEEANVASIMCSYN 253

Query: 279 RVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAG 338
            VNG+PSCAD ++ N+  R  W   G+I +DC +I  + + H +  ++ +D VA  LK G
Sbjct: 254 EVNGVPSCADGQINNKLARDTWGFDGFIASDCGAIDDIQNKHHY-TNNTDDTVAAALKGG 312

Query: 339 LDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ---YVSLGKQD 395
            DL+CG YY +   +A   G +   +I+ +L  L+T  M+LG FD  P+   Y ++    
Sbjct: 313 CDLNCGSYYQSHAQSAFLNGTITIGEINLALTRLFTARMKLGMFD-PPELQPYNAISPDV 371

Query: 396 ICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPC 455
           + S E+  LA  AARE IVLL+N+ + LPLN  K  T+AVVGPHA AT  M GNY G+  
Sbjct: 372 VNSLEHQALALNAARESIVLLQNNNDVLPLNFEKHSTIAVVGPHAMATDVMQGNYNGVAP 431

Query: 456 RYMSPIAGFS--GYANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAE 513
             +SP+ GF   G  +V   +GC DV C+  +    A + A  ADA I + GLD S E+E
Sbjct: 432 YLISPVEGFENLGIDSVLTASGC-DVNCEVTDGFQDAFDIAVKADAVIAVLGLDQSHESE 490

Query: 514 SLDREDLWLPGYQTQLINQVAEVAK-----GPVILVIMSAGGVDIAFAETNTNIKAILWA 568
             DREDL+LP  Q + +  +    K      P+I+V+MS   VD+    T  +  AILWA
Sbjct: 491 GHDREDLFLPNLQDKFVQDLKNTLKAAGTNAPLIVVVMSGSSVDLTV--TKKHADAILWA 548

Query: 569 GYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKF 628
           GYPG+ GG+AIA++++GK NP GRLP+T+Y G Y+ ++    M +R      YPGRTYKF
Sbjct: 549 GYPGQSGGQAIAEIIYGKVNPSGRLPVTFYPGSYIDLVAFRHMSMRE-----YPGRTYKF 603

Query: 629 YNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVND 688
           YN    + FG GLSYT F    L ++K +      +   R+++Y         P V+ N 
Sbjct: 604 YNDTPDFSFGDGLSYTTF---YLEWSKPV-----NMSGVRSVSY---------PTVVYN- 645

Query: 689 LRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIK 748
                     V   N G   G+  V+ Y       A    K++ GF++VF+   ++  + 
Sbjct: 646 ----------VTVTNTGKMPGAISVLAYISYNNSGAPK--KKLFGFEKVFLNPLQSVSVT 693

Query: 749 FVFNACKSLNIVDYAANTLLPAGEHTIFVGN 779
           F  ++ K+ + VD +    +  G++ + +G+
Sbjct: 694 FPADS-KAFSTVDKSGKRSVNPGDYHVTIGD 723


>gi|125576920|gb|EAZ18142.1| hypothetical protein OsJ_33692 [Oryza sativa Japonica Group]
          Length = 618

 Score =  516 bits (1329), Expect = e-143,   Method: Compositional matrix adjust.
 Identities = 261/620 (42%), Positives = 373/620 (60%), Gaps = 21/620 (3%)

Query: 165 WSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSC 224
           WSPN+N+ RDPRWGR  ETPGEDP    +Y   +V+GLQ        + L +  L+ S+C
Sbjct: 2   WSPNVNIFRDPRWGRGQETPGEDPATASKYGAAFVKGLQ-------GSSLTN--LQTSAC 52

Query: 225 CKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIP 284
           CKH  AYD++ WKGV RY+F+A+VT QD+ +T+  PF  CV +G AS +MC+Y  +NG+P
Sbjct: 53  CKHITAYDIEEWKGVSRYNFNAKVTPQDLADTYNPPFRSCVVDGKASCIMCAYTLINGVP 112

Query: 285 SCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCG 344
           +CA   LL +TVRGEW L GY  +DCD++ ++  +  F   + E+AVA  LKAGLD++CG
Sbjct: 113 ACASSDLLTKTVRGEWKLDGYTASDCDAVAILHKSEHF-TRTAEEAVAVALKAGLDINCG 171

Query: 345 QYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ----YVSLGKQDICSDE 400
            Y      +A+QQGK+ E D+DK+LK L+ + MRLG FDG P+    Y  L   D+C+  
Sbjct: 172 VYMQQNAASALQQGKMTEKDVDKALKNLFAIRMRLGHFDGDPRGNKLYGRLSAADVCTPV 231

Query: 401 NIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSP 460
           +  LA EAAR G+VLLKND   LPL +  V + AV+G +AN  +A++GNY G+PC   +P
Sbjct: 232 HKALALEAARRGVVLLKNDARLLPLRAPTVASAAVIGHNANDILALLGNYYGLPCETTTP 291

Query: 461 IAGFSGYA-NVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAESLDRED 519
             G   Y  +  +  GC   AC    +   A+  AK++D   ++ GL    E E LDR  
Sbjct: 292 FGGIQKYVKSAKFLPGCSSAACDV-AATDQATALAKSSDYVFLVMGLSQKQEQEGLDRTS 350

Query: 520 LWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAI 579
           L LPG Q  LI  VA  +K PVIL++++ G VDI FA+TN  I AILWAGYPG+ GG+AI
Sbjct: 351 LLLPGKQQALITAVATASKRPVILILLTGGPVDITFAQTNPKIGAILWAGYPGQAGGQAI 410

Query: 580 ADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGY 639
           ADV+FG+FNP G+LP+TWY  ++ +   +T M +RP  + GYPGR+Y+FY G T+Y FGY
Sbjct: 411 ADVLFGEFNPSGKLPVTWYPEEFTK-FTMTDMRMRPDPATGYPGRSYRFYKGKTVYKFGY 469

Query: 640 GLSYTQFKYNLLSFT-KTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDY-FEF 697
           GLSY++F   ++S    +       L   R        +  R     + D RC+   F  
Sbjct: 470 GLSYSKFACRIVSGAGNSSSYGKAALAGLRAATTPEGDAVYRVDE--IGDDRCERLRFPV 527

Query: 698 KVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSL 757
            V+ QN G  DG   V+++ +  +      ++Q+IGF+   ++ G  K++K   + C+ L
Sbjct: 528 MVEVQNHGPMDGKHTVLMFVRWSSTDGGRPVRQLIGFRNQHLKVGEKKKLKMEISPCEHL 587

Query: 758 NIVDYAANTLLPAGEHTIFV 777
           +        ++  G H + V
Sbjct: 588 SRARVDGEKVIDRGSHFLMV 607


>gi|297740661|emb|CBI30843.3| unnamed protein product [Vitis vinifera]
          Length = 401

 Score =  509 bits (1312), Expect = e-141,   Method: Compositional matrix adjust.
 Identities = 243/434 (55%), Positives = 313/434 (72%), Gaps = 36/434 (8%)

Query: 357 QGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQDICSDENIELAAEAAREGIVLL 416
           QGK +E D+D SL+ LY VL ++GFFDG P Y SL K+D+C+ E+IELAA+AAR+GIVLL
Sbjct: 2   QGKAREEDVDTSLRNLYIVLTQVGFFDGIPSYESLDKKDLCTKEHIELAADAARQGIVLL 61

Query: 417 KNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFSGYANVTYKTGC 476
           KN   TLPL+ AK+K +A++GPHANAT+ M+GNYAG+PC+Y SP+ GFS Y  VTY+ GC
Sbjct: 62  KNINETLPLDPAKLKNLALIGPHANATIEMLGNYAGVPCQYSSPLDGFSAYGKVTYEMGC 121

Query: 477 DDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEV 536
           ++V C +   I  A EA+K ADATI+L GLD +VE E LDR DL LPGYQT+LI QV   
Sbjct: 122 NNVTCDNKTFIMPAVEASKNADATILLVGLDKTVEGEGLDRNDLLLPGYQTELILQVIVA 181

Query: 537 AKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPIT 596
           +KGP+ILVIMS   VDI+F++T+  +KAILWAGYPGEEGGRAIADVV+GK+NPGGRLP+T
Sbjct: 182 SKGPIILVIMSGSAVDISFSKTDDRVKAILWAGYPGEEGGRAIADVVYGKYNPGGRLPLT 241

Query: 597 WYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKT 656
           W+  DY+ MLP+TSM LRPV++  YPGRTYKF+NG  +YPFG+GLSYT+F Y L S    
Sbjct: 242 WHQNDYLSMLPMTSMSLRPVNN--YPGRTYKFFNGSVVYPFGHGLSYTKFNYTLRS---- 295

Query: 657 IQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVY 716
                                         +++ C D+FE  ++ +N+G+  G++VV+VY
Sbjct: 296 ------------------------------SNMSCKDHFELDIEVKNIGAKHGNEVVLVY 325

Query: 717 SKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIF 776
           SKPP  I  T+ KQVIGF+RVFV AG ++ +KF FN CKSL IV Y A  LLP+GEH I 
Sbjct: 326 SKPPTGIVGTHAKQVIGFKRVFVPAGGSQNVKFEFNVCKSLGIVGYNAYKLLPSGEHKII 385

Query: 777 VGNGGVSFPIHLNF 790
           +G+   S PI ++F
Sbjct: 386 IGDSPTSLPIDISF 399


>gi|340377241|ref|XP_003387138.1| PREDICTED: probable beta-D-xylosidase 2-like [Amphimedon
           queenslandica]
          Length = 733

 Score =  508 bits (1308), Expect = e-141,   Method: Compositional matrix adjust.
 Identities = 293/736 (39%), Positives = 431/736 (58%), Gaps = 59/736 (8%)

Query: 54  FCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGT 113
           +C+  L +  RVKDL+SR+TL+EK+ QLG+ A  + RLG+P Y+WWSE LHGV+ V PG 
Sbjct: 37  YCNYRLSFKDRVKDLLSRLTLEEKISQLGNSASAIDRLGIPGYQWWSEGLHGVA-VSPGL 95

Query: 114 HFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNINVAR 173
           H    +   TSFP +I T +SFN+SL+ +IG+AVSTEAR   + G+ GLTY++PNIN+ R
Sbjct: 96  HLGGNLTCTTSFPQIITTASSFNKSLFYEIGEAVSTEARGFADNGQGGLTYFTPNINIVR 155

Query: 174 DPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDV 233
           DPRWGR  ET GEDP++  +YAVN VRG Q   G++      S   K+ + CKH+AAYD+
Sbjct: 156 DPRWGRGQETAGEDPYLTSQYAVNLVRGAQ---GND------SEYKKIIATCKHFAAYDL 206

Query: 234 DNWKGVD-RYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLL 292
           +++   D R  F+A VT+QD+EET+   F  CV  G   S+MCSYN VNG+PSC D    
Sbjct: 207 ESYINGDVRDSFNAEVTKQDLEETYFPAFRSCVTAGGVGSIMCSYNSVNGVPSCVDGVFN 266

Query: 293 NQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFTG 352
           N+  R +W   GY+V+DC +I  +++ H + + +  D VA  LK G DL+CG +Y     
Sbjct: 267 NKIARNKWKFDGYLVSDCGAIDDVMNKHHYTS-TPTDTVAAGLKGGTDLNCGSFYQTHAM 325

Query: 353 NAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQY--VSLGKQDIC-SDENIELAAEAA 409
           +A   G + E DID+++  L+T  MRLG FD  P+Y   S    D+  + ++ +LA +AA
Sbjct: 326 DAFLNGSITEVDIDRAVGRLFTARMRLGLFD-LPKYQPYSYFNTDVVNTKQHQDLALQAA 384

Query: 410 REGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGF-SGYA 468
           RE IVLL+N+   LPL+      +AVVGP+  A V M G    I    +SP+ GF S   
Sbjct: 385 RESIVLLQNN-GKLPLSYEDHHKIAVVGPNILANVTMQGISQVIAPYLISPVDGFKSKGL 443

Query: 469 NVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAESLDREDLWLPGYQTQ 528
           +VTY  GC DV C   +    A +  K A A + + GLD  +E E++DRED++LPG Q +
Sbjct: 444 HVTYSLGC-DVKCIVTDGFHDAFKLVKDAKAVVAVMGLDQGIERETVDREDIFLPGLQDK 502

Query: 529 LINQVAEVAKG-----PVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVV 583
            +  + +         P+I+VIMS   VD+  +E+ +   AILW GYPG+ GG+AIA+V+
Sbjct: 503 FLLGLRDTLTNLQSPVPLIVVIMSGSSVDL--SESKSLADAILWVGYPGQSGGQAIAEVI 560

Query: 584 FGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSY 643
           +G+ NP GRLP+T+Y G+Y+ ++    M +R       PGRTY+FY    ++PFG+GLSY
Sbjct: 561 YGEVNPSGRLPLTFYPGEYIDLVAYRHMSMREP-----PGRTYRFYTENPVFPFGHGLSY 615

Query: 644 TQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQN 703
           T F+   LS+T       NK+ +                 ++++D   D   +F +   N
Sbjct: 616 TTFE---LSWT-------NKMNNVTE--------------IVISD-SVDINIDFDITVVN 650

Query: 704 VGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYA 763
            G   G+  V+ Y    + I    ++++  F +VF+    +K+I  +F    +   VD  
Sbjct: 651 TGYLSGAVSVLGYVS--SNIPDAPLRELFDFDKVFIDKYESKKIS-LFATNDAFTTVDEK 707

Query: 764 ANTLLPAGEHTIFVGN 779
               +  GE+ I + N
Sbjct: 708 GRRNILPGEYDIAIEN 723


>gi|407922988|gb|EKG16078.1| Glycoside hydrolase family 3 [Macrophomina phaseolina MS6]
          Length = 800

 Score =  498 bits (1281), Expect = e-138,   Method: Compositional matrix adjust.
 Identities = 289/735 (39%), Positives = 414/735 (56%), Gaps = 40/735 (5%)

Query: 53  LFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPG 112
           L CDSS     R   LV  +TL+EK+   G+ + GVPRLG+P+Y+WW+EALHGV+   PG
Sbjct: 39  LVCDSSATPLARATALVKELTLEEKLNNTGNTSPGVPRLGIPEYQWWNEALHGVAFTYPG 98

Query: 113 THFDDV--IPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNIN 170
               +      ATSFP  IL  A+F++ L  ++   VSTEARA  N GR+GL YW+PNIN
Sbjct: 99  QPMTESGNFSSATSFPQPILMGAAFDDELIYEVASVVSTEARAYSNGGRSGLDYWTPNIN 158

Query: 171 VARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPL-KVSSCCKHYA 229
             +DPRWGR  ETPGEDPF +  Y  N +RGL   EG++N       P  K+ + CKH+ 
Sbjct: 159 PYKDPRWGRGQETPGEDPFHLASYVQNLIRGL---EGNQN------DPYKKIVATCKHFT 209

Query: 230 AYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADP 289
            YD++NW G  RY FDA++  +DM E ++ PF+ C +E    + MCSYN VNG+P+CADP
Sbjct: 210 GYDMENWNGNFRYQFDAQINMRDMVEYYMPPFQACAREAKVGAFMCSYNAVNGVPTCADP 269

Query: 290 KLLNQTVRGEWDLH---GYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQY 346
            LL   +R  W  +    ++V+DCD+IQ +   H++ A+S+E AVA TL AG DL+CG Y
Sbjct: 270 WLLQTVLREHWGWNQEDQWVVSDCDAIQNVYLPHEW-AESREQAVADTLNAGTDLNCGTY 328

Query: 347 YTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDG--SPQYVSLGKQDICSDENIEL 404
           Y  +   A +QG + +T +D++L   Y+ L++LG+FD   S  Y  +G QD+ S    EL
Sbjct: 329 YQRYLPGAYEQGLINDTTLDRALTRTYSSLIKLGYFDNADSQPYRQIGWQDVNSQHAQEL 388

Query: 405 AAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPI-AG 463
           A +AA+EGIVLLKND   LPL+   V ++A++G  ANAT  M GNYAG+     SP+ A 
Sbjct: 389 ALKAAQEGIVLLKND-GLLPLSLDGVSSIALIGSWANATEQMQGNYAGVAPYLHSPLYAA 447

Query: 464 FSGYANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAESLDREDLWLP 523
                 V Y  G    +  + +   A   AA+ +D  I++ G+D  +E+E LDR  +   
Sbjct: 448 EQLGVKVNYAEGASQ-SNPTTDQWGAEYTAAENSDVIIVVGGIDNDIESEELDRVAIAWS 506

Query: 524 GYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVV 583
           G Q  +I ++A   K PVI+V M AG +D     +N NI A+LW GYPG++GG A+ D++
Sbjct: 507 GPQLDMITKLATYGK-PVIVVQMGAGQLDSTPLVSNANISALLWGGYPGQDGGTALFDII 565

Query: 584 FGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSY 643
            G   P GRLPIT Y   Y + + +T M LRP  +    GRTYK+YNG  ++PFG+GL Y
Sbjct: 566 TGAVAPAGRLPITQYPARYTKEVAMTDMSLRPSSTSA--GRTYKWYNGTAVFPFGFGLHY 623

Query: 644 TQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTR-CPGVLVNDLRCDDYFEFKVDFQ 702
           T F   + S   +     + +  C      +D SK   CP           +    VD  
Sbjct: 624 TNFSAAIPSPPASSFAISDLVASCS----ANDTSKLDLCP-----------FTSLAVDIA 668

Query: 703 NVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDY 762
           N G+     V + +       +      ++ +QR+   A    +   +     SL  VD 
Sbjct: 669 NDGTRASDFVALAFLTGEFGPSPHPKSSLVAYQRLHAIAAGETQTARLNLTLGSLVRVDE 728

Query: 763 AANTLLPAGEHTIFV 777
             + LL  G++++ +
Sbjct: 729 NGDKLLYPGDYSVLI 743


>gi|147857580|emb|CAN78858.1| hypothetical protein VITISV_030325 [Vitis vinifera]
          Length = 699

 Score =  494 bits (1273), Expect = e-137,   Method: Compositional matrix adjust.
 Identities = 266/646 (41%), Positives = 378/646 (58%), Gaps = 87/646 (13%)

Query: 138 SLWKKIGQAVSTEARAMYNLGRAGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVN 197
           S + ++ + VSTEARAMYN+G AGLT+WSPN+N+ +DPRWGR  ETPGEDP +  +YA  
Sbjct: 128 SKFMRLRKVVSTEARAMYNVGLAGLTFWSPNVNIFQDPRWGRGQETPGEDPLLSSKYASG 187

Query: 198 YVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETF 257
           YVRGLQ       + D +   LKV++CCKHY AYD+DNWKGVD +HF+A VT QDM++TF
Sbjct: 188 YVRGLQ------QSDDGSPDRLKVAACCKHYTAYDLDNWKGVDCFHFNAVVTNQDMDDTF 241

Query: 258 LRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMV 317
             PF+ CV +G+ +SV+                              YIV+DCDS+ V  
Sbjct: 242 QPPFKSCVIDGNVASVI------------------------------YIVSDCDSVDVFY 271

Query: 318 DNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLM 377
           ++  +   + E+A A+ + AGLDL+CG +    T  AV+ G V E+ +DK++   +  LM
Sbjct: 272 NSQHY-TKTPEEAAAKAILAGLDLNCGSFLGQHTEAAVKGGLVDESAVDKAVSNNFATLM 330

Query: 378 RLGFFDGSPQ---YVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVA 434
           RLGFFDG+P    Y  LG +D+C+ E+ E A EA R+GIV                    
Sbjct: 331 RLGFFDGNPSKAIYGKLGPKDVCTSEHQERAREAPRQGIV-------------------- 370

Query: 435 VVGPHANATVAMIGNYAGIPCRYMSPIAGFSGYANVTYKTGCDDVACKSNNSIFAASEAA 494
                          +AG PC+Y +P+ G +     TY  GC +VAC +   I  A + A
Sbjct: 371 ---------------FAGTPCKYTTPLQGLTALVATTYLPGCSNVACGTAQ-IDEAKKIA 414

Query: 495 KTADATIILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIA 554
             ADAT+++ G+D S+EAE  DR ++ LPG Q  LI +VA+ +KG VILV+MS GG DI+
Sbjct: 415 AAADATVLIVGIDQSIEAEGRDRVNIQLPGQQPLLITEVAKXSKGNVILVVMSGGGFDIS 474

Query: 555 FAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLR 614
           FA+ +  I +I W GYPGE GG AIADV+FG +NP G+LP+TWY   YV  +P+T+M +R
Sbjct: 475 FAKNDDKITSIQWVGYPGEAGGAAIADVIFGFYNPSGKLPMTWYPQSYVDKVPMTNMNMR 534

Query: 615 PVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTS 674
           P  + GYPGRTY+FY G T+Y FG GLSYTQF ++L+   K++ + + +   C +     
Sbjct: 535 PDPASGYPGRTYRFYTGETIYTFGDGLSYTQFNHHLVQAPKSVSIPIEEAHSCHS----- 589

Query: 675 DASKTRCPGVLVNDLRCDDY-FEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIG 733
               ++C  V      C +  F+  +   N G+  GS  V ++S PP+ +  +  K ++G
Sbjct: 590 ----SKCKSVDAVQESCQNLAFDIHLRVNNAGNISGSHTVFLFSSPPS-VHNSPQKHLLG 644

Query: 734 FQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGN 779
           F++VFV A     ++F  + CK L+IVD      +  G H + VGN
Sbjct: 645 FEKVFVTAKAKALVRFKVDVCKDLSIVDELGTRKVALGLHVLHVGN 690


>gi|115436096|ref|NP_001042806.1| Os01g0296700 [Oryza sativa Japonica Group]
 gi|113532337|dbj|BAF04720.1| Os01g0296700, partial [Oryza sativa Japonica Group]
          Length = 522

 Score =  492 bits (1266), Expect = e-136,   Method: Compositional matrix adjust.
 Identities = 256/525 (48%), Positives = 345/525 (65%), Gaps = 19/525 (3%)

Query: 280 VNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGL 339
           +NG+P+CAD +LL +TVR +W LHGYIV+DCDS++VMV + K+L  +  +A A  +KAGL
Sbjct: 1   INGVPACADARLLTETVRRDWQLHGYIVSDCDSVRVMVRDAKWLGYTGVEATAAAMKAGL 60

Query: 340 DLDCGQY-------YTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLG 392
           DLDCG +       +T +  +AV+QGK+KE+ +D +L  LY  LMRLGFFDG P+  SLG
Sbjct: 61  DLDCGMFWEGVHDFFTTYGVDAVRQGKLKESAVDNALTNLYLTLMRLGFFDGIPELESLG 120

Query: 393 KQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGP--HANATVAMIGNY 450
             D+C++E+ ELAA+AAR+G+VLLKND   LPL+  KV +VA+ G   H NAT  M+G+Y
Sbjct: 121 AADVCTEEHKELAADAARQGMVLLKNDAALLPLSPEKVNSVALFGQLQHINATDVMLGDY 180

Query: 451 AGIPCRYMSPIAGFSGYANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSV 510
            G PCR ++P  G     + T    CD  +C +  +      AAKT DATI++AGL++SV
Sbjct: 181 RGKPCRVVTPYDGVRKVVSSTSVHACDKGSCDTAAA------AAKTVDATIVVAGLNMSV 234

Query: 511 EAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGY 570
           E ES DREDL LP  Q   IN VAE +  P++LVIMSAGGVD++FA+ N  I A++WAGY
Sbjct: 235 ERESNDREDLLLPWSQASWINAVAEASPSPIVLVIMSAGGVDVSFAQDNPKIGAVVWAGY 294

Query: 571 PGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYN 630
           PGEEGG AIADV+FGK+NPGGRLP+TWY  +YV  +P+TSM LRP    GYPGRTYKFY 
Sbjct: 295 PGEEGGTAIADVLFGKYNPGGRLPLTWYKNEYVSKIPMTSMALRPDAEHGYPGRTYKFYG 354

Query: 631 GP-TLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSD-ASKTRCPGVLVND 688
           G   LYPFG+GLSYT F Y   +    + V +   ++C+ L Y +  +S   CP V V  
Sbjct: 355 GADVLYPFGHGLSYTNFTYASATAAAPVTVKVGAWEYCKQLTYKAGVSSPPACPAVNVAS 414

Query: 689 LRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIK 748
             C +   F V   N G  DG+ VV +Y+ PPAE+     KQ++ F+RV V AG    + 
Sbjct: 415 HACQEEVSFAVTVANTGGRDGTHVVPMYTAPPAEVDGAPRKQLVAFRRVRVAAGAAVEVA 474

Query: 749 FVFNACKSLNIVDYAANTLLPAGEHTIFVGNGG--VSFPIHLNFN 791
           F  N CK+  IV+  A T++P+G   + VG+    +SFP+ ++  
Sbjct: 475 FALNVCKAFAIVEETAYTVVPSGVSRVLVGDDALSLSFPVQIDLQ 519


>gi|40363751|dbj|BAD06320.1| putative beta-xylosidase [Triticum aestivum]
          Length = 573

 Score =  489 bits (1259), Expect = e-135,   Method: Compositional matrix adjust.
 Identities = 249/571 (43%), Positives = 360/571 (63%), Gaps = 13/571 (2%)

Query: 215 NSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVM 274
           NS  L+ S+CCKH+ AYD++NWKGV R+ FDA+VTEQD+ +T+  PF+ CV++G AS +M
Sbjct: 1   NSSDLEASACCKHFTAYDLENWKGVTRFAFDAKVTEQDLADTYNPPFKSCVEDGGASGIM 60

Query: 275 CSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQT 334
           CSYNRVNG+P+CAD  LL++T RG+W  +GYI +DCD++ ++ D   + A + EDAVA  
Sbjct: 61  CSYNRVNGVPTCADHNLLSKTARGDWSFNGYITSDCDAVAIIHDVQGY-AKAPEDAVADV 119

Query: 335 LKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ---YVSL 391
           LKAG+D++CG Y      +A QQGK+   DID++L+ L+ + MRLG F+G+P+   Y ++
Sbjct: 120 LKAGMDVNCGGYIQTHGVSAYQQGKITGEDIDRALRNLFAIRMRLGLFNGNPKYNRYGNI 179

Query: 392 GKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYA 451
           G   +C  E+ +LA +AA++GIVLLKND   LPL+ +KV +VAV+GP+ N    ++GNY 
Sbjct: 180 GADQVCKKEHQDLALQAAQDGIVLLKNDAGALPLSKSKVSSVAVIGPNGNNASLLLGNYF 239

Query: 452 GIPCRYMSPIAGFSGYA-NVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSV 510
           G PC  ++P     GY  + T+  GC+   C  +N I  A  AA +AD  ++  GLD + 
Sbjct: 240 GPPCISVTPFQALQGYVKDATFVQGCNAAVCNVSN-IGEAVHAASSADYVVLFMGLDQNQ 298

Query: 511 EAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGY 570
           E E +DR +L LPG Q  L+N+VA+ AK PVILV++  G VD+ FA+ N  I AI+WAGY
Sbjct: 299 EREEVDRLELGLPGMQESLVNKVADAAKKPVILVLLCGGPVDVTFAKNNPKIGAIVWAGY 358

Query: 571 PGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYN 630
           PG+ GG AIA V+FG+ NPGGRLP+TWY  ++   +P+T M +R   S GYPGRTY+FY 
Sbjct: 359 PGQAGGIAIAQVLFGEHNPGGRLPVTWYPKEFT-AVPMTDMRMRADPSTGYPGRTYRFYK 417

Query: 631 GPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLR 690
           G T+Y FGYGLSY+++ +   S   T   +++ ++    L  T+ A+ T    V      
Sbjct: 418 GKTVYNFGYGLSYSKYSHRFAS-EGTKPPSMSGIE---GLKATASAAGTVSYDVEEMGAE 473

Query: 691 CDDYFEFK--VDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIK 748
             D   F   V  QN G  DG   V+++ + P         Q+IGFQ V +RA     ++
Sbjct: 474 ACDRLRFPAVVRVQNHGPMDGRHPVLLFLRWPNATDGRPASQLIGFQSVHLRADEAAHVE 533

Query: 749 FVFNACKSLNIVDYAANTLLPAGEHTIFVGN 779
           F  + CK  +        ++  G H + VG+
Sbjct: 534 FEVSPCKHFSRAAEDGRKVIDQGSHFVKVGD 564


>gi|125576923|gb|EAZ18145.1| hypothetical protein OsJ_33695 [Oryza sativa Japonica Group]
          Length = 591

 Score =  487 bits (1254), Expect = e-135,   Method: Compositional matrix adjust.
 Identities = 256/597 (42%), Positives = 364/597 (60%), Gaps = 26/597 (4%)

Query: 190 VVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVT 249
           +  +YAV +V+G+Q   G+ +A       L+ S+CCKH  AYD+++W GV RY+F+A+VT
Sbjct: 1   MASKYAVAFVKGMQ---GNSSAI------LQTSACCKHVTAYDLEDWNGVQRYNFNAKVT 51

Query: 250 EQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVAD 309
            QD+E+T+  PF  CV +  A+ +MC+Y  +NG+P+CA+  LL +TVRG+W L GYI +D
Sbjct: 52  AQDLEDTYNPPFRSCVVDAKATCIMCAYTGINGVPACANADLLTKTVRGDWGLDGYIASD 111

Query: 310 CDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSL 369
           CD++ +M D  ++   + EDAVA  LKAGLD++CG Y       A+QQGK+ E DIDK+L
Sbjct: 112 CDAVAIMRDAQRY-TQTPEDAVAVALKAGLDMNCGTYMQQHATAAIQQGKLTEEDIDKAL 170

Query: 370 KYLYTVLMRLGFFDGSPQ----YVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPL 425
           K L+ + MRLG FDG P+    Y  LG  DIC+ E+  LA EAA +GIVLLKND   LPL
Sbjct: 171 KNLFAIRMRLGHFDGDPRSNSVYGGLGAADICTPEHRSLALEAAMDGIVLLKNDAGILPL 230

Query: 426 NSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFSGY-ANVTYKTGCDDVACKSN 484
           +   V + AV+GP+AN  +A+IGNY G PC   +P+ G  GY  NV +  GC+  AC   
Sbjct: 231 DRTAVASAAVIGPNANDGLALIGNYFGPPCESTTPLNGILGYIKNVRFLAGCNSAACDVA 290

Query: 485 NSIFAASEAAKTADATIILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILV 544
            +  AA+ A+ ++D   +  GL    E+E  DR  L LPG Q  LI  VA+ AK PVILV
Sbjct: 291 ATDQAAAVAS-SSDYVFLFMGLSQKQESEGRDRTSLLLPGEQQSLITAVADAAKRPVILV 349

Query: 545 IMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQ 604
           +++ G VD+ FA+TN  I AILWAGYPG+ GG AIA V+FG  NPGGRLP+TWY  ++ +
Sbjct: 350 LLTGGPVDVTFAQTNPKIGAILWAGYPGQAGGLAIARVLFGDHNPGGRLPVTWYPEEFTK 409

Query: 605 MLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKL 664
            +P+T M +R   + GYPGR+Y+FY G T+Y FGYGLSY+ +   L+S  K  +   N L
Sbjct: 410 -VPMTDMRMRADPATGYPGRSYRFYQGKTVYKFGYGLSYSSYSRQLVSGGKPAESYTNLL 468

Query: 665 QHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFK----VDFQNVGSTDGSDVVIVYSKPP 720
              R    TS+  ++      + ++  D   + K    V+ QN G  DG   V++Y + P
Sbjct: 469 ASLRTTT-TSEGDES----YHIEEIGTDGCEQLKFPAVVEVQNHGPMDGKHSVLMYLRWP 523

Query: 721 AEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFV 777
                    Q+IGF+   ++ G    I+F  + C+  + V      ++  G H + V
Sbjct: 524 NAKGGRPTTQLIGFRSQHLKVGEKANIRFDISPCEHFSRVRKDGKKVIDRGSHYLMV 580


>gi|398403795|ref|XP_003853364.1| putative xylan 1,4-beta-Xylosidase [Zymoseptoria tritici IPO323]
 gi|339473246|gb|EGP88340.1| putative xylan 1,4-beta-Xylosidase [Zymoseptoria tritici IPO323]
          Length = 785

 Score =  487 bits (1253), Expect = e-134,   Method: Compositional matrix adjust.
 Identities = 286/731 (39%), Positives = 399/731 (54%), Gaps = 40/731 (5%)

Query: 55  CDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTH 114
           CD +     R   L++  T++EK+   G  A GVPRLGLP Y WW EALHGV+   PG +
Sbjct: 39  CDFTADPLTRATALIAAFTIEEKINNTGSTAPGVPRLGLPAYTWWQEALHGVAQ-SPGVN 97

Query: 115 FDDV--IPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNINVA 172
           F D      ATSFP  IL  A+F++ L K +   +STEARA  N  R+GL YW+PNIN  
Sbjct: 98  FSDSGDFRYATSFPQPILMGAAFDDDLIKDVATVISTEARAFNNDARSGLDYWTPNINPF 157

Query: 173 RDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYD 232
           +D RWGR  ETPGEDP+ +  Y  + + GLQ           + +  KV + CKH+ AYD
Sbjct: 158 KDSRWGRGQETPGEDPYHLSSYVKSLIAGLQG----------DGKYKKVVATCKHFVAYD 207

Query: 233 VDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLL 292
           ++ W G  RY FD  V  Q++ E ++ PF+ C ++ +  + MCSYN +NGIP+CADP LL
Sbjct: 208 LETWNGNFRYQFDPHVGSQELVEYYMPPFQACARDANVGAFMCSYNSLNGIPTCADPYLL 267

Query: 293 NQTVRGEWDL---HGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTN 349
              +R  W+      ++ +DCDSIQ +   H++ + ++E+AVA +LKAG D++CG YY  
Sbjct: 268 QTILREHWNWTSEEQWVTSDCDSIQNVYLPHEYTS-TREEAVAVSLKAGTDVNCGTYYQE 326

Query: 350 FTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSP-QYVSLGKQDICSDENIELAAEA 408
           F   A+  G V E DID +L   Y+ L+RLG+FDG+  +Y SL  +D+ +    +LA +A
Sbjct: 327 FLPGALSLGLVTEKDIDMALIRQYSSLVRLGYFDGTAVEYRSLSWKDVSTPYAQQLALKA 386

Query: 409 AREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPI-AGFSGY 467
           A EGI LLKND   LPL   K   +AV+G  ANAT  M+GNY GIP    SP+ A     
Sbjct: 387 AVEGITLLKND-GILPLAITKDTKIAVIGDWANATEQMLGNYDGIPPYLHSPLWAAQQTG 445

Query: 468 ANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAESLDREDLWLPGYQT 527
           ANVTY          + N+      A   AD  +   G+D  VEAE +DR  +   G Q 
Sbjct: 446 ANVTYSGNPGGQGDPTTNNWLHIWTAVDEADVILFAGGIDNGVEAEGMDRVSIAWTGAQL 505

Query: 528 QLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKF 587
            +I Q+A   K PVI+  M   GVD      N NI A+LW GYPG++GG A+ D++ GK 
Sbjct: 506 DVIGQLASRGK-PVIVAQMGTNGVDSTPLLNNQNISALLWGGYPGQDGGVALLDIIQGKS 564

Query: 588 NPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFK 647
            P GRLP T Y   Y+  +P+T M LRP  + G+PGRTY +YN   ++ FGYGL YT F 
Sbjct: 565 APAGRLPTTQYPASYISKVPMTDMHLRPNSTTGFPGRTYMWYNEKPVFEFGYGLHYTNFS 624

Query: 648 YNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGST 707
              +S T T   ++  L      +Y       RCP           + + K+   N G+ 
Sbjct: 625 AT-ISPTDTTSFSIADLTKDCTEHYMD-----RCP-----------FADMKIAVTNTGNV 667

Query: 708 DGSDVVIVYSKPPAEIAATYIKQVIGFQRVF-VRAGRNKRIKFVFNACKSLNIVDYAANT 766
               V + +       A    K+++ +QR+  + AG ++          SL  VD   NT
Sbjct: 668 TSDYVTLGFLAGEHGPAPCPNKRLVNYQRLHNITAGASQTTSLNL-TLASLARVDDMGNT 726

Query: 767 LLPAGEHTIFV 777
           +L  G + + +
Sbjct: 727 VLYPGSYALLI 737


>gi|452989371|gb|EME89126.1| glycoside hydrolase family 3 protein [Pseudocercospora fijiensis
           CIRAD86]
          Length = 790

 Score =  482 bits (1240), Expect = e-133,   Method: Compositional matrix adjust.
 Identities = 284/732 (38%), Positives = 399/732 (54%), Gaps = 38/732 (5%)

Query: 55  CDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTH 114
           CD++     R K L++  TL EK+   G  + GVPRLGL  YEWW EALHGV++  PG +
Sbjct: 39  CDTAADPLTRAKALIAEFTLAEKINNTGSTSPGVPRLGLLPYEWWQEALHGVAS-SPGVN 97

Query: 115 FDDVIPG----ATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNIN 170
           F   + G    ATSFP  IL  A+F++ L   +   +STEARA  N  RAGL +W+PNIN
Sbjct: 98  FS--VSGEFRYATSFPQPILMGAAFDDQLIHDVASVISTEARAFSNDDRAGLDFWTPNIN 155

Query: 171 VARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAA 230
             +DPRWGR  ETPGEDP+ +  Y  + +RGLQ           N    KV + CKH+ A
Sbjct: 156 PFKDPRWGRGQETPGEDPYHLSSYVHSLIRGLQGD---------NPSYKKVVATCKHFVA 206

Query: 231 YDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPK 290
           YDV+NW G  RY  DA +  QD+ E ++ PF  C ++ +  + MCSYN +NG+P+CADP 
Sbjct: 207 YDVENWNGNFRYQLDAHINSQDLVEYYMPPFRSCARDSNVGAFMCSYNSLNGVPTCADPY 266

Query: 291 LLNQTVRGEWDL---HGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYY 347
           LL   +R  W+      ++ +DCDS+Q +   H + A S+E+A A +LKAG D++CG YY
Sbjct: 267 LLQTVLREHWNWTAEEQWVTSDCDSVQNVFLYHNY-ASSREEAAAISLKAGTDINCGTYY 325

Query: 348 TNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSP-QYVSLGKQDICSDENIELAA 406
                 A +QG + ETD+D SL   Y  L+RLG+FDG    Y +L   D+ +    +LA 
Sbjct: 326 QEHLPRAYEQGLINETDVDTSLIRQYGSLIRLGYFDGDRVPYRNLTWNDVSTPYAQDLAL 385

Query: 407 EAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPI-AGFS 465
           +AA  GI LLKND   LPL       +A++G  ANAT  M+GNY GIP  + SP+ A   
Sbjct: 386 KAATSGITLLKND-GILPLQITNGTKIALIGDWANATDQMLGNYHGIPPYFHSPLWAAQQ 444

Query: 466 GYANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAESLDREDLWLPGY 525
             A VTY  G    +  +  +      AA  +D  I + G+D  VEAE  DR  +   G 
Sbjct: 445 TGAEVTYVQGPGGQSDPTTYTWRPIWSAANKSDVIIYIGGMDERVEAEEKDRVSIAWSGP 504

Query: 526 QTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFG 585
           Q  +I Q+A+    P I+V M  G +D +    N NI+A+LW GYPG++GG+AI D++ G
Sbjct: 505 QLDVIGQLADYYDKPTIVVQMGGGSLDSSPLVKNPNIRALLWGGYPGQDGGKAIFDILQG 564

Query: 586 KFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQ 645
              P GRLPIT Y  DY+  +P+T   LRP  + G PGRTY + N   ++ FGYGL YT 
Sbjct: 565 ISAPAGRLPITQYRADYISKVPMTDTSLRPNATSGSPGRTYIWLNEEPVFEFGYGLHYT- 623

Query: 646 FKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVG 705
                 +FT TI           +  Y+ D+  + C    ++  RC  +  F +D  N G
Sbjct: 624 ------NFTATI-----PDAESSDTTYSIDSLASDCTESYLD--RC-PFKTFSIDVTNTG 669

Query: 706 STDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAAN 765
           S     V + +            K+++ +QR+      + +   +     SL+ VD   N
Sbjct: 670 SVTSDYVTLGFLTGAHGPEPCPNKRLVSYQRLHNITAGSTQTAALNLTLGSLSRVDDKGN 729

Query: 766 TLLPAGEHTIFV 777
           T+L  G + + V
Sbjct: 730 TVLFPGSYALLV 741


>gi|291167620|dbj|BAI82526.1| 1,4-beta-D-xylosidase [Aureobasidium pullulans var. melanogenum]
          Length = 805

 Score =  482 bits (1240), Expect = e-133,   Method: Compositional matrix adjust.
 Identities = 277/736 (37%), Positives = 412/736 (55%), Gaps = 37/736 (5%)

Query: 49  MSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSN 108
           +S+   CD S     R K LV+  T+ EK+   G+ + GVPRLGLP Y+WW EALHGV++
Sbjct: 38  LSNNTVCDKSADPVARAKALVAAFTVAEKLNLTGNNSPGVPRLGLPVYQWWQEALHGVAS 97

Query: 109 VGPGTHFDDV--IPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWS 166
             PG  F+       ATSFP  IL  A+F+++L + + + VSTEARA  N GRAGL +W+
Sbjct: 98  -SPGVTFNATGQFDSATSFPQPILMGAAFDDALIQSVAEVVSTEARAFNNYGRAGLDFWT 156

Query: 167 PNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCK 226
           PNIN  RDPRWGR  ETPGEDP+ +  Y  + + GLQ  E  E          K+++ CK
Sbjct: 157 PNINPYRDPRWGRGQETPGEDPYHLSSYVHSLIMGLQGGEDPEIR--------KITATCK 208

Query: 227 HYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSC 286
           H+A YD+++W G  RY  D ++ ++D+ E +L  F  C ++ +  + MC+Y+ +NG+P+C
Sbjct: 209 HFAGYDIESWNGNLRYQNDVQIPQRDLVEYYLPSFRSCARDSNVGAFMCTYSALNGVPTC 268

Query: 287 ADPKLLNQTVRGEW---DLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDC 343
           ADP LLN  +R  W   +   ++ +DCDSIQ +   H F +D+++ A A  L AG DLDC
Sbjct: 269 ADPWLLNDVLREHWGWTNEEQWVTSDCDSIQNIFLPHNF-SDTRQGAAAAALNAGTDLDC 327

Query: 344 GQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDG-SPQYVSLGKQDICSDENI 402
           G YY +    A  QG + +T +D++L  LYT L+R G+FDG +  Y +L   D+ +    
Sbjct: 328 GTYYQHHLPLAYSQGLINQTTVDQALVRLYTSLVRTGYFDGPNAMYRNLTWSDVGTTHAQ 387

Query: 403 ELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPI- 461
           +LA +AA EG+VLLKND   LPL+ +    +A++G  ANAT  M GNY G+P    SP+ 
Sbjct: 388 QLALQAAEEGMVLLKND-GLLPLSISNGTKIALIGSWANATTQMQGNYYGVPTYLHSPLY 446

Query: 462 AGFSGYANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAESLDREDLW 521
           A     A V Y  G       + +       AA+ AD  I + G+D+SVEAE +DRED+ 
Sbjct: 447 AAQQTGAQVFYAQGPGGQGDPTTDHWLPVWTAAEKADIIIYIGGVDISVEAEGMDREDIN 506

Query: 522 LPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIAD 581
             G Q  +I ++A   K P++L  M    +D      N NI A++W GYPG++GG A+ +
Sbjct: 507 WTGAQLDIIGELAMYGK-PMVLAQM-GDQLDNTPIVNNANISALIWGGYPGQDGGVALFN 564

Query: 582 VVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGL 641
           ++ GK  P GRLP+T Y   Y+  +P+T M LRP  + G PGRTYK+YNG  ++ FGYG+
Sbjct: 565 IITGKTAPAGRLPVTQYPAHYIADIPMTDMTLRPNATTGSPGRTYKWYNGTAVFEFGYGM 624

Query: 642 SYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDF 701
            YT+F  ++   +K+     + L  C      ++  K RC            +    V+ 
Sbjct: 625 HYTKFSADISPMSKSSYDISSLLSGC------NETYKDRCA-----------FESISVNV 667

Query: 702 QNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVD 761
            N G+       + +       +    K ++ +QR+   AG + +   +     SL+ VD
Sbjct: 668 HNTGNVTSDYAALGFIAGQFGPSPYPKKSLVNYQRLHNIAGGSSQTATLNLTLGSLSRVD 727

Query: 762 YAANTLLPAGEHTIFV 777
              NT L  G++ + +
Sbjct: 728 DHGNTYLYPGDYALMI 743


>gi|115436902|ref|XP_001217674.1| conserved hypothetical protein [Aspergillus terreus NIH2624]
 gi|121734342|sp|Q0CB82.1|BXLB_ASPTN RecName: Full=Probable exo-1,4-beta-xylosidase bxlB; AltName:
           Full=1,4-beta-D-xylan xylohydrolase bxlB; AltName:
           Full=Beta-xylosidase bxlB; AltName: Full=Xylobiase bxlB;
           Flags: Precursor
 gi|114188489|gb|EAU30189.1| conserved hypothetical protein [Aspergillus terreus NIH2624]
          Length = 765

 Score =  480 bits (1236), Expect = e-132,   Method: Compositional matrix adjust.
 Identities = 278/655 (42%), Positives = 383/655 (58%), Gaps = 46/655 (7%)

Query: 2   AKVVSSLLCFSLSIALLVFSTNAVDANGSSSPVFVCDPGRFSKLGLQMSSFLFCDSSLPY 61
           ++  +S+L   +S+  L F+          SP   C+ G  SK  +       CD++L  
Sbjct: 6   SRRAASILACIVSLTQLGFA---------QSPFPDCENGPLSKNAV-------CDTTLDP 49

Query: 62  SIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTHFDDV--I 119
             R + L++ MTL+EK+      + GVPRLGLP Y WWSEALHGV+   PG HF D    
Sbjct: 50  VTRAQALLAAMTLEEKINNTQYNSPGVPRLGLPAYNWWSEALHGVAG-SPGVHFADSGNF 108

Query: 120 PGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNINVARDPRWGR 179
             ATSFP+ I   A+F++ L K+I   + TE RA  N G AGL YW+PNIN  RDPRWGR
Sbjct: 109 SYATSFPSPITLGAAFDDDLVKQIATVIGTEGRAFGNAGHAGLDYWTPNINPYRDPRWGR 168

Query: 180 ITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGV 239
             ETPGEDPF   RY  + + GLQD  G E          K+ + CKH+A YD+++W+G 
Sbjct: 169 GQETPGEDPFHTSRYVYHLIDGLQDGIGPEKP--------KIVATCKHFAGYDIEDWEGN 220

Query: 240 DRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGE 299
           +RY FDA +++QDM E +  PF+ C ++    +VMCSYN VNGIP+CADP LL   +R  
Sbjct: 221 ERYAFDAVISDQDMAEYYFPPFKTCTRDAKVDAVMCSYNSVNGIPTCADPWLLQTVLREH 280

Query: 300 WDLHG---YIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFTGNAVQ 356
           W+  G   ++ +DC +I  +  +HK++A     A A  + AG DLDCG  Y  F G+A+ 
Sbjct: 281 WEWEGVGHWVTSDCGAIDNIYKDHKYVA-DGAHAAAVAVNAGTDLDCGSVYPQFLGSAIS 339

Query: 357 QGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ--YVSLGKQDICSDENIELAAEAAREGIV 414
           QG +    +D++L  LY+ L++LG+FD +    Y S+G  D+ + +  +LA  AA EG V
Sbjct: 340 QGLLGNRTLDRALTRLYSSLVKLGYFDPAADQPYRSIGWSDVATPDAEQLAHTAAVEGTV 399

Query: 415 LLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRY---MSPIAGFSGYANVT 471
           LLKND  TLPL   K  TVA+VGP+ANAT  + GNY G   +Y   M   A   GY  V 
Sbjct: 400 LLKND-GTLPLK--KNGTVAIVGPYANATTQLQGNYEGT-AKYIHTMLSAAAQQGY-KVK 454

Query: 472 YKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAESLDREDLWLPGYQTQLIN 531
           Y  G   +   S +    A  AAK +D  I   G+D  VEAE+LDR  +  PG Q  LI 
Sbjct: 455 YAPGT-GINSNSTSGFEQALNAAKGSDLVIYFGGIDHEVEAEALDRTSIAWPGNQLDLIQ 513

Query: 532 QVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGG 591
           Q++++ K P+++V    G VD +   +N  +  +LWAGYP + GG A+ D++ GK  P G
Sbjct: 514 QLSDLKK-PLVVVQFGGGQVDDSSLLSNAGVNGLLWAGYPSQAGGAAVFDILTGKTAPAG 572

Query: 592 RLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQF 646
           RLP+T Y  +YV  +P+T M LRP  S   PGRTY++Y+   + PFGYG+ YT F
Sbjct: 573 RLPVTQYPEEYVDQVPMTDMNLRPGPS--NPGRTYRWYDKAVI-PFGYGMHYTTF 624


>gi|344303941|gb|EGW34190.1| hypothetical protein SPAPADRAFT_65353 [Spathaspora passalidarum
           NRRL Y-27907]
          Length = 788

 Score =  479 bits (1233), Expect = e-132,   Method: Compositional matrix adjust.
 Identities = 286/733 (39%), Positives = 408/733 (55%), Gaps = 37/733 (5%)

Query: 55  CDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGV--SNVGPG 112
           C+  LP   R K +V   T+DE +  +G+ + GV RLGLP Y+WWSEALHG+  SN    
Sbjct: 61  CNPHLPTEQRAKAVVDLFTVDELIANMGNTSPGVERLGLPPYQWWSEALHGIARSNFTAS 120

Query: 113 THFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNINVA 172
             +      ATSFP  IL   +FN  L+K++G  + TEARA  N+GRAGL ++SPNIN  
Sbjct: 121 GEYSH----ATSFPQPILMGGAFNNDLYKQVGNVIGTEARAFNNVGRAGLDFYSPNINPF 176

Query: 173 RDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYD 232
           RD RWGR  E   E P +VG YA+NYV+GLQ   G +  ++ N   L+V++ CKH+  YD
Sbjct: 177 RDARWGRGQEVASESPVLVGNYALNYVQGLQG--GLD--SNQNDDTLQVAATCKHFVGYD 232

Query: 233 VDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLL 292
           +++W    R  ++A +++QD+ + +L  F+ CV++  A+  MCSYN VNG+P+CA    L
Sbjct: 233 MESWNQHSRLGYNAIISDQDLADFYLPTFQSCVRDAKAAGAMCSYNAVNGVPACASEFFL 292

Query: 293 NQTVRGEWDLH-GYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFT 351
           N  +R  +D   G I +DCD+I  + + H +  D    A A  +KAG+D++CG  Y N  
Sbjct: 293 NTVLRDGFDFQNGVIHSDCDAIYNVWNPHLYAQDLG-GAAADAIKAGVDVNCGDTYQNNL 351

Query: 352 GNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ---YVSLGKQDICSDENIELAAEA 408
           G A+    + E  I  S+   Y+ L+RLG+FD SPQ   Y      D+ + +  +LA +A
Sbjct: 352 GYALGNKTINENQIRTSVTRQYSNLIRLGYFD-SPQTNKYRKYDWNDVSTPQANQLAYQA 410

Query: 409 AREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFSGYA 468
           A EGI LLKND  TLP N  KV+ VAV+GP ANAT  M+G+YAG P   +SP+ G     
Sbjct: 411 AVEGIALLKND-GTLPFNKQKVRKVAVIGPWANATTQMLGDYAGTPPYMISPLQGAQSEG 469

Query: 469 -NVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAESLDREDLWLPGYQT 527
             V Y  G   +     +   AA  AAK ADA +   G+D SVE E+LDRE L  PG Q 
Sbjct: 470 FQVEYALGT-QINTTDTSGYTAALNAAKGADAIVYFGGIDNSVENEALDRESLAWPGNQL 528

Query: 528 QLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKF 587
            L+++++ + K P++++    G +D    + N N+ AI++AGYPG+ GG AI D++ GK+
Sbjct: 529 DLVSKLSGLKK-PLVVLQFGGGQIDDTEIKNNKNVNAIVYAGYPGQSGGTAIWDILSGKY 587

Query: 588 NPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFK 647
            P GRL  T Y   Y   +P+T M LRP    GYPGRT+ +YNG  +Y FGYGL YT F 
Sbjct: 588 APAGRLTTTQYPASYADQVPMTDMTLRPRQ--GYPGRTFMWYNGEPVYEFGYGLHYTTFS 645

Query: 648 YNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGST 707
            +L +  +    + N  Q        +  S+    G++           F V+ +N G T
Sbjct: 646 ASLANAPRGGHQSFNIEQVVA----AAKRSQYVDTGLITT---------FDVNIKNTGKT 692

Query: 708 DGSDVVIVYSKPPAEIAATYIKQVIGFQRVF-VRAGRNKRIKFVFNACKSLNIVDYAANT 766
                 ++YSK  A       K ++ F ++  + AG+ +  K       SL   D   N 
Sbjct: 693 TSDYAALLYSKTTAGPGPHPNKILVSFDKLHQIHAGQTQTAKLPV-TIGSLLQTDTNGNK 751

Query: 767 LLPAGEHTIFVGN 779
            L  G +T FV N
Sbjct: 752 WLYPGTYTFFVDN 764


>gi|440799679|gb|ELR20723.1| betaxylosidase [Acanthamoeba castellanii str. Neff]
          Length = 748

 Score =  478 bits (1231), Expect = e-132,   Method: Compositional matrix adjust.
 Identities = 292/747 (39%), Positives = 405/747 (54%), Gaps = 98/747 (13%)

Query: 49  MSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSN 108
           +    FC++SL    R  DLVSR+TLD+ + Q+G  A  VP LG+P Y WW+E LHGV  
Sbjct: 10  LKDLPFCNTSLTAGQRTDDLVSRLTLDQLIGQMGHQAPAVPSLGIPAYNWWTECLHGVLT 69

Query: 109 VGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPN 168
              GT+        TSFP      A+FN  L  K+ +A+S EARA+ N G  GL +W+PN
Sbjct: 70  KC-GTNC------PTSFPAPCALGAAFNMKLIHKMARAISNEARALNNEGIGGLDFWAPN 122

Query: 169 I-----------------------NVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDV 205
           I                       ++ RDPRWGR  E PGEDPF+  +Y  +++RGLQ+ 
Sbjct: 123 IKYSTQPTNKTRQESQLRNAMVCISINRDPRWGRNMEVPGEDPFMTAQYVAHFMRGLQEG 182

Query: 206 EGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCV 265
           E        +SR  +V   CKH+AAY ++ WK  DR+ FDA V++ D  ET+L  F+ C+
Sbjct: 183 E--------DSRYPQVVGTCKHFAAYSLEAWKDYDRFMFDAIVSDYDFVETYLPAFKGCI 234

Query: 266 KEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLAD 325
            EG A S+MCSYN VNG+PSCA+  LL   +R  W   GY+V+DCD++  + +NH F   
Sbjct: 235 VEGRARSIMCSYNSVNGVPSCANDFLLRTILRDSWSFDGYVVSDCDAVDTIYNNHHF-TK 293

Query: 326 SKEDAVAQTLKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGS 385
           + E A A  L AG DL+CG +Y    G A  +G+V E ++  ++K L+   M LG +D  
Sbjct: 294 TPEGACAVALHAGTDLNCGDFYQKHLGKAHSEGRVTEDEVRLAVKRLFRQRMELGMWDPP 353

Query: 386 PQ--YVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANAT 443
            +  Y       + S E+ +LA +AARE +VLL+N +  LPL  + V+ VAV+GP+ANAT
Sbjct: 354 AEQPYKQYPPSVVGSREHSDLALQAARESMVLLQNRRGVLPLRKS-VRRVAVIGPNANAT 412

Query: 444 VAMIGNYAGIPCR------YMSPIAGFSG---YANVTYKTGCDDVACKSNNSIFAASEAA 494
             M+GNY G  C        +SP          A VTY  GC DV   +   I  A +AA
Sbjct: 413 ETMLGNYYGSRCHDGTYDCIVSPYLAIKAKLPQALVTYNLGC-DVDSTNTTGIPEAVKAA 471

Query: 495 KTADATIILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIA 554
           + AD  I++ GL+ SVE+E  DR  + LPG Q  LI  +      P ++V+M  G V I 
Sbjct: 472 QAADVAIVVLGLNTSVESEGKDRVAITLPGMQDHLIKSIV-ATNTPTVVVMMHGGAVAIE 530

Query: 555 FAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPG----------GRLPITWYNGDYVQ 604
           + +    +  I+ A YPGE GG+AIADV+FG +NPG          GRLP+T    +YV 
Sbjct: 531 WIK--DQVDGIVDAFYPGENGGQAIADVLFGDYNPGDNKTDGTTLLGRLPVTVLPANYVD 588

Query: 605 MLPLTSMPLRPVDSLGYPGRTYKFYNGPT-LYPFGYGLSYTQFKYNLLSFTKTIQVNLNK 663
           M+PLT+M +R   S   PGRTY++Y GP  L+ FG+GLSYT FK   LS           
Sbjct: 589 MVPLTNMSMRA--SGNNPGRTYRYYTGPAPLWEFGFGLSYTTFKTEWLS----------- 635

Query: 664 LQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVY-SKPPAE 722
                          T  P  L +  R D+   F+V   NVG   G +VV+ + ++  A+
Sbjct: 636 ---------------TPQPSALKSYAR-DEAVSFRVRVTNVGPVAGDEVVLAFVTRDNAD 679

Query: 723 IAATYIKQVIGFQRVFVRAGRNKRIKF 749
                +KQ+  F+RV +  G +K I F
Sbjct: 680 RGP--LKQLFAFERVHLNPGESKEIFF 704


>gi|413919687|gb|AFW59619.1| putative O-Glycosyl hydrolase superfamily protein [Zea mays]
          Length = 451

 Score =  478 bits (1229), Expect = e-132,   Method: Compositional matrix adjust.
 Identities = 233/429 (54%), Positives = 304/429 (70%), Gaps = 17/429 (3%)

Query: 30  SSSPVFVCDPGRFSKLGLQMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVP 89
           + +P F CD    +     ++S+ FC+ S   + R  DLVSR+TL EKV  L D    +P
Sbjct: 35  AQTPAFACDASNAT-----LASYGFCNRSAAAAARAADLVSRLTLAEKVGFLVDKQAALP 89

Query: 90  RLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVST 149
           RLG+P YEWWSEALHGVS VGPGT F  ++PGATSFP  ILT ASFN +L++ IG+ VS 
Sbjct: 90  RLGVPLYEWWSEALHGVSYVGPGTRFSPLVPGATSFPQPILTAASFNATLFRAIGEVVSN 149

Query: 150 EARAMYNLGRAGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHE 209
           EARAM+N+G AGLT+WSPNIN+ RDPRWGR  ETPGEDP +  +YAV YV GLQ      
Sbjct: 150 EARAMHNVGLAGLTFWSPNINIFRDPRWGRGQETPGEDPLLTSKYAVGYVTGLQGAVSGA 209

Query: 210 NATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGD 269
            A       LKV++CCKHY AYDVDNWKGV+RY FDA V++QD+++TF  PF+ CV +G+
Sbjct: 210 GA-------LKVAACCKHYTAYDVDNWKGVERYTFDAVVSQQDLDDTFQPPFKSCVVDGN 262

Query: 270 ASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKED 329
            +SVMCSYN+VNG P+CAD  LL+  +RG+W L+GYI +DCDS+ V+ +N  +   + ED
Sbjct: 263 VASVMCSYNQVNGKPTCADKDLLSGVIRGDWKLNGYISSDCDSVDVLYNNQHY-TKTPED 321

Query: 330 AVAQTLKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ-- 387
           A A ++KAGLDL+CG +    T  AVQ GK+ E+D+D+++      LMRLGFFDG P+  
Sbjct: 322 AAAISIKAGLDLNCGTFLAQHTVAAVQAGKLSESDVDRAVTNNLVTLMRLGFFDGDPREL 381

Query: 388 -YVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAM 446
            + +LG  D+C+  N ELA EAAR+GIVLLKN    LPL++  +K++AV+GP+ANA+  M
Sbjct: 382 PFGNLGPSDVCTPSNQELAREAARQGIVLLKN-TGKLPLSATSIKSMAVIGPNANASFTM 440

Query: 447 IGNYAGIPC 455
           IGNY G  C
Sbjct: 441 IGNYEGTSC 449


>gi|396473219|ref|XP_003839293.1| similar to beta-1,4-xylosidase [Leptosphaeria maculans JN3]
 gi|312215862|emb|CBX95814.1| similar to beta-1,4-xylosidase [Leptosphaeria maculans JN3]
          Length = 789

 Score =  476 bits (1225), Expect = e-131,   Method: Compositional matrix adjust.
 Identities = 284/737 (38%), Positives = 399/737 (54%), Gaps = 42/737 (5%)

Query: 54  FCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGT 113
            C++S     R K LV+  TL+EK+      + GVPRLG+P Y+WWSE LHG++  GP T
Sbjct: 34  ICNTSASPLDRAKSLVTLYTLEEKINATSSGSPGVPRLGIPPYQWWSEGLHGIA--GPYT 91

Query: 114 HFDDV---IPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNIN 170
           +F         +TSFP  IL  A+F++ L   + + +STEARA  N  R GL +W+PNIN
Sbjct: 92  NFSTSGIEYSYSTSFPQPILMGAAFDDHLITDVAKVISTEARAFNNANRTGLDFWTPNIN 151

Query: 171 VARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAA 230
             RDPRWGR  ETPGED F +  Y    + GLQ        TD   R   V + CKH+A 
Sbjct: 152 PFRDPRWGRGQETPGEDAFHLSSYVKALIAGLQG-----ETTDPYKR---VVATCKHFAG 203

Query: 231 YDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPK 290
           YD+++W G  RY FDA++++QD+ E +L+PF+ CV + +  + MCSYN VNG+P+CADP 
Sbjct: 204 YDIEDWNGNLRYQFDAQISQQDLVEYYLQPFQACV-QANVGAFMCSYNAVNGVPTCADPY 262

Query: 291 LLNQTVRGEW---DLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYY 347
           LL   +R  W   +   ++ +DCD++Q +   H++ A ++E AVA  L AG DLDCG Y 
Sbjct: 263 LLQTILREHWGWTNEEQWVTSDCDAVQNIYLPHQWSA-TREQAVADALIAGTDLDCGTYM 321

Query: 348 TNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ--YVSLGKQDICSDENIELA 405
                 A  QG V E  +D++L   Y+ L+RLG+FD +    Y   G   + +D +  LA
Sbjct: 322 QEHLPGAFAQGLVNENVLDQALVRQYSSLVRLGWFDDAADQPYRQFGWDSVATDASQALA 381

Query: 406 AEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFS 465
             AA EGIVLLKND   LPL+     ++ V G  ANAT  ++GNYAG+P    SP+    
Sbjct: 382 RRAAVEGIVLLKND-GVLPLSIDSSVSLGVFGDWANATSQLLGNYAGVPTYLHSPLWALQ 440

Query: 466 GYANVTYKTGCDDVACK---SNNSIFAASEAAKTADATIILAGLDLSVEAESLDREDLWL 522
              N+T      +   +   + N   + S A  T+D  I + G+D S+E E  DR  L  
Sbjct: 441 -QENLTINYAGGNPGGQGDPTTNRWSSLSGAIATSDILIYIGGIDNSIEEEGHDRTSLAW 499

Query: 523 PGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADV 582
            G Q  +I Q+A   K P I+V+M  G +D A    N NI AILWAGYPG++GG AI D+
Sbjct: 500 TGAQLDVIFQLAATGK-PTIVVVMGGGQIDSAPLANNANISAILWAGYPGQDGGPAIVDI 558

Query: 583 VFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLS 642
           + GK  P GRLP T Y   Y  ++P+T M LRP ++   PGRTYK+YNG   Y FG+GL 
Sbjct: 559 LTGKSPPAGRLPQTQYPASYTSLVPMTDMGLRPSEN--NPGRTYKWYNGTATYEFGHGLH 616

Query: 643 YTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQ 702
           YT F   + S  +      + +  C+N    +  +  RC            +    +   
Sbjct: 617 YTNFSATVTSPMQQSYRIADLMSTCKN---ATSITLERCA-----------FTSVDISVT 662

Query: 703 NVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDY 762
           N G+     V + Y       A    K ++G+QR+F  A        +    +SL  VD 
Sbjct: 663 NTGAVASDYVTLCYISGSHGPAPHPKKSLVGYQRLFGIAAGASDTARIDLTLESLARVDE 722

Query: 763 AANTLLPAGEHTIFVGN 779
             N +L  GE+++ V N
Sbjct: 723 VGNKVLYPGEYSLMVDN 739


>gi|389748500|gb|EIM89677.1| glycoside hydrolase family 3 protein [Stereum hirsutum FP-91666
           SS1]
          Length = 770

 Score =  475 bits (1222), Expect = e-131,   Method: Compositional matrix adjust.
 Identities = 285/747 (38%), Positives = 416/747 (55%), Gaps = 45/747 (6%)

Query: 49  MSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSN 108
           ++S L C++S  +  R K LV+ MTL+E V    + + GVPRLGLP YEWWSEALHGV++
Sbjct: 30  LASNLVCNTSANFLDRAKALVNAMTLEEMVNNTVNTSPGVPRLGLPPYEWWSEALHGVAS 89

Query: 109 VGPGTHFDDV--IPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWS 166
             PG  F+      GATSFP  IL +A+F++ L   +   +STEARA  N   +GL +++
Sbjct: 90  -SPGVTFETSGDFSGATSFPEPILMSAAFDDDLIFSVASTISTEARAFGNTNHSGLDFFT 148

Query: 167 PNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCK 226
           PNIN  +DPRWGR  ETPGEDP    RY    + GLQ   G        S   K+ + CK
Sbjct: 149 PNINPFKDPRWGRGQETPGEDPLHTSRYVYQLITGLQGGVGP-------SPYYKIIADCK 201

Query: 227 HYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSC 286
           H+AAYD++NW+G +R  F+A V+ QD+ E +   F+ CV++    SVMCSYN VNG+P+C
Sbjct: 202 HFAAYDLENWEGNNRMAFNAIVSTQDLAEFYTPSFQSCVRDAKVGSVMCSYNAVNGVPAC 261

Query: 287 ADPKLLNQTVRGEWDLHG--YIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCG 344
             P LL   VR  ++L    +I +DCD++  + D H +   +  +A A  L AG D+DCG
Sbjct: 262 GSPYLLQDLVRDYFELGNDTWITSDCDAVGNIFDPHNYTT-TLTNASAVALLAGTDVDCG 320

Query: 345 QYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFD--GSPQYVSLGKQDICSDENI 402
             Y+   G AV +G V ++D++++L  LY  L+RLG+FD   S  Y +LG  D+ +    
Sbjct: 321 TSYSETLGEAVSEGLVSKSDVERALVRLYGSLVRLGYFDPEDSVPYRALGASDVNTPAAQ 380

Query: 403 ELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIA 462
            LA  AA EGIVLLKND   LPL S+ V  +A++GP ANAT  M GNY GI    +SP+ 
Sbjct: 381 TLAYTAAVEGIVLLKND-GLLPL-SSNVSHIALIGPWANATTQMQGNYEGIAPLLISPLD 438

Query: 463 GFSGYA-NVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAESLDREDLW 521
           GF+    NV++  G   ++  S +    A   A  AD  + + G+D +VEAE  DR  + 
Sbjct: 439 GFTSAGFNVSFTNGTT-ISGNSTSGFADALSMASAADVIVYIGGIDDTVEAEGQDRTSIT 497

Query: 522 LPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIAD 581
            PG Q +LI ++    K P +++ M  G VD    + N+++ A+LW GYPG+ GG+A+AD
Sbjct: 498 WPGNQLELIGELGAFGK-PFVVIQMGGGQVDDTELKANSSVNALLWGGYPGQAGGKALAD 556

Query: 582 VVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGL 641
           ++ G   P GRL  T Y   YV  + +T M +RP +S G PGRTYK+Y G  ++ FG+GL
Sbjct: 557 IITGVQAPAGRLTTTQYPASYVDQVAMTDMSVRPSNSTGSPGRTYKWYTGTPVFEFGFGL 616

Query: 642 SYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDF 701
            YT F       +     ++  L    N + ++ A         V+    D    F V  
Sbjct: 617 HYTTFDVEWAEGSPAASYSIQDLVASANSSSSAVAH--------VDSAILD---TFTVQV 665

Query: 702 QNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNI-- 759
            N G+     V +++S   A  +   +++++ + RV       K I    +A  SLN+  
Sbjct: 666 TNTGNVTSDYVALLFSNTTAGPSPAPLQELVSYARV-------KGITPGVSATASLNVTL 718

Query: 760 -----VDYAANTLLPAGEHTIFVGNGG 781
                VD   N+++  G + ++V   G
Sbjct: 719 GTIARVDEDGNSIIYPGVYNLWVDTTG 745


>gi|403412992|emb|CCL99692.1| predicted protein [Fibroporia radiculosa]
          Length = 760

 Score =  473 bits (1217), Expect = e-130,   Method: Compositional matrix adjust.
 Identities = 288/734 (39%), Positives = 405/734 (55%), Gaps = 45/734 (6%)

Query: 55  CDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTH 114
           CD+S     R   L+   TL+EK+   G+ + GVPRLGLP Y+WW EALHGV+   PG  
Sbjct: 34  CDTSASPVARATALIGLFTLEEKINNTGNTSPGVPRLGLPAYQWWQEALHGVAE-SPGVI 92

Query: 115 FDDV--IPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNINVA 172
           F +      ATSFP  IL  A+F++ L  ++   VSTEARA  N  R+GL +W+PNIN  
Sbjct: 93  FAETGEYSYATSFPQPILMGAAFDDELINQVATIVSTEARAFNNANRSGLDFWTPNINPF 152

Query: 173 RDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYD 232
           +DPRWGR  ETPGEDPF +  Y  N + GLQ          L+    ++ + CKHYA YD
Sbjct: 153 KDPRWGRGQETPGEDPFHLQSYVYNLITGLQG--------GLDPEYKRIVATCKHYAGYD 204

Query: 233 VDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLL 292
           ++NW+G  RY FDA ++ QD+ E + R FE C ++ +  + MCSYN VNG+PSCA+  LL
Sbjct: 205 LENWEGNVRYGFDALISIQDLSEFYTRSFETCARDANVGAFMCSYNAVNGVPSCANSYLL 264

Query: 293 NQTVRGEWDLHG---YIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTN 349
              +RG W+      +I +DCD+IQ + + H + A ++E  VA  L AG DLDCG YY  
Sbjct: 265 QDILRGHWNWTSDDQWITSDCDAIQNIYEPH-YYAPTRELTVADALNAGADLDCGTYYPE 323

Query: 350 FTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ--YVSLGKQDICSDENIELAAE 407
             G A  +G   E+ +D++L   Y  L++LG+FD +    Y  +G  ++ + E  ELA  
Sbjct: 324 NLGAAYDEGLFAESTLDRALIRQYASLVKLGYFDPAENQPYRQIGWANVSTPEAEELAYR 383

Query: 408 AAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFSGY 467
           AA EGI L+KND  TLPL S  +K++A++GP ANAT  M GNY G P   +SP+      
Sbjct: 384 AAVEGITLIKND-GTLPL-SPSIKSLALIGPWANATTQMQGNYYGQPPYLISPLMAAEAL 441

Query: 468 ANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAESLDREDLWLPGYQT 527
               Y +    V   + +S  AA  AA+ ADA I + G+D +VEAE++DR  L  PG Q 
Sbjct: 442 NYTVYYSPGPGVDDPTTSSFPAAFAAAQAADAIIYIGGIDTTVEAEAMDRYTLDWPGVQP 501

Query: 528 QLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKF 587
             I+Q+++  K P++++ M  G VD +    NTN+ A++W GYPG+ GG A+ D++ G  
Sbjct: 502 DFIDQLSQFGK-PLVVLQMGGGQVDDSCLLPNTNVNALIWGGYPGQSGGTALMDIIVGNA 560

Query: 588 NPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFK 647
            P GRLP T Y  DYV  + +T M LRP  S   PGRTY +Y G  +  FG+GL YT F 
Sbjct: 561 APAGRLPTTQYPLDYVYQVAMTDMSLRP--SATNPGRTYMWYTGTPIVEFGFGLHYTNFS 618

Query: 648 YNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGST 707
                          +L      +Y   +    C GV   DL    +  + V+  N+GS 
Sbjct: 619 --------------AELSQPSAPSYDIASLVGACEGVAHLDLCA--FESYTVNVTNIGSK 662

Query: 708 DGSDVV----IVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYA 763
             SD V    +     PA I     K +  + R+   A  + +   +     SL+ VD  
Sbjct: 663 VTSDYVALLFVAGEHGPAPIPN---KVLAAYDRLHTIAPLSSQQATLNLTLGSLSRVDEY 719

Query: 764 ANTLLPAGEHTIFV 777
            N +L  GE+T+ +
Sbjct: 720 GNRVLYPGEYTLIL 733


>gi|344302281|gb|EGW32586.1| hypothetical protein SPAPADRAFT_51129 [Spathaspora passalidarum
           NRRL Y-27907]
          Length = 788

 Score =  473 bits (1217), Expect = e-130,   Method: Compositional matrix adjust.
 Identities = 286/737 (38%), Positives = 409/737 (55%), Gaps = 45/737 (6%)

Query: 55  CDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGV--SNVGPG 112
           C+  LP + R K +V   T+DE +  +G+ + GV RLGLP Y+WWSE LHG+  SN    
Sbjct: 61  CNPYLPNNQRAKAVVDLFTVDELIANMGNTSPGVERLGLPPYQWWSEGLHGIARSNFTAS 120

Query: 113 THFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNINVA 172
             +      ATSFP  IL   +FN  L+K++G  + TEARA  N+GRAGL Y+SPNIN  
Sbjct: 121 GEYSH----ATSFPQPILMGGAFNSDLYKQVGNVIGTEARAFNNVGRAGLDYYSPNINPF 176

Query: 173 RDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYD 232
           +DPRWGR  E   E P +VG YA+NYV+GLQ   G +  ++ N   L+V++ CKH+A YD
Sbjct: 177 KDPRWGRGQEVASESPVLVGNYALNYVQGLQG--GID--SNPNDDTLQVAATCKHFAGYD 232

Query: 233 VDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLL 292
           +++WK   R  ++A +++QD+ + +   F+ CV++  A+  MCSYN +NGIP CA    L
Sbjct: 233 MESWKQHSRLGYNAIISDQDLADYYFPTFQSCVRDAKAAGAMCSYNAINGIPVCASEFFL 292

Query: 293 NQTVRGEWDLH-GYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFT 351
              +R  +D   G I +DCDS+  + + H ++ D    A A  +KAG+D++CG  Y N  
Sbjct: 293 GTVIREGFDFQNGVIHSDCDSLYSIWNPHLYVQDLGA-AAADGIKAGVDVNCGDTYQNNL 351

Query: 352 GNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ---YVSLGKQDICSDENIELAAEA 408
           G A+    + E  I  S+   Y+ L+RLG+FD SPQ   Y +    D+ + +  +LA +A
Sbjct: 352 GYALGNKTINEDQIRASVTRQYSNLIRLGYFD-SPQTNKYRTYNWSDVSTSQANQLAYQA 410

Query: 409 AREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGF--SG 466
           A EGI LLKND  TLP N  KVK VAV+GP ANAT  M+G+YAG P   +SP+ G   SG
Sbjct: 411 AVEGITLLKND-GTLPFNKDKVKNVAVIGPWANATTDMLGDYAGTPPYLISPLQGAQDSG 469

Query: 467 YANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAESLDREDLWLPGYQ 526
           +  V Y  G       + N   AA  AAK ADA +   G+D S+E E+LDRE L  PG Q
Sbjct: 470 F-KVQYAYGTQINTTLTTNYT-AALNAAKGADAIVYFGGIDNSIENEALDRESLAWPGNQ 527

Query: 527 TQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGK 586
             L+++++ + K P+++V   AG VD    + N N+ +I++AGYPG+ GG AI DV+ G 
Sbjct: 528 LDLVSKLSGLNK-PLVVVQFGAGQVDDTEIKNNNNVNSIVYAGYPGQSGGTAIWDVLNGI 586

Query: 587 FNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQF 646
           + P GRL  T Y   Y   +P+T M LRP D  GYPGRT+ +YNG  +Y FGYGL YT F
Sbjct: 587 YAPAGRLSTTQYPASYADQVPMTDMTLRPRD--GYPGRTFMWYNGEPVYEFGYGLHYTTF 644

Query: 647 KYNLLSFTKT---IQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQN 703
             +L +          N+++    ++  Y   +  T                 F V+ +N
Sbjct: 645 SVSLANAPPKGAPQSFNIDQFIAAKSSQYVDTSLITT----------------FDVNIKN 688

Query: 704 VGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVF-VRAGRNKRIKFVFNACKSLNIVDY 762
            G        ++YS   +       K ++ F ++  +  G+ +          SL   D 
Sbjct: 689 TGKVTSDYAALLYSNTTSGPGPHPNKILVSFDKLHQIHPGQIQTASLPV-TIGSLLQTDT 747

Query: 763 AANTLLPAGEHTIFVGN 779
             N  L  G +T FV N
Sbjct: 748 NGNKWLYPGAYTFFVDN 764


>gi|297039776|gb|ADH95739.1| beta-xylosidase [Aspergillus fumigatus]
          Length = 771

 Score =  473 bits (1216), Expect = e-130,   Method: Compositional matrix adjust.
 Identities = 303/761 (39%), Positives = 423/761 (55%), Gaps = 56/761 (7%)

Query: 37  CDPGRFSKLGLQMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQY 96
           C  G  SKL +       CD+SL  + R + LV+ MT +EKV      + GVPRLGLP Y
Sbjct: 32  CSSGPLSKLAV-------CDTSLDVTTRAQSLVNAMTFEEKVNNTQYNSPGVPRLGLPAY 84

Query: 97  EWWSEALHGVSNVGPGTHFDDVIPG--ATSFPTVILTTASFNESLWKKIGQAVSTEARAM 154
            WWSEALHGV+   PG  F D  P   ATSFP  IL  A+F++ L K++   VSTE RA 
Sbjct: 85  NWWSEALHGVAG-SPGVEFADSGPFSYATSFPQPILLGATFDDDLIKQVATVVSTEGRAF 143

Query: 155 YNLGRAGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDL 214
            N GR+GL +W+PNIN  RD RWGR  ETPGEDP  V RY  + V GLQ+  G  N    
Sbjct: 144 GNAGRSGLDFWTPNINPFRDARWGRGQETPGEDPLHVSRYVYHLVDGLQNGIGPANP--- 200

Query: 215 NSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVM 274
                KV + CKH+AAY +++W GV R+ F+A V+ QD+ E +L PF+ C ++    +VM
Sbjct: 201 -----KVVATCKHFAAYGLEDWNGVVRHSFNAEVSTQDLSEFYLPPFKSCARDARVDAVM 255

Query: 275 CSYNRVNGIPSCADPKLLNQTVRG--EWDLHG-YIVADCDSIQVMVDNHKFLADSKEDAV 331
           CSYN +NG+P+CAD  LL   +R   +WD  G +I +DC +I  + + H F   +  +A 
Sbjct: 256 CSYNALNGVPACADSYLLQTILREHWKWDEPGRWITSDCGAIDDIYNGHNFTT-TPAEAA 314

Query: 332 AQTLKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ--YV 389
           A  L AG DLDCG  +  + G A  +G      +D++L  LY+  ++LG+FD +    Y 
Sbjct: 315 ATALNAGTDLDCGTVFPKYLGQAADEGLYSNQTLDRALVRLYSSFVKLGYFDPAEDQPYR 374

Query: 390 SLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGN 449
           S+G  D+ +     LA +AA EGIVLLKND+ TLPL +    T+A++GP+ANAT  M GN
Sbjct: 375 SIGWTDVDTPAVEALAHKAAGEGIVLLKNDK-TLPLKAK--GTLALIGPYANATKQMQGN 431

Query: 450 YAGIPCRYMSPI---AGFSGYANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGL 506
           Y G P +Y+  +   A  +GY +V Y  G   +   S     AA  AAK AD  +   G+
Sbjct: 432 YEG-PAKYIRTLLWAATQAGY-DVKYAAGT-AINTNSTAGFDAALSAAKQADVVVYAGGI 488

Query: 507 DLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAIL 566
           D ++EAE  DR  +  PG Q  LI+Q++++ K P+++V    G VD +   +N  + A+L
Sbjct: 489 DNTIEAEGRDRTTIAWPGNQVNLIDQLSKIGK-PLVVVQFGGGQVDDSSLLSNPRVNALL 547

Query: 567 WAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTY 626
           WAGYP +EGG AI D++ GK  P GRLP+T Y  DYV  +P+T M LRP  +   PGRTY
Sbjct: 548 WAGYPSQEGGSAIFDILTGKTAPAGRLPVTQYPADYVNQVPMTDMALRPGSNT--PGRTY 605

Query: 627 KFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLN-YTSDASKTRCPGVL 685
           ++Y+   L PFG+GL YT FK   +S+ +            R L  Y + A  +R P  +
Sbjct: 606 RWYDKAVL-PFGFGLHYTTFK---ISWPR------------RALGPYNTAALVSRSPKNV 649

Query: 686 VNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATY-IKQVIGFQRVFVRAGRN 744
             D    D F  +V   N G T    V +++ K        Y +K ++G+ R        
Sbjct: 650 PIDRAAFDTFHIQV--TNTGKTTSDYVALLFLKTIDAGPKPYPLKTLVGYTRAKQIKPGE 707

Query: 745 KRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGNGGVSFP 785
           KR   +  +  SL       + +L  G +T+ V  G   +P
Sbjct: 708 KRSVDIEVSLGSLARTAENGDLVLYPGRYTLEVDVGESQYP 748


>gi|409041356|gb|EKM50841.1| glycoside hydrolase family 3 protein [Phanerochaete carnosa
           HHB-10118-sp]
          Length = 764

 Score =  472 bits (1214), Expect = e-130,   Method: Compositional matrix adjust.
 Identities = 296/744 (39%), Positives = 416/744 (55%), Gaps = 52/744 (6%)

Query: 53  LFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPG 112
           L C+ S   + R   LV  +TL+E V    + + GVPRLGLP Y WWSEALHGV+ + PG
Sbjct: 36  LVCNPSADPTSRANALVDALTLEELVNNTVNASPGVPRLGLPPYNWWSEALHGVA-LSPG 94

Query: 113 THFDDVIPG-----ATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSP 167
           T+F   +PG     ATSFP  I+  A+F++ L   I   +STEARA  N GRAGL +++P
Sbjct: 95  TNFS--VPGSPFSSATSFPQPIILGATFDDDLVTSIATVISTEARAFNNAGRAGLDFFTP 152

Query: 168 NINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRP-LKVSSCCK 226
           NIN  +DPRWGR  ETPGEDPF + +Y    V GLQ          L+  P  KV + CK
Sbjct: 153 NINPFKDPRWGRGQETPGEDPFHIAQYVYQLVTGLQG--------GLSPDPYYKVIADCK 204

Query: 227 HYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSC 286
           H+A YD++NW+G  R  F+A ++ QD+ E +   F+ CV++    SVMCSYN VNGIPSC
Sbjct: 205 HFAGYDLENWEGNSRMAFNAIISTQDLAEYYTPSFQSCVRDAHVGSVMCSYNAVNGIPSC 264

Query: 287 ADPKLLNQTVRGEWDL-HGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQ 345
           A+  LL   +RG + L  G+I +DCD++  +   H++   +  +A A  LKAG D+DCG 
Sbjct: 265 ANSYLLQDIIRGHFGLGDGWITSDCDAVANIFSPHQYTT-TLVNASAVALKAGTDVDCGT 323

Query: 346 YYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ--YVSLGKQDICSDENIE 403
            Y+    +AV Q  V E DI  S+  LY  L+RLG+FD   +  +  LG  D+ +  +  
Sbjct: 324 TYSQTLVDAVDQNLVTEDDIKNSMIRLYRSLVRLGYFDSPAEQPFRQLGWSDVNTPSSQA 383

Query: 404 LAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPI-- 461
           LA  AA EG+ LLKND  TLPL+SA +K +A+VGP ANAT  M GNY GI    +SP+  
Sbjct: 384 LALTAAEEGVTLLKND-GTLPLSSA-IKRIALVGPWANATTQMQGNYQGIAPFLVSPLQA 441

Query: 462 ---AGFS-GYANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAESLDR 517
              AGF   +AN T     DD      +   AA  A + ADA I   G+D ++E+E  DR
Sbjct: 442 LQDAGFQVTFANGTAINSTDD------SGFAAAVSAVQVADAVIYAGGIDETIESEGNDR 495

Query: 518 EDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGR 577
           E +  PG Q  L++Q+A V K P +++ M  G VD +  ++N  + A++W GYPG+ GG 
Sbjct: 496 EIITWPGNQLDLVSQLAAVGK-PFVVLQMGGGQVDSSSLKSNKAVNALIWGGYPGQSGGA 554

Query: 578 AIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPF 637
           AI +++ GK  P GRLPIT Y  DYV  +P+T M LRP  +   PGRTYK++ G  ++ F
Sbjct: 555 AIVNILTGKIAPAGRLPITQYPADYVNEIPMTDMALRPNGT--SPGRTYKWFTGTPIFGF 612

Query: 638 GYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEF 697
           G+GL YT F    L +  T            +   ++  S+    GV   +L     F F
Sbjct: 613 GFGLHYTTFS---LDWAPT---------PPSSFAISTLVSEANTAGVSFTNLA--PLFTF 658

Query: 698 KVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSL 757
           +V+ +N G      V +++S   A      +KQ++ + RV   A        +     S+
Sbjct: 659 RVNVKNTGKVGSDYVALLFSNTTAGPQPAPLKQLVSYTRVKGIAPGQTETAELKVTLGSI 718

Query: 758 NIVDYAANTLLPAGEHTIFVGNGG 781
             +D   ++ L  G + I+V   G
Sbjct: 719 ARIDENGDSALYPGRYNIWVDTTG 742


>gi|70986056|ref|XP_748529.1| beta-xylosidase [Aspergillus fumigatus Af293]
 gi|74668295|sp|Q4WFI6.1|BXLB_ASPFU RecName: Full=Probable exo-1,4-beta-xylosidase bxlB; AltName:
           Full=1,4-beta-D-xylan xylohydrolase bxlB; AltName:
           Full=Beta-xylosidase bxlB; AltName: Full=Xylobiase bxlB;
           Flags: Precursor
 gi|296439536|sp|B0Y0I4.1|BXLB_ASPFC RecName: Full=Probable exo-1,4-beta-xylosidase bxlB; AltName:
           Full=1,4-beta-D-xylan xylohydrolase bxlB; AltName:
           Full=Beta-xylosidase bxlB; AltName: Full=Xylobiase bxlB;
           Flags: Precursor
 gi|66846158|gb|EAL86491.1| beta-xylosidase, putative [Aspergillus fumigatus Af293]
 gi|159128339|gb|EDP53454.1| beta-xylosidase [Aspergillus fumigatus A1163]
          Length = 771

 Score =  472 bits (1214), Expect = e-130,   Method: Compositional matrix adjust.
 Identities = 305/761 (40%), Positives = 425/761 (55%), Gaps = 56/761 (7%)

Query: 37  CDPGRFSKLGLQMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQY 96
           C  G  SKL +       CD+SL  + R + LV+ MT +EKV      + GVPRLGLP Y
Sbjct: 32  CSSGPLSKLAV-------CDTSLDVTTRAQSLVNAMTFEEKVNNTQYNSPGVPRLGLPAY 84

Query: 97  EWWSEALHGVSNVGPGTHFDDVIPG--ATSFPTVILTTASFNESLWKKIGQAVSTEARAM 154
            WWSEALHGV+   PG  F D  P   ATSFP  IL  A+F++ L K++   VSTE RA 
Sbjct: 85  NWWSEALHGVAG-SPGVEFADSGPFSYATSFPQPILLGATFDDDLIKQVATVVSTEGRAF 143

Query: 155 YNLGRAGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDL 214
            N GR+GL +W+PNIN  RD RWGR  ETPGEDP  V RY  + V GLQ+  G  N    
Sbjct: 144 GNAGRSGLDFWTPNINPFRDARWGRGQETPGEDPLHVSRYVYHLVDGLQNGIGPANP--- 200

Query: 215 NSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVM 274
                KV + CKH+AAYD+++W GV R+ F+A V+ QD+ E +L PF+ C ++    +VM
Sbjct: 201 -----KVVATCKHFAAYDLEDWNGVVRHSFNAEVSTQDLSEFYLPPFKSCARDARVDAVM 255

Query: 275 CSYNRVNGIPSCADPKLLNQTVRG--EWDLHG-YIVADCDSIQVMVDNHKFLADSKEDAV 331
           CSYN +NG+P+CAD  LL   +R   +WD  G +I +DC +I  + + H F   +  +A 
Sbjct: 256 CSYNALNGVPACADSYLLQTILREHWKWDEPGRWITSDCGAIDDIYNGHNFTT-TPAEAA 314

Query: 332 AQTLKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ--YV 389
           A  L AG DLDCG  +  + G A  +G      +D++L  LY+ L++LG+FD +    Y 
Sbjct: 315 ATALNAGTDLDCGTVFPKYLGQAADEGLYSNQTLDRALVRLYSSLVKLGYFDPAEDQPYR 374

Query: 390 SLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGN 449
           S+G  D+ +     LA +AA EGIVLLKND+ TLPL +    T+A++GP+ANAT  M GN
Sbjct: 375 SIGWTDVDTPAAEALAHKAAGEGIVLLKNDK-TLPLKAK--GTLALIGPYANATKQMQGN 431

Query: 450 YAGIPCRYMSPI---AGFSGYANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGL 506
           Y G P +Y+  +   A  +GY +V Y  G   +   S     AA  AAK AD  +   G+
Sbjct: 432 YEG-PAKYIRTLLWAATQAGY-DVKYAAGT-AINTNSTAGFDAALSAAKQADVVVYAGGI 488

Query: 507 DLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAIL 566
           D ++EAE  DR  +  PG Q  LI+Q++++ K P+++V    G VD +   +N  + A+L
Sbjct: 489 DNTIEAEGRDRTTIAWPGNQVNLIDQLSKIGK-PLVVVQFGGGQVDDSSLLSNPRVNALL 547

Query: 567 WAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTY 626
           WAGYP +EGG AI D++ GK  P GRLP+T Y  DYV  +P+T M LRP  +   PGRTY
Sbjct: 548 WAGYPSQEGGSAIFDILTGKTAPAGRLPVTQYPADYVNQVPMTDMALRPGSNT--PGRTY 605

Query: 627 KFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLN-YTSDASKTRCPGVL 685
           ++Y+   L PFG+GL YT FK   +S+ +            R L  Y + A  +R P  +
Sbjct: 606 RWYDKAVL-PFGFGLHYTTFK---ISWPR------------RALGPYNTAALVSRSPKNV 649

Query: 686 VNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATY-IKQVIGFQRVFVRAGRN 744
             D    D F  +V   N G T    V +++ K        Y +K ++G+ R        
Sbjct: 650 PIDRAAFDTFHIQV--TNTGKTTSDYVALLFLKTTDAGPKPYPLKTLVGYTRAKQIKPGE 707

Query: 745 KRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGNGGVSFP 785
           KR   +  +  SL       + +L  G +T+ V  G   +P
Sbjct: 708 KRSVDIEVSLGSLARTAENGDLVLYPGRYTLEVDVGESQYP 748


>gi|389748262|gb|EIM89440.1| hypothetical protein STEHIDRAFT_182874, partial [Stereum hirsutum
           FP-91666 SS1]
          Length = 772

 Score =  471 bits (1213), Expect = e-130,   Method: Compositional matrix adjust.
 Identities = 289/740 (39%), Positives = 414/740 (55%), Gaps = 46/740 (6%)

Query: 53  LFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPG 112
           L C+++  +  R   L+    L + V    + + GV RLGLP Y+WW+EALHGV +  PG
Sbjct: 37  LVCNTTAHFVDRATSLIEEFNLTDLVNNTVNGSPGVDRLGLPPYQWWNEALHGVGS-SPG 95

Query: 113 THF----DDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPN 168
            ++    D     ATSFP  IL  A+FN+SL   I   +STEARA  N   AGLT+++PN
Sbjct: 96  VNWGSGPDANFTSATSFPAPILLGATFNDSLIASIADVISTEARAFNNFNYAGLTFFTPN 155

Query: 169 INVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRP-LKVSSCCKH 227
           IN  RDPRWGR  ETPGEDP+ + RY   YV GLQ          L+  P  KV + CKH
Sbjct: 156 INPFRDPRWGRGQETPGEDPYHLSRYVYQYVVGLQG--------GLSPDPYYKVLANCKH 207

Query: 228 YAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCA 287
             AYDV+NW+G DR  F+A VT QD+ E +   F+ C+++   +S MCSYN VNG+PSCA
Sbjct: 208 VLAYDVENWEGNDRTGFNAVVTTQDLSEFYTPSFQGCLRDAQGASAMCSYNAVNGVPSCA 267

Query: 288 DPKLLNQTVRGEWDL---HGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCG 344
              +L   VR  W L    G+I  DC ++Q +   H +  D+  +A A  + AG DLDCG
Sbjct: 268 SSYILKDLVRDFWGLGEREGWITGDCGAVQNIYQPHGY-TDTLVNATAVAMDAGTDLDCG 326

Query: 345 QYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ--YVSLGKQDICSDENI 402
             Y+     AV +G +    I  +L  LY  L+RLG+FD + Q  Y S    ++ +  + 
Sbjct: 327 DVYSPNLWTAVVEGLITAGQIQTALIRLYGSLIRLGYFDPAEQQPYRSFDWSNVNTPSSQ 386

Query: 403 ELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIA 462
           +LA  AA +GIVLL+ND   LPL S  VK +A++GP ANAT+++ GNYAGI    +SP  
Sbjct: 387 DLAYNAAVQGIVLLEND-GLLPL-STNVKNIALIGPMANATLSLQGNYAGIAPFVISPQQ 444

Query: 463 GF--SGYANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAESLDREDL 520
            F  +GY NVT+  G   ++   N+    A EAA+ AD  + + G+D S+EAE  DR  +
Sbjct: 445 AFETAGY-NVTFAFGT-GISNSDNSGYSEALEAAQGADVVVFVGGIDNSIEAEGQDRTSI 502

Query: 521 WLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIA 580
             PG Q  LI Q+ E+ K P+++V M  G  D +  + N  + A+LWAGYPG+ GG A+ 
Sbjct: 503 EWPGSQLDLIGQLGELGK-PLVVVRMGGGQCDDSTLKANATVNALLWAGYPGQSGGTALV 561

Query: 581 DVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYG 640
           D++ GK +P GRLP+T Y   YV  + +T M +RP +S G PGRTYK+Y G  +YPFGYG
Sbjct: 562 DIISGKQSPSGRLPVTQYPSSYVSEIDMTDMAIRP-NSSGSPGRTYKWYTGAPIYPFGYG 620

Query: 641 LSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVD 700
           + YT F+   L+++ +     N      + N +   +          D    D F   V 
Sbjct: 621 IHYTTFR---LAWSDSSSTTYNIQDIVSSANKSGGFA----------DTEILDTFSLLV- 666

Query: 701 FQNVGSTDGSD-VVIVYSKPPAEIAATYIKQVIGFQRV-FVRAGRNKRIKFVFNACKSLN 758
             N GS   SD V ++++   +  +   +++++G+ RV  +  G     +       S++
Sbjct: 667 -TNTGSNYTSDYVALLFANSTSGPSPAPLQELVGYTRVPHITPGGTATAELNV-TLGSIS 724

Query: 759 IVDYAANTLLPAGEHTIFVG 778
            VD   N +L  G + ++VG
Sbjct: 725 RVDENGNWILYPGTYNLWVG 744


>gi|452846807|gb|EME48739.1| glycoside hydrolase family 3 protein [Dothistroma septosporum
           NZE10]
          Length = 802

 Score =  471 bits (1212), Expect = e-130,   Method: Compositional matrix adjust.
 Identities = 274/736 (37%), Positives = 401/736 (54%), Gaps = 36/736 (4%)

Query: 55  CDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTH 114
           CD++     R   L++  TL EK+   G  + GVPRLGLP Y WW EALHGV++  PG +
Sbjct: 39  CDTTADPLTRATALINAFTLQEKLNNTGSTSPGVPRLGLPAYTWWQEALHGVAS-SPGVN 97

Query: 115 FDDVIPG--ATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNINVA 172
           F D  P   ATSFP  IL  A+F++ L + +   +STEARA  N  RAGL +W+PNIN  
Sbjct: 98  FSDSGPFRYATSFPQPILMGAAFDDDLIRDVATVISTEARAFNNDKRAGLDFWTPNINPF 157

Query: 173 RDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYD 232
           +D RWGR  ETPGEDP+ +  Y    + GLQ           + +  +V + CKH+ AYD
Sbjct: 158 KDSRWGRGQETPGEDPYHLSSYVAALIEGLQGSP--------DDKYKRVVATCKHFVAYD 209

Query: 233 VDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLL 292
           +++W G  RY FDA+V+ QD+ E ++ PF+ C ++ +  + MCSYN +NG+P+CADP LL
Sbjct: 210 MESWNGNFRYQFDAQVSSQDLVEYYMPPFQQCARDSNVGAFMCSYNALNGVPTCADPWLL 269

Query: 293 NQTVRGEWDL---HGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTN 349
              +R +W+      ++ +DCD++Q +   H + A ++E+A A +LKAG D++CG YY +
Sbjct: 270 QTVLREKWNWTSEQQWVTSDCDAVQNVFLPHDY-ASTREEAAALSLKAGTDINCGTYYQD 328

Query: 350 FTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDG-SPQYVSLGKQDICSDENIELAAEA 408
               A  QG +  TD+D SL   Y+ L+RLG+FDG +  Y +L   D+ +    +LA +A
Sbjct: 329 HLPAAYDQGLINTTDLDISLIRQYSSLVRLGYFDGLAVPYRNLTWNDVSTPHAQQLAYKA 388

Query: 409 AREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPI-AGFSGY 467
           A EGI LLKND   LPL  +   ++A++G  ANAT  M+GNY GIP  + SP+ A     
Sbjct: 389 AAEGITLLKND-GVLPLTISNGTSIALIGDWANATDQMLGNYDGIPPFFHSPLYAAQQTG 447

Query: 468 ANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAESLDREDLWLPGYQT 527
           A V + TG       + +       AA  +D  I   G+D SVE+E +DR  L   G Q 
Sbjct: 448 ATVNFATGPGGQGDPTTDHWLPVWAAANKSDVIIYAGGIDNSVESEGMDRVSLTWTGAQL 507

Query: 528 QLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKF 587
            +I Q+A   K PVI++ M  G +D +    N N+ A++W GYPG++GG A+ D++ G  
Sbjct: 508 DMIGQLAMYGK-PVIVLQMGGGQIDSSPLVNNPNVSALIWGGYPGQDGGVALFDIIRGIT 566

Query: 588 NPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFK 647
            P GRLP T Y   Y+  +P+T M LRP  + G PGRTY +YN   ++P+G GL YT F 
Sbjct: 567 APAGRLPTTQYPAKYISQVPMTDMTLRPNSTTGSPGRTYIWYNENAVFPYGLGLHYTNFT 626

Query: 648 YNLL-SFTKTIQVNLNKLQ----HCRNLNYTSDAS-KTRCPGVLVNDLRCDDYFEFKVDF 701
             +  SF  T   + +           L     A+ K  CP           +  F V  
Sbjct: 627 AAIKPSFPSTYDSSSSNSGSASYDISTLTSNCTATYKDLCP-----------FTSFSVSI 675

Query: 702 QNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVD 761
            N G      V + +       A    K+++ +QR+      + +  ++     SL  VD
Sbjct: 676 TNTGEIMSDYVTLGFLAGIHGPAPHPNKRLVSYQRLHNITAGSSQTAWLNLTLGSLARVD 735

Query: 762 YAANTLLPAGEHTIFV 777
              N +L  G++ + V
Sbjct: 736 EMGNKVLYPGDYALLV 751


>gi|62321271|dbj|BAD94481.1| beta-xylosidase [Arabidopsis thaliana]
          Length = 523

 Score =  471 bits (1212), Expect = e-130,   Method: Compositional matrix adjust.
 Identities = 239/528 (45%), Positives = 334/528 (63%), Gaps = 11/528 (2%)

Query: 265 VKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLA 324
           V +G+ +SVMCSYN+VNG P+CADP LL+  +RGEW L+GYIV+DCDS+ V+  N  +  
Sbjct: 3   VVDGNVASVMCSYNQVNGKPTCADPDLLSGVIRGEWKLNGYIVSDCDSVDVLYKNQHYTK 62

Query: 325 DSKEDAVAQTLKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDG 384
              E A A ++ AGLDL+CG +    T  AV+ G V E  IDK++   +  LMRLGFFDG
Sbjct: 63  TPAE-AAAISILAGLDLNCGSFLGQHTEEAVKSGLVNEAAIDKAISNNFLTLMRLGFFDG 121

Query: 385 SPQ---YVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHAN 441
           +P+   Y  LG  D+C+  N ELAA+AAR+GIVLLKN    LPL+   +KT+AV+GP+AN
Sbjct: 122 NPKNQIYGGLGPTDVCTSANQELAADAARQGIVLLKN-TGCLPLSPKSIKTLAVIGPNAN 180

Query: 442 ATVAMIGNYAGIPCRYMSPIAGFSGYANVTYKTGCDDVACKSNNSIFAASEAAKTADATI 501
            T  MIGNY G PC+Y +P+ G +G  + TY  GC +VAC   + +  A++ A TAD ++
Sbjct: 181 VTKTMIGNYEGTPCKYTTPLQGLAGTVSTTYLPGCSNVACAVAD-VAGATKLAATADVSV 239

Query: 502 ILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTN 561
           ++ G D S+EAES DR DL LPG Q +L+ QVA+ AKGPV+LVIMS GG DI FA+ +  
Sbjct: 240 LVIGADQSIEAESRDRVDLRLPGQQQELVIQVAKAAKGPVLLVIMSGGGFDITFAKNDPK 299

Query: 562 IKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGY 621
           I  ILW GYPGE GG AIAD++FG++NP G+LP+TWY   YV+ +P+T M +RP  + GY
Sbjct: 300 IAGILWVGYPGEAGGIAIADIIFGRYNPSGKLPMTWYPQSYVEKVPMTIMNMRPDKASGY 359

Query: 622 PGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTS-DASKTR 680
           PGRTY+FY G T+Y FG GLSYT+F + L+     + + L +   CR+    S DA    
Sbjct: 360 PGRTYRFYTGETVYAFGDGLSYTKFSHTLVKAPSLVSLGLEENHVCRSSECQSLDAIGPH 419

Query: 681 CPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVR 740
           C   +         FE  +  +N G  +G   V +++ PPA I  +  K ++GF+++ + 
Sbjct: 420 CENAVSGG---GSAFEVHIKVRNGGDREGIHTVFLFTTPPA-IHGSPRKHLVGFEKIRLG 475

Query: 741 AGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGNGGVSFPIHL 788
                 ++F    CK L++VD      +  G+H + VG+   S  I +
Sbjct: 476 KREEAVVRFKVEICKDLSVVDEIGKRKIGLGKHLLHVGDLKHSLSIRI 523


>gi|119473971|ref|XP_001258861.1| beta-xylosidase [Neosartorya fischeri NRRL 181]
 gi|292495290|sp|A1DJS5.1|XYND_NEOFI RecName: Full=Probable exo-1,4-beta-xylosidase xlnD; AltName:
           Full=1,4-beta-D-xylan xylohydrolase xlnD; AltName:
           Full=Beta-xylosidase A; AltName: Full=Beta-xylosidase
           xlnD; AltName: Full=Xylobiase xlnD; Flags: Precursor
 gi|119407014|gb|EAW16964.1| beta-xylosidase [Neosartorya fischeri NRRL 181]
          Length = 771

 Score =  471 bits (1211), Expect = e-130,   Method: Compositional matrix adjust.
 Identities = 307/760 (40%), Positives = 423/760 (55%), Gaps = 54/760 (7%)

Query: 37  CDPGRFSKLGLQMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQY 96
           C  G  SKL +       CD+SL  + R + LV+ MT +EKV      + GVPRLGLP Y
Sbjct: 32  CSSGPLSKLAV-------CDTSLDVTTRARSLVNAMTFEEKVNNTQYNSPGVPRLGLPAY 84

Query: 97  EWWSEALHGVSNVGPGTHFDDVIPG--ATSFPTVILTTASFNESLWKKIGQAVSTEARAM 154
            WWSEALHGV+   PG  F D  P   ATSFP  IL  A+F++ L K++   VSTE RA 
Sbjct: 85  NWWSEALHGVAG-SPGVEFADSGPFSYATSFPQPILLGATFDDDLIKQVATVVSTEGRAF 143

Query: 155 YNLGRAGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDL 214
            N GRAGL +W+PNIN  RD RWGR  ETPGEDP  V RY  + V GLQ+  G  N    
Sbjct: 144 GNAGRAGLDFWTPNINPFRDARWGRGQETPGEDPLHVSRYVYHLVDGLQNGIGPANP--- 200

Query: 215 NSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVM 274
                KV + CKH+AAYD+++W GV R+ F+A V+ QD+ E +L PF+ C ++    +VM
Sbjct: 201 -----KVVATCKHFAAYDLEDWNGVVRHSFNAEVSTQDLSEFYLPPFKSCARDAKVDAVM 255

Query: 275 CSYNRVNGIPSCADPKLLNQTVRG--EWDLHG-YIVADCDSIQVMVDNHKFLADSKEDAV 331
           CSYN +NG+P+CAD  LL   +R   +WD  G +I  DC +I  + + H +   +  +A 
Sbjct: 256 CSYNALNGVPACADSYLLQTILREHWKWDEPGHWITGDCGAIDDIYNGHNY-TKTPAEAA 314

Query: 332 AQTLKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ--YV 389
           A  L AG DLDCG  +  + G A  +G      +DK+L  LY+ L++LG+FD +    Y 
Sbjct: 315 ATALNAGTDLDCGTVFPKYLGQAADEGLYTNKTLDKALVRLYSSLVKLGYFDPAEDQPYR 374

Query: 390 SLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGN 449
           S+G +D+ S     LA +AA EGIVLLKND+ TLPL +    T+A++GP+ANAT  M GN
Sbjct: 375 SIGWKDVDSPAAEALAHKAAVEGIVLLKNDK-TLPLKAK--GTLALIGPYANATKQMQGN 431

Query: 450 YAGIP--CRYMSPIAGFSGYANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLD 507
           Y G P   R +   A  +GY +V Y  G   +   S     AA  AAK AD  +   G+D
Sbjct: 432 YEGPPKYIRTLLWAATQAGY-DVKYVAGT-AINANSTAGFDAALSAAKQADVVVYAGGID 489

Query: 508 LSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILW 567
            ++EAE  DR  +  PG Q  LI+Q++++ K P+++V    G VD +   +N ++ A+LW
Sbjct: 490 NTIEAEGHDRTTIVWPGNQLDLIDQLSKIGK-PLVVVQFGGGQVDDSSLLSNPHVNALLW 548

Query: 568 AGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYK 627
            GYP +EGG AI D++ GK  P GRLP+T Y  DYV  +PLT M LRP  +   PGRTY+
Sbjct: 549 TGYPSQEGGSAIFDILTGKTAPAGRLPVTQYPADYVNQVPLTDMALRPGSNT--PGRTYR 606

Query: 628 FYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLN-YTSDASKTRCPGVLV 686
           +Y+   L PFG+GL YT FK   +S+ +            R L  Y + A  +R P  + 
Sbjct: 607 WYDKAVL-PFGFGLHYTTFK---ISWPR------------RALGPYDTAALVSRSPKNVP 650

Query: 687 NDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATY-IKQVIGFQRVFVRAGRNK 745
            D    D F  +V   N G T    V +++ K        Y +K ++G+ R        K
Sbjct: 651 IDRAAFDTFHIQV--TNTGKTTSDYVALLFLKTIDAGPKPYPLKTLVGYTRAKQIKPGEK 708

Query: 746 RIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGNGGVSFP 785
           R   +  +  SL       + +L  G +T+ V  G   +P
Sbjct: 709 RSVDIKVSLGSLARTAENGDLVLYPGRYTLEVDVGENQYP 748


>gi|393247584|gb|EJD55091.1| beta-xylosidase [Auricularia delicata TFB-10046 SS5]
          Length = 763

 Score =  471 bits (1211), Expect = e-130,   Method: Compositional matrix adjust.
 Identities = 273/694 (39%), Positives = 394/694 (56%), Gaps = 42/694 (6%)

Query: 53  LFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPG 112
           L C+++  +  R K L+   T +E V    + + GVPRLGLP Y+WWSEALHGV+   PG
Sbjct: 35  LVCNTTANFMDRAKALIDEFTTEELVNNTVNGSPGVPRLGLPPYQWWSEALHGVAGANPG 94

Query: 113 THF---DDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNI 169
            HF    +    ATSFP  IL  A+F++ L  ++   +STEARA  N G +G+ +++PNI
Sbjct: 95  VHFAPAGEDFDHATSFPQPILMGAAFDDELIHEVATVISTEARAFNNFGFSGIDFFTPNI 154

Query: 170 NVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYA 229
           N  RDPRWGR  ETPGEDP  + RY    V  LQ   G        S   K+ + CKH+A
Sbjct: 155 NPFRDPRWGRGQETPGEDPLHISRYVFQLVTALQGGLGP-------SPYYKIVADCKHFA 207

Query: 230 AYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADP 289
            YD+++W+G+DR+HFDA +T QD+ E +   F+ CV++    SVMCSYN VNG+P+CA  
Sbjct: 208 GYDLESWEGIDRFHFDAVITTQDLAEFYTPSFQSCVRDAKVGSVMCSYNSVNGVPACASS 267

Query: 290 KLLNQTVRGEWDL-HGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYT 348
            LL   VR  + L  G+I +DCD++Q +   H F   ++ +A A +LKAG D+DCG  Y 
Sbjct: 268 YLLQDIVRDFYGLGDGWITSDCDAVQNVFTTHNFTT-TQANASAISLKAGTDVDCGNVYA 326

Query: 349 NFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ---YVSLGKQDICSDENIELA 405
              G+A+ QG V+E D+ ++L  LY  L+R G+FD SP+   +  LG  D+ +  +  LA
Sbjct: 327 QSLGDALDQGLVEEDDLKQALVRLYGSLVRTGYFD-SPEEQPFRQLGWADVDTPASRRLA 385

Query: 406 AEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGF- 464
             AA EGIVLLKND   LPL+S  V  V +VGP  NAT  M GNY G     +SP  GF 
Sbjct: 386 LLAAEEGIVLLKND-GLLPLSSRDVPNVIMVGPWGNATTMMQGNYFGNAPYLVSPRQGFV 444

Query: 465 -SGYANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAESLDREDLWLP 523
            +G+ NVT+  G         +    A  AA   D  + + G D  VE ES DR ++  P
Sbjct: 445 DAGF-NVTFFNGTVGTNGTDTSGFDEAVAAAGDTDLIVFVGGPDNVVERESRDRINITWP 503

Query: 524 GYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVV 583
           G Q  LI ++A V K P+I++ M AG VD  + + +  I A++W GYPG+ GG A+A++V
Sbjct: 504 GVQLDLIKELAGVGK-PMIVLQMGAGQVDDTWLKESDAINALIWGGYPGQSGGTALANIV 562

Query: 584 FGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSY 643
            GK  P  RLPIT Y  DY+  LP+T M +RP +S   PGRTYK++ G  ++ FG+GL Y
Sbjct: 563 TGKTAPAARLPITQYPEDYIS-LPMTDMNVRPSNS--SPGRTYKWFTGEPIFEFGFGLHY 619

Query: 644 TQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQN 703
           ++F +   ++ +    +        N +   D +                +  F+V+  N
Sbjct: 620 SKFDF---AWAEEPPASFAIGDLVANASSPVDLAT---------------FHTFQVNVTN 661

Query: 704 VGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRV 737
           +G      V +++    A  +   +K+++G+ R+
Sbjct: 662 LGPVASDFVAMLFGNTTAGPSPAPLKELVGYTRL 695


>gi|242216161|ref|XP_002473890.1| beta-xylosidase [Postia placenta Mad-698-R]
 gi|220726990|gb|EED80923.1| beta-xylosidase [Postia placenta Mad-698-R]
          Length = 741

 Score =  468 bits (1205), Expect = e-129,   Method: Compositional matrix adjust.
 Identities = 289/730 (39%), Positives = 402/730 (55%), Gaps = 39/730 (5%)

Query: 55  CDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTH 114
           CD+S     R   L+S  TL+EK+   G+ A GVPRLGLP Y+WW EALHGV+   PG  
Sbjct: 34  CDTSATPLERATALISLFTLEEKINNTGNTAPGVPRLGLPAYQWWQEALHGVAE-SPGVI 92

Query: 115 F--DDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNINVA 172
           F        ATSFP  IL  A+F+++L   +   VSTEARA  N  R+G+ +W+PNIN  
Sbjct: 93  FAPSGEYSYATSFPQPILMGAAFDDALINHVATIVSTEARAFNNANRSGIDFWTPNINPF 152

Query: 173 RDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYD 232
           +DPRWGR  ETPGEDPF +  Y  N + GLQ          L+    ++ + CKH+AAYD
Sbjct: 153 KDPRWGRGQETPGEDPFHLQSYVYNLITGLQG--------GLDPEYKRIVATCKHFAAYD 204

Query: 233 VDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLL 292
           ++NW+G  RY FDA V+ QD+ E + R F  C ++ +  S MCSYN VNG+PSCA+  LL
Sbjct: 205 LENWEGNVRYGFDALVSLQDLSEFYTRSFRTCARDANVGSFMCSYNAVNGVPSCANSYLL 264

Query: 293 NQTVRGEW---DLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTN 349
              +R  W   +   YI +DCD+IQ + + H + A ++ + VA  L AG DLDCG+YY  
Sbjct: 265 QDILRDHWGWTNEDQYITSDCDAIQNIYEPHYYTA-TRAETVADALNAGTDLDCGEYYPE 323

Query: 350 FTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGS--PQYVSLGKQDICSDENIELAAE 407
             G A  QG   E+ ++++L   Y  L++LG+FD +    Y  +G  ++ + E  ELA  
Sbjct: 324 NLGAAYDQGLFTESTLNRALIRQYAALVKLGYFDPADIQPYRQIGWANVSTPEAEELAYT 383

Query: 408 AAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFSGY 467
           AA EGI LLKND  TLPL S  +KT+A++GP ANAT  M GNY G+    +SP+      
Sbjct: 384 AAVEGITLLKND-GTLPL-SPSIKTIALIGPWANATTQMQGNYYGVAPYLISPLMAAEEL 441

Query: 468 ANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAESLDREDLWLPGYQT 527
               Y +    V   + +S  AA  AA+ ADA I   G+D++VEAE++DR  L  PG Q 
Sbjct: 442 GFTVYYSAGPGVDDPTTSSFPAAFAAAEAADAIIYAGGIDITVEAEAMDRYTLDWPGVQP 501

Query: 528 QLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKF 587
             I+Q++ + K P+I++    G +D +    N  + A++W GYPG+ GG+AI D++ G  
Sbjct: 502 DFIDQLSLLGK-PLIVLQFGGGQIDDSALLPNPGVNALVWGGYPGQSGGKAIMDIIVGNA 560

Query: 588 NPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFK 647
            P GRLPIT Y  DYV  + +T M LRP  S   PGRTY +Y G  +  FG+GL YT F 
Sbjct: 561 APAGRLPITQYPLDYVYQVAMTDMSLRP--SPTNPGRTYMWYTGTPIVEFGFGLHYTTFT 618

Query: 648 YNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGST 707
                           L      +Y      + C GV   DL C  +  +  +  N GS+
Sbjct: 619 --------------ASLSQPSAPSYDIATLVSLCSGVAHPDL-C-PFASYTANVTNTGSS 662

Query: 708 DGSDVVIVYSKPPAEIAATYIKQV-IGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANT 766
             SD V +         A Y  +V + + R+   A    +   +     SL+ VD   NT
Sbjct: 663 VTSDFVSLLFLAGEHGPAPYPNKVLVAYDRLHAIAPLASQTTTLNLTLGSLSRVDDYGNT 722

Query: 767 LLPAGEHTIF 776
           +L  GE+T+ 
Sbjct: 723 ILYPGEYTLI 732


>gi|449531013|ref|XP_004172482.1| PREDICTED: beta-D-xylosidase 1-like, partial [Cucumis sativus]
          Length = 534

 Score =  468 bits (1204), Expect = e-129,   Method: Compositional matrix adjust.
 Identities = 242/531 (45%), Positives = 342/531 (64%), Gaps = 17/531 (3%)

Query: 253 MEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDS 312
           +E+T+  PF+ CV EG  +SVMCSYN+VNG P+CADP LL  T+RG W L GYIV+DCDS
Sbjct: 1   LEDTYNVPFKACVVEGKVASVMCSYNQVNGKPTCADPDLLKNTIRGAWGLDGYIVSDCDS 60

Query: 313 IQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYL 372
           + V+ D+  F   + E+A A T+KAGLDLDCG +    T  AV +G +KE D++ +L  L
Sbjct: 61  VGVLYDSQHF-TPTPEEAAASTIKAGLDLDCGPFLAVHTATAVGRGLLKEVDLNNALANL 119

Query: 373 YTVLMRLGFFDGSPQ---YVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAK 429
            +V MRLG FDG P    Y +LG +D+C+  +  LA EAAR+GIVLL+N    LPL+  +
Sbjct: 120 LSVQMRLGMFDGEPAAQPYGNLGPKDVCTPAHKHLALEAARQGIVLLQNRAGALPLSPTR 179

Query: 430 VKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFSGYANVTYKTGCDDVACKSNNSIFA 489
            +TVAV+GP+++ATV MIGNYAG+ C Y +P+ G S Y    +  GC +VAC  +  I  
Sbjct: 180 HRTVAVIGPNSDATVTMIGNYAGVACEYTTPVQGISKYVKTIHAKGCANVACVGDQLIGE 239

Query: 490 ASEAAKTADATIILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAG 549
           A  AA+ ADA +++ GLD S+EAES DR  + LPG Q +L+ ++    KGP ++V+MS G
Sbjct: 240 AEAAARVADAAVVVVGLDQSIEAESRDRNGVLLPGKQEELVRRIGLACKGPTVVVLMSGG 299

Query: 550 GVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLT 609
            +D++FA+ +  I  ILW GYPG+ GG AIADV+FG  NPGG+LP+TWY   Y+  +P+T
Sbjct: 300 PIDVSFAKNDGKISGILWVGYPGQAGGAAIADVLFGATNPGGKLPMTWYPQSYLAKVPMT 359

Query: 610 SMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKT-IQVNLNKLQHCR 668
           +M LRP  S GYPGRTY+FY GP ++PFG+GLSY++F     SF +   +++L       
Sbjct: 360 NMGLRPDPSTGYPGRTYRFYKGPVVFPFGFGLSYSKFSQ---SFAEAPTKISLPLSSLSP 416

Query: 669 NLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYI 728
           N + T   S T C    V+DL         +D +N G+ DGS  ++V+S  P +  +   
Sbjct: 417 NSSATVKVSHTDCAS--VSDL------PIMIDVKNTGTVDGSHTILVFSTVPNQTWSPE- 467

Query: 729 KQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGN 779
           K +IGF++V + AG  KR++   + C  L+ VD      +P GEH + +G+
Sbjct: 468 KHLIGFEKVHLIAGSQKRVRIGIHVCDHLSRVDEFGTRRIPMGEHKLHIGD 518


>gi|296439595|sp|A1CCL9.2|BXLB_ASPCL RecName: Full=Probable exo-1,4-beta-xylosidase bxlB; AltName:
           Full=1,4-beta-D-xylan xylohydrolase bxlB; AltName:
           Full=Beta-xylosidase bxlB; AltName: Full=Xylobiase bxlB;
           Flags: Precursor
          Length = 771

 Score =  468 bits (1203), Expect = e-129,   Method: Compositional matrix adjust.
 Identities = 305/760 (40%), Positives = 420/760 (55%), Gaps = 54/760 (7%)

Query: 37  CDPGRFSKLGLQMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQY 96
           C  G  SKL +       CD+S   + R + LV  M+  EKV      A GVPRLGLP Y
Sbjct: 32  CTSGPLSKLAV-------CDTSRDVTTRAQSLVDAMSFAEKVNNTQYEAPGVPRLGLPAY 84

Query: 97  EWWSEALHGVSNVGPGTHFDDVIPG--ATSFPTVILTTASFNESLWKKIGQAVSTEARAM 154
            WWSEALHGV+   PG HF D  P   ATSF   IL  ASF++ L K++   V TE RA 
Sbjct: 85  NWWSEALHGVAGA-PGVHFADSGPFSYATSFAQPILLGASFDDELVKQVATVVGTEGRAF 143

Query: 155 YNLGRAGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDL 214
            N GRAGL YW+PNIN  RDPRWGR  ETPGEDP  V RY  + V GLQ   G       
Sbjct: 144 GNAGRAGLDYWTPNINPFRDPRWGRGQETPGEDPLHVSRYVYHLVDGLQGGIG------- 196

Query: 215 NSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVM 274
            +RP ++++ CKH+AAYD+++W GV R+ FDARV+ QD+ E +L  F+ CV++    +VM
Sbjct: 197 PARP-QIAATCKHFAAYDMEDWNGVSRHEFDARVSTQDLAEFYLPSFKSCVRDAQVDAVM 255

Query: 275 CSYNRVNGIPSCADPKLLNQTVRGEWDLH---GYIVADCDSIQVMVDNHKFLADSKEDAV 331
           CSYN +NG+P+CADP LL   +R  WD      ++V+DC +I  +   H +   +  +A 
Sbjct: 256 CSYNALNGVPTCADPYLLQTLLREHWDWDQPGHWVVSDCGAIDDIYIGHNY-TKTGAEAA 314

Query: 332 AQTLKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ--YV 389
           A  L AG DLDCG  +    G A +QG      +D++L  LY+ L++LG+FD + +  Y 
Sbjct: 315 AVALNAGTDLDCGTVFPKHLGEAAEQGLYTNQTLDRALVRLYSSLVKLGYFDPAEKQPYG 374

Query: 390 SLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGN 449
           S+G +D+ +    +LA +AA EGIVLLKNDQ TLPL +    T+A++GP+ANAT  M GN
Sbjct: 375 SIGWKDVDTPAAEQLAHKAAVEGIVLLKNDQ-TLPLKAK--GTLALIGPYANATKQMQGN 431

Query: 450 YAGIP--CRYMSPIAGFSGYANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLD 507
           Y G P   R +   A   GY  V Y  G   +   S     AA  AAK AD  +   G+D
Sbjct: 432 YQGPPKYIRTLEWAATQHGY-QVQYSPGT-AINNSSTAGFAAALAAAKDADVVLYAGGID 489

Query: 508 LSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILW 567
            ++E+E+LDR  +  PG Q  LI++++ + K P+I++    G VD     TN ++ A+LW
Sbjct: 490 NTIESETLDRTTITWPGNQLSLISELSNLHK-PLIVIQFGGGQVDDTPLLTNPHVNALLW 548

Query: 568 AGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYK 627
           AGYP +EGG AI D++ GK  P GRLPIT Y   Y   +P+T M LR       PGRTY+
Sbjct: 549 AGYPSQEGGAAIFDILTGKAAPAGRLPITQYPAAYTAQVPMTEMGLRAGGD--NPGRTYR 606

Query: 628 FYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVN 687
           +Y+   + PFG+GL YT F           +V+ ++    R   Y + A   R PG    
Sbjct: 607 WYD-KAVVPFGFGLHYTSF-----------EVSWDR---GRLGPYNTAALVNRAPGGSHV 651

Query: 688 DLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATY-IKQVIGFQRV-FVRAGRNK 745
           D    D   F+V  QN G+     V +++ K        Y +K ++G+ RV  V+ G  +
Sbjct: 652 DRALFD--TFRVQVQNTGTVTSDYVALLFVKTEDAGPEPYPLKTLVGYTRVQQVKPGERR 709

Query: 746 RIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGNGGVSFP 785
            ++                  L P G++T+ V  G   +P
Sbjct: 710 SVEIEVTLGAMARTAANGDLVLYP-GKYTLQVDVGERGYP 748


>gi|121712174|ref|XP_001273702.1| beta-xylosidase [Aspergillus clavatus NRRL 1]
 gi|119401854|gb|EAW12276.1| beta-xylosidase [Aspergillus clavatus NRRL 1]
          Length = 803

 Score =  467 bits (1202), Expect = e-129,   Method: Compositional matrix adjust.
 Identities = 305/760 (40%), Positives = 420/760 (55%), Gaps = 54/760 (7%)

Query: 37  CDPGRFSKLGLQMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQY 96
           C  G  SKL +       CD+S   + R + LV  M+  EKV      A GVPRLGLP Y
Sbjct: 64  CTSGPLSKLAV-------CDTSRDVTTRAQSLVDAMSFAEKVNNTQYEAPGVPRLGLPAY 116

Query: 97  EWWSEALHGVSNVGPGTHFDDVIPG--ATSFPTVILTTASFNESLWKKIGQAVSTEARAM 154
            WWSEALHGV+   PG HF D  P   ATSF   IL  ASF++ L K++   V TE RA 
Sbjct: 117 NWWSEALHGVAGA-PGVHFADSGPFSYATSFAQPILLGASFDDELVKQVATVVGTEGRAF 175

Query: 155 YNLGRAGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDL 214
            N GRAGL YW+PNIN  RDPRWGR  ETPGEDP  V RY  + V GLQ   G       
Sbjct: 176 GNAGRAGLDYWTPNINPFRDPRWGRGQETPGEDPLHVSRYVYHLVDGLQGGIG------- 228

Query: 215 NSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVM 274
            +RP ++++ CKH+AAYD+++W GV R+ FDARV+ QD+ E +L  F+ CV++    +VM
Sbjct: 229 PARP-QIAATCKHFAAYDMEDWNGVSRHEFDARVSTQDLAEFYLPSFKSCVRDAQVDAVM 287

Query: 275 CSYNRVNGIPSCADPKLLNQTVRGEWDLH---GYIVADCDSIQVMVDNHKFLADSKEDAV 331
           CSYN +NG+P+CADP LL   +R  WD      ++V+DC +I  +   H +   +  +A 
Sbjct: 288 CSYNALNGVPTCADPYLLQTLLREHWDWDQPGHWVVSDCGAIDDIYIGHNY-TKTGAEAA 346

Query: 332 AQTLKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ--YV 389
           A  L AG DLDCG  +    G A +QG      +D++L  LY+ L++LG+FD + +  Y 
Sbjct: 347 AVALNAGTDLDCGTVFPKHLGEAAEQGLYTNQTLDRALVRLYSSLVKLGYFDPAEKQPYG 406

Query: 390 SLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGN 449
           S+G +D+ +    +LA +AA EGIVLLKNDQ TLPL +    T+A++GP+ANAT  M GN
Sbjct: 407 SIGWKDVDTPAAEQLAHKAAVEGIVLLKNDQ-TLPLKAK--GTLALIGPYANATKQMQGN 463

Query: 450 YAGIP--CRYMSPIAGFSGYANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLD 507
           Y G P   R +   A   GY  V Y  G   +   S     AA  AAK AD  +   G+D
Sbjct: 464 YQGPPKYIRTLEWAATQHGY-QVQYSPGT-AINNSSTAGFAAALAAAKDADVVLYAGGID 521

Query: 508 LSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILW 567
            ++E+E+LDR  +  PG Q  LI++++ + K P+I++    G VD     TN ++ A+LW
Sbjct: 522 NTIESETLDRTTITWPGNQLSLISELSNLHK-PLIVIQFGGGQVDDTPLLTNPHVNALLW 580

Query: 568 AGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYK 627
           AGYP +EGG AI D++ GK  P GRLPIT Y   Y   +P+T M LR       PGRTY+
Sbjct: 581 AGYPSQEGGAAIFDILTGKAAPAGRLPITQYPAAYTAQVPMTEMGLRAGGD--NPGRTYR 638

Query: 628 FYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVN 687
           +Y+   + PFG+GL YT F           +V+ ++    R   Y + A   R PG    
Sbjct: 639 WYD-KAVVPFGFGLHYTSF-----------EVSWDR---GRLGPYNTAALVNRAPGGSHV 683

Query: 688 DLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATY-IKQVIGFQRV-FVRAGRNK 745
           D    D   F+V  QN G+     V +++ K        Y +K ++G+ RV  V+ G  +
Sbjct: 684 DRALFD--TFRVQVQNTGTVTSDYVALLFVKTEDAGPEPYPLKTLVGYTRVQQVKPGERR 741

Query: 746 RIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGNGGVSFP 785
            ++                  L P G++T+ V  G   +P
Sbjct: 742 SVEIEVTLGAMARTAANGDLVLYP-GKYTLQVDVGERGYP 780


>gi|426198365|gb|EKV48291.1| hypothetical protein AGABI2DRAFT_219902 [Agaricus bisporus var.
           bisporus H97]
          Length = 767

 Score =  467 bits (1201), Expect = e-128,   Method: Compositional matrix adjust.
 Identities = 296/807 (36%), Positives = 436/807 (54%), Gaps = 77/807 (9%)

Query: 1   MAKVVSSLLCFSLSIALLVFSTNAVD-ANGSSSPVFVCDPGRFSKLGLQMSSFLFCDSSL 59
           M   ++S++C ++ +AL  F+ +  D  NG  S   VCDP +                  
Sbjct: 1   MNPFLASIVCAAIHVALGQFNYSFPDCVNGPLSSTAVCDPTKAP---------------- 44

Query: 60  PYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTHF--DD 117
             + R K L+   T +E +Q   + + GVPRLG+P Y+WWSEALHGV+   PG  F    
Sbjct: 45  --AARAKTLIQMFTDEELMQNTDNVSPGVPRLGVPSYQWWSEALHGVAG-SPGVSFAPSG 101

Query: 118 VIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNINVARDPRW 177
               ATSFP  I+  ++F+  L K +   +STEARA  N  RAGL Y++PNIN  +DPRW
Sbjct: 102 EFSSATSFPQSIVLGSTFDIDLVKAVATVISTEARAFNNFHRAGLDYFTPNINPFKDPRW 161

Query: 178 GRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRP-LKVSSCCKHYAAYDVDNW 236
           GR  ETPGEDPF V +Y  + + GLQ          ++ RP  KV++ CKHYAAYD+D+W
Sbjct: 162 GRGQETPGEDPFHVSQYVYSLIDGLQG--------GIDPRPYFKVAADCKHYAAYDLDSW 213

Query: 237 KGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTV 296
           +G+DR+HFDA+V+ QD+ E +L  F+ CV++   +SVMCSYN VNGIP+CA+P LL   +
Sbjct: 214 EGIDRFHFDAKVSLQDLSEYYLPSFQSCVRDAKVASVMCSYNSVNGIPACANPYLLQDIL 273

Query: 297 RGEW--DLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFTGNA 354
           R  W  D   ++ +DCD+I  +   H F  D+  +AVA  LKAG D+DCG  Y+    +A
Sbjct: 274 RDFWGFDDDRWVTSDCDAIGNIFTTHNF-TDTFAEAVADALKAGTDVDCGTSYSTHLPDA 332

Query: 355 VQQGKVKETDIDKSLKYLYTVLMRLGFFD--GSPQYVSLGKQDICSDENIELAAEAAREG 412
           + Q  +   D++++L   YT LMRLG+FD   S     L   D+   +   LA  AA EG
Sbjct: 333 LNQSLITRDDLERALTRQYTSLMRLGYFDPPESQPLRQLAWSDVNKPDAQALAHTAAVEG 392

Query: 413 IVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFSGYANVTY 472
           +VLLKND   LP+ SA  KT+A++GP+ANAT  M GNY G     ++P  G         
Sbjct: 393 LVLLKND-GFLPV-SASGKTIAIIGPYANATKDMQGNYFGTAPFIVTPFQG-------AV 443

Query: 473 KTGCDDVACKSNNSIFAASE--------AAKTADATIILAGLDLSVEAESLDREDLWLPG 524
             G ++V   +  SI   SE         A ++D  I   G++ S+E+E+ DR  +   G
Sbjct: 444 DAGFNEVVSAAGTSINGTSEADFAAAIAVANSSDIIIFAGGINNSIESEAKDRLTIAWTG 503

Query: 525 YQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVF 584
            Q  L+ Q+A + K PV++V    G +D +    N  ++A++WAGYPG+ GG AI DV+ 
Sbjct: 504 NQLSLVKQLASLGK-PVVVVQFGGGQLDDSDLLDNDAVRAVIWAGYPGQSGGTAIFDVIT 562

Query: 585 GKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYT 644
           G   P GRL +T Y  D+V  + +T M LRP  +   PGRTYK+Y G  +  FG+GL +T
Sbjct: 563 GAVAPAGRLSVTQYPEDFVNQVGMTDMALRPGSA--NPGRTYKWYTGRPVLEFGHGLHFT 620

Query: 645 QFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNV 704
            F ++        + N+  L H  +  +         P ++  D        F V+ +N 
Sbjct: 621 TFDFSWRG-RPGRKYNIQHLLHTADKKF---------PDLIPLD-------TFHVNIRNT 663

Query: 705 GSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVF-VRAGRNKRIKFVFNACKSLNIVDYA 763
           G+     V +++ +  A  A    K ++ F R   + AG +  +    N   S+  VD  
Sbjct: 664 GNITSDYVALLFLRSNAGFAPHPKKSLVSFARAHRIDAGSSATVDLGVN-LGSIARVDEH 722

Query: 764 ANTLLPAGEHTIF--VGNGGVSFPIHL 788
            ++ L AG++ +   +G+G +S    L
Sbjct: 723 GDSWLFAGDYQLVLDIGDGVLSHSFSL 749


>gi|409079872|gb|EKM80233.1| hypothetical protein AGABI1DRAFT_57801 [Agaricus bisporus var.
           burnettii JB137-S8]
          Length = 767

 Score =  466 bits (1198), Expect = e-128,   Method: Compositional matrix adjust.
 Identities = 296/807 (36%), Positives = 439/807 (54%), Gaps = 77/807 (9%)

Query: 1   MAKVVSSLLCFSLSIALLVFSTNAVD-ANGSSSPVFVCDPGRFSKLGLQMSSFLFCDSSL 59
           M   ++S++C ++ +AL  F+ +  D  NG  S   VCDP +                  
Sbjct: 1   MNPFLASIVCAAIHVALGQFNYSFPDCVNGPLSSTAVCDPTKAP---------------- 44

Query: 60  PYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTHF--DD 117
             + R   L+   T +E +Q   + + GVPRLG+P Y+WWSEALHGV+   PG  F    
Sbjct: 45  --AARATTLIQMFTDEELMQNTDNVSPGVPRLGVPSYQWWSEALHGVAG-SPGVSFAPSG 101

Query: 118 VIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNINVARDPRW 177
               ATSFP  I+  ++F+  L K +   +STEARA  N  RAGL Y++PNIN  +DPRW
Sbjct: 102 EFSSATSFPQSIVLGSTFDIDLVKAVATVISTEARAFNNFHRAGLDYFTPNINPFKDPRW 161

Query: 178 GRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRP-LKVSSCCKHYAAYDVDNW 236
           GR  ETPGEDPF V +Y  + + GLQ          ++ RP  KV++ CKHYAAYD+D+W
Sbjct: 162 GRGQETPGEDPFHVSQYVYSLIDGLQG--------GIDPRPYFKVAADCKHYAAYDLDSW 213

Query: 237 KGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTV 296
           +G+DR+HFDA+V+ QD+ E +L  F+ CV++   +SVMCSYN VNGIP+CA+P LL   +
Sbjct: 214 EGIDRFHFDAKVSLQDLSEYYLPSFQSCVRDAKVASVMCSYNSVNGIPACANPYLLQDIL 273

Query: 297 RGEW--DLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFTGNA 354
           R  W  D   ++ +DCD+I  +   H F  D+  +AVA  LKAG D+DCG  Y+    +A
Sbjct: 274 RDFWGFDDDRWVTSDCDAIGNIFTTHNF-TDTFAEAVADALKAGTDVDCGTSYSTHLPDA 332

Query: 355 VQQGKVKETDIDKSLKYLYTVLMRLGFFD--GSPQYVSLGKQDICSDENIELAAEAAREG 412
           + Q  +   D++++L   YT LMRLG+FD   S     L   D+   +   LA  AA EG
Sbjct: 333 LNQSLITRDDLERALTRQYTSLMRLGYFDPPESQPLRQLAWSDVNKPDAQALAHTAAVEG 392

Query: 413 IVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFSGYANVTY 472
           +VLLKND   LP+ SA  KT+A++GP+ANAT  M GNY G     ++P  G         
Sbjct: 393 LVLLKND-GFLPV-SASGKTIAIIGPYANATKDMQGNYFGTAPFIVTPFQG-------AV 443

Query: 473 KTGCDDVACKSNNSIFAASE--------AAKTADATIILAGLDLSVEAESLDREDLWLPG 524
             G ++V   +  SI   SE         A ++D  I   G++ S+E+E+ DR  +   G
Sbjct: 444 DAGFNEVVSAAGTSINGTSEADFAAAIAVANSSDIIIFAGGINNSIESEAKDRLTIAWTG 503

Query: 525 YQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVF 584
            Q  L+ Q+A + K PV++V    G +D +    N  ++A++WAGYPG+ GG AI DV+ 
Sbjct: 504 NQLSLVKQLASLGK-PVVVVQFGGGQLDDSDLLDNDAVRAVIWAGYPGQSGGTAIFDVIT 562

Query: 585 GKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYT 644
           G   P GRL +T Y  D+V  + +T M LRP  +   PGRTYK+Y G  +  FG+GL +T
Sbjct: 563 GAVAPAGRLSVTQYPEDFVNQVGMTDMALRPGSA--NPGRTYKWYTGRPVLEFGHGLHFT 620

Query: 645 QFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNV 704
            F ++        +    +  + ++L +T+D    + P ++  D        F V+ +N 
Sbjct: 621 TFDFSW-------RGRPGRKYNIQHLLHTAD---KKFPDLIPLD-------TFHVNIRNT 663

Query: 705 GSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVF-VRAGRNKRIKFVFNACKSLNIVDYA 763
           G+     V +++ K  A  A    K ++ F R   + AG +  +    N   S+  VD  
Sbjct: 664 GNITSDYVALLFLKSNAGFAPHPKKSLVSFARAHRIDAGSSATVDLGVN-LGSIARVDEH 722

Query: 764 ANTLLPAGEHTIF--VGNGGVSFPIHL 788
            ++ L AG++ +   +G+G +S    L
Sbjct: 723 GDSWLFAGDYQLVLDIGDGVLSHSFSL 749


>gi|409079878|gb|EKM80239.1| hypothetical protein AGABI1DRAFT_120267 [Agaricus bisporus var.
           burnettii JB137-S8]
          Length = 786

 Score =  463 bits (1191), Expect = e-127,   Method: Compositional matrix adjust.
 Identities = 260/611 (42%), Positives = 361/611 (59%), Gaps = 29/611 (4%)

Query: 49  MSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSN 108
           + S   CDS+   + R + L+   T DE +Q   + + GVPRLGLP YEWWSEALHGV +
Sbjct: 32  LKSTPVCDSAKDPATRAQSLIQMFTDDELIQNGDNASPGVPRLGLPPYEWWSEALHGVGH 91

Query: 109 VGPGTHF--DDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWS 166
             PG  F        ATSFP  I+  A+F++ L K +   VSTEARA  N GRAGL Y++
Sbjct: 92  -SPGVVFAPSGDFSSATSFPQPIVIGAAFDDDLVKAVANVVSTEARAFNNFGRAGLNYFT 150

Query: 167 PNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRP-LKVSSCC 225
           PNIN  +DPRWGR  ETPGEDPF + +Y  + V GLQ          ++  P +KV++ C
Sbjct: 151 PNINPFKDPRWGRGQETPGEDPFHLSQYVYHLVDGLQG--------GIDPWPYIKVAADC 202

Query: 226 KHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPS 285
           KH+AAYD++NW+G+DR+HFDA+V++QD+ E +L PF+ CV++  A+SVMCSYN VNG+P+
Sbjct: 203 KHFAAYDLENWEGIDRFHFDAQVSQQDLSEYYLPPFQSCVRDAKAASVMCSYNSVNGVPA 262

Query: 286 CADPKLLNQTVRGEW--DLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDC 343
           CA   LL   +R  W  D   ++ +DC ++  + D+H F   S  +A A +LKAG D+DC
Sbjct: 263 CASTYLLQDILRDAWGFDDDRWVTSDCWALDKIFDSHNF-TRSFAEAAAISLKAGTDIDC 321

Query: 344 GQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFD--GSPQYVSLGKQDICSDEN 401
           G  + +    A+ Q  +   D+ ++    YT L+RLG+FD   S  Y      D+ + E 
Sbjct: 322 GSTFADHLPAALNQSLISRDDLTRAFIRQYTSLIRLGYFDPSDSQTYRQFDWSDVNTPEA 381

Query: 402 IELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPI 461
             L+  AA EG+VLLKND   LPL +   KT+A++GP+ NAT +M GNY G      SP 
Sbjct: 382 QALSRRAAVEGLVLLKND-GLLPL-APDGKTIAIIGPYTNATSSMQGNYFGNAPIITSP- 438

Query: 462 AGFSGYANVTYK---TGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAESLDRE 518
             F G  +V +K        V   S+     A   AK AD  + + G+D ++E E LDR 
Sbjct: 439 --FQGAQDVGFKVVSAAGTTVNGTSSAGFAEAINTAKAADVVVFVGGIDNTLEREGLDRS 496

Query: 519 DLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRA 578
            +  PG Q  L+  +A + K P+I+V    G VD      N  ++AI+WAGYPG+ GG A
Sbjct: 497 SISWPGNQLDLVKDLASLGK-PLIVVQFGGGQVDDTEILANKKVQAIIWAGYPGQSGGTA 555

Query: 579 IADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFG 638
           I D++ G   P GRLP+T Y  DY   + +T M LRP  S   PGRTYK+Y  P L  +G
Sbjct: 556 IFDIIVGSTAPAGRLPVTQYPADYTHQVRMTDMSLRP--SSHNPGRTYKWYKTPVL-EYG 612

Query: 639 YGLSYTQFKYN 649
           +GL +T F ++
Sbjct: 613 HGLHFTTFDFS 623


>gi|392590128|gb|EIW79457.1| glycoside hydrolase family 3 protein [Coniophora puteana RWD-64-598
           SS2]
          Length = 770

 Score =  462 bits (1190), Expect = e-127,   Method: Compositional matrix adjust.
 Identities = 259/604 (42%), Positives = 368/604 (60%), Gaps = 25/604 (4%)

Query: 55  CDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTH 114
           CD+SL  + R   LV   T++E +    + + GVPRLGLP Y+WWSE LHGV++  PG +
Sbjct: 37  CDTSLNATQRAAALVELFTVEELINNTVNGSPGVPRLGLPAYQWWSEGLHGVAD-SPGVN 95

Query: 115 FDDVIPG--ATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNINVA 172
           F    P   ATSFP  I+ +A+F+++L K +G  V  E R+  N G AGL +W+PNIN  
Sbjct: 96  FSTSGPFSYATSFPQPIVMSAAFDDALIKAVGGVVGMEGRSFNNYGHAGLDFWTPNINPF 155

Query: 173 RDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRP-LKVSSCCKHYAAY 231
           +DPRWGR  ETPGEDP+ + +Y  N ++GLQ          +N  P  +V + CKH+A Y
Sbjct: 156 KDPRWGRGQETPGEDPYHIAQYVYNLIQGLQG--------GVNPEPYFQVVATCKHFAGY 207

Query: 232 DVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKL 291
           D+++W+   RY FDA +T QD+ E +L  F+ C ++  A + MCSYN VNGIP+CAD  L
Sbjct: 208 DLEDWENNFRYGFDALITTQDLSEFYLPSFQSCYRDAQAGASMCSYNAVNGIPTCADTYL 267

Query: 292 LNQTVRGEW--DLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTN 349
           L   +R  W  D   ++ +DCD+++ + + H + A  ++ A A  L+AG DLDCG +YT 
Sbjct: 268 LQDILRDYWNFDETRWVTSDCDAVENIYNPHNYTALPQQ-AAADALRAGTDLDCGTFYTE 326

Query: 350 FTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ--YVSLGKQDICSDENIELAAE 407
           +   A  Q  + ET++  +L   Y  L+RLG+FD + Q  Y   G  ++ +    +LA  
Sbjct: 327 YLPLAYNQSLITETELRAALTRQYASLVRLGYFDPAAQQPYRQYGWSNVDTPYAQQLAYT 386

Query: 408 AAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAG--FS 465
           AA EGI LLKND  TLPL S  +K +A++GP ANAT  M GNY G+    +SP+ G   +
Sbjct: 387 AATEGITLLKND-GTLPLPS-TLKNIALIGPWANATNQMQGNYFGVAPYLVSPLQGALAA 444

Query: 466 GYANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAESLDREDLWLPGY 525
           GY NVTY  G  ++   S     AA  AA+ ADA +   G+D++VEAE++DR ++  PG 
Sbjct: 445 GY-NVTYVFGT-NITSNSTAGFAAAIAAAREADAVVYAGGIDVTVEAEAMDRYNVTWPGN 502

Query: 526 QTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFG 585
           Q QLI ++A + K P ++     G VD    + N ++ +++WAGYPG+ GG+A+ D++ G
Sbjct: 503 QLQLIGELAALGK-PFVVAQFGGGQVDDTEIKANASVNSLIWAGYPGQSGGQALFDIISG 561

Query: 586 KFNPGGRLPITWYNGDYVQMLPLTSMPLRP-VDSLGYPGRTYKFYNGPTLYPFGYGLSYT 644
           K  P GRL  T Y  DYV  +P+T M LRP  +    PGRTYK+Y G  +Y FGYGL YT
Sbjct: 562 KVAPAGRLVTTQYPADYVYEIPMTDMNLRPNANGTTSPGRTYKWYTGAPVYEFGYGLHYT 621

Query: 645 QFKY 648
            F Y
Sbjct: 622 NFTY 625


>gi|242786966|ref|XP_002480909.1| conserved hypothetical protein [Talaromyces stipitatus ATCC 10500]
 gi|218721056|gb|EED20475.1| conserved hypothetical protein [Talaromyces stipitatus ATCC 10500]
          Length = 757

 Score =  462 bits (1190), Expect = e-127,   Method: Compositional matrix adjust.
 Identities = 259/622 (41%), Positives = 373/622 (59%), Gaps = 33/622 (5%)

Query: 64  RVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTHFDDV---IP 120
           RVK L+  +TL+EK+  L D + G  RLGLP YEWW+EA HGV +  PG  F +      
Sbjct: 25  RVKSLIDSLTLEEKILNLVDASAGSERLGLPSYEWWNEATHGVGSA-PGVQFTEKPVNFS 83

Query: 121 GATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNINVARDPRWGRI 180
            ATSFP  ILT ASF+++L ++I   +  E RA  N G +G  +W+PNIN  RDPRWGR 
Sbjct: 84  YATSFPAPILTAASFDDALVREIASVIGREGRAFGNNGFSGFDFWAPNINPFRDPRWGRG 143

Query: 181 TETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVD 240
            ETPGED FVV  Y  N++ GLQ  +  +          +V + CKHYAAYD++      
Sbjct: 144 QETPGEDSFVVQSYIRNFIPGLQGDDPEDK---------QVIATCKHYAAYDLE----TG 190

Query: 241 RYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEW 300
           RY  D   T+QD+ + FL PF+ CV++    S+MC+YN V+GIP+CA   LL+Q +R  W
Sbjct: 191 RYGNDYNPTQQDLADYFLAPFKTCVRDTGVGSIMCAYNAVDGIPTCASEYLLDQVLRKHW 250

Query: 301 DL---HGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFTGN-AVQ 356
           +    + Y+V+DC ++  +   H F  D++E A + +L AG+DL+CG  Y     + A  
Sbjct: 251 NFTADYNYVVSDCGAVTDIWQYHNF-TDTEEAAASVSLNAGVDLECGSSYLKLNESLAAN 309

Query: 357 QGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQDICSDENIELAAEAAREGIVLL 416
           Q  V+   +D++L  LY+ L  +GFFDG  +Y +LG  D+ + E   LA EAA EG+ LL
Sbjct: 310 QTTVQA--LDQALTRLYSALFTVGFFDGG-KYTALGFADVSTPEAQSLAYEAAVEGMTLL 366

Query: 417 KNDQNTLPLNSA-KVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFSGYA-NVTYKT 474
           KND+  LP+ S+ K K+VA++GP ANAT  M G+Y+GIP   +SP+  F G+   V Y  
Sbjct: 367 KNDKRLLPIRSSHKYKSVALIGPFANATTQMQGDYSGIPPFLISPLEAFKGHDWEVNYAM 426

Query: 475 GCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAESLDREDLWLPGYQTQLINQVA 534
           G   +  ++     +A  AA+ +D  I L G+D S+EAE+LDR  L  PG Q  L+ Q++
Sbjct: 427 GT-GINNQTTTGFASALAAAEKSDLVIYLGGIDNSIEAETLDRTSLTWPGNQLDLVTQLS 485

Query: 535 EVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLP 594
           ++ K P+I+V    G +D +    N  ++A++WAGYP + GG A+ DV+ GK +  GRLP
Sbjct: 486 KLHK-PLIVVQFGGGQLDDSALLQNEGVQALVWAGYPSQSGGSALLDVLLGKRSIAGRLP 544

Query: 595 ITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFT 654
           +T Y   Y   + +  + +RP DS  YPGRTYK+Y G  + PFGYGL YT+F++     T
Sbjct: 545 VTQYPASYADQVSIFDINIRPNDS--YPGRTYKWYTGMPVVPFGYGLHYTKFEFEWAQ-T 601

Query: 655 KTIQVNLNKL-QHCRNLNYTSD 675
              + N+ +L   C++    SD
Sbjct: 602 LNHEYNIQQLVASCQSTGPISD 623


>gi|343172466|gb|AEL98937.1| beta-xylosidase, partial [Silene latifolia]
 gi|343172468|gb|AEL98938.1| beta-xylosidase, partial [Silene latifolia]
          Length = 374

 Score =  462 bits (1189), Expect = e-127,   Method: Compositional matrix adjust.
 Identities = 219/382 (57%), Positives = 278/382 (72%), Gaps = 13/382 (3%)

Query: 90  RLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVST 149
           RLGL  YEWWSEALHGVSNVGPGT F    P ATSFP VI T ASFN SLW+ IGQAVS 
Sbjct: 1   RLGLQGYEWWSEALHGVSNVGPGTKFQGAFPAATSFPQVITTAASFNASLWQAIGQAVSD 60

Query: 150 EARAMYNLGRAGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHE 209
           EARAMYN G AGLTYWSPN+N+ RDPRWGR  ETPGEDP +  +YA +YV GLQ   G+ 
Sbjct: 61  EARAMYNGGTAGLTYWSPNVNIFRDPRWGRGQETPGEDPTLSAQYAASYVTGLQGNYGNR 120

Query: 210 NATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGD 269
                    LKV++CCKHY AYD+DNW G+DR+HF+A+V++QD+E+T+  PF+ CV EG 
Sbjct: 121 ---------LKVAACCKHYTAYDLDNWNGMDRFHFNAKVSKQDLEDTYNVPFKACVLEGK 171

Query: 270 ASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKED 329
            +SVMCSYN+VNG P+CADP +L  T+RG+W L+GYIV+DCDS+ V+ D+  +   + E+
Sbjct: 172 VASVMCSYNQVNGKPTCADPDILRNTIRGQWHLNGYIVSDCDSVGVLYDDQHY-TRTPEE 230

Query: 330 AVAQTLKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ-- 387
           A A T+ AGLDLDCG +    T  A++QG V E  ++++L    TV MRLG FDG P   
Sbjct: 231 AAADTINAGLDLDCGPFLAVHTEGAIRQGLVTEAAVNQALANTITVQMRLGMFDGEPSAQ 290

Query: 388 -YVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAM 446
            + +LG +D+C+  + +LA +AAREGIVLLKN   +LPL++ + + +AV+GP+A AT  M
Sbjct: 291 PFGNLGPRDVCTPAHQDLALQAAREGIVLLKNQVGSLPLSTVRHRNIAVIGPNAQATTTM 350

Query: 447 IGNYAGIPCRYMSPIAGFSGYA 468
           IGNYAGI C Y SP+ G S YA
Sbjct: 351 IGNYAGIACGYTSPLQGISRYA 372


>gi|121797681|sp|Q2TYT2.1|BXLB_ASPOR RecName: Full=Probable exo-1,4-beta-xylosidase bxlB; AltName:
           Full=1,4-beta-D-xylan xylohydrolase bxlB; AltName:
           Full=Beta-xylosidase bxlB; AltName: Full=Xylobiase bxlB;
           Flags: Precursor
 gi|83775471|dbj|BAE65591.1| unnamed protein product [Aspergillus oryzae RIB40]
          Length = 797

 Score =  462 bits (1188), Expect = e-127,   Method: Compositional matrix adjust.
 Identities = 285/706 (40%), Positives = 396/706 (56%), Gaps = 47/706 (6%)

Query: 55  CDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTH 114
           CD+SL    R K LV+ MTL+EK+      + G PRLGLP Y WW+EALHGV+  G G  
Sbjct: 62  CDTSLDPVSRAKSLVAAMTLEEKINNTKYDSSGAPRLGLPAYNWWNEALHGVAE-GHGVS 120

Query: 115 FDDV--IPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNINVA 172
           F D      ATSFP  IL  A+F++ L K++   +STEARA  N G AGL YW+PNIN  
Sbjct: 121 FSDSGNFSYATSFPMPILLGAAFDDDLVKQVATVISTEARAFANGGHAGLDYWTPNINPF 180

Query: 173 RDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYD 232
           RDPRWGR  ETPGEDP  + RY  + V GLQD  G E       RP KV + CKH+AAYD
Sbjct: 181 RDPRWGRGQETPGEDPLHLSRYVYHLVDGLQDGIGPE-------RP-KVVATCKHFAAYD 232

Query: 233 VDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLL 292
           ++NW+G++RY FDA V+ QD+ E +L  F+ C ++    +VMCSYN +NGIP+CAD  LL
Sbjct: 233 LENWEGIERYAFDAVVSPQDLSEYYLPSFKTCTRDAKVDAVMCSYNSLNGIPTCADRWLL 292

Query: 293 NQTVRGEWDLH---GYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTN 349
              +R  W       ++  DC +I  +  +H ++A     A A  L AG DLDCG  +  
Sbjct: 293 QTLLREHWGWEQTGHWVTGDCGAIDNIYADHHYVA-DGAHAAAAALNAGTDLDCGSVFPE 351

Query: 350 FTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ--YVSLGKQDICSDENIELAAE 407
           + G+A+QQG      ++ +L  LY+ L++LG+FD +    Y S+G  ++ +    ELA +
Sbjct: 352 YLGSALQQGLYNNQTLNNALIRLYSSLVKLGYFDPADDQPYRSIGWNEVFTPAAEELAHK 411

Query: 408 AAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIP--CRYMSPIAGFS 465
           A  EGIV+LKND  TLPL S    TVA++GP ANAT  + GNY G P   R +   A  +
Sbjct: 412 ATVEGIVMLKND-GTLPLKSN--GTVAIIGPFANATTQLQGNYEGPPKYIRTLIWAAVHN 468

Query: 466 GYANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAESLDREDLWLPGY 525
           GY  V +  G  D+   S+     A  AAK AD  I   G+D ++E ES DR  +  PG 
Sbjct: 469 GY-KVKFSQGT-DINSNSSAGFAEAISAAKEADTVIYAGGIDNTIEKESQDRTTIVWPGN 526

Query: 526 QTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFG 585
           Q  LI Q++++ K P+I+V    G VD +    N  + A+LWAGYP + GG A+ D++ G
Sbjct: 527 QLDLIEQLSDLEK-PLIVVQFGGGQVDDSSLLANAGVGALLWAGYPSQAGGAAVFDILTG 585

Query: 586 KFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQ 645
           K  P GRLP+T Y   YV  +P+T M LRP  +   PGRTY++Y+   L PFG+GL YT 
Sbjct: 586 KSAPAGRLPVTQYPASYVDEVPMTDMTLRPGSN--NPGRTYRWYDKAVL-PFGFGLHYTT 642

Query: 646 FKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVG 705
           F            V+ N   H     Y +D+  +      V+    D    F +   N G
Sbjct: 643 FN-----------VSWN---HAEYGPYNTDSVASGTTNAPVDTELFD---TFSITVTNTG 685

Query: 706 STDGSDVVIVYSKPPAEIAATY-IKQVIGFQRVF-VRAGRNKRIKF 749
           +     + +++          Y IK ++G+ R   +  G+++++K 
Sbjct: 686 NVASDYIALLFLTADGVGPEPYPIKTLVGYSRAKGIEPGQSQQVKL 731


>gi|426198356|gb|EKV48282.1| hypothetical protein AGABI2DRAFT_67675 [Agaricus bisporus var.
           bisporus H97]
          Length = 763

 Score =  462 bits (1188), Expect = e-127,   Method: Compositional matrix adjust.
 Identities = 259/611 (42%), Positives = 366/611 (59%), Gaps = 29/611 (4%)

Query: 49  MSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSN 108
           + S   CDS+   + R + L+   T DE +Q   + + GVPRLGLP YEWWSEALHGV +
Sbjct: 32  LKSTPVCDSTKDPATRAQSLIQMFTDDELIQNGDNASPGVPRLGLPPYEWWSEALHGVGH 91

Query: 109 VGPGTHF--DDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWS 166
             PG  F        ATSFP  I+  A+F++ L K +   VSTEARA  N GRAGL Y++
Sbjct: 92  -SPGVVFAPSGDFSSATSFPQPIVIGAAFDDDLVKAVANVVSTEARAFNNFGRAGLNYFT 150

Query: 167 PNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRP-LKVSSCC 225
           PNIN  +DPRWGR  ETPGEDPF + +Y  + V GLQ          ++  P +KV++ C
Sbjct: 151 PNINPFKDPRWGRGQETPGEDPFHLSQYVYHLVDGLQG--------GIDPWPYIKVAADC 202

Query: 226 KHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPS 285
           KH+AAYD++NW+G+DR+HFDA+V++QD+ E +L PF+ CV++  A+SVMCSYN VNG+P+
Sbjct: 203 KHFAAYDLENWEGIDRFHFDAQVSQQDLSEYYLPPFQSCVRDAKAASVMCSYNSVNGVPA 262

Query: 286 CADPKLLNQTVRGEW--DLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDC 343
           CA   LL   +R  W  D   ++ +DC ++  + D+H F   S  +A A +LKAG D+DC
Sbjct: 263 CASTYLLQDILRDAWGFDDDRWVTSDCWALDKIFDSHNF-TRSFAEAAAISLKAGTDIDC 321

Query: 344 GQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFD--GSPQYVSLGKQDICSDEN 401
           G  + +    A+ Q  +   D+ ++    YT L+RLG+FD   S  Y      D+ + E 
Sbjct: 322 GSTFADHLPAALNQSLISRDDLTRAFIRQYTSLIRLGYFDPSHSQTYRQFDWSDVNTPEA 381

Query: 402 IELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPI 461
             L+  AA EG+VLLKND   LPL +   KT+A++GP+ NAT +M GNY G      SP 
Sbjct: 382 QALSRRAAVEGLVLLKND-GLLPL-APDGKTIAIIGPYTNATSSMQGNYFGNAPFITSP- 438

Query: 462 AGFSGYANVTYK--TGCDDVACKSNNSIFA-ASEAAKTADATIILAGLDLSVEAESLDRE 518
             F G  +V +K  +    +   ++++ FA A   A+ AD  + + G+D ++E E LDR 
Sbjct: 439 --FQGAQDVGFKVVSAAGTIVNGTSSAGFAEAINTARAADVVVFVGGIDNTLEREGLDRS 496

Query: 519 DLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRA 578
            +  PG Q  L+  +A + K P+I+V    G VD      N  ++AI+WAGYPG+ GG A
Sbjct: 497 SISWPGNQLDLVKDLASLGK-PLIVVQFGGGQVDDTEILANEKVQAIIWAGYPGQSGGTA 555

Query: 579 IADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFG 638
           I D++ G   P GRLP+T Y  DY   + +T M LRP  S   PGRTYK+Y  P L  +G
Sbjct: 556 IFDIIVGATAPAGRLPVTQYPADYTHQVRMTDMSLRP--SSHNPGRTYKWYKTPVL-EYG 612

Query: 639 YGLSYTQFKYN 649
           +GL +T F ++
Sbjct: 613 HGLHFTTFDFS 623


>gi|317158006|ref|XP_001826724.2| exo-1,4-beta-xylosidase xlnD [Aspergillus oryzae RIB40]
          Length = 776

 Score =  461 bits (1187), Expect = e-127,   Method: Compositional matrix adjust.
 Identities = 285/706 (40%), Positives = 396/706 (56%), Gaps = 47/706 (6%)

Query: 55  CDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTH 114
           CD+SL    R K LV+ MTL+EK+      + G PRLGLP Y WW+EALHGV+  G G  
Sbjct: 41  CDTSLDPVSRAKSLVAAMTLEEKINNTKYDSSGAPRLGLPAYNWWNEALHGVAE-GHGVS 99

Query: 115 FDDV--IPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNINVA 172
           F D      ATSFP  IL  A+F++ L K++   +STEARA  N G AGL YW+PNIN  
Sbjct: 100 FSDSGNFSYATSFPMPILLGAAFDDDLVKQVATVISTEARAFANGGHAGLDYWTPNINPF 159

Query: 173 RDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYD 232
           RDPRWGR  ETPGEDP  + RY  + V GLQD  G E       RP KV + CKH+AAYD
Sbjct: 160 RDPRWGRGQETPGEDPLHLSRYVYHLVDGLQDGIGPE-------RP-KVVATCKHFAAYD 211

Query: 233 VDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLL 292
           ++NW+G++RY FDA V+ QD+ E +L  F+ C ++    +VMCSYN +NGIP+CAD  LL
Sbjct: 212 LENWEGIERYAFDAVVSPQDLSEYYLPSFKTCTRDAKVDAVMCSYNSLNGIPTCADRWLL 271

Query: 293 NQTVRGEWDLH---GYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTN 349
              +R  W       ++  DC +I  +  +H ++A     A A  L AG DLDCG  +  
Sbjct: 272 QTLLREHWGWEQTGHWVTGDCGAIDNIYADHHYVA-DGAHAAAAALNAGTDLDCGSVFPE 330

Query: 350 FTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ--YVSLGKQDICSDENIELAAE 407
           + G+A+QQG      ++ +L  LY+ L++LG+FD +    Y S+G  ++ +    ELA +
Sbjct: 331 YLGSALQQGLYNNQTLNNALIRLYSSLVKLGYFDPADDQPYRSIGWNEVFTPAAEELAHK 390

Query: 408 AAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIP--CRYMSPIAGFS 465
           A  EGIV+LKND  TLPL S    TVA++GP ANAT  + GNY G P   R +   A  +
Sbjct: 391 ATVEGIVMLKND-GTLPLKSN--GTVAIIGPFANATTQLQGNYEGPPKYIRTLIWAAVHN 447

Query: 466 GYANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAESLDREDLWLPGY 525
           GY  V +  G  D+   S+     A  AAK AD  I   G+D ++E ES DR  +  PG 
Sbjct: 448 GY-KVKFSQGT-DINSNSSAGFAEAISAAKEADTVIYAGGIDNTIEKESQDRTTIVWPGN 505

Query: 526 QTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFG 585
           Q  LI Q++++ K P+I+V    G VD +    N  + A+LWAGYP + GG A+ D++ G
Sbjct: 506 QLDLIEQLSDLEK-PLIVVQFGGGQVDDSSLLANAGVGALLWAGYPSQAGGAAVFDILTG 564

Query: 586 KFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQ 645
           K  P GRLP+T Y   YV  +P+T M LRP  +   PGRTY++Y+   L PFG+GL YT 
Sbjct: 565 KSAPAGRLPVTQYPASYVDEVPMTDMTLRPGSN--NPGRTYRWYDKAVL-PFGFGLHYTT 621

Query: 646 FKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVG 705
           F            V+ N   H     Y +D+  +      V+    D    F +   N G
Sbjct: 622 FN-----------VSWN---HAEYGPYNTDSVASGTTNAPVDTELFD---TFSITVTNTG 664

Query: 706 STDGSDVVIVYSKPPAEIAATY-IKQVIGFQRVF-VRAGRNKRIKF 749
           +     + +++          Y IK ++G+ R   +  G+++++K 
Sbjct: 665 NVASDYIALLFLTADGVGPEPYPIKTLVGYSRAKGIEPGQSQQVKL 710


>gi|391864313|gb|EIT73609.1| beta-glucosidase-related glycosidase [Aspergillus oryzae 3.042]
          Length = 797

 Score =  460 bits (1184), Expect = e-126,   Method: Compositional matrix adjust.
 Identities = 285/706 (40%), Positives = 395/706 (55%), Gaps = 47/706 (6%)

Query: 55  CDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTH 114
           CD+SL    R K LV+ MTL+EK+      + G PRLGLP Y WW+EALHGV+  G G  
Sbjct: 62  CDTSLDPVSRAKSLVAAMTLEEKINNTKYDSSGAPRLGLPAYNWWNEALHGVAE-GHGVS 120

Query: 115 FDDV--IPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNINVA 172
           F D      ATSFP  IL  A+F++ L K++   +STEARA  N G AGL YW+PNIN  
Sbjct: 121 FSDSGNFSYATSFPMPILLGAAFDDDLVKQVATVISTEARAFANGGHAGLDYWTPNINPF 180

Query: 173 RDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYD 232
           RDPRWGR  ETPGEDP  + RY  + V GLQD  G E       RP KV + CKH+AAYD
Sbjct: 181 RDPRWGRGQETPGEDPLHLSRYVYHLVDGLQDGIGPE-------RP-KVVATCKHFAAYD 232

Query: 233 VDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLL 292
           ++NW+G++RY FDA V+ QD+ E +L  F+ C ++    +VMCSYN +NGIP+CAD  LL
Sbjct: 233 LENWEGIERYAFDAVVSPQDLSEYYLPSFKTCTRDAKVDAVMCSYNSLNGIPTCADRWLL 292

Query: 293 NQTVRGEWDLH---GYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTN 349
              +R  W       ++  DC +I  +  +H ++A     A A  L AG DLDCG  +  
Sbjct: 293 QTLLREHWGWEQTGHWVTGDCGAIDNIYADHHYVA-DGAHAAAAALNAGTDLDCGSVFPE 351

Query: 350 FTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ--YVSLGKQDICSDENIELAAE 407
           + G+A+QQG      +  +L  LY+ L++LG+FD +    Y S+G  ++ +    ELA +
Sbjct: 352 YLGSALQQGLYNNQTLYNALIRLYSSLVKLGYFDPADDQPYRSIGWNEVFTPAAEELAHK 411

Query: 408 AAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIP--CRYMSPIAGFS 465
           A  EGIV+LKND  TLPL S    TVA++GP ANAT  + GNY G P   R +   A  +
Sbjct: 412 ATVEGIVMLKND-GTLPLKSN--GTVAIIGPFANATTQLQGNYEGPPKYIRTLIWAAVHN 468

Query: 466 GYANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAESLDREDLWLPGY 525
           GY  V +  G  D+   S+     A  AAK AD  I   G+D ++E ES DR  +  PG 
Sbjct: 469 GY-KVKFSQGT-DINSNSSAGFAEAISAAKEADTVIYAGGIDNTIEKESQDRTTIVWPGN 526

Query: 526 QTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFG 585
           Q  LI Q++++ K P+I+V    G VD +    N  + A+LWAGYP + GG A+ D++ G
Sbjct: 527 QLDLIEQLSDLEK-PLIVVQFGGGQVDDSSLLANAGVGALLWAGYPSQAGGAAVFDILTG 585

Query: 586 KFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQ 645
           K  P GRLP+T Y   YV  +P+T M LRP  +   PGRTY++Y+   L PFG+GL YT 
Sbjct: 586 KSAPAGRLPVTQYPASYVDEVPMTDMTLRPGSN--NPGRTYRWYDKAVL-PFGFGLHYTT 642

Query: 646 FKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVG 705
           F            V+ N   H     Y +D+  +      V+    D    F +   N G
Sbjct: 643 FN-----------VSWN---HAEYGPYNTDSVASGTTNAPVDTELFD---TFSITVTNTG 685

Query: 706 STDGSDVVIVYSKPPAEIAATY-IKQVIGFQRVF-VRAGRNKRIKF 749
           +     + +++          Y IK ++G+ R   +  G+++++K 
Sbjct: 686 NVASDYIALLFLTADGVGPEPYPIKTLVGYSRAKGIEPGQSQQVKL 731


>gi|242813865|ref|XP_002486253.1| beta-xylosidase, putative [Talaromyces stipitatus ATCC 10500]
 gi|218714592|gb|EED14015.1| beta-xylosidase, putative [Talaromyces stipitatus ATCC 10500]
          Length = 893

 Score =  458 bits (1178), Expect = e-126,   Method: Compositional matrix adjust.
 Identities = 266/635 (41%), Positives = 377/635 (59%), Gaps = 35/635 (5%)

Query: 21  STNAVDANGSSSPVFVCDPGRFSKLGLQMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQ 80
           S+N +  +GS  P    DP         + S   CD+SL    R K LV  MT +EKVQ 
Sbjct: 140 SSNPIPLSGSVKPNCTLDP---------LCSNPICDTSLDPLTRAKGLVDAMTFEEKVQN 190

Query: 81  LGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTHFDDV--IPGATSFPTVILTTASFNES 138
             + + G  RLGLP Y+WW+EALHGV+   PG  F        ATSFP  IL +A+F+++
Sbjct: 191 TQNGSPGAARLGLPAYQWWNEALHGVAG-SPGVTFQPSGNFSYATSFPQPILMSAAFDDA 249

Query: 139 LWKKIGQAVSTEARAMYNLGRAGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNY 198
           L K++G  VS E RA  N G AGL +W+PNIN  RDPRWGR  ETPGEDP+ + RY  N 
Sbjct: 250 LIKEVGTVVSIEGRAFNNYGNAGLDFWTPNINPFRDPRWGRGQETPGEDPYHIARYVYNL 309

Query: 199 VRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFL 258
           V GLQ+     N         +V + CKH+A YD+++W+G  RY F+A ++ QD+ E +L
Sbjct: 310 VDGLQNGIAPANP--------RVVATCKHFAGYDIEDWEGNSRYGFNAIISTQDLSEYYL 361

Query: 259 RPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHG---YIVADCDSIQV 315
            PF+ C ++    ++MCSYN VNGIP+CAD  LL+  +R  W+ +    ++ +DCD++  
Sbjct: 362 PPFKSCARDAQVDAIMCSYNAVNGIPTCADSYLLDTILRDHWNWNQTGHWVTSDCDAVDN 421

Query: 316 MVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTV 375
           +  +H++ + S   A A  L AG +LDCG   +N    A  Q   K   ++ +L YLY+ 
Sbjct: 422 IYSDHRYTS-SLAAAAADALNAGTNLDCGTTMSNNLAAAAAQDLFKNATLNSALVYLYSS 480

Query: 376 LMRLGFFDGS-PQYVSLGKQDICSDENIELAAEAAREGIVLLKND-QNTLPLNSAKVKTV 433
           L+RLG+FD    QY SLG  D+ +  + +LA  AA EGIVLLKND +  LPL S   +T+
Sbjct: 481 LVRLGWFDSEDSQYSSLGWSDVGTTASQQLANRAAVEGIVLLKNDHKKVLPL-SQHGQTI 539

Query: 434 AVVGPHANATVAMIGNYAGIPCRYMSPIAGFS--GYANVTYKTGCDDVACKSNNSIFAAS 491
           A++GP+ANAT  + GNY G P    + + G    GY  V Y+ G   +     +   AA 
Sbjct: 540 ALIGPYANATTQLQGNYYGTPAYIRTLVWGAEQMGYT-VQYEAGT-GINSTDTSGFAAAV 597

Query: 492 EAAKTADATIILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGV 551
            AAKTAD  I   G+D S+EAE++DR  +   G Q QLI+Q+++V K P++++    G +
Sbjct: 598 AAAKTADIVIYAGGIDNSIEAEAMDRNTIAWTGNQLQLIDQLSQVGK-PLVVLQFGGGQL 656

Query: 552 DIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSM 611
           D +    N N+ A+LW GYP + GG+A+ D++ G+  P GRLP+T Y  +Y   +P+T M
Sbjct: 657 DDSALLQNENVNALLWCGYPSQTGGQAVFDILTGQSAPAGRLPVTQYPANYTNAIPMTDM 716

Query: 612 PLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQF 646
            LRP  S   PGRTY++Y+   + PFG+GL YT F
Sbjct: 717 SLRPNGST--PGRTYRWYDDAVI-PFGFGLHYTTF 748


>gi|238508313|ref|XP_002385353.1| beta-xylosidase, putative [Aspergillus flavus NRRL3357]
 gi|296439537|sp|B8NYD8.1|BXLB_ASPFN RecName: Full=Probable exo-1,4-beta-xylosidase bxlB; AltName:
           Full=1,4-beta-D-xylan xylohydrolase bxlB; AltName:
           Full=Beta-xylosidase bxlB; AltName: Full=Xylobiase bxlB;
           Flags: Precursor
 gi|220688872|gb|EED45224.1| beta-xylosidase, putative [Aspergillus flavus NRRL3357]
          Length = 776

 Score =  458 bits (1178), Expect = e-126,   Method: Compositional matrix adjust.
 Identities = 284/706 (40%), Positives = 395/706 (55%), Gaps = 47/706 (6%)

Query: 55  CDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTH 114
           CD+SL    R K LV+ MTL+EK+      + G PRLGLP Y WW+EALHGV+  G G  
Sbjct: 41  CDTSLDPVSRAKSLVAAMTLEEKINNTKYDSSGAPRLGLPAYNWWNEALHGVAE-GHGVS 99

Query: 115 FDDV--IPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNINVA 172
           F D      ATSFP  IL  A+F++ L K++   +STEARA  N G AGL YW+PNIN  
Sbjct: 100 FSDSGNFSYATSFPMPILLGAAFDDDLVKQVATVISTEARAFANGGHAGLDYWTPNINPF 159

Query: 173 RDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYD 232
           RDPRWGR  ETPGEDP  + RY  + V GLQD  G E       RP KV + CKH+AAYD
Sbjct: 160 RDPRWGRGQETPGEDPLHLSRYVYHLVDGLQDGIGPE-------RP-KVVATCKHFAAYD 211

Query: 233 VDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLL 292
           ++NW+G++RY FDA V+ QD+ E +L  F+ C ++    +VMCSYN +NGIP+CAD  LL
Sbjct: 212 LENWEGIERYAFDAVVSPQDLSEYYLPSFKTCTRDAKVDAVMCSYNSLNGIPTCADRWLL 271

Query: 293 NQTVRGEWDLH---GYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTN 349
              +R  W       ++  DC +I  +  +H ++A     A A  L AG DLDCG  +  
Sbjct: 272 QTLLREHWGWEQTGHWVTGDCGAIDNIYADHHYVA-DGAHAAAAALNAGTDLDCGSVFPE 330

Query: 350 FTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ--YVSLGKQDICSDENIELAAE 407
           +  +A+QQG      ++ +L  LY+ L++LG+FD +    Y S+G  ++ +    ELA +
Sbjct: 331 YLRSALQQGLYNNQTLNNALIRLYSSLVKLGYFDPADDQPYRSIGWNEVFTPAAEELAHK 390

Query: 408 AAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIP--CRYMSPIAGFS 465
           A  EGIV+LKND  TLPL S    TVA++GP ANAT  + GNY G P   R +   A  +
Sbjct: 391 ATVEGIVMLKND-GTLPLKSN--GTVAIIGPFANATTQLQGNYEGPPKYIRTLIWAAVHN 447

Query: 466 GYANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAESLDREDLWLPGY 525
           GY  V +  G  D+   S+     A  AAK AD  I   G+D ++E ES DR  +  PG 
Sbjct: 448 GY-KVKFSQGT-DINSNSSAGFAEAISAAKEADTVIYAGGIDNTIEKESQDRTTIVWPGN 505

Query: 526 QTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFG 585
           Q  LI Q++++ K P+I+V    G VD +    N  + A+LWAGYP + GG A+ D++ G
Sbjct: 506 QLDLIEQLSDLEK-PLIVVQFGGGQVDDSSLLANAGVGALLWAGYPSQAGGAAVFDILTG 564

Query: 586 KFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQ 645
           K  P GRLP+T Y   YV  +P+T M LRP  +   PGRTY++Y+   L PFG+GL YT 
Sbjct: 565 KSAPAGRLPVTQYPASYVDEVPMTDMTLRPGSN--NPGRTYRWYDKAVL-PFGFGLHYTT 621

Query: 646 FKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVG 705
           F            V+ N   H     Y +D+  +      V+    D    F +   N G
Sbjct: 622 FN-----------VSWN---HAEYGPYNTDSVASGTTNAPVDTELFD---TFSITVTNTG 664

Query: 706 STDGSDVVIVYSKPPAEIAATY-IKQVIGFQRVF-VRAGRNKRIKF 749
           +     + +++          Y IK ++G+ R   +  G+++++K 
Sbjct: 665 NVASDYIALLFLTADRVGPEPYPIKTLVGYSRAKGIEPGQSQQVKL 710


>gi|402225863|gb|EJU05924.1| hypothetical protein DACRYDRAFT_113532 [Dacryopinax sp. DJM-731
           SS1]
          Length = 778

 Score =  458 bits (1178), Expect = e-126,   Method: Compositional matrix adjust.
 Identities = 260/606 (42%), Positives = 356/606 (58%), Gaps = 28/606 (4%)

Query: 55  CDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTH 114
           CDS+L    R + LV  +T+ EK     + + GVPRLGLP Y WWSE LHGV++  PG  
Sbjct: 42  CDSALDPLTRARALVGMLTMAEKFNNTVNASPGVPRLGLPPYNWWSEGLHGVAS-SPGVT 100

Query: 115 FDDV---IPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNINV 171
           F         ATSFP  IL  A+F+++L   I   +STEARA  N   +GL +W+PNIN 
Sbjct: 101 FAPAGQNFSYATSFPEPILMGAAFDDNLIYDIATIISTEARAFNNFNHSGLDFWTPNINP 160

Query: 172 ARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAY 231
            RDPRWGR  ETPGEDPF +  Y    V GLQ   G +       +  K+ + CKHYA Y
Sbjct: 161 VRDPRWGRSLETPGEDPFHLASYVAKLVTGLQ-FGGDD------PKYQKLVATCKHYAGY 213

Query: 232 DVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKL 291
           D++NW G  RY FDA ++ QD+ E FL PF+ C ++ + +SVMCSYN VNGIPSCA+  L
Sbjct: 214 DLENWGGYARYGFDAVISNQDLVEYFLPPFQTCARDVNVTSVMCSYNAVNGIPSCANDYL 273

Query: 292 LNQTVRGEWDLH--------GYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDC 343
           L   +R  W            Y+ +DCD++  +   H +   + E AVA +LKAG DLDC
Sbjct: 274 LQSLLRTYWGWEPDSESLNAHYVTSDCDAVSNIYYPHNYTI-TPEQAVAVSLKAGTDLDC 332

Query: 344 GQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ--YVSLGKQDICSDEN 401
           G +Y  +  ++ +QG   +TDID++L   Y  L  LG+FD +    Y      +I +D  
Sbjct: 333 GTFYAEWLPSSYEQGLFHQTDIDRALIRSYAALFLLGYFDPAEGQIYRQYNWANINTDYA 392

Query: 402 IELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPI 461
            +LA  AA EGI LLKN  + LPL S  +  +A++GP ANAT  M GNY GI     SP+
Sbjct: 393 QQLAYTAAWEGITLLKNIDDMLPLPS-TMTNIALIGPWANATTQMQGNYQGIAPFLHSPL 451

Query: 462 AGFSGYA-NVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAESLDREDL 520
                   NVTY  G  ++   S     AA  AA+TAD T+ + G+D++VEAE++DR ++
Sbjct: 452 YALQQRGINVTYVLGT-NITSNSTAGFAAALAAAQTADLTLYIGGIDITVEAEAMDRVNI 510

Query: 521 WLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIA 580
             PG Q  LI Q+A V+   +I+  M  G +D      N  +  +LW GYPG++GG A+ 
Sbjct: 511 TWPGNQLDLIAQLANVSTH-LIVYQMGGGQIDDTVLLENPKVHGLLWGGYPGQDGGTAMI 569

Query: 581 DVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYG 640
           D+++G   P GRLP++ Y  +++  +P+T M L P  +LG PGRTYK+Y+G  + PFGYG
Sbjct: 570 DILYGSRAPAGRLPLSQYPANFINEVPMTDMRLHP--ALGTPGRTYKWYSGDLVLPFGYG 627

Query: 641 LSYTQF 646
           L YT F
Sbjct: 628 LHYTTF 633


>gi|83774566|dbj|BAE64689.1| unnamed protein product [Aspergillus oryzae RIB40]
          Length = 822

 Score =  457 bits (1176), Expect = e-126,   Method: Compositional matrix adjust.
 Identities = 273/652 (41%), Positives = 378/652 (57%), Gaps = 43/652 (6%)

Query: 34  VFVCDPGRFSKLGLQ---MSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPR 90
           V +    + S +  Q   + S   CD+SL  + RV  LV  +TL+EK+  L D + G  R
Sbjct: 56  VTILTAAKLSTIACQTQPLCSHPVCDTSLSIAERVDSLVKSLTLEEKILNLVDASAGSTR 115

Query: 91  LGLPQYEWWSEALHGVSNVGPGTHFDDVIPG---ATSFPTVILTTASFNESLWKKIGQAV 147
           LGLP YEWWSEA HGV +  PG  F         ATSFP  ILT ASF+++L +KI + +
Sbjct: 116 LGLPSYEWWSEATHGVGS-APGVQFTSKPANFSYATSFPAPILTAASFDDTLIRKIAEVI 174

Query: 148 STEARAMYNLGRAGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEG 207
             E RA  N G +G  +W+PNIN  RDPRWGR  ETPGEDP V   Y  N+V GLQ  + 
Sbjct: 175 GREGRAFGNNGFSGFDFWAPNINGFRDPRWGRGQETPGEDPLVAQNYIRNFVPGLQGDDP 234

Query: 208 HENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKE 267
                       +V + CKHYA YD++      RY  +   T+QD+ + FL PF+ CV++
Sbjct: 235 KNK---------QVIATCKHYAVYDLE----TGRYGNNYNPTQQDLSDYFLAPFKTCVRD 281

Query: 268 GDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHG---YIVADCDSIQVMVDNHKFLA 324
            D  S+MCSYN V+GIP+CA+  LL++ +R  W+ +    Y+V+DC ++  +   H F  
Sbjct: 282 TDVGSIMCSYNSVSGIPACANEYLLSEVLRKHWNFNSDYHYVVSDCGAVTDIWQYHNF-T 340

Query: 325 DSKEDAVAQTLKAGLDLDCGQYYTNFTGN-AVQQGKVKETDIDKSLKYLYTVLMRLGFFD 383
           D++E A +  L AG+DL+CG  Y     + A  Q  VK   +D+SL  LY+ L  +GFFD
Sbjct: 341 DTEEAAASVALNAGVDLECGSSYLKLNESLAANQTSVKV--MDQSLARLYSALFTVGFFD 398

Query: 384 GSPQYVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSA-KVKTVAVVGPHANA 442
           G  +Y  L   D+ + +   LA EAA EG+ LLKND + LPL+S  K K+VAV+GP ANA
Sbjct: 399 GG-KYDKLDFSDVSTPDAQALAYEAAVEGMTLLKND-DLLPLDSPHKYKSVAVIGPFANA 456

Query: 443 TVAMIGNYAGIPCRYMSPIAGF-SGYANVTYKTGCDDVACKSNNSIF-AASEAAKTADAT 500
           T  M G+Y+G     +SP+  F      V Y  G        N S F  A  AA  +D  
Sbjct: 457 TTQMQGDYSGDAPYLISPLEAFGDSRWKVNYALGT--AMNNQNTSGFEEALAAANKSDLI 514

Query: 501 IILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNT 560
           I L G+D S+E+E+LDR  L  PG Q  LI  +++++K P+++V    G VD +    N 
Sbjct: 515 IYLGGIDNSLESETLDRTSLTWPGNQLDLITSLSKLSK-PLVVVQFGGGQVDDSDILKNK 573

Query: 561 NIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLG 620
           +I+A++WAGYP + GG A+ DV+ GK +P GRLP+T Y   Y   + +  + LRP DS  
Sbjct: 574 DIQALVWAGYPSQSGGTALLDVLVGKRSPAGRLPVTQYPASYADQVNIFDINLRPTDS-- 631

Query: 621 YPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTI--QVNLNKL-QHCRN 669
           YPGRTYK+Y G  + PFGYGL YT+F ++   + KT+  + N+  L   CRN
Sbjct: 632 YPGRTYKWYTGKPVLPFGYGLHYTKFMFD---WEKTLNREYNIQDLVASCRN 680


>gi|336365124|gb|EGN93476.1| glycoside hydrolase family 3 protein [Serpula lacrymans var.
           lacrymans S7.3]
          Length = 732

 Score =  457 bits (1176), Expect = e-125,   Method: Compositional matrix adjust.
 Identities = 266/625 (42%), Positives = 369/625 (59%), Gaps = 27/625 (4%)

Query: 55  CDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTH 114
           CD+SL    R   +V   T+DE +      + GVPRLGLP Y+WWSE LHGV++  PG +
Sbjct: 22  CDTSLDPISRATAVVDLFTIDELINNTVSTSPGVPRLGLPPYQWWSEGLHGVAD-SPGVN 80

Query: 115 FD--DVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNINVA 172
           F        ATSFP  I+  A+F++ L K +G  V  E R+  N GRAGL +W+PNIN  
Sbjct: 81  FSASGEFSYATSFPQPIIMGAAFDDELIKSVGAIVGMEGRSFNNYGRAGLDFWTPNINPF 140

Query: 173 RDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRP-LKVSSCCKHYAAY 231
           +DPRWGR  ETPGEDP+ + +Y  N V+GLQ          L+ +P  +V S CKH+AAY
Sbjct: 141 KDPRWGRGQETPGEDPYHLAQYVYNLVQGLQG--------GLDPKPYYQVISTCKHFAAY 192

Query: 232 DVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKL 291
           D+++W G  RY FDA VT QD+ E +L  F+ C ++    + MCSYN VNGIPSCA+  L
Sbjct: 193 DLEDWDGNYRYGFDAIVTTQDLSEYYLPSFQSCYRDAKVGAAMCSYNAVNGIPSCANTYL 252

Query: 292 LNQTVRGEWDL--HGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTN 349
           L   +R  W      ++ +DCD++  + D H +   + E+AVA  LKAG D+DCG +Y+ 
Sbjct: 253 LQSILRDFWGFAEDRWVTSDCDAVDNIYDPHNY-TKTPEEAVADALKAGTDIDCGTFYSE 311

Query: 350 FTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGS--PQYVSLGKQDICSDENIELAAE 407
           +   A  Q  + ET++ ++L   Y  L+RLG+FD +    Y      ++ + +  +LA +
Sbjct: 312 YLPGAYNQSLITETELRQALIRQYASLVRLGYFDPTDIQPYRQYNWNNVDTPQAQQLAYQ 371

Query: 408 AAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAG--FS 465
           AA EGIVLLKND  TLPL S+ +K +A++GP  NAT  M GNY G+    +SP+ G   +
Sbjct: 372 AAAEGIVLLKND-GTLPL-SSDIKNIALIGPWGNATGEMQGNYYGVAPYLISPLMGAVAT 429

Query: 466 GYANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAESLDREDLWLPGY 525
           GY NVTY  G  ++     +   AA  AA+ AD  I   G+D +VE+E  DR  +  PG 
Sbjct: 430 GY-NVTYVFGT-NITSNDTSGFAAAIAAAQGADVVIYAGGIDETVESEGNDRNYITWPGN 487

Query: 526 QTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFG 585
           Q  L+ ++A V K P+++V    G VD    + N+ + A+LWAGYPG+ GG A+ D++ G
Sbjct: 488 QLDLVGELAAVGK-PLVVVQFGGGQVDDTSLKANSTVNALLWAGYPGQSGGSALFDIISG 546

Query: 586 KFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQ 645
           K  P GRLP+T Y  DYV  +P+T M LRP  +   PGRTYK+Y G  +Y FGYGL YT 
Sbjct: 547 KVAPAGRLPVTQYPADYVYEIPMTDMDLRP--NATSPGRTYKWYTGTPIYDFGYGLHYTT 604

Query: 646 FKYNLLSFTKTIQVNLNKLQHCRNL 670
           F Y       +   N+  L    NL
Sbjct: 605 FSYKWAK-APSSTYNIQTLVQSGNL 628


>gi|336377735|gb|EGO18896.1| glycoside hydrolase family 3 protein [Serpula lacrymans var.
           lacrymans S7.9]
          Length = 766

 Score =  457 bits (1175), Expect = e-125,   Method: Compositional matrix adjust.
 Identities = 266/625 (42%), Positives = 369/625 (59%), Gaps = 27/625 (4%)

Query: 55  CDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTH 114
           CD+SL    R   +V   T+DE +      + GVPRLGLP Y+WWSE LHGV++  PG +
Sbjct: 37  CDTSLDPISRATAVVDLFTIDELINNTVSTSPGVPRLGLPPYQWWSEGLHGVAD-SPGVN 95

Query: 115 FDDV--IPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNINVA 172
           F        ATSFP  I+  A+F++ L K +G  V  E R+  N GRAGL +W+PNIN  
Sbjct: 96  FSASGEFSYATSFPQPIIMGAAFDDELIKSVGAIVGMEGRSFNNYGRAGLDFWTPNINPF 155

Query: 173 RDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRP-LKVSSCCKHYAAY 231
           +DPRWGR  ETPGEDP+ + +Y  N V+GLQ          L+ +P  +V S CKH+AAY
Sbjct: 156 KDPRWGRGQETPGEDPYHLAQYVYNLVQGLQG--------GLDPKPYYQVISTCKHFAAY 207

Query: 232 DVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKL 291
           D+++W G  RY FDA VT QD+ E +L  F+ C ++    + MCSYN VNGIPSCA+  L
Sbjct: 208 DLEDWDGNYRYGFDAIVTTQDLSEYYLPSFQSCYRDAKVGAAMCSYNAVNGIPSCANTYL 267

Query: 292 LNQTVRGEWDL--HGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTN 349
           L   +R  W      ++ +DCD++  + D H +   + E+AVA  LKAG D+DCG +Y+ 
Sbjct: 268 LQSILRDFWGFAEDRWVTSDCDAVDNIYDPHNY-TKTPEEAVADALKAGTDIDCGTFYSE 326

Query: 350 FTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGS--PQYVSLGKQDICSDENIELAAE 407
           +   A  Q  + ET++ ++L   Y  L+RLG+FD +    Y      ++ + +  +LA +
Sbjct: 327 YLPGAYNQSLITETELRQALIRQYASLVRLGYFDPTDIQPYRQYNWNNVDTPQAQQLAYQ 386

Query: 408 AAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAG--FS 465
           AA EGIVLLKND  TLPL S+ +K +A++GP  NAT  M GNY G+    +SP+ G   +
Sbjct: 387 AAAEGIVLLKND-GTLPL-SSDIKNIALIGPWGNATGEMQGNYYGVAPYLISPLMGAVAT 444

Query: 466 GYANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAESLDREDLWLPGY 525
           GY NVTY  G  ++     +   AA  AA+ AD  I   G+D +VE+E  DR  +  PG 
Sbjct: 445 GY-NVTYVFGT-NITSNDTSGFAAAIAAAQGADVVIYAGGIDETVESEGNDRNYITWPGN 502

Query: 526 QTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFG 585
           Q  L+ ++A V K P+++V    G VD    + N+ + A+LWAGYPG+ GG A+ D++ G
Sbjct: 503 QLDLVGELAAVGK-PLVVVQFGGGQVDDTSLKANSTVNALLWAGYPGQSGGSALFDIISG 561

Query: 586 KFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQ 645
           K  P GRLP+T Y  DYV  +P+T M LRP  +   PGRTYK+Y G  +Y FGYGL YT 
Sbjct: 562 KVAPAGRLPVTQYPADYVYEIPMTDMDLRP--NATSPGRTYKWYTGTPIYDFGYGLHYTT 619

Query: 646 FKYNLLSFTKTIQVNLNKLQHCRNL 670
           F Y       +   N+  L    NL
Sbjct: 620 FSYKWAK-APSSTYNIQTLVQSGNL 643


>gi|302683060|ref|XP_003031211.1| glycoside hydrolase family 3 protein [Schizophyllum commune H4-8]
 gi|300104903|gb|EFI96308.1| glycoside hydrolase family 3 protein [Schizophyllum commune H4-8]
          Length = 761

 Score =  456 bits (1174), Expect = e-125,   Method: Compositional matrix adjust.
 Identities = 270/612 (44%), Positives = 366/612 (59%), Gaps = 31/612 (5%)

Query: 49  MSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSN 108
           ++S   CD+SL +  R + LV  +T+ E +      A GVPRLGLP Y WW+EALHGV+ 
Sbjct: 29  LASNAVCDTSLGHVERARALVEELTVAEMINNTVHTAPGVPRLGLPPYNWWNEALHGVA- 87

Query: 109 VGPGTHFDDVIPG-----ATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLT 163
             PG  F    PG     ATSFP  I   ++F+++L   +G   STEARA  N G AGL 
Sbjct: 88  ASPGVVF--TSPGEEFSSATSFPMPINMGSAFDDALMLAVGNVTSTEARAFNNAGLAGLD 145

Query: 164 YWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSS 223
           YW+PNIN  +DPRWGR  ETPGEDP    RY    V GLQ          ++   LKV++
Sbjct: 146 YWTPNINPFKDPRWGRGAETPGEDPLHAARYVRTLVEGLQ--------GGIDPPSLKVAA 197

Query: 224 CCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGI 283
            CKH+AAYD+++W GV RY FDA VT QD+ E +  PF+ CV++  A+SVMCSYN VNG+
Sbjct: 198 DCKHWAAYDLEDWGGVARYAFDAVVTPQDLAEYYSPPFKSCVRDARAASVMCSYNAVNGV 257

Query: 284 PSCADPKLLNQTVRGEWDL--HGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDL 341
           P+CA P LL   +R  W L    ++ +DCD++  + D H +  D   +  A +LKAG DL
Sbjct: 258 PACASPYLLKTVLRDAWGLAEDRWVTSDCDAVGNVYDPHGYTEDFV-NGSAVSLKAGSDL 316

Query: 342 DCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ---YVSLGKQDICS 398
           DCG  Y+ +   A  +G + E D+  +L  LY  L+ LG+FD +P+   Y  +   D+ +
Sbjct: 317 DCGTTYSQYLPEAYDRGLIDEDDLKAALTRLYASLVWLGYFD-APEDQPYRQISWADVNT 375

Query: 399 DENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANAT-VAMIGNYAGIPCRY 457
                LA  AA E  VLLKND  TLPL  + + ++A++GP ANA+ V + GNY GIP   
Sbjct: 376 PAAQALAYTAAIESFVLLKND-GTLPLTDSSL-SIALIGPMANASAVQLQGNYNGIPPFA 433

Query: 458 MSPIAGF--SGYANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAESL 515
           ++P+ GF  +G+ NVTY  G + V     + I  A  AA+ AD  I + G+D +VE E+ 
Sbjct: 434 IAPLQGFLDAGF-NVTYVLGTN-VTGNDADDIDGAVAAAEAADVVIYVGGIDSTVEEEAK 491

Query: 516 DREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEG 575
           DR ++  P  Q  L++ + E  K P+++V M  G +D    + +  + AILWAGYPG+ G
Sbjct: 492 DRTEISWPDNQLALLSALEEAGK-PLVVVQMGGGQLDDTPLKESDAVNAILWAGYPGQSG 550

Query: 576 GRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLY 635
           G AIAD V GK  P GRL IT Y   YV  + +T M LRP +S G PGRTYK+Y G  +Y
Sbjct: 551 GTAIADTVMGKVAPAGRLSITQYPASYVDAVAMTDMTLRPDNSTGNPGRTYKWYTGTPVY 610

Query: 636 PFGYGLSYTQFK 647
           P+GYGL YT F 
Sbjct: 611 PYGYGLHYTNFS 622


>gi|391865040|gb|EIT74331.1| beta-glucosidase-related glycosidase [Aspergillus oryzae 3.042]
          Length = 822

 Score =  456 bits (1173), Expect = e-125,   Method: Compositional matrix adjust.
 Identities = 269/651 (41%), Positives = 378/651 (58%), Gaps = 41/651 (6%)

Query: 34  VFVCDPGRFSKLGLQ---MSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPR 90
           V +    + S +  Q   + S   CD+SL  + RV  LV  +TL+EK+  L D + G  R
Sbjct: 56  VTILTAAKLSTIACQTQPLCSHPVCDTSLSIAERVDSLVKSLTLEEKILNLVDASAGSTR 115

Query: 91  LGLPQYEWWSEALHGVSNVGPGTHFDDVIPG---ATSFPTVILTTASFNESLWKKIGQAV 147
           LGLP YEWWSEA HGV +  PG  F         ATSFP  ILT ASF+++L +KI + +
Sbjct: 116 LGLPSYEWWSEATHGVGS-APGVQFTSKPANFSYATSFPAPILTAASFDDTLIRKIAEVI 174

Query: 148 STEARAMYNLGRAGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEG 207
             E RA  N G +G  +W+PNIN  RDPRWGR  ETPGEDP V   Y  N+V GLQ  + 
Sbjct: 175 GREGRAFGNNGFSGFDFWAPNINGFRDPRWGRGQETPGEDPLVAQNYIRNFVPGLQGDDP 234

Query: 208 HENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKE 267
                       +V + CKHYA YD++      RY  +   T+QD+ + FL PF+ CV++
Sbjct: 235 KNK---------QVIATCKHYAVYDLE----TGRYGNNYNPTQQDLSDYFLAPFKTCVRD 281

Query: 268 GDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHG---YIVADCDSIQVMVDNHKFLA 324
            D  S+MCSYN V+GIP+CA+  LL++ +R  W+ +    Y+V+DC ++  +   H F  
Sbjct: 282 TDVGSIMCSYNSVSGIPACANEYLLDEVLRKHWNFNSDYYYVVSDCGAVTDIWQYHNF-T 340

Query: 325 DSKEDAVAQTLKAGLDLDCGQYYTNFTGN-AVQQGKVKETDIDKSLKYLYTVLMRLGFFD 383
           D++E A +  L AG+DL+CG  Y     + A  Q  VK   +D+SL  LY+ L  +GFFD
Sbjct: 341 DTEEAAASVALNAGVDLECGSSYLKLNESLAANQTSVKV--MDRSLARLYSALFTVGFFD 398

Query: 384 GSPQYVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLN-SAKVKTVAVVGPHANA 442
           G  +Y  L   D+ + +   LA EAA EG+ LLKND + LPL+   K K+VAV+GP ANA
Sbjct: 399 GG-KYDKLDFSDVSTPDAQALAYEAAVEGMTLLKND-DLLPLDFPHKYKSVAVIGPFANA 456

Query: 443 TVAMIGNYAGIPCRYMSPIAGF-SGYANVTYKTGCDDVACKSNNSIFAASEAAKTADATI 501
           T  M G+Y+G     +SP+  F      V Y  G   +  ++ +    A  AA  +D  I
Sbjct: 457 TTQMQGDYSGDAPYLISPLEAFGDSRWKVNYALGT-AINNQNTSGFEEALAAANKSDLII 515

Query: 502 ILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTN 561
            L G+D S+E+E+LDR  L  PG Q  LI  +++++K P+++V    G VD +    N +
Sbjct: 516 YLGGIDNSLESETLDRTSLAWPGNQLDLITSLSKLSK-PLVVVQFGGGQVDDSAILKNKD 574

Query: 562 IKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGY 621
           I+A++WAGYP + GG A+ DV+ GK +P GRLP+T Y   Y   + +  + LRP DS  Y
Sbjct: 575 IQALVWAGYPSQSGGTALLDVLVGKRSPAGRLPVTQYPASYADQVNIFDINLRPTDS--Y 632

Query: 622 PGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTI--QVNLNKL-QHCRN 669
           PGRTYK+Y G  + PFGYGL YT+F ++   + KT+  + N+  L   CRN
Sbjct: 633 PGRTYKWYTGKPVLPFGYGLHYTKFMFD---WEKTLNREYNIQDLVASCRN 680


>gi|317156541|ref|XP_001825822.2| exo-1,4-beta-xylosidase xlnD [Aspergillus oryzae RIB40]
          Length = 882

 Score =  456 bits (1173), Expect = e-125,   Method: Compositional matrix adjust.
 Identities = 270/634 (42%), Positives = 372/634 (58%), Gaps = 40/634 (6%)

Query: 49  MSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSN 108
           + S   CD+SL  + RV  LV  +TL+EK+  L D + G  RLGLP YEWWSEA HGV +
Sbjct: 134 LCSHPVCDTSLSIAERVDSLVKSLTLEEKILNLVDASAGSTRLGLPSYEWWSEATHGVGS 193

Query: 109 VGPGTHFDDVIPG---ATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYW 165
             PG  F         ATSFP  ILT ASF+++L +KI + +  E RA  N G +G  +W
Sbjct: 194 -APGVQFTSKPANFSYATSFPAPILTAASFDDTLIRKIAEVIGREGRAFGNNGFSGFDFW 252

Query: 166 SPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCC 225
           +PNIN  RDPRWGR  ETPGEDP V   Y  N+V GLQ  +             +V + C
Sbjct: 253 APNINGFRDPRWGRGQETPGEDPLVAQNYIRNFVPGLQGDDPKNK---------QVIATC 303

Query: 226 KHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPS 285
           KHYA YD++      RY  +   T+QD+ + FL PF+ CV++ D  S+MCSYN V+GIP+
Sbjct: 304 KHYAVYDLE----TGRYGNNYNPTQQDLSDYFLAPFKTCVRDTDVGSIMCSYNSVSGIPA 359

Query: 286 CADPKLLNQTVRGEWDLHG---YIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLD 342
           CA+  LL++ +R  W+ +    Y+V+DC ++  +   H F  D++E A +  L AG+DL+
Sbjct: 360 CANEYLLSEVLRKHWNFNSDYHYVVSDCGAVTDIWQYHNF-TDTEEAAASVALNAGVDLE 418

Query: 343 CGQYYTNFTGN-AVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQDICSDEN 401
           CG  Y     + A  Q  VK   +D+SL  LY+ L  +GFFDG  +Y  L   D+ + + 
Sbjct: 419 CGSSYLKLNESLAANQTSVKV--MDQSLARLYSALFTVGFFDGG-KYDKLDFSDVSTPDA 475

Query: 402 IELAAEAAREGIVLLKNDQNTLPLNSA-KVKTVAVVGPHANATVAMIGNYAGIPCRYMSP 460
             LA EAA EG+ LLKND + LPL+S  K K+VAV+GP ANAT  M G+Y+G     +SP
Sbjct: 476 QALAYEAAVEGMTLLKND-DLLPLDSPHKYKSVAVIGPFANATTQMQGDYSGDAPYLISP 534

Query: 461 IAGFS-GYANVTYKTGCDDVACKSNNSIF-AASEAAKTADATIILAGLDLSVEAESLDRE 518
           +  F      V Y  G        N S F  A  AA  +D  I L G+D S+E+E+LDR 
Sbjct: 535 LEAFGDSRWKVNYALGT--AMNNQNTSGFEEALAAANKSDLIIYLGGIDNSLESETLDRT 592

Query: 519 DLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRA 578
            L  PG Q  LI  +++++K P+++V    G VD +    N +I+A++WAGYP + GG A
Sbjct: 593 SLTWPGNQLDLITSLSKLSK-PLVVVQFGGGQVDDSDILKNKDIQALVWAGYPSQSGGTA 651

Query: 579 IADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFG 638
           + DV+ GK +P GRLP+T Y   Y   + +  + LRP DS  YPGRTYK+Y G  + PFG
Sbjct: 652 LLDVLVGKRSPAGRLPVTQYPASYADQVNIFDINLRPTDS--YPGRTYKWYTGKPVLPFG 709

Query: 639 YGLSYTQFKYNLLSFTKTI--QVNLNKL-QHCRN 669
           YGL YT+F ++   + KT+  + N+  L   CRN
Sbjct: 710 YGLHYTKFMFD---WEKTLNREYNIQDLVASCRN 740


>gi|451992719|gb|EMD85198.1| glycoside hydrolase family 3 protein [Cochliobolus heterostrophus
           C5]
          Length = 781

 Score =  455 bits (1171), Expect = e-125,   Method: Compositional matrix adjust.
 Identities = 287/742 (38%), Positives = 403/742 (54%), Gaps = 55/742 (7%)

Query: 54  FCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGT 113
            CD S     R K LV+  TL+EK+    + A GV RLG+P Y+WW+E LHG++  GP T
Sbjct: 36  ICDPSASTLARAKSLVALYTLEEKINATSNSAPGVARLGVPPYQWWNEGLHGIA--GPFT 93

Query: 114 HFDDV--IPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNINV 171
            F        +TSFP  IL  A+F++ L  ++ + +STEARA  N  R GL +W+PNIN 
Sbjct: 94  SFAKQGDYSYSTSFPQPILMGAAFDDDLITEVAKVISTEARAFNNANRTGLDFWTPNINP 153

Query: 172 ARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAY 231
            RDPRWGR  ETPGED + +  Y    + GLQ      NATD   R   V + CKHYA Y
Sbjct: 154 FRDPRWGRGQETPGEDSYHLSSYVKALIHGLQG-----NATDPYRR---VVATCKHYAGY 205

Query: 232 DVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKL 291
           D++NW G  RY  D ++++QD+ E +L PFE CV + +  + MCSYN VNG P CADP L
Sbjct: 206 DIENWNGNLRYQNDVQISQQDLVEYYLAPFEACV-QANVGAFMCSYNAVNGAPPCADPYL 264

Query: 292 LNQTVRGEW----DLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYY 347
           L   +R  W    D H ++ +DCD+IQ +   H++ + ++E A A +L AG DLDCG Y 
Sbjct: 265 LQTVLREHWGWSSDDH-WVTSDCDAIQNVYLPHQW-SSTREGAAADSLNAGTDLDCGTYL 322

Query: 348 TNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ---YVSLGKQDICSDENIEL 404
                 AV+QG   ET +DK+L   Y+ L++LG+FD +P+   Y  LG   + +  +  L
Sbjct: 323 QTHLPGAVKQGLTDETTLDKALIRQYSSLIKLGYFD-APENQPYRQLGFDAVATSASQAL 381

Query: 405 AAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGF 464
           A +AA EGIVLLKND   LP+N    K V + G  ANAT  + GNY G+     SP+   
Sbjct: 382 ALKAAEEGIVLLKND-GVLPINLGS-KQVGIYGDWANATSQLQGNYFGVAKFLTSPLMAL 439

Query: 465 SGYA-NVTYK----TGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAESLDRED 519
                +V Y      G  D    + +S+   S    T+D  I + G+D  VE+E  DR  
Sbjct: 440 QNLGVDVKYAGNLPGGQGDPTTGAWSSL---SGVITTSDVHIWVGGIDNGVESEDRDRSW 496

Query: 520 LWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAI 579
           L L G Q  +I Q+A+  K PVI+VIM  G +D +    N  I A+LWAGYPG++GG AI
Sbjct: 497 LTLTGGQLDVIGQLADTGK-PVIVVIMGGGQIDTSPLIRNPKISAVLWAGYPGQDGGTAI 555

Query: 580 ADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGY 639
            +++ GK  P GRLP T Y   YV  +P+T M +RP D    PGRTYK+Y G  ++ FGY
Sbjct: 556 VNILTGKAAPAGRLPQTQYPSKYVSEVPMTDMAMRPSDK--NPGRTYKWYTGEPIFEFGY 613

Query: 640 GLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCP--GVLVNDLRCDDYFEF 697
           GL YT F  ++ +  K      + ++ C     ++     RCP  G+ V+     +  + 
Sbjct: 614 GLHYTNFSASITNQPKQSYAISDLVKGCN----STGGFLERCPFTGITVS---VQNTGKI 666

Query: 698 KVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSL 757
             D+  +G   GS     Y K          K ++ + R+F  A  +     +     SL
Sbjct: 667 SSDYVTLGFLTGSFGPKPYPK----------KSLVAYDRLFNIAAGSSSTATLNLTLASL 716

Query: 758 NIVDYAANTLLPAGEHTIFVGN 779
             VD + N +L  G++ + + N
Sbjct: 717 ARVDESGNKVLYPGDYELQIDN 738


>gi|302683012|ref|XP_003031187.1| glycoside hydrolase family 3 protein [Schizophyllum commune H4-8]
 gi|300104879|gb|EFI96284.1| glycoside hydrolase family 3 protein [Schizophyllum commune H4-8]
          Length = 752

 Score =  455 bits (1170), Expect = e-125,   Method: Compositional matrix adjust.
 Identities = 295/751 (39%), Positives = 415/751 (55%), Gaps = 53/751 (7%)

Query: 49  MSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSN 108
           ++S   CD+SL +  R + LV   T+ E +    + A GVPRLGLP YEWW+EALHGV  
Sbjct: 30  LASNPVCDASLGHVERARALVEEFTVPEMINNTVNAAFGVPRLGLPPYEWWNEALHGV-G 88

Query: 109 VGPGTHFDDVIPG-ATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSP 167
           + PG  F +  P  ATSFP  I   ++F+++L   +G  +STEARA  N GRAGL YW+P
Sbjct: 89  LSPGVVFFEPEPAVATSFPMPINMGSAFDDALMLAMGDVISTEARAFSNAGRAGLDYWTP 148

Query: 168 NINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKH 227
           NIN  +DPRWGR  ETPGEDP     +A  YVR L  VEG +   D  S  LKV++ CKH
Sbjct: 149 NINPFKDPRWGRGAETPGEDPL----HAARYVRSL--VEGLQGGIDPPS--LKVAAACKH 200

Query: 228 YAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCA 287
           +AAYD++NW GV RY FDA VT QD+ E +  PF  CV++  A+S MCSYN VNG+P+CA
Sbjct: 201 WAAYDLENWGGVTRYAFDAVVTPQDLAEYYAPPFRSCVRDARAASAMCSYNAVNGVPACA 260

Query: 288 DPKLLNQTVRGEWDL--HGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQ 345
            P LL   +R  W L    ++ +DC ++  + D H +  D   +A   +LKAG DL+CG 
Sbjct: 261 SPYLLKTVLRDAWGLAEDRWVTSDCGAVGNVYDPHGYTED-LVNASTVSLKAGTDLNCGT 319

Query: 346 YYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ---YVSLGKQDICSDENI 402
            YT +   A  +G + E D+  +L  LY  L+ LG+FD +P+   Y  +   D+ + E  
Sbjct: 320 NYTQYLPEAYDRGLIDEDDLKAALTRLYASLVWLGYFD-APEDQPYRQITWADVNTPEAQ 378

Query: 403 ELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANAT-VAMIGNYAGIPCRYMSPI 461
            LA  AA +  VLLKND  TLPL  + + ++A++GP ANA+ + M+GNY GIP   ++P+
Sbjct: 379 ALAYTAAIKSFVLLKND-GTLPLTDSTL-SLALIGPMANASALQMLGNYFGIPPFVIAPL 436

Query: 462 AGF--SGYANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAESLDRED 519
            GF  +G+ NVTY  G  +V      S  AA  AA+ AD  I + G+D ++E E  DR +
Sbjct: 437 QGFLDAGF-NVTYVLGT-NVTGNDAGSFDAAVAAAEAADVVIYVGGIDNTLEMEEKDRTE 494

Query: 520 LWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAI 579
           +  P  Q  L++ +  V K P+++V M  G +D    + +  + AILWAGYPG+ GG AI
Sbjct: 495 ISWPDNQLALLSALEGVGK-PLVVVQMGGGQLDDTPLKESDAVNAILWAGYPGQSGGTAI 553

Query: 580 ADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGY 639
           AD V GK  P GRL        YV  + +T M LRP ++ G PGRTYK+Y G  +YP+GY
Sbjct: 554 ADTVTGKVAPAGRL--------YVDEVAMTDMTLRPDNATGNPGRTYKWYTGTPVYPYGY 605

Query: 640 GLSYTQFKYNLLSFTKTIQVNLNKLQHCRNL-NYTSDASKTRCPGVLVNDLRCDDYFEFK 698
           GL YT       S         +  + C ++ + T +AS          DL   D   F+
Sbjct: 606 GLHYTNISVAWAS---------DAPEACYSIQDLTGEASG-------FVDLAPLD--TFR 647

Query: 699 VDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVF-VRAGRNKRIKFVFNACKSL 757
           V   N G      V +++    A  A   IK+++ + R   V+ G +  ++         
Sbjct: 648 VTVTNEGDIASDFVALLFVSTQAGPAPAPIKEMVAYARASDVQPGNSTEVELEVTLGALA 707

Query: 758 NIVDYAANTLLPAGEHTIFVGNGGVSFPIHL 788
              +    +L P      F  +G +S    L
Sbjct: 708 RTDESGDASLYPGKYELTFDYDGALSLSFEL 738


>gi|238492365|ref|XP_002377419.1| conserved hypothetical protein [Aspergillus flavus NRRL3357]
 gi|220695913|gb|EED52255.1| conserved hypothetical protein [Aspergillus flavus NRRL3357]
          Length = 775

 Score =  454 bits (1169), Expect = e-125,   Method: Compositional matrix adjust.
 Identities = 266/633 (42%), Positives = 371/633 (58%), Gaps = 38/633 (6%)

Query: 49  MSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSN 108
           + S   CD+SL  + RV  LV  +TL+EK+  L D + G  RLGLP YEWWSEA HGV +
Sbjct: 27  LCSHPVCDTSLSIAERVDSLVKSLTLEEKILNLVDASAGSTRLGLPSYEWWSEATHGVGS 86

Query: 109 VGPGTHFDDVIPG---ATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYW 165
             PG  F         ATSFP  ILT ASF+++L +KI + +  E R   N G +G  +W
Sbjct: 87  -APGVQFTSKPANFSYATSFPAPILTAASFDDTLIRKIAEVIGREGRVFGNNGFSGFDFW 145

Query: 166 SPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCC 225
           +PNIN  RDPRWGR  ETPGEDP V   Y  N+V GLQ  +             +V + C
Sbjct: 146 APNINGFRDPRWGRGQETPGEDPLVAQNYIRNFVPGLQGDDPKNK---------QVIATC 196

Query: 226 KHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPS 285
           KHYA YD++      RY  +   T+QD+ E FL PF+ CV++ D  S+MCSYN V+GIP+
Sbjct: 197 KHYAVYDLE----TGRYGNNYNPTQQDLSEYFLAPFKTCVRDTDVGSIMCSYNSVSGIPA 252

Query: 286 CADPKLLNQTVRGEWDLHG---YIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLD 342
           CA+  LL++ +R  W+ +    Y+V+DC ++  +   H F  D++E A +  L AG+DL+
Sbjct: 253 CANEYLLDEVLRKHWNFNSDYHYVVSDCGAVTDIWQYHNF-TDTEEAAASVALNAGVDLE 311

Query: 343 CGQYYTNFTGN-AVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQDICSDEN 401
           CG  Y     + A  Q  VK   +D+SL  LY+ L  +GFFDG  +Y  L   D+ + + 
Sbjct: 312 CGSSYLKLNESLAANQTSVKV--MDQSLARLYSALFTVGFFDGG-KYDKLDFSDVSTPDA 368

Query: 402 IELAAEAAREGIVLLKNDQNTLPLNSA-KVKTVAVVGPHANATVAMIGNYAGIPCRYMSP 460
             LA EAA EG+ LLKND + LPL+S  K K+VAV+GP ANAT  M G+Y+G     +SP
Sbjct: 369 QALAYEAAVEGMTLLKND-DLLPLDSPHKYKSVAVIGPFANATTQMQGDYSGDAPYLISP 427

Query: 461 IAGF-SGYANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAESLDRED 519
           +  F      V Y  G   +  ++ +    A  AA  +D  I L G+D S+E+E+LDR  
Sbjct: 428 LEAFGDSRWKVNYALGT-AINNQNTSGFEEALAAANKSDLIIYLGGIDNSLESETLDRTS 486

Query: 520 LWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAI 579
           L  PG Q  LI  +++++K P+++V    G VD +    N +I+A++WAGYP + GG A+
Sbjct: 487 LAWPGNQLDLITSLSKLSK-PLVVVQFGGGQVDDSAILKNKDIQALVWAGYPSQSGGTAL 545

Query: 580 ADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGY 639
            DV+ GK +P GRLP+T Y   Y   + +  + LRP D   YPGRTYK+Y G  + PFGY
Sbjct: 546 LDVLVGKRSPAGRLPVTQYPASYADQVNIFDINLRPTDL--YPGRTYKWYTGKPVLPFGY 603

Query: 640 GLSYTQFKYNLLSFTKTI--QVNLNKL-QHCRN 669
           GL YT+F ++   + KT+  + N+  L   CRN
Sbjct: 604 GLHYTKFMFD---WEKTLNREYNIQDLVASCRN 633


>gi|395334835|gb|EJF67211.1| beta-xylosidase [Dichomitus squalens LYAD-421 SS1]
          Length = 774

 Score =  454 bits (1168), Expect = e-125,   Method: Compositional matrix adjust.
 Identities = 290/712 (40%), Positives = 401/712 (56%), Gaps = 43/712 (6%)

Query: 49  MSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSN 108
           +S+   CD+S     R   L+   T +E      + + GVPRLGLP Y WWSE LHGV+ 
Sbjct: 35  LSNNTVCDTSKDPITRATALIDLWTDEELTNNTVNASPGVPRLGLPAYNWWSEGLHGVAQ 94

Query: 109 VGPGTHF--DDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWS 166
             PG  F        ATSFP  IL  A+F++ L + +   VSTE RA  N+GRAGL YW+
Sbjct: 95  -SPGVTFAPSGNFSYATSFPQPILMGAAFDDPLIQAVASVVSTEGRAFNNVGRAGLDYWT 153

Query: 167 PNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRP-LKVSSCC 225
           PNIN  +DPRWGR  ETPGEDPF +  Y  N + GLQ          L+  P  KV + C
Sbjct: 154 PNINPFKDPRWGRGQETPGEDPFHLQGYVYNLILGLQG--------GLDPTPYFKVVADC 205

Query: 226 KHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPS 285
           KH+AAYD+DNW+G  RY F+A VT+QD+ E +L  F+ CV++   +SVMCSYN VNGIPS
Sbjct: 206 KHFAAYDMDNWEGNVRYGFNAVVTQQDLSEYYLPSFQTCVRDAKVASVMCSYNAVNGIPS 265

Query: 286 CADPKLLNQTVRGEW--DLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDC 343
           CA+  LL   +R  W  D   ++ +DCD++Q +   H +  D+   A A  L AG D+DC
Sbjct: 266 CANSFLLQDILRDYWGFDDTRWVTSDCDAVQNIYTPHNY-TDNPAQAAADALLAGTDIDC 324

Query: 344 GQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDG--SPQYVSLGKQDICSDEN 401
           G + + +  +A+ QG V  TD+ ++    Y  L+RLG+FD   S  Y  LG  D+ + E 
Sbjct: 325 GTFSSTYLPDALSQGLVNATDLKRAAIRQYASLVRLGYFDPPESQPYRQLGWSDVNTPEA 384

Query: 402 IELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPI 461
            +LA  AA EG+VLLKND  TLPL S  V+ +A++GP ANAT  M GNYAGI    +SP+
Sbjct: 385 QQLAHTAAVEGMVLLKND-GTLPL-SKHVRKLALIGPWANATTLMQGNYAGIAPYLISPL 442

Query: 462 AGF--SGYANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILA-GLDLSVEAESLDRE 518
            G   +G+ +V Y  G  +V   ++ S FAA+ AA      +I A GLD +VE E +DR 
Sbjct: 443 LGAQQAGF-DVEYVFGT-NVTTTNDTSGFAAAVAAAKRADAVIFAGGLDETVEREEVDRL 500

Query: 519 DLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRA 578
           ++  PG Q  L+ ++A V K P+I+     G +D +  ++  ++ AI+W GYPG+ GG A
Sbjct: 501 NVTWPGNQLDLVAELASVGK-PLIVAQFGGGQLDDSALKSKRSVNAIIWGGYPGQSGGTA 559

Query: 579 IADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFG 638
           + D++ GK  P GRLPIT Y  +Y   +P+T M LRP  S   PGRTYK+Y G  ++ FG
Sbjct: 560 LFDILTGKAAPAGRLPITQYPAEYANQVPMTDMTLRP--SATNPGRTYKWYTGTPVFEFG 617

Query: 639 YGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSD---ASKTRCPGVLVNDLRCDDYF 695
           +GL YT F +   S         N   +    +Y+ D   AS  +    L  DL   D F
Sbjct: 618 FGLHYTTFSFAWAS---------NAHANTPAASYSIDALMASGNKSAAFL--DLAPLDTF 666

Query: 696 EFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRI 747
             +V   N G      V ++++      A    KQ++ + RV   A +   I
Sbjct: 667 AVRV--TNTGKMTSDYVALLFASGTFGPAPHPNKQLVAYTRVHGVAPKQSTI 716


>gi|226491558|ref|NP_001146416.1| uncharacterized protein LOC100279996 [Zea mays]
 gi|223975771|gb|ACN32073.1| unknown [Zea mays]
          Length = 507

 Score =  453 bits (1165), Expect = e-124,   Method: Compositional matrix adjust.
 Identities = 231/511 (45%), Positives = 325/511 (63%), Gaps = 18/511 (3%)

Query: 274 MCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQ 333
           MCSYN+VNG P+CAD  LL+  +RG+W L+GYI +DCDS+ V+ +N  +   + EDA A 
Sbjct: 1   MCSYNQVNGKPTCADKDLLSGVIRGDWKLNGYISSDCDSVDVLYNNQHY-TKTPEDAAAI 59

Query: 334 TLKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ---YVS 390
           ++KAGLDL+CG +    T  AVQ GK+ E+D+D+++      LMRLGFFDG P+   + +
Sbjct: 60  SIKAGLDLNCGTFLAQHTVAAVQAGKLSESDVDRAVTNNLVTLMRLGFFDGDPRELPFGN 119

Query: 391 LGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNY 450
           LG  D+C+  N ELA EAAR+GIVLLKN    LPL++  +K++AV+GP+ANA+  MIGNY
Sbjct: 120 LGPSDVCTPSNQELAREAARQGIVLLKN-TGKLPLSATSIKSMAVIGPNANASFTMIGNY 178

Query: 451 AGIPCRYMSPIAGFSGYANVTYKTGCDDVACKSNN-SIFAASEAAKTADATIILAGLDLS 509
            G PC+Y +P+ G        Y+ GC +V C  N+  + AA++AA +AD T+++ G D S
Sbjct: 179 EGTPCKYTTPLQGLGANVATVYQPGCTNVGCSGNSLQLDAATKAAASADVTVLVVGADQS 238

Query: 510 VEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAG 569
           +E ESLDR  L LPG Q QL++ VA  + GP ILV+MS G  DI+FA+++  I AILW G
Sbjct: 239 IERESLDRTSLLLPGQQPQLVSAVANASSGPCILVVMSGGPFDISFAKSSDKIAAILWVG 298

Query: 570 YPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFY 629
           YPGE GG AIADV+FG  NP GRLP+TWY   + + +P+T M +RP  S GYPGRTY+FY
Sbjct: 299 YPGEAGGAAIADVLFGYHNPSGRLPVTWYPESFTK-VPMTDMRMRPDPSTGYPGRTYRFY 357

Query: 630 NGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDL 689
            G T+Y FG GLSYT F ++L+S  K + + L +   C            +CP V     
Sbjct: 358 TGDTVYAFGDGLSYTSFAHHLVSAPKQLALQLAEGHACLT---------EQCPSVEAEGA 408

Query: 690 RCDDY-FEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIK 748
            C+   F+  +  +N G   G   V ++S PPA +     K ++GF++V +  G+   + 
Sbjct: 409 HCEGLAFDVHLRVRNAGERSGGHTVFLFSSPPA-VHNAPAKHLLGFEKVSLEPGQAGVVA 467

Query: 749 FVFNACKSLNIVDYAANTLLPAGEHTIFVGN 779
           F  + CK L++VD   N  +  G HT+ VG+
Sbjct: 468 FKVDVCKDLSVVDELGNRKVALGSHTLHVGD 498


>gi|332982588|ref|YP_004464029.1| glycoside hydrolase [Mahella australiensis 50-1 BON]
 gi|332700266|gb|AEE97207.1| glycoside hydrolase family 3 domain protein [Mahella australiensis
           50-1 BON]
          Length = 714

 Score =  452 bits (1163), Expect = e-124,   Method: Compositional matrix adjust.
 Identities = 284/749 (37%), Positives = 396/749 (52%), Gaps = 95/749 (12%)

Query: 54  FCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGT 113
           + D SL +  R KDLVSRMTL EK+ Q+   A  +PRL +P Y WW+E LHGV+  G   
Sbjct: 13  YKDVSLSFEDRAKDLVSRMTLPEKISQMIYDAPAIPRLDIPAYNWWNECLHGVARAGI-- 70

Query: 114 HFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLG--------RAGLTYW 165
                   AT FP  I   A+FN  L  K+ +A+S EARA ++            GLT+W
Sbjct: 71  --------ATVFPQAIAMAATFNPELIHKVAEAISDEARAKHHEAVRNGDRGIYKGLTFW 122

Query: 166 SPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCC 225
           SPNIN+ RDPRWGR  ET GEDP++  R  V +V+GLQ           + + LKV +  
Sbjct: 123 SPNINIFRDPRWGRGHETYGEDPYLTSRMGVAFVKGLQGD---------DPKYLKVVATP 173

Query: 226 KHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPS 285
           KHYA   V +     R+ FDARV+++D+ ET+L  FE CVKEG A S+M +YNR NG P 
Sbjct: 174 KHYA---VHSGPESQRHSFDARVSQKDLRETYLPAFEECVKEGKAVSIMGAYNRTNGEPC 230

Query: 286 CADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQ 345
           CA   LL   +R EW   GY+V+DC +I  +  +HK    + E A A  +  G +L+CG+
Sbjct: 231 CASKTLLKDILRDEWGFDGYVVSDCGAIDDIHMHHKVTKTAAESA-ALAVNNGCELNCGK 289

Query: 346 YYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGK--QDIC-SDENI 402
            Y  +   AV+QG + E  ID+++  L+T  MRLG FD  P+ V       D+  S E+ 
Sbjct: 290 TY-EYLCQAVEQGLISEETIDQAVIKLFTARMRLGMFD-PPEMVRYAHIPYDVNDSPEHR 347

Query: 403 ELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIA 462
           ELA E AR+ IVLLKND+N LPL S K+KT+AV+GP+A+    ++ NY G P +Y++P+ 
Sbjct: 348 ELALETARQSIVLLKNDENILPL-SKKLKTIAVIGPNADDLDVLLANYFGTPSKYVTPLE 406

Query: 463 GF----SGYANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAES---- 514
           G     S    V Y  GC +V   S +    A   A+ AD  I+  GL   +E E     
Sbjct: 407 GIKNKVSPDTKVLYAKGC-EVTGNSVDGFDEAVNIAEMADIVIMCLGLSPRIEGEEGDVA 465

Query: 515 -----LDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAG 569
                 DR  + LPG Q QL+  +    K P++LV+++   + I +A  + ++ AI+ A 
Sbjct: 466 DSDGGGDRLHIDLPGMQEQLLETIYGTGK-PIVLVLLNGSAIAINWA--HEHVPAIIEAW 522

Query: 570 YPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFY 629
           YPGEEGG AIADV+FG +NP GRLPIT+          L  +P  P       GRTY+++
Sbjct: 523 YPGEEGGTAIADVLFGDYNPAGRLPITFVRS-------LDDLP--PFTDYNMKGRTYRYF 573

Query: 630 NGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDL 689
               LYPFGYGLSYT FKY+ L                         S  R P       
Sbjct: 574 EKEPLYPFGYGLSYTSFKYSNLRL-----------------------SAMRLP------- 603

Query: 690 RCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKF 749
              +  +  VD +N G   G +VV +Y           ++Q+ G Q + +  G+ + + F
Sbjct: 604 -AGNNLDINVDVENTGKLAGREVVQLYISDVEASVEVPMRQLCGIQCITLEPGQKQTVSF 662

Query: 750 VFNACKSLNIVDYAANTLLPAGEHTIFVG 778
                + +++ DY    +L  G+  I VG
Sbjct: 663 TVEP-QHMSLFDYDGKRILEPGQFIIAVG 690


>gi|297738404|emb|CBI27605.3| unnamed protein product [Vitis vinifera]
          Length = 581

 Score =  452 bits (1163), Expect = e-124,   Method: Compositional matrix adjust.
 Identities = 224/403 (55%), Positives = 275/403 (68%), Gaps = 45/403 (11%)

Query: 204 DVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEM 263
           DVEG EN TDLNSRPLKVSSCCKHYA YD+D+W           V+EQDM+ETF  PFE 
Sbjct: 4   DVEGTENVTDLNSRPLKVSSCCKHYATYDIDSW---------LNVSEQDMKETFFSPFE- 53

Query: 264 CVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFL 323
                                            R EWDLHGYIV+DC  ++V+VDN  +L
Sbjct: 54  ---------------------------------RDEWDLHGYIVSDCYGLEVIVDNQNYL 80

Query: 324 ADSKEDAVAQTLKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFD 383
            +SK DAVA+TL+AGLDL+CG YYT+    +V  GKV + ++D++LK +Y +LMR+G+FD
Sbjct: 81  NESKVDAVAKTLQAGLDLECGHYYTDALNESVLTGKVSQYELDRALKNIYVLLMRVGYFD 140

Query: 384 GSPQYVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANAT 443
           G P Y SLG +DIC+ ++IELA EAAR+GIVLLKND   LPL   K   + +VGPHANAT
Sbjct: 141 GIPAYESLGLKDICAADHIELAREAARQGIVLLKNDYEVLPLKPGK--KLVLVGPHANAT 198

Query: 444 VAMIGNYAGIPCRYMSPIAGFSGYANVTYKTGCDDVACKSNNSIFAASEAAKTADATIIL 503
             MIGNYAG+P +Y+SP+  FS   NVTY TGC D +C ++     A EAAK A+ TII 
Sbjct: 199 EVMIGNYAGLPYKYVSPLEAFSAIGNVTYATGCLDASCSNDTYFSEAKEAAKFAEVTIIF 258

Query: 504 AGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIK 563
            G DLS+EAE +DR D  LPG QT+LI QVAEV+ GPVILV++S   +DI FA+ N  I 
Sbjct: 259 VGTDLSIEAEFVDRVDFLLPGNQTELIKQVAEVSSGPVILVVLSGSNIDITFAKNNPRIS 318

Query: 564 AILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQML 606
           AILW G+PGE+GG AIADVVFGK+NPGGRLP+TWY  DYV  L
Sbjct: 319 AILWVGFPGEQGGHAIADVVFGKYNPGGRLPVTWYEADYVACL 361


>gi|392596548|gb|EIW85871.1| hypothetical protein CONPUDRAFT_80240 [Coniophora puteana
           RWD-64-598 SS2]
          Length = 770

 Score =  451 bits (1159), Expect = e-124,   Method: Compositional matrix adjust.
 Identities = 252/603 (41%), Positives = 359/603 (59%), Gaps = 23/603 (3%)

Query: 55  CDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSN-VGPGT 113
           CD+SL  + R   L+   T+DE +    ++A GVPRLGLP YEWWSE LHGV+N  G   
Sbjct: 37  CDTSLNATQRAAALIDLFTVDELIVNTVNWAPGVPRLGLPAYEWWSEGLHGVANSAGVTW 96

Query: 114 HFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNINVAR 173
                   ATSFP  IL +A+F+++L K +G  +  E RA  N G AGL +W+PNIN  +
Sbjct: 97  SITGPFSYATSFPQPILMSAAFDDALIKAVGGVIGMEGRAFNNYGHAGLDFWTPNINPFK 156

Query: 174 DPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRP-LKVSSCCKHYAAYD 232
           DPRWGR  ETPGEDP+ + +Y  N ++GLQ          L+  P  +V + CKH+A YD
Sbjct: 157 DPRWGRGQETPGEDPYHIAQYVYNLIQGLQG--------GLDPEPYFQVVATCKHFAGYD 208

Query: 233 VDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLL 292
           +++W    RY ++A ++ QD+ E +L  F+ C ++  A + MCSYN +NGIP+CAD  LL
Sbjct: 209 LEDWDFNYRYGYNAIISTQDLSEYYLPSFQSCYRDAFAGASMCSYNAINGIPTCADTYLL 268

Query: 293 NQTVRGEW--DLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNF 350
              +RG W  D   ++  DCDS++ + D H + A  ++ A A  LKAG D+DCG +YT +
Sbjct: 269 QDILRGFWGFDQTRWVTGDCDSVEDIYDFHHYTALPQQ-AAADALKAGSDIDCGIFYTTW 327

Query: 351 TGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ--YVSLGKQDICSDENIELAAEA 408
              A  +  + E D+  +L   Y  L+RLG+FD + +  Y      ++ +    ELA  A
Sbjct: 328 LPLAYTESLITEQDLRAALTRQYASLVRLGYFDPASEQPYRQYNWSNVDTSYAQELAYTA 387

Query: 409 AREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAG--FSG 466
           A EGI LLKND  TLP +SA +K +A++GP   AT  M GNY G     +SP  G   +G
Sbjct: 388 AVEGITLLKND-GTLPFSSA-IKNIALIGPWTFATTQMQGNYYGNAPYLISPYQGAQLAG 445

Query: 467 YANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAESLDREDLWLPGYQ 526
           Y N++Y     +V   + +   AA  AA+ ADA + + G+D +VEAE++DR D+  P +Q
Sbjct: 446 Y-NISYVLET-NVTSNTTDGYAAAFTAAQGADAIVFVGGIDNTVEAEAMDRNDITWPAFQ 503

Query: 527 TQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGK 586
             LI ++ ++ K P+++V    G VD      N ++ A+LW GYPG+ GG+A+ D++ GK
Sbjct: 504 LWLIGELGKLGK-PLVVVQFGGGQVDDTEINANPDVNALLWGGYPGQSGGQALFDIISGK 562

Query: 587 FNPGGRLPITWYNGDYVQMLPLTSMPLRP-VDSLGYPGRTYKFYNGPTLYPFGYGLSYTQ 645
             P GRL  T Y  DYV  +P+T+M LRP  +    PGRTYK+Y G  +Y FGYGL YT 
Sbjct: 563 VAPAGRLVSTQYPADYVNEIPMTNMNLRPDANGTTSPGRTYKWYTGTPVYEFGYGLHYTN 622

Query: 646 FKY 648
           F Y
Sbjct: 623 FTY 625


>gi|451849522|gb|EMD62825.1| glycoside hydrolase family 3 protein [Cochliobolus sativus ND90Pr]
          Length = 849

 Score =  447 bits (1149), Expect = e-122,   Method: Compositional matrix adjust.
 Identities = 276/726 (38%), Positives = 389/726 (53%), Gaps = 43/726 (5%)

Query: 64  RVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTHFDDV--IPG 121
           R K LV+  TL+EK+    + A GV RLG+P Y+WW+E LHG++  GP T F        
Sbjct: 114 RAKSLVALYTLEEKINATSNSAPGVARLGIPPYQWWNEGLHGIA--GPFTSFAKQGDYSY 171

Query: 122 ATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNINVARDPRWGRIT 181
           +TSFP  IL  A+F+++L  ++   +STEARA  N+ R GL +W+PNIN  RDPRWGR  
Sbjct: 172 STSFPQPILMGAAFDDNLITEVANVISTEARAFNNVNRTGLDFWTPNINPFRDPRWGRGQ 231

Query: 182 ETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDR 241
           ETPGED + +  Y    + GLQ      N TD   R   V + CKHYA YD++NW G  R
Sbjct: 232 ETPGEDSYHLSSYVKALIHGLQG-----NETDPYRR---VVATCKHYAGYDIENWNGNLR 283

Query: 242 YHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEW- 300
           Y  D ++++QD+ E +L PFE CV + +  + MCSYN VNG P CADP +L   +R  W 
Sbjct: 284 YQNDVQISQQDLVEYYLAPFEACV-QANVGAFMCSYNAVNGAPPCADPYMLQTVLREHWG 342

Query: 301 ---DLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFTGNAVQQ 357
              D H ++ +DCDSIQ +   H++ + ++E A A +L AG DLDCG Y  +    AV+Q
Sbjct: 343 WSSDEH-WVTSDCDSIQNVYLPHQW-SSTREGAAADSLNAGTDLDCGTYLQSHLPGAVKQ 400

Query: 358 GKVKETDIDKSLKYLYTVLMRLGFFD--GSPQYVSLGKQDICSDENIELAAEAAREGIVL 415
           G   ET +D +L   Y+ L++LG+FD   +  Y  LG   + +  +  LA +AA EGIVL
Sbjct: 401 GLTNETTLDNALIRQYSSLIKLGYFDIPENQPYRQLGFDAVATSASQALALKAAEEGIVL 460

Query: 416 LKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFSGYA-NVTYKT 474
           LKND   LP+N    K V + G  ANAT  + GNY G+     SP         NV Y  
Sbjct: 461 LKND-GVLPINFGS-KNVGIYGDWANATSQLQGNYFGVAKFLTSPYMALEKLGVNVRYAG 518

Query: 475 GC-DDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAESLDREDLWLPGYQTQLINQV 533
                    +  S    S    T+D  I + G+D  +E+E  DR  L L G Q  +I Q+
Sbjct: 519 NLPGGQGDPTTGSWPRLSGVITTSDVHIWVGGMDNGIESEDRDRSWLTLTGSQLDVIGQL 578

Query: 534 AEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRL 593
           A+  K PVI++IM  G +D +    N  I A+LWAGYPG++GG AI +++ GK  P GRL
Sbjct: 579 ADTGK-PVIVIIMGGGQIDTSPLIKNPKISAVLWAGYPGQDGGTAIVNILTGKAAPAGRL 637

Query: 594 PITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSF 653
           P T Y   YV  +P+T M +RP +    PGRTYK+Y G  ++ FGYGL YT F  ++ + 
Sbjct: 638 PQTQYLYKYVSEVPMTDMAMRPSNK--NPGRTYKWYTGKPIFEFGYGLHYTNFSASITNQ 695

Query: 654 TKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVV 713
            K      + ++ C     ++     RCP   +N           V  QN G T    V 
Sbjct: 696 PKQSYAISDLVKGCN----STGGFLERCPFTGIN-----------VSVQNTGKTSSDYVT 740

Query: 714 IVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEH 773
           + +            K ++ + R+F  A  +     +     SL  VD + N +L  G++
Sbjct: 741 LGFLTGSFGPKPYPKKSLVAYDRLFNIAASSSSTATLNLTLASLARVDESGNKVLYPGDY 800

Query: 774 TIFVGN 779
            + + N
Sbjct: 801 ELQIDN 806


>gi|156062754|ref|XP_001597299.1| hypothetical protein SS1G_01493 [Sclerotinia sclerotiorum 1980]
 gi|154696829|gb|EDN96567.1| hypothetical protein SS1G_01493 [Sclerotinia sclerotiorum 1980
           UF-70]
          Length = 758

 Score =  446 bits (1148), Expect = e-122,   Method: Compositional matrix adjust.
 Identities = 256/604 (42%), Positives = 355/604 (58%), Gaps = 31/604 (5%)

Query: 55  CDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTH 114
           CD++     R   LVS  TL EK+   G+ + GVPR+GLP Y+WW+EALHG++    GTH
Sbjct: 34  CDTTADPYTRATALVSLFTLAEKINNTGNTSPGVPRIGLPAYQWWNEALHGIAY---GTH 90

Query: 115 FDDVIPG---ATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNINV 171
           F         ATSFP  IL  A+F+++L   +   +STEARA  N  R GL +W+PNIN 
Sbjct: 91  FAAAGSNYSYATSFPQPILMGAAFDDALIHDVASQISTEARAFSNANRYGLNFWTPNINP 150

Query: 172 ARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVS-SCCKHYAA 230
            +DPRWGR  ETPGEDPF V  Y    V GLQ          L+  P K   + CKHYA 
Sbjct: 151 YKDPRWGRGQETPGEDPFHVSSYVNALVTGLQG--------GLDDLPYKKGVATCKHYAG 202

Query: 231 YDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPK 290
           YD++N  G+ RY FDA +  QD+ + +L  F+ C ++ +  S+MCSYN VNG+P+CAD  
Sbjct: 203 YDLENGGGIQRYAFDAIINSQDLRDYYLPSFQQCARDSNVQSIMCSYNAVNGVPTCADDW 262

Query: 291 LLNQTVRGEW---DLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYY 347
           LL   +R  W   +   ++ +DCD++Q + D+H + + + E A A  L AG DLDCG ++
Sbjct: 263 LLQSLLREHWGWVEEDQWVTSDCDAVQNIWDSHNYTS-TPEQAAADALNAGTDLDCGGFW 321

Query: 348 TNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSP--QYVSLGKQDICSDENIELA 405
             + G+A  Q     + +D+SL   Y  L+RLG+FD +    Y  LG  D+ +    +LA
Sbjct: 322 PTYLGSAYNQSLYNISTLDRSLTRRYASLVRLGYFDPASIQPYRQLGWSDVSTPSAEQLA 381

Query: 406 AEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSP-IAGF 464
            +AA +GIVLLKND   LPL S  +  VA++GP ANAT  M GNY G      SP IA  
Sbjct: 382 LQAAEDGIVLLKND-GILPLPS-NITNVALIGPWANATTQMQGNYYGQAPYLHSPLIAAQ 439

Query: 465 SGYANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAESLDREDLWLPG 524
           +   +VTY  G  D+   +     AA  AAK AD  I + G+D S+EAE+ DR+ +  P 
Sbjct: 440 NAGFHVTYVQGA-DIDSTNTTEFTAAIAAAKKADVIIYIGGIDNSIEAEAKDRKTIAWPS 498

Query: 525 YQTQLINQVAEVAKGPVILVIMSAGG-VDIAFAETNTNIKAILWAGYPGEEGGRAIADVV 583
            Q  L+NQ+A ++   + L+I   G  +D +   TN  +  I+WAGYPG++GG AI +++
Sbjct: 499 SQISLVNQLANLS---IPLIISQMGTMIDSSSLLTNRGVNGIIWAGYPGQDGGTAIFNIL 555

Query: 584 FGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSY 643
            GK  P GRLPIT Y  DYV  + + +M L P      PGRTYK++NG +++ FG+GL Y
Sbjct: 556 TGKTAPAGRLPITQYPSDYVNEVSMNNMNLHP--GANNPGRTYKWFNGTSIFDFGFGLHY 613

Query: 644 TQFK 647
           T F 
Sbjct: 614 TTFN 617


>gi|340519849|gb|EGR50086.1| glycoside hydrolase family 3 [Trichoderma reesei QM6a]
          Length = 796

 Score =  445 bits (1145), Expect = e-122,   Method: Compositional matrix adjust.
 Identities = 257/610 (42%), Positives = 362/610 (59%), Gaps = 32/610 (5%)

Query: 55  CDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTH 114
           CD++   + R   +V  MTL+EKV  +G  A G  RLGLP Y+W +EALHGV+    G  
Sbjct: 75  CDTTKSIAERAAAIVKPMTLNEKVANVGSSASGSARLGLPAYQWQNEALHGVAG-STGVQ 133

Query: 115 FDDVI----PGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNIN 170
           F   +      ATSFP  IL +A+F+++L K +  A+STEARA  N G AGL +W+PNIN
Sbjct: 134 FQSPLGANFSAATSFPMPILLSAAFDDALVKSVATAISTEARAFANYGFAGLDFWTPNIN 193

Query: 171 VARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAA 230
             RDPRWGR  ETPGED F +  Y +  V GLQ          ++    +  S CKH+AA
Sbjct: 194 PFRDPRWGRGMETPGEDAFRIQGYVLALVDGLQG--------GIDPDFYRTLSTCKHFAA 245

Query: 231 YDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPK 290
           YD++N +  +    +   T+QDM + +L  FE CV++   +S+MC+YN V+G+P+CAD  
Sbjct: 246 YDIENGRTAN----NLSPTQQDMADYYLPMFETCVRDAKVASIMCAYNAVDGVPACADSY 301

Query: 291 LLNQTVRGEWDL---HGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYY 347
           LL   +R  +       Y+V+DCD+++ + D H + A+  + A A ++ AG DLDCG  Y
Sbjct: 302 LLQDVLRDTYGFTEDFNYVVSDCDAVENVFDPHHYAANLTQ-AAAMSINAGTDLDCGSSY 360

Query: 348 TNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQDICSDENIELAAE 407
            N    +VQ G   E  +DKSL  LY+ L+++G+FD   +Y SLG  ++ + ++  LA +
Sbjct: 361 -NVLNASVQAGLTTEATLDKSLIRLYSALVKVGYFDQPAEYNSLGWGNVNTTQSQALAHD 419

Query: 408 AAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGF-SG 466
           AA EG+ LLKND  TLPL S  +  VAV+GP AN T  M GNYAG     ++P++ F   
Sbjct: 420 AATEGMTLLKND-GTLPL-SRTLSNVAVIGPWANVTTQMQGNYAGTAPLLVNPLSVFQQK 477

Query: 467 YANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAESLDREDLWLPGYQ 526
           + NV Y  G   +  +  +   AA  AA ++D  + L G+D+SVE E  DR  +  PG Q
Sbjct: 478 WRNVKYAQGT-AINSQDTSGFNAALSAASSSDVIVYLGGIDISVENEGFDRSSITWPGNQ 536

Query: 527 TQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGK 586
             LI+Q+A + K P+++V    G +D +   +N+ + +ILWAGYPG++GG AI DV+ G 
Sbjct: 537 LNLISQLANLGK-PLVIVQFGGGQIDDSALLSNSKVNSILWAGYPGQDGGNAIFDVLTGA 595

Query: 587 FNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQF 646
             P GRLP+T Y  +YV    +  M LRP  S G PGRTY +Y G  + PFGYGL YT F
Sbjct: 596 NPPAGRLPVTQYPANYVNNNNIQDMNLRP--SNGIPGRTYAWYTGTPVLPFGYGLHYTNF 653

Query: 647 KYNLLSFTKT 656
               LSF  T
Sbjct: 654 S---LSFQST 660


>gi|343428088|emb|CBQ71612.1| related to Beta-xylosidase [Sporisorium reilianum SRZ2]
          Length = 698

 Score =  443 bits (1139), Expect = e-121,   Method: Compositional matrix adjust.
 Identities = 258/619 (41%), Positives = 360/619 (58%), Gaps = 26/619 (4%)

Query: 47  LQMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGV 106
           L +S+   CD+SL +  R   LV++ T  E +    + A GVPRLG+PQY+WW+EALHGV
Sbjct: 27  LPLSTLPVCDTSLDFYTRATSLVAQFTTAELINNTVNHAPGVPRLGIPQYQWWTEALHGV 86

Query: 107 SNVGPGTHFDDVIPG----ATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGL 162
           +   PG +F+    G    ATSFP VI   A+F+++L++ +   ++ E RA  N GRAGL
Sbjct: 87  AR-SPGVNFNPDAAGEFGCATSFPQVINLGATFDDALYEAVAAHIANETRAFSNAGRAGL 145

Query: 163 TYWSP-NINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKV 221
             +SP NIN  RDPRWGR  ET GEDP  + RYAV  VRGLQ     + A   N R L +
Sbjct: 146 NMYSPLNINAFRDPRWGRGQETVGEDPLHLSRYAVRVVRGLQGPAAQDEA---NPR-LTL 201

Query: 222 SSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVN 281
           ++ CKHY AYD++   GV+RY FDA V+ QD+ +  L  F  CV++G A+++M SYN VN
Sbjct: 202 AATCKHYLAYDLEASAGVERYQFDALVSNQDLADLHLPQFRACVRDGGATTLMTSYNAVN 261

Query: 282 GIPSCADPKLLNQTVRGEWDL---HGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAG 338
           G+P  A    L    R  W L   H Y+ +DCD++  + D H + A     A A +L AG
Sbjct: 262 GVPPSASKYYLETLARDTWGLDKHHNYVTSDCDAVANVYDAHHYAA-DYVHAAAASLNAG 320

Query: 339 LDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ--YVSLGKQDI 396
            DLDCG  Y +    A+ Q       I +++  +Y  L+RLG+FD +       LG +D+
Sbjct: 321 TDLDCGATYRDSLAAALAQNLTDVATIRRAVTRMYGSLVRLGYFDAAEAQPLRQLGWKDV 380

Query: 397 CSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCR 456
            +    +LA EAA   I LLKN Q+TLPL     KT+A++GP+ NAT A+ GNYAG    
Sbjct: 381 NAPAAQKLAYEAAAASITLLKNRQSTLPLRETAGKTIALIGPYTNATFALRGNYAGPSPL 440

Query: 457 YMSPIAG----FSGYANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEA 512
            ++P       FS  A++    G          +  AA   AK+AD  +   G+D +VE 
Sbjct: 441 VITPFDAARRTFSD-AHIVSANGTSIAGPYDTATASAALATAKSADIIVYAGGIDPTVEG 499

Query: 513 ESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGG-VDIAFAETNTNIKAILWAGYP 571
           ESLDR D+  P  Q +LI ++A + K  V++V+   GG VD A  + +  + A++WAGYP
Sbjct: 500 ESLDRRDIAWPANQLRLIQELAALGK--VLVVVQFGGGQVDGALLKGDDGVGALVWAGYP 557

Query: 572 GEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNG 631
           G+ G  A+ D++ GK  P GRLPIT Y  +Y   L  T+M LRP  +  YPGRTYK+Y G
Sbjct: 558 GQSGALALMDILAGKRAPAGRLPITQYPANYTHALRETTMALRPTAT--YPGRTYKWYTG 615

Query: 632 PTLYPFGYGLSYTQFKYNL 650
              +PFG+GL YT F+ ++
Sbjct: 616 TPTFPFGFGLHYTTFRASI 634


>gi|212531051|ref|XP_002145682.1| beta-xylosidase XylA [Talaromyces marneffei ATCC 18224]
 gi|210071046|gb|EEA25135.1| beta-xylosidase XylA [Talaromyces marneffei ATCC 18224]
          Length = 799

 Score =  443 bits (1139), Expect = e-121,   Method: Compositional matrix adjust.
 Identities = 250/604 (41%), Positives = 357/604 (59%), Gaps = 28/604 (4%)

Query: 53  LFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPG 112
           + CD+S  Y  R + L++  TL+E +    +   GVPRLGLP YE WSE LHG+      
Sbjct: 62  IVCDTSANYVDRAEGLIALFTLEELINNTQNSGPGVPRLGLPPYEVWSEGLHGLDRA--- 118

Query: 113 THF---DDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNI 169
            HF    D    ATSFP  IL+ A+ N +L  +I   ++T+ARA  N+GR GL  ++PNI
Sbjct: 119 -HFVKSGDEWTWATSFPMPILSMAALNRTLINQIASIIATQARAFNNVGRYGLDAYAPNI 177

Query: 170 NVARDPRWGRITETPGEDP-FVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHY 228
           N  R P WGR  ETPGED  F+   YA  Y+ GLQ     +N        LK+++  KH+
Sbjct: 178 NGFRSPLWGRGQETPGEDANFLTSSYAYEYITGLQGGIDPDN--------LKIAATAKHF 229

Query: 229 AAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCAD 288
           A YD++NW G  R  FDAR+T+QD+ E +   F    +   A S MCSYN VN IPSC+ 
Sbjct: 230 AGYDLENWGGNSRLGFDARITQQDLAEYYTPQFLAASRYAKARSFMCSYNSVNAIPSCSS 289

Query: 289 PKLLNQTVRGEWDL--HGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQY 346
             LL   +R +WD   +GY+ +DCD++  + + H + A ++  A A++L+AG D+DCGQ 
Sbjct: 290 SFLLQTLLREQWDFPEYGYVSSDCDAVYNVFNPHGY-ASNQSSAAAESLRAGTDIDCGQT 348

Query: 347 YTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSP-QYVSLGKQDICSDENIELA 405
           Y+     +  +G V   +I++S+  LY+ L++LG+FDG   +Y  LG  D+ + +   ++
Sbjct: 349 YSWHLNQSFIEGSVTRGEIERSILRLYSNLVKLGYFDGDKNEYRQLGWNDVVTTDAWNIS 408

Query: 406 AEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFS 465
            EAA EGIVLLKND   LPL S  VK+VA+VGP ANAT  + GNY G     ++P+ G S
Sbjct: 409 YEAAVEGIVLLKND-GVLPL-SKNVKSVALVGPWANATKQLQGNYFGTAPYLITPLQGAS 466

Query: 466 --GYANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAESLDREDLWLP 523
             GY  V Y  G  +++  + +    A  AAK +D  + L G+D ++EAE  DR ++  P
Sbjct: 467 DAGY-KVNYALGT-NISGNTTDGFANALSAAKKSDVIVYLGGIDNTIEAEGTDRMNVTWP 524

Query: 524 GYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVV 583
             Q  LI Q+++  K P++++ M  G VD +  ++N+ + A++W GYPG+ GG+AI D++
Sbjct: 525 RNQLDLIQQLSQTGK-PLVVLQMGGGQVDSSSIKSNSKVNALIWGGYPGQSGGKAIFDIL 583

Query: 584 FGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSY 643
            GK  P GRL  T Y  +Y    P T M LRP D    PG+TY +Y G  +Y FGYGL Y
Sbjct: 584 KGKRAPAGRLVSTQYPAEYATQFPATDMSLRP-DGKSNPGQTYMWYIGKPVYEFGYGLFY 642

Query: 644 TQFK 647
           T FK
Sbjct: 643 TTFK 646


>gi|392570764|gb|EIW63936.1| glycoside hydrolase family 3 protein [Trametes versicolor FP-101664
           SS1]
          Length = 781

 Score =  442 bits (1136), Expect = e-121,   Method: Compositional matrix adjust.
 Identities = 261/602 (43%), Positives = 361/602 (59%), Gaps = 27/602 (4%)

Query: 55  CDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTH 114
           CD +     R   L+S  T +E      + + GVPRLGLP Y WWSE LHGV+   PG  
Sbjct: 41  CDVTKDPITRATALISIWTDEELTNNTVNASPGVPRLGLPAYNWWSEGLHGVAQ-SPGVT 99

Query: 115 F--DDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNINVA 172
           F        ATSFP  IL  A+F++ L + I   VSTE RA  N GRAGL YW+PNIN  
Sbjct: 100 FAPSGNFSYATSFPQPILMGAAFDDPLIQAIATIVSTEGRAFNNAGRAGLDYWTPNINPF 159

Query: 173 RDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRP-LKVSSCCKHYAAY 231
           +DPRWGR  ETPGEDPF + +Y  N + GLQ          L+ +P  KV + CKH+AAY
Sbjct: 160 KDPRWGRGQETPGEDPFHLSQYVYNLILGLQG--------GLDPKPYFKVVADCKHFAAY 211

Query: 232 DVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKL 291
           D+DNW+GV RY F+A V++QD+ E +L PF+ CV++   +SVMCSYN VNGIPSCA+  L
Sbjct: 212 DMDNWEGVVRYGFNAVVSQQDLSEFYLPPFQTCVRDAKVASVMCSYNAVNGIPSCANSFL 271

Query: 292 LNQTVRGEWDLHG--YIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTN 349
           L   +R  W      ++ +DCD++Q +   H +  D  + A A  L AG D+DCG + + 
Sbjct: 272 LQDVLRDHWGFTDDRWVTSDCDAVQNIFTPHNYTTDPAQ-AAADALLAGTDIDCGTFSST 330

Query: 350 FTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFD--GSPQYVSLGKQDICSDENIELAAE 407
           +   A+Q+G V  TD+ ++    Y  L+RLG+FD   +  Y  LG  D+ + +  +LA  
Sbjct: 331 YLPEALQRGLVNSTDLRRAAIRQYASLVRLGYFDDPAAQPYRQLGWSDVNTLQAQQLAHT 390

Query: 408 AAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGF--S 465
           AA EG+VLLKND   LPL S +V+ +A++GP ANAT  + GNY GI    +SP+ G   +
Sbjct: 391 AAVEGMVLLKND-GLLPL-SKRVRKLALIGPWANATRLLQGNYFGIAPYLVSPVQGAQQA 448

Query: 466 GYANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILA-GLDLSVEAESLDREDLWLPG 524
           G+  V Y  G  +V  +++ S FAA+ AA      ++ A GLD +VE E +DR ++  PG
Sbjct: 449 GF-EVEYVFGT-NVTTRNDTSGFAAAVAAAKRADAVVFAGGLDETVEREEIDRLNVTWPG 506

Query: 525 YQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVF 584
            Q  L+ ++  V K P+I+     G +D    + +  + AI+W GYPG+ GG A+ D++ 
Sbjct: 507 NQLDLVAELERVGK-PLIVAQFGGGQLDNTALKRSKAVNAIIWGGYPGQSGGTALFDILT 565

Query: 585 GKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYT 644
           GK  P GRLPIT Y   Y + +P+T M LRP  S   PGRTYK+Y+G  ++ FG+GL YT
Sbjct: 566 GKAAPAGRLPITQYPAAYAEQVPMTDMTLRP--SATNPGRTYKWYSGTPVFEFGFGLHYT 623

Query: 645 QF 646
            F
Sbjct: 624 TF 625


>gi|378730020|gb|EHY56479.1| beta-glucosidase, variant [Exophiala dermatitidis NIH/UT8656]
 gi|378730021|gb|EHY56480.1| beta-glucosidase [Exophiala dermatitidis NIH/UT8656]
          Length = 783

 Score =  441 bits (1134), Expect = e-121,   Method: Compositional matrix adjust.
 Identities = 284/750 (37%), Positives = 404/750 (53%), Gaps = 45/750 (6%)

Query: 49  MSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSN 108
           +S+   C+++   + R K LV+ +T +EK    G+ + GVPRLGL  Y+WW EALHGV++
Sbjct: 29  LSNNTVCNTNASVADRAKALVAALTNEEKFNLTGNTSPGVPRLGLYSYQWWQEALHGVAS 88

Query: 109 VGPGTHFDDV--IPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWS 166
             PG +F        ATSFP  IL +A+F+++L   +   VSTEARA  N+ R+GL +W+
Sbjct: 89  -SPGVNFSTSGDFSHATSFPQPILMSAAFDDALINAVATVVSTEARAFNNVNRSGLDFWT 147

Query: 167 PNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCK 226
           PNIN  +DPRWGR  ETPGED F +  Y    + GLQ          LN    KV + CK
Sbjct: 148 PNINPYKDPRWGRGQETPGEDTFHLKSYVAALIDGLQG--------GLNPPIKKVIATCK 199

Query: 227 HYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSC 286
           H+ AYD+++W   DRY+FDA V+ QD+ E +++PF+ C ++    S+MCSYN +NG+P+C
Sbjct: 200 HFVAYDLEDWITTDRYNFDAIVSTQDLAEYYMQPFQTCARDARVGSIMCSYNAMNGVPTC 259

Query: 287 ADPKLLNQTVRGEW---DLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDC 343
           ADP +L   +R  W   D   Y+ +DCD+IQ +   H +   ++E AVA  L AG DL+C
Sbjct: 260 ADPYILQTVLREHWNWTDDGQYVTSDCDAIQNIYAPH-YYEPTREQAVADALTAGTDLNC 318

Query: 344 GQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFD--GSPQYVSLGKQDICSDEN 401
           G YY      A  +G   +T ID+++  LY+ L++LG+FD   +  Y SL   D+ +   
Sbjct: 319 GTYYQTHLPAAFSEGLFNQTVIDQTITRLYSALIKLGYFDPPSATPYRSLNWSDVSTPAA 378

Query: 402 IELAAEAAREGIVLLKNDQNTLPLN--SAKVKTVAVVGPHANATVAMIGNYAGIPCRYMS 459
             LA +AA EGIVLLKND   LPL+  + K  TVA++G  ANAT  M GNY GI     S
Sbjct: 379 EALALKAAEEGIVLLKND-GLLPLSFPTDKNTTVAIIGGWANATTTMQGNYFGIAPYLHS 437

Query: 460 PIAGFSGYANVT--YKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAESLDR 517
           P+       N+   Y  G         + +     AA  AD  II  GL  S E+ES DR
Sbjct: 438 PLYALQQLPNINAVYGGGFGVPTTDGWDELLG---AAGEADLIIIADGLTTSDESESNDR 494

Query: 518 EDL-WLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGG 576
             + W P     +INQ++ + K P + + M    +D      N NI A++W GYPG  GG
Sbjct: 495 YTIGWQPA-AIDIINQLSGMGK-PTVFLQM-GDQLDNTPLLNNPNISALIWGGYPGMAGG 551

Query: 577 RAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYP 636
            A+ +++ GK  P GRLP+T Y  DYV  + +T M LRP  + G PGRTYK+YN   L P
Sbjct: 552 DALINILTGKAAPAGRLPVTQYPADYVNQVNMTDMELRPNATSGNPGRTYKWYNNAVL-P 610

Query: 637 FGYGLSYTQFKY--NLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDY 694
           FGYGL YT F    +     +T     +     +  +Y   +  + C       L    +
Sbjct: 611 FGYGLHYTNFSVAASAQGQAQTQSGPSSNSSQGQGTSYNISSLVSSCDRSQYAYLDLCPF 670

Query: 695 FEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATY------IKQVIGFQRVF-VRAGRNKRI 747
             F V+  N GS   SD V +       I+ +Y      IKQ++ +QR+F + AG +   
Sbjct: 671 ESFNVNVTNTGSKLASDFVAL-----GFISGSYGPQPYPIKQLVAYQRLFNISAGASATA 725

Query: 748 KFVFNACKSLNIVDYAANTLLPAGEHTIFV 777
                   SL   D   N +L  G++ + +
Sbjct: 726 TLNL-TLGSLARHDENGNAVLYPGDYGLLI 754


>gi|367046937|ref|XP_003653848.1| glycoside hydrolase family 3 protein [Thielavia terrestris NRRL
           8126]
 gi|347001111|gb|AEO67512.1| glycoside hydrolase family 3 protein [Thielavia terrestris NRRL
           8126]
          Length = 923

 Score =  441 bits (1134), Expect = e-121,   Method: Compositional matrix adjust.
 Identities = 265/611 (43%), Positives = 353/611 (57%), Gaps = 38/611 (6%)

Query: 55  CDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTH 114
           C++SLP + RV+ LV ++TL EK+  L D A G  R+GLP YEWWSEALHGV+   PG  
Sbjct: 165 CNTSLPIADRVRWLVGQLTLQEKITNLVDGASGSARVGLPPYEWWSEALHGVA-ASPGVT 223

Query: 115 F----DDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNIN 170
           F          ATSFP  I  +A+F++ L  +I   V  E RA  N G +G  +W+PNIN
Sbjct: 224 FAGPNGTAFSYATSFPMPITISAAFDDDLVSQIAAVVGREGRAFANHGLSGFDFWTPNIN 283

Query: 171 VARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPL--KVSSCCKHY 228
             RDPRWGR  ETPGED F + +Y  + + GLQ            S PL  ++ + CKHY
Sbjct: 284 PFRDPRWGRGPETPGEDAFRIQQYIRHLIPGLQ-----------GSDPLDKQIIATCKHY 332

Query: 229 AAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCAD 288
           A YDV+      RY +D      D+ E +L PF+ CV++    SVMCSYN V+GIP+CA 
Sbjct: 333 AVYDVE----TGRYEYDYDPQPHDLAEYYLAPFKTCVRDVGIGSVMCSYNAVDGIPACAS 388

Query: 289 PKLLNQTVRGEWDL---HGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQ 345
             LL   +R  W     + Y+V+DCD+++ +   H F  DS   A A  L AG DL+CG 
Sbjct: 389 EYLLQSVLRDHWGFTEPYQYVVSDCDAVRFIYSPHNF-TDSPAAAAAVALNAGTDLECGS 447

Query: 346 YYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQDICSDENIELA 405
            Y N    ++      E  +D++L  LYT L  +GFFDGS +Y  LG   + + +   LA
Sbjct: 448 TYLNLN-QSLASNMTTEAALDRALTRLYTALHTIGFFDGSARYGGLGWDAVGTGDAQVLA 506

Query: 406 AEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFS 465
            +AA +G VLLKN+++ LPL+S +++ +AV+GP ANAT  M GNY G     +SP+A F 
Sbjct: 507 YQAAVDGAVLLKNEKSLLPLDSKRLRKLAVIGPWANATTQMQGNYFGQAAYLVSPLAAFQ 566

Query: 466 ---GYANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAESLDREDLWL 522
              G  NV +  G   +A  S     AA  AAK ADA + L G+D SVE+ESLDR  +  
Sbjct: 567 SAWGADNVLFANGT-GIAGNSTAGFAAALAAAKAADAVVFLGGVDNSVESESLDRTAISW 625

Query: 523 PGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADV 582
           PG Q  LI Q+A V K P+++V    G +D +    N  + A+LWAGYPG+ GG AIAD+
Sbjct: 626 PGNQLDLIAQLAAVGK-PLVVVQCGGGQLDDSALLANPRVGALLWAGYPGQAGGAAIADL 684

Query: 583 VFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLG------YPGRTYKFYNGPTLYP 636
           + GK  P GRLP+T Y   Y   + L    LRP  S G      +PGRTYK+Y G  + P
Sbjct: 685 LTGKQAPAGRLPVTQYAASYTSEVSLFDPSLRPRRSGGSKSHSTFPGRTYKWYTGKPVLP 744

Query: 637 FGYGLSYTQFK 647
           FGYGL YT F+
Sbjct: 745 FGYGLHYTTFR 755


>gi|358382857|gb|EHK20527.1| hypothetical protein TRIVIDRAFT_192759 [Trichoderma virens Gv29-8]
          Length = 860

 Score =  441 bits (1134), Expect = e-121,   Method: Compositional matrix adjust.
 Identities = 256/610 (41%), Positives = 362/610 (59%), Gaps = 30/610 (4%)

Query: 55  CDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTH 114
           CD++   + R   +V  MTL+EKV  +G  A G  RLGLP Y+W +EALHGV+    G  
Sbjct: 139 CDTTKSIAARAAAIVKPMTLNEKVANVGSSASGSGRLGLPAYQWQNEALHGVAG-STGVQ 197

Query: 115 FDDVI----PGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNIN 170
           F   +      ATSFP  IL +A+F+++L + +  A+STEARA  N G AGL +W+PNIN
Sbjct: 198 FQSPLGANFSAATSFPMPILLSAAFDDALVQSVATAISTEARAFANYGFAGLDFWTPNIN 257

Query: 171 VARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAA 230
             RDPRWGR  ETPGED F +  Y ++ + GLQ          ++    +  S CKH+AA
Sbjct: 258 PFRDPRWGRGMETPGEDAFRIQGYVLSLINGLQG--------GIDPDFFRTISTCKHFAA 309

Query: 231 YDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPK 290
           YD++N +  +    +   T+QDM + +L  FE CV++    S+MC+YN VNG+P+CAD  
Sbjct: 310 YDIENGRTAN----NLSPTQQDMADYYLPMFETCVRDAKVGSIMCAYNSVNGVPACADSY 365

Query: 291 LLNQTVR---GEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYY 347
           LL   +R   G  +   Y+V+DCD+++ + D H + A+  + A A +L AG DLDCG  Y
Sbjct: 366 LLQSVLRDGYGFTEDFNYVVSDCDAVENVYDPHHYAANLTQ-AAAMSLNAGTDLDCGSSY 424

Query: 348 TNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQDICSDENIELAAE 407
            N    +VQ G   E  +DKSL  LY+ L+++G+FD   +Y SLG  ++ + +   LA +
Sbjct: 425 -NVLNASVQAGMTTEATLDKSLIRLYSALIKVGWFDQPAKYSSLGWGNVNTTQTRALAHD 483

Query: 408 AAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGF-SG 466
           AA  G+ LLKND  TLPL S  ++ VAV+GP  NAT  + GNYAG     ++P+  F   
Sbjct: 484 AATGGMTLLKND-GTLPL-SPTLQNVAVIGPWVNATTQLQGNYAGTAPVLVNPLTVFQQK 541

Query: 467 YANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAESLDREDLWLPGYQ 526
           + NV Y  G   +  +  +   AA  AA ++D  + L G+D+SVE E  DR  +  PG Q
Sbjct: 542 WRNVKYAQGT-AINSQDTSGFNAAISAASSSDVIVYLGGIDISVENEGFDRTAITWPGNQ 600

Query: 527 TQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGK 586
             LI+Q+A + K P+++V    G +D +   +N+ + +ILWAGYPG+EGG A+ DV+ G 
Sbjct: 601 LSLISQLANLGK-PLVIVQFGGGQIDDSSLLSNSKVNSILWAGYPGQEGGNALFDVLTGA 659

Query: 587 FNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQF 646
             P GRLPIT Y  +YV    +  M LRP  S+  PGRTY +Y G  + PFGYGL YT F
Sbjct: 660 NPPAGRLPITQYPANYVNNNNIQDMNLRPSGSI--PGRTYAWYTGTPVLPFGYGLHYTNF 717

Query: 647 KYNLLSFTKT 656
             +  S TKT
Sbjct: 718 SVSFQS-TKT 726


>gi|189203341|ref|XP_001938006.1| beta-xylosidase [Pyrenophora tritici-repentis Pt-1C-BFP]
 gi|187985105|gb|EDU50593.1| beta-xylosidase [Pyrenophora tritici-repentis Pt-1C-BFP]
          Length = 761

 Score =  439 bits (1129), Expect = e-120,   Method: Compositional matrix adjust.
 Identities = 276/740 (37%), Positives = 392/740 (52%), Gaps = 48/740 (6%)

Query: 57  SSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTHFD 116
           +S P   R + LV+  TL+EK+      A GVPRLG+P Y+WWSE LHG++  GP T+F 
Sbjct: 3   TSRPPLARAQSLVALYTLEEKINATSSGAPGVPRLGVPPYQWWSEGLHGIA--GPYTNFS 60

Query: 117 DV--IPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNINVARD 174
           D      +TSFP  IL  A+F++ L   + + +STEARA  N  R GL +W+PNIN  RD
Sbjct: 61  DSGEWSYSTSFPQPILMGAAFDDDLITDVAKVISTEARAFNNANRTGLDFWTPNINPFRD 120

Query: 175 PRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVD 234
           PRWGR  ETPGED + +  Y    + GLQ       +TD   R   V + CKH+A YDV+
Sbjct: 121 PRWGRGQETPGEDAYHLSSYVQALIHGLQG-----ESTDPYKR---VVATCKHFAGYDVE 172

Query: 235 NWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQ 294
           +W G  RY  D ++T+Q++ E +L PF+ CV + +  + MCSYN VNG P CADP LL  
Sbjct: 173 DWNGNLRYQNDVQITQQELVEYYLAPFQACV-QANVGAFMCSYNAVNGAPPCADPYLLQT 231

Query: 295 TVRGEW---DLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFT 351
            +R  W   +   ++  DCD++Q +   H++ + ++  A A +L AG D+ CG Y     
Sbjct: 232 ILREHWGWTNEEQWVTGDCDAVQNVYLPHQW-SPTRAGAAADSLVAGTDVTCGTYMQEHL 290

Query: 352 GNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ--YVSLGKQDICSDENIELAAEAA 409
             A QQ  + E+ +D++L   Y+ L+RLG+FD S    Y  LG   + ++ +  LA  AA
Sbjct: 291 PAAFQQKLLNESSLDQALIRQYSSLVRLGYFDASENQPYRQLGFDAVATNASQALARRAA 350

Query: 410 REGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFS---- 465
            EGIVLLKND  TLPL+     TV + G  ANAT  ++GNYAG+     SP+        
Sbjct: 351 AEGIVLLKND-GTLPLSLDSSVTVGLFGDWANATSQLLGNYAGVATYLHSPLYALEQTGV 409

Query: 466 --GYANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAESLDREDLWLP 523
              YA        D    + +N   A S    T+D  I + G+D SVE E  DR  L   
Sbjct: 410 KINYAGGNPGGQGDPTTNRWSNLYGAYS----TSDVLIYVGGIDNSVEEEGRDRGYLTWT 465

Query: 524 GYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVV 583
           G Q  +I Q+A+  K PVI+V+   G +D +    N NI AI+WAGYPG++GG AI D++
Sbjct: 466 GAQLDVIGQLADTGK-PVIVVVTGGGQIDSSPLVNNPNISAIMWAGYPGQDGGSAIIDII 524

Query: 584 FGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSY 643
            GK  P GRLP T Y  +Y   + + +M LRP ++   PGRTYK+YNG   + FGYG+ Y
Sbjct: 525 GGKTAPAGRLPQTQYPANYTAAVSMMNMNLRPGEN--SPGRTYKWYNGSATFEFGYGMHY 582

Query: 644 TQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQN 703
           T F   +   T  +Q +          N T    + RCP   VN           V   N
Sbjct: 583 TNFSAEI---TTQMQQSYAISSLASGCNSTGGFLE-RCPFASVN-----------VQVHN 627

Query: 704 VGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYA 763
            G+     + + Y       A    K ++ ++R+   AG       +     SL  VD  
Sbjct: 628 TGNVTSDYITLGYMAGTFGPAPHPRKTLVSYKRLHSIAGGATSTATLNLTLASLARVDEH 687

Query: 764 ANTLLPAGEHTIFVGNGGVS 783
            N +L  G++++ + N  ++
Sbjct: 688 GNKVLYPGDYSLQIDNNALA 707


>gi|2791278|emb|CAA93248.1| beta-xylosidase [Trichoderma reesei]
 gi|340519464|gb|EGR49702.1| glycoside hydrolase family 3 [Trichoderma reesei QM6a]
          Length = 797

 Score =  438 bits (1127), Expect = e-120,   Method: Compositional matrix adjust.
 Identities = 274/733 (37%), Positives = 405/733 (55%), Gaps = 44/733 (6%)

Query: 53  LFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVG-- 110
           L CDSS  Y  R + L+S  TL+E +    +   GVPRLGLP Y+ W+EALHG+      
Sbjct: 61  LVCDSSAGYVERAQALISLFTLEELILNTQNSGPGVPRLGLPNYQVWNEALHGLDRANFA 120

Query: 111 -PGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNI 169
             G  F+     ATSFP  ILTTA+ N +L  +I   +ST+ARA  N GR GL  ++PN+
Sbjct: 121 TKGGQFE----WATSFPMPILTTAALNRTLIHQIADIISTQARAFSNSGRYGLDVYAPNV 176

Query: 170 NVARDPRWGRITETPGEDPFVVGR-YAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHY 228
           N  R P WGR  ETPGED F +   Y   Y+ G+Q          ++   LKV++  KH+
Sbjct: 177 NGFRSPLWGRGQETPGEDAFFLSSAYTYEYITGIQG--------GVDPEHLKVAATVKHF 228

Query: 229 AAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCAD 288
           A YD++NW    R  FDA +T+QD+ E +   F    +   + S+MC+YN VNG+PSCA+
Sbjct: 229 AGYDLENWNNQSRLGFDAIITQQDLSEYYTPQFLAAARYAKSRSLMCAYNSVNGVPSCAN 288

Query: 289 PKLLNQTVRGEWDL--HGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQY 346
              L   +R  W     GY+ +DCD++  + + H + ++    A A +L+AG D+DCGQ 
Sbjct: 289 SFFLQTLLRESWGFPEWGYVSSDCDAVYNVFNPHDYASNQSS-AAASSLRAGTDIDCGQT 347

Query: 347 YTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQDICSDENIELAA 406
           Y      +   G+V   +I++S+  LY  L+RLG+FD   QY SLG +D+   +   ++ 
Sbjct: 348 YPWHLNESFVAGEVSRGEIERSVTRLYANLVRLGYFDKKNQYRSLGWKDVVKTDAWNISY 407

Query: 407 EAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPI--AGF 464
           EAA EGIVLLKND  TLPL S KV+++A++GP ANAT  M GNY G     +SP+  A  
Sbjct: 408 EAAVEGIVLLKND-GTLPL-SKKVRSIALIGPWANATTQMQGNYYGPAPYLISPLEAAKK 465

Query: 465 SGYANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAESLDREDLWLPG 524
           +GY +V ++ G  ++A  S      A  AAK +DA I L G+D ++E E  DR D+  PG
Sbjct: 466 AGY-HVNFELGT-EIAGNSTTGFAKAIAAAKKSDAIIYLGGIDNTIEQEGADRTDIAWPG 523

Query: 525 YQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVF 584
            Q  LI Q++EV K P++++ M  G VD +  ++N  + +++W GYPG+ GG A+ D++ 
Sbjct: 524 NQLDLIKQLSEVGK-PLVVLQMGGGQVDSSSLKSNKKVNSLVWGGYPGQSGGVALFDILS 582

Query: 585 GKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYT 644
           GK  P GRL  T Y  +YV   P   M LRP D    PG+TY +Y G  +Y FG GL YT
Sbjct: 583 GKRAPAGRLVTTQYPAEYVHQFPQNDMNLRP-DGKSNPGQTYIWYTGKPVYEFGSGLFYT 641

Query: 645 QFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNV 704
            FK  L S  K+++ N + +    +  YT      + P            F F+ + +N 
Sbjct: 642 TFKETLASHPKSLKFNTSSILSAPHPGYT---YSEQIP-----------VFTFEANIKNS 687

Query: 705 GSTDGSDVVIVYSKPPAEIAATYI-KQVIGFQRVF-VRAGRNKRIKFVFNACKSLNIVDY 762
           G T+     +++ +      A Y  K ++GF R+  ++ G + ++        +L  VD 
Sbjct: 688 GKTESPYTAMLFVRTSNAGPAPYPNKWLVGFDRLADIKPGHSSKLSIPI-PVSALARVDS 746

Query: 763 AANTLLPAGEHTI 775
             N ++  G++ +
Sbjct: 747 HGNRIVYPGKYEL 759


>gi|443893988|dbj|GAC71176.1| hypothetical protein PANT_1d00031 [Pseudozyma antarctica T-34]
          Length = 759

 Score =  437 bits (1125), Expect = e-120,   Method: Compositional matrix adjust.
 Identities = 252/621 (40%), Positives = 360/621 (57%), Gaps = 36/621 (5%)

Query: 46  GLQMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHG 105
           G  +S+   CD+SL Y  R   LV+  T  E +    + A GVPRLG+P Y+WW+EALHG
Sbjct: 27  GTPLSANAVCDTSLDYWTRATSLVAEFTTQELINNTINTAPGVPRLGIPPYQWWTEALHG 86

Query: 106 VSNVGPGTHFDDVIPG----ATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAG 161
           V+   PG +F D +      AT+FP +I   A+F+++L++++   ++ E RA  N G+AG
Sbjct: 87  VAG-SPGVNFADDVEAPYGSATNFPQIINLGATFDDALYEQVATHIANETRAFNNAGKAG 145

Query: 162 LTYWSP-NINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLK 220
           L  +SP NIN  RDPRWGR  ET GEDP  + RYAV  V+GLQ           N   L+
Sbjct: 146 LNMYSPLNINCFRDPRWGRGQETTGEDPLHMSRYAVKMVQGLQGP---------NQDELR 196

Query: 221 VSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRV 280
           +++ CKHY AYD++ W GV+RY FDA+V+ Q++ E +L  F  CV++G A ++M SYN V
Sbjct: 197 LAATCKHYLAYDLEKWDGVERYQFDAQVSRQELAEFYLPQFRACVRDGKAVTLMTSYNAV 256

Query: 281 NGIPSCADPKLLNQTVRGEWDL---HGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKA 337
           N +P  A    L    R EW L   H Y+ +DCD++  + D H + ADS   A A ++ A
Sbjct: 257 NNVPPSASRYYLETLARKEWGLDKKHNYVTSDCDAVANVFDGHHY-ADSYVQAAADSINA 315

Query: 338 GLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFD---GSPQYVSLGKQ 394
           G DL+CG  Y++  G A++Q       I  ++  +Y   +RLG FD   G P    LG +
Sbjct: 316 GTDLNCGATYSDNLGQALEQNLTDVETIRTAVARMYASQVRLGLFDPKQGQP-LRELGWE 374

Query: 395 DICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIP 454
            + +    +LA  +A   + LLKN+  TLP++ A    VAV+GP++NAT A+ GNYAG P
Sbjct: 375 HVNTKAAQDLAYSSAAASVTLLKNN-GTLPVDGA--TKVAVIGPYSNATFALRGNYAG-P 430

Query: 455 CRY---MSPIAG--FSGYANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLS 509
             +   M+  A   FS  A ++   G       ++    AA + AK AD  I   G+D +
Sbjct: 431 GPFAITMTEAAQRVFS-QATISSANGTTISGTYNHTDAEAAMQLAKEADLVIFAGGIDPT 489

Query: 510 VEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAG 569
           +E+E LDR  +  P  Q QLI+ +  +AK  + +V    G +D A  + + NI A+LWAG
Sbjct: 490 IESEELDRATIAWPPNQLQLIHALGGMAK-KMAVVQFGGGQIDGASIKADGNIGALLWAG 548

Query: 570 YPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFY 629
           YPG+ G  A+ DV+ G   P GRLPIT Y  +Y+  L  T+M LRP  +  YPGRTYK+Y
Sbjct: 549 YPGQSGALAVMDVIAGNTAPAGRLPITQYPAEYIDGLAETTMALRP--NATYPGRTYKWY 606

Query: 630 NGPTLYPFGYGLSYTQFKYNL 650
           +G   YP+ +GL YT+FK  L
Sbjct: 607 SGTPTYPYAHGLHYTEFKAEL 627


>gi|292495634|sp|A1CND4.2|XYND_ASPCL RecName: Full=Probable exo-1,4-beta-xylosidase xlnD; AltName:
           Full=1,4-beta-D-xylan xylohydrolase xlnD; AltName:
           Full=Beta-xylosidase A; AltName: Full=Beta-xylosidase
           xlnD; AltName: Full=Xylobiase xlnD; Flags: Precursor
          Length = 792

 Score =  437 bits (1124), Expect = e-119,   Method: Compositional matrix adjust.
 Identities = 278/806 (34%), Positives = 429/806 (53%), Gaps = 60/806 (7%)

Query: 1   MAKVVSSLLCFSLSIALLVFSTNAVDANGSSSPVFV---------CDPGRFSKLGLQMSS 51
           +A V++++L   L+ A   ++    +AN   +P  V         CD G  SK       
Sbjct: 7   IATVLAAILPSVLAQANTSYADYNTEANPDLTPQSVATIDLSFPDCDNGPLSKT------ 60

Query: 52  FLFCDS-SLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVG 110
            + CD+ + PY  R   L+S  TL+E V   G+ + GVPRLGLP Y+ W+EALHG+    
Sbjct: 61  -IVCDTLTSPYD-RAAALISLFTLEELVNATGNTSPGVPRLGLPPYQVWNEALHGLDRA- 117

Query: 111 PGTHFDD--VIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPN 168
              +F D      +TSFP  ILT ++ N +L  ++   +ST+ RA  N GR GL  +SPN
Sbjct: 118 ---YFTDEGQFSWSTSFPMPILTMSALNRTLINQVASIISTQGRAFSNAGRYGLDVYSPN 174

Query: 169 INVARDPRWGRITETPGEDPFVVGR-YAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKH 227
           IN  R P WGR  ETPGED + +   YA  Y+ G+Q          ++ + LK+ +  KH
Sbjct: 175 INSFRHPVWGRGQETPGEDAYCLSSAYAYEYITGIQG--------GVDPKSLKLVATAKH 226

Query: 228 YAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCA 287
           YA YD++NW G  R   D  +T+QD+ E +   F +  ++    SVMCSYN VNG+PSCA
Sbjct: 227 YAGYDIENWDGHSRLGNDMNITQQDLSEYYTPQFLVAARDAKVRSVMCSYNAVNGVPSCA 286

Query: 288 DPKLLNQTVRGEWDL--HGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQ 345
           +   L   +R  +     GYI +DCDS   + + H++ A+    A A +++AG D+DCG 
Sbjct: 287 NSFFLQTLLRDTFGFVEDGYISSDCDSAYNVFNPHEYAANVSS-AAADSIRAGTDIDCGT 345

Query: 346 YYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDG-SPQYVSLGKQDICSDENIEL 404
            Y  +   AV Q  +   DI++ +  LY+ LMRLG+FDG S  Y +L   D+ +  +  +
Sbjct: 346 TYQYYFDEAVDQNLLSRADIERGVIRLYSNLMRLGYFDGNSSAYRNLTWNDVVTTNSWNI 405

Query: 405 AAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGF 464
           + E   EG VLLKND  TLPL S  ++++A+VGP  N +  + GNY G     +SP+  F
Sbjct: 406 SYEV--EGTVLLKND-GTLPL-SESIRSIALVGPWMNVSTQLQGNYFGPAPYLISPLDAF 461

Query: 465 -SGYANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAESLDREDLWLP 523
              + +V Y  G  +++  S +    A  AAK +DA I   G+D S+EAE+LDR ++  P
Sbjct: 462 RDSHLDVNYAFGT-NISSNSTDGFSKALSAAKKSDAIIFAGGIDNSLEAETLDRMNITWP 520

Query: 524 GYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVV 583
           G Q +LI+Q++++ K P+I++ M  G VD +  ++N N+ +++W GYPG+ GG+A+ D++
Sbjct: 521 GKQLELIDQLSQLGK-PLIVLQMGGGQVDSSLLKSNKNVNSLIWGGYPGQSGGQALLDII 579

Query: 584 FGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSY 643
            GK  P GRL +T Y  +Y    P T M LRP  +   PG+TY +Y G  +Y FG+GL Y
Sbjct: 580 TGKRAPAGRLVVTQYPAEYATQFPATDMSLRPHGN--NPGQTYMWYTGTPVYEFGHGLFY 637

Query: 644 TQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQN 703
           T F+   +S  + +     K++   N+    D      PG +   +    +  F VD  N
Sbjct: 638 TTFR---VSHARAV-----KIKPTYNIQ---DLLAQPHPGYI--HVEQMPFLNFTVDITN 684

Query: 704 VGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYA 763
            G        ++++   A  A    K ++GF R+        ++  +     S+   D  
Sbjct: 685 TGKASSDYTAMLFANTTAGPAPYPKKWLVGFDRLPTLGPSTSKLMTIPVTINSMARTDEL 744

Query: 764 ANTLLPAGEHTIFVGNG-GVSFPIHL 788
            N +L  G++ + + N   V  P+ L
Sbjct: 745 GNRVLYPGKYELALNNERSVVLPLSL 770


>gi|348604625|dbj|BAK96214.1| beta-xylosidase [Acremonium cellulolyticus]
          Length = 797

 Score =  436 bits (1122), Expect = e-119,   Method: Compositional matrix adjust.
 Identities = 248/601 (41%), Positives = 350/601 (58%), Gaps = 22/601 (3%)

Query: 53  LFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPG 112
           + CD+S  Y  R + L++  TL+E +    + A GVPRLGLP Y+ WSEALHG+      
Sbjct: 62  IVCDTSANYVDRAEGLIALFTLEELINNTQNTAPGVPRLGLPPYQVWSEALHGLDRANFA 121

Query: 113 THFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNINVA 172
           T  D+    ATSFP  IL+ A+ N +L  +I   + T+ARA  N GR GL  ++PNIN  
Sbjct: 122 TSGDEWT-WATSFPMPILSMAALNRTLINQIAGIIGTQARAFNNAGRYGLDAYAPNINGF 180

Query: 173 RDPRWGRITETPGEDP-FVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAY 231
           R P WGR  ETPGED  F+   YA  Y+ GLQ          ++   LKV +  KH+A Y
Sbjct: 181 RSPLWGRGQETPGEDANFLSSSYAYEYITGLQG--------GVDPDHLKVVATAKHFAGY 232

Query: 232 DVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKL 291
           D++NW G  R  FDA +T+QD+ E +   F    +   A S MCSYN VNG+PSC+   L
Sbjct: 233 DLENWGGNSRLGFDASITQQDLAEYYTPQFLAASRYAKARSFMCSYNSVNGVPSCSSSFL 292

Query: 292 LNQTVRGEWDL--HGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTN 349
           L   +R  WD   +GY+ +DCD++  + + H + A ++  A A +L+AG D+DCGQ Y  
Sbjct: 293 LQTLLRDNWDFPEYGYVSSDCDAVYNVFNPHGY-ASNQSAAAADSLRAGTDIDCGQTYPW 351

Query: 350 FTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDG-SPQYVSLGKQDICSDENIELAAEA 408
               +  +G V   +I++S+  LY+ L++LG+FDG   +Y  LG  D+ + +   ++ EA
Sbjct: 352 NLNQSFIEGSVTRGEIERSIVRLYSNLVKLGYFDGDKSEYRQLGWNDVVTTDAWNISYEA 411

Query: 409 AREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFS--G 466
           A EGIVLLKND   LPL S  VK++A++GP ANAT  + GNY G     ++P+ G S  G
Sbjct: 412 AVEGIVLLKND-GILPL-SKHVKSIALIGPWANATEQLQGNYYGTAPYLITPLQGASDAG 469

Query: 467 YANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAESLDREDLWLPGYQ 526
           Y  V Y  G  ++   +      A  AAK +D  + L G+D ++EAE  DR ++  PG Q
Sbjct: 470 Y-KVNYALGT-NILGNTTEGFADALSAAKKSDVIVYLGGIDNTIEAEGTDRMNVTWPGNQ 527

Query: 527 TQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGK 586
             LI Q+++  K P++++ M  G VD +  + N+ + A++W GYPG+ GG AI D++ GK
Sbjct: 528 LDLIQQLSQTGK-PLVVLQMGGGQVDSSSIKANSKVNALVWGGYPGQSGGTAIFDILSGK 586

Query: 587 FNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQF 646
             P GRL  T Y  +Y    P T M LRP D    PG+TY +Y G  +Y FGYGL YT F
Sbjct: 587 RVPAGRLVTTQYPAEYATQFPATDMNLRP-DGASNPGQTYMWYTGTPVYDFGYGLFYTTF 645

Query: 647 K 647
           K
Sbjct: 646 K 646


>gi|392560759|gb|EIW53941.1| glycoside hydrolase family 3 protein [Trametes versicolor FP-101664
           SS1]
          Length = 783

 Score =  436 bits (1121), Expect = e-119,   Method: Compositional matrix adjust.
 Identities = 257/611 (42%), Positives = 363/611 (59%), Gaps = 27/611 (4%)

Query: 49  MSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSN 108
           + S   CD +     R   L+   T +E      + + GVPRLGLP Y WWSE LHGV+ 
Sbjct: 35  LKSNAVCDITKDPITRATALIGLWTDEELTSNTVNASPGVPRLGLPAYNWWSEGLHGVAQ 94

Query: 109 VGPGTHF--DDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWS 166
             PG  F        ATSFP  IL  A+F+++L + I   VSTE RA  N GRAGL YW+
Sbjct: 95  -SPGVTFAPSGNFSHATSFPQPILMGAAFDDTLIQAIATIVSTEGRAFNNAGRAGLDYWT 153

Query: 167 PNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRP-LKVSSCC 225
           PNIN  +DPRWGR  ETPGEDPF + +Y  N + GLQ          L+ +P  KV + C
Sbjct: 154 PNINPFKDPRWGRGQETPGEDPFHLSQYVYNLILGLQG--------GLDPKPYFKVVADC 205

Query: 226 KHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPS 285
           KH+AAYD++NW+G+ R  FDA V++QD+ E +L PF+ CV++   +SVMCSYN VNGIPS
Sbjct: 206 KHFAAYDLENWEGIVRNGFDAIVSQQDLSEFYLPPFQTCVRDAKVASVMCSYNAVNGIPS 265

Query: 286 CADPKLLNQTVRGEWDLHG--YIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDC 343
           CA+  LL   +R  W      ++ +DCD+++ ++  HK+  D  + A A  L AG D+DC
Sbjct: 266 CANSFLLQDVLRDHWGFTDDRWVTSDCDAVENILTPHKYTTDPAQ-AAADALLAGTDIDC 324

Query: 344 GQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFD--GSPQYVSLGKQDICSDEN 401
           G + + +   A+Q+G V  TD+ ++    Y  L+RLG+FD   +  Y  LG  D+ + + 
Sbjct: 325 GTFSSTYLPEALQRGLVNSTDLRRAAIRQYASLVRLGYFDDPAAQPYRQLGWSDVNTPQA 384

Query: 402 IELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPI 461
            +LA  AA EGIVLLKND   LP  S  V+ +A++GP ANAT  + G+Y G+    +SP+
Sbjct: 385 QQLAHTAAVEGIVLLKND-GVLPF-SKHVRKLALIGPWANATSLLQGSYIGVAPYLVSPL 442

Query: 462 AGF--SGYANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILA-GLDLSVEAESLDRE 518
            G   +G+  V Y  G  +V  +++ S FAA+ AA      ++ A GLD +VE E  DR 
Sbjct: 443 QGAQEAGF-EVEYVLGT-NVTTQNDMSGFAAAVAAVRRADAVVFAGGLDETVECEGTDRL 500

Query: 519 DLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRA 578
           ++  PG Q  L+ ++  V K P+I+     G +D    + +  + AI+W GYPG+ GG A
Sbjct: 501 NVTWPGNQLDLVAELERVGK-PLIVAQFGGGQLDDTALKHSKAVNAIIWGGYPGQSGGTA 559

Query: 579 IADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFG 638
           + D++ GK  P GRLPIT Y   Y + +P+T M LRP  S   PGRTYK+Y+G  ++ FG
Sbjct: 560 LFDILTGKAAPAGRLPITQYPAAYTKQVPMTDMSLRP--SATNPGRTYKWYSGTPVFEFG 617

Query: 639 YGLSYTQFKYN 649
           +GL YT F ++
Sbjct: 618 FGLHYTTFVFS 628


>gi|358397360|gb|EHK46735.1| glycoside hydrolase family 3 protein [Trichoderma atroviride IMI
           206040]
          Length = 865

 Score =  435 bits (1118), Expect = e-119,   Method: Compositional matrix adjust.
 Identities = 252/613 (41%), Positives = 357/613 (58%), Gaps = 29/613 (4%)

Query: 49  MSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSN 108
           + S   CD++L  + R   +V  MTLDEKV  +G  A G  RLGLP Y+W +EALHGV+ 
Sbjct: 138 LCSNAICDTTLSMAERAAAIVKPMTLDEKVANVGSSASGSARLGLPAYQWQNEALHGVAG 197

Query: 109 VGPGTHFDDVI----PGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTY 164
              G  F   +      ATSFP  IL +A+F+++L + +  A+STEARA  N G AGL +
Sbjct: 198 -STGVQFQSPLGANFSAATSFPMPILLSAAFDDALVQNVATAISTEARAFANYGFAGLDF 256

Query: 165 WSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSC 224
           W+PNIN  RDPRWGR  ETPGED F +  Y +  + GLQ          +N    ++ + 
Sbjct: 257 WTPNINPFRDPRWGRGMETPGEDAFRIQGYVLALISGLQG--------GINPDFFRIIAT 308

Query: 225 CKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIP 284
           CKH+AAYD++N     R   +   T+QDM + +L  FE CV++    SVMC+YN V+GIP
Sbjct: 309 CKHFAAYDIEN----GRTGNNLNPTQQDMADYYLPMFETCVRDAKVGSVMCAYNAVDGIP 364

Query: 285 SCADPKLLNQTVR---GEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDL 341
           +CA   LL   +R   G  +   Y+V+DCD++  + D H + ++  E A A +L AG DL
Sbjct: 365 ACASEYLLQDVLRDGFGFTEDFNYVVSDCDAVDNVFDPHHYASNLTE-AAALSLNAGTDL 423

Query: 342 DCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQDICSDEN 401
           DCG  Y N    +V+     E  +++SL  LY+ L+++G+FD   +Y SL   ++ + +N
Sbjct: 424 DCGSSY-NVLNASVEAALTSEAALNQSLVRLYSALIKVGYFDQPSEYKSLSWANVNTTQN 482

Query: 402 IELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPI 461
             LA +AA  G+ LLKND  TLPL S  +  VA++GP  NAT  M GNYAG     ++P+
Sbjct: 483 QALAHDAATGGMTLLKND-GTLPL-SRTLSNVAIIGPWVNATTQMQGNYAGTAPFLVNPL 540

Query: 462 AGF-SGYANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAESLDREDL 520
             F   + NV Y  G   +  +  +   AA  AA ++D  + L G+D++VE E  DR  +
Sbjct: 541 DVFQQKWGNVKYAQGT-AINSQDTSGFSAALSAASSSDVIVYLGGIDITVENEGFDRGSI 599

Query: 521 WLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIA 580
             PG Q  LI+Q+A + K P+++V    G +D +   +N N+++ILWAGYPG++GG A+ 
Sbjct: 600 VWPGNQLDLISQLANLGK-PLVIVQFGGGQIDDSSLLSNPNVRSILWAGYPGQDGGNAVF 658

Query: 581 DVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYG 640
           DV+ G   P GRLPIT Y   Y+    +  M LRP  S G PGRTY +Y G  + PFGYG
Sbjct: 659 DVLTGANPPAGRLPITQYPASYINNNNIQDMNLRP--SNGIPGRTYAWYTGTPVLPFGYG 716

Query: 641 LSYTQFKYNLLSF 653
           L YT F  +  S 
Sbjct: 717 LHYTNFSVSFQSI 729


>gi|347832625|emb|CCD48322.1| glycoside hydrolase family 3 protein [Botryotinia fuckeliana]
          Length = 772

 Score =  434 bits (1117), Expect = e-119,   Method: Compositional matrix adjust.
 Identities = 259/603 (42%), Positives = 359/603 (59%), Gaps = 29/603 (4%)

Query: 55  CD-SSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGT 113
           CD SS PY+ R   L+S  TL EKV   G+ + GVPR+GLP YEWW+EALHG++   PGT
Sbjct: 34  CDTSSDPYT-RAAALISLFTLAEKVNNTGNTSPGVPRIGLPSYEWWNEALHGIAR-SPGT 91

Query: 114 HFDDVIPG---ATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNIN 170
            F         +TSFP  IL  A+F++ L  K+   VSTEARA  N+ R GL +W+PNIN
Sbjct: 92  TFAATGSNYSYSTSFPQPILMGATFDDELIHKVATQVSTEARAFNNVNRFGLNFWTPNIN 151

Query: 171 VARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVS-SCCKHYA 229
             +DPRWGR  ETPGEDPF    Y    + GLQ          L+  P K   + CKH+A
Sbjct: 152 PYKDPRWGRGQETPGEDPFHTSSYVNALITGLQG--------GLDDLPYKKGVATCKHFA 203

Query: 230 AYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADP 289
            YD+++  G  RY FDA +  QD+ + +L PF+ C ++ +  SVMCSYN +NG+P+CAD 
Sbjct: 204 GYDLESSDGAIRYGFDAIIKSQDLRDYYLPPFQQCARDSNVQSVMCSYNAMNGVPTCADD 263

Query: 290 KLLNQTVRGEW---DLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQY 346
            LL   +R  W   +   ++ +DCD+++ + D H +   + E + A  L AG DLDCG +
Sbjct: 264 WLLQTLLREHWGWTEEDQWVTSDCDAVKNIWDYHNYTL-TPEQSAADALNAGTDLDCGTF 322

Query: 347 YTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFD--GSPQYVSLGKQDICSDENIEL 404
           +  + G+A  QG    + +D+SL   Y  L+RLG+FD      Y  L   ++ +    +L
Sbjct: 323 WPTYLGSAYDQGLYDISTLDRSLARRYASLVRLGYFDPPSVQPYRQLNWDNVSTPAAQQL 382

Query: 405 AAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSP-IAG 463
           A +AA +GIVLLKND   LPL S+ +  VA++GP ANAT  M GNY G      SP IA 
Sbjct: 383 ALQAAEDGIVLLKND-GILPL-SSNITNVALIGPLANATKQMQGNYYGTAPYLRSPLIAA 440

Query: 464 FSGYANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAESLDREDLWLP 523
            +    VTY  G  D+  ++     AA  AA++AD  I + G+D S+EAE +DR  +  P
Sbjct: 441 QNAGFKVTYVQGA-DIDSQNTTDFSAAISAAQSADLVIYVGGIDNSIEAEEIDRTSISWP 499

Query: 524 GYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVV 583
             Q  LINQ+A ++  P+I+  M    +D +   +NT + A+LWAGYPG++GG AI +++
Sbjct: 500 SSQLSLINQLANLST-PLIISQMGC-MIDSSSLLSNTGVNALLWAGYPGQDGGTAIFNIL 557

Query: 584 FGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSY 643
            GK  P GRLPIT Y  +YV  + +T M L+P  S   PGRTYK+YNG  ++ +GYGL Y
Sbjct: 558 TGKTAPAGRLPITQYPSNYVNQVTMTDMNLQP--SRFNPGRTYKWYNGEPVFEYGYGLQY 615

Query: 644 TQF 646
           T F
Sbjct: 616 TTF 618


>gi|164429277|ref|XP_958209.2| hypothetical protein NCU09923 [Neurospora crassa OR74A]
 gi|16945419|emb|CAB91343.2| related to xylan 1, 4-beta-xylosidase [Neurospora crassa]
 gi|157073010|gb|EAA28973.2| hypothetical protein NCU09923 [Neurospora crassa OR74A]
          Length = 774

 Score =  434 bits (1115), Expect = e-118,   Method: Compositional matrix adjust.
 Identities = 267/741 (36%), Positives = 401/741 (54%), Gaps = 45/741 (6%)

Query: 49  MSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSN 108
           ++S   CD++L    R   LV+ MT +EK+Q L   + G PR+GLP Y WWSEALHGV+ 
Sbjct: 36  LASLKVCDATLSPPQRAAALVAAMTTEEKLQNLVSKSKGAPRIGLPAYNWWSEALHGVA- 94

Query: 109 VGPGTHF---DDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYW 165
             PGT F   D     +TSFP  +L  A+F++ L +K+G+ + TE RA  N G +G  YW
Sbjct: 95  YAPGTQFRSGDGPFNSSTSFPMPLLMAATFDDELIEKVGEVIGTEGRAFGNAGFSGFDYW 154

Query: 166 SPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCC 225
           +PN+N  +DPRWGR +ETPGED   + RYA + +RGLQ          L  R  +V + C
Sbjct: 155 TPNVNPFKDPRWGRGSETPGEDILRIKRYAASMIRGLQG--------PLPER--RVVATC 204

Query: 226 KHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPS 285
           KHYAA D ++W G  R+ FDA+VT QD+ E +L PF+ C ++    S+MCSYN VNG+P+
Sbjct: 205 KHYAANDFEDWNGSTRHDFDAKVTLQDLAEYYLSPFQQCARDSKVGSIMCSYNAVNGVPA 264

Query: 286 CADPKLLNQTVRGEWDLHG---YIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLD 342
           CA+  L+   +R  W+      YI +DC+++  +  NH + A +  +  A   +AG D  
Sbjct: 265 CANTYLMQTILREHWNWTAPGNYITSDCEAVLDIFANHHY-AKTNAEGTALAFEAGTDSS 323

Query: 343 CGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGS-PQYVSLGKQDICSDEN 401
           C    ++    A  QG ++++ +D++L  LY  L+R+G+FDG+  +Y SLG +D+ S ++
Sbjct: 324 CEYESSSDIPGAWTQGLLEQSTVDRALTRLYEGLVRVGYFDGNHSEYASLGWKDVNSPKS 383

Query: 402 IELAAEAAREGIVLLKNDQNTLP--LNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMS 459
            E+A + A EGIVLLKNDQ TLP  L +     +A++G  AN    + G Y+G P    S
Sbjct: 384 QEVALQTAVEGIVLLKNDQ-TLPLGLKTDPKSKLAMIGFWANDPKTLSGGYSGKPAFEHS 442

Query: 460 PIAGFSGYA-NVTYKTGCDDVACKSNNS-IFAASEAAKTADATIILAGLDLSVEAESLDR 517
           P+        NVT   G       SN++   AA EAA+ A+  +   GLD S   E+ DR
Sbjct: 443 PVYAAEAMGFNVTTAGGPVLQNSTSNDTWTQAALEAAQDANYILYFGGLDTSAAGETKDR 502

Query: 518 EDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGR 577
             +  P  Q QLI  + ++ K P+++V M     +     T T + +ILWA +PG++GG 
Sbjct: 503 TTINWPEAQLQLIKTLTKLGK-PLVVVQMGDQLDNTPLLATKT-VNSILWANWPGQDGGT 560

Query: 578 AIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPT-LYP 636
           A+  ++ G  +P GRLP+T Y  +Y   +P+T M LRP D L  PGRTY++Y  PT + P
Sbjct: 561 AVMQILTGLKSPAGRLPVTQYPANYTAAVPMTDMNLRPSDRL--PGRTYRWY--PTAVQP 616

Query: 637 FGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFE 696
           FG+GL YT F+  + +    + +  + L  C   N  +       P              
Sbjct: 617 FGFGLHYTTFQAKIAAPLPRLAIQ-DLLSRCGGDNANAYPDTCALP-------------P 662

Query: 697 FKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKS 756
            KV+  N G+     VV+ +    A      IK ++ + R+   +  +K    +      
Sbjct: 663 LKVEVTNSGNRSSDYVVLAFLAGDAGPRPYPIKTLVSYTRLRDVSPGHKTTAHLEWTLGD 722

Query: 757 LNIVDYAANTLLPAGEHTIFV 777
           +   D   NT+L  G +T+ V
Sbjct: 723 IARYDEQGNTVLYPGTYTVTV 743


>gi|330934749|ref|XP_003304687.1| hypothetical protein PTT_17336 [Pyrenophora teres f. teres 0-1]
 gi|311318569|gb|EFQ87188.1| hypothetical protein PTT_17336 [Pyrenophora teres f. teres 0-1]
          Length = 798

 Score =  434 bits (1115), Expect = e-118,   Method: Compositional matrix adjust.
 Identities = 273/749 (36%), Positives = 393/749 (52%), Gaps = 49/749 (6%)

Query: 49  MSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSN 108
           + +   CD S     R K LV+  TL+EK+      A GVPRLG+P Y+WW+E LHG++ 
Sbjct: 31  LKNVTICDPSASPLARAKSLVALYTLEEKINATSSGAPGVPRLGVPPYQWWNEGLHGIA- 89

Query: 109 VGPGTHFDDV---IPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYW 165
            GP T+F         +TSFP  IL  A+F++ L  ++ + +STEARA  N  R GL +W
Sbjct: 90  -GPYTNFSHSGVEWSYSTSFPQPILMGAAFDDDLITEVAKVISTEARAFNNANRTGLDFW 148

Query: 166 SPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCC 225
           +PNIN  RDPRWGR  ETPGED + +  Y    + GLQ       ATD   R   V + C
Sbjct: 149 TPNINPFRDPRWGRGQETPGEDAYHLSSYVQALIHGLQG-----EATDPYKR---VVATC 200

Query: 226 KHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPS 285
           KH+A YDV++W G  RY  D ++T+QD+ E +L PF+ CV + +  + MCSYN VNG P 
Sbjct: 201 KHFAGYDVEDWNGNLRYQNDVQITQQDLVEYYLAPFQACV-QANVGAFMCSYNAVNGAPP 259

Query: 286 CADPKLLNQTVRGEWDLHG---YIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLD 342
           CADP LL   +R  W  +    ++  DCD++Q +   H++ + ++  A A +L AG D+ 
Sbjct: 260 CADPYLLQTILREHWGWNKEEQWVTGDCDAVQNVYFPHQW-SSTRAGAAADSLVAGTDIT 318

Query: 343 CGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ---YVSLGKQDICSD 399
           CG Y       A +Q  + E+ +D +L   Y+ L+RLG+FD +P+   Y  LG   + ++
Sbjct: 319 CGTYMQEHLPAAFRQKLLNESSLDLALIRQYSSLVRLGYFD-APENQPYRQLGFDAVATN 377

Query: 400 ENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMS 459
            +  LA  AA EGIVLLKND  TLPL+     TV + G  ANAT  ++GNYAG+     S
Sbjct: 378 ASQALARRAAAEGIVLLKND-GTLPLSLDSSMTVGLFGDWANATTQLLGNYAGVATYLHS 436

Query: 460 PIAGFSGYA-NVTYK----TGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAES 514
           P+         + Y      G  D      ++++ A     T+D  I + G+D  VE E 
Sbjct: 437 PLYALKQTGVKINYAGGKPGGQGDPTTNRWSNLYGAY---STSDVLIYVGGIDNGVEEEG 493

Query: 515 LDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEE 574
            DR  L   G Q  +I Q+AE  K PVI+V+   G +D +    N NI AI+WAGYPG++
Sbjct: 494 HDRGYLTWTGPQLDVIGQLAETGK-PVIVVVTGGGQIDSSPLVNNPNISAIMWAGYPGQD 552

Query: 575 GGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTL 634
           GG AI D++ GK  P GRLP T Y   Y   + + +M LRP ++   PGRTYK+YNG  +
Sbjct: 553 GGSAIIDIISGKTAPAGRLPQTQYPASYAAAVSMMNMNLRPGEN--NPGRTYKWYNGSAV 610

Query: 635 YPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDY 694
           + FGYG+ YT F   +              Q  ++   +S AS     G  +   RC  +
Sbjct: 611 FEFGYGMHYTNFSAAI------------STQMQQSYAISSLASGCNSTGGFLE--RC-PF 655

Query: 695 FEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNAC 754
               V   N G      V + Y       A    K ++ ++R+   AG       +    
Sbjct: 656 ASVDVQVHNTGKVTSDYVTLGYMAGTFGPAPHPRKTLVSYKRLHNIAGGATSTAKLNLTL 715

Query: 755 KSLNIVDYAANTLLPAGEHTIFVGNGGVS 783
            S+  VD   N +L  G +++ + N  ++
Sbjct: 716 ASVARVDEYGNKVLYPGHYSLQIDNNALA 744


>gi|115387056|ref|XP_001210069.1| predicted protein [Aspergillus terreus NIH2624]
 gi|114191067|gb|EAU32767.1| predicted protein [Aspergillus terreus NIH2624]
          Length = 908

 Score =  431 bits (1109), Expect = e-118,   Method: Compositional matrix adjust.
 Identities = 256/610 (41%), Positives = 353/610 (57%), Gaps = 31/610 (5%)

Query: 49  MSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSN 108
           + S   CD+SL  + RV  LV  +TL+EK+  L D A G  RLGLP YEWW+EA HGV +
Sbjct: 157 LCSHRVCDTSLSIAERVNSLVKSLTLEEKILNLVDAAAGSTRLGLPFYEWWNEATHGVGS 216

Query: 109 VGPGTHFDDVIPG---ATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYW 165
             PG  F         ATSFP  IL  ASF+ +L +KI + +  E RA  N G +G  +W
Sbjct: 217 A-PGVQFTSKPANFSYATSFPAPILIAASFDNALIRKIAEVIGKEGRAFANNGFSGFDFW 275

Query: 166 SPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCC 225
           +PNIN  RDPRWGR  ETPGED FV   Y  N++ GLQ  +          +  +V + C
Sbjct: 276 APNINGFRDPRWGRGQETPGEDTFVAQNYIRNFIPGLQGDD---------PKNKQVIATC 326

Query: 226 KHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPS 285
           KHYA YD++      RY  +   T+QD+ + FL PF+ CV++ D  S+MCSYN V+GIP+
Sbjct: 327 KHYAVYDLE----TGRYGNNYNPTQQDLSDYFLAPFKTCVRDTDVGSIMCSYNSVSGIPA 382

Query: 286 CADPKLLNQTVRGEW----DLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDL 341
           CA+  LL++ +R  W    D H Y+V+DC+++  +   H F  D++E A A  L AG+DL
Sbjct: 383 CANEYLLDEVLRKHWGFNADYH-YVVSDCNAVTDIWQYHNF-TDTEEAAAAVALNAGVDL 440

Query: 342 DCGQYYTNFTGN-AVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQDICSDE 400
           +CG  Y     + A  Q  VK   +D+SL  LY+ L  +GFFDG  +Y  L   D+    
Sbjct: 441 ECGSSYLKLNESLAANQTSVKA--MDQSLARLYSALFTIGFFDGG-KYDHLDFSDVSIPA 497

Query: 401 NIELAAEAAREGIVLLKNDQNTLPLNSA-KVKTVAVVGPHANATVAMIGNYAGIPCRYMS 459
              LA EAA EG+ LLKND   LPL+S  K K+VAV+GP ANAT  M G Y+G     +S
Sbjct: 498 AQALAYEAAVEGMTLLKND-GLLPLHSQHKYKSVAVIGPFANATTQMQGGYSGNAPYLIS 556

Query: 460 PIAGFSGYANVTYKTGCDDVACKSNNSIFAAS-EAAKTADATIILAGLDLSVEAESLDRE 518
           P+  F                   N + F AS  AAK +D  + L G+D S+E+E++DR 
Sbjct: 557 PLVAFESDHRWKVNYAVGTAINDQNTTGFEASLAAAKKSDLIVYLGGIDNSIESETIDRT 616

Query: 519 DLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRA 578
            L  PG Q  LI  ++ ++K P+++V    G VD +    N +I+A++WAGYP + GG A
Sbjct: 617 SLAWPGNQLDLIKSLSNLSK-PMVVVQFGGGQVDDSALLENKDIQALIWAGYPSQSGGTA 675

Query: 579 IADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFG 638
           + D++ GK +P GRLP+T Y   Y   + +  + LRP     +PGRTYK+Y G  + PFG
Sbjct: 676 LLDILVGKRSPAGRLPVTQYPASYADQINIFDINLRPNSKDSHPGRTYKWYTGKPVIPFG 735

Query: 639 YGLSYTQFKY 648
           +GL YT+FK+
Sbjct: 736 HGLHYTKFKF 745


>gi|380293100|gb|AFD50200.1| beta-xylosidase [Hypocrea orientalis]
          Length = 797

 Score =  431 bits (1108), Expect = e-118,   Method: Compositional matrix adjust.
 Identities = 274/733 (37%), Positives = 404/733 (55%), Gaps = 44/733 (6%)

Query: 53  LFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVG-- 110
           L CDSS  Y  R + L+S  TL+E +    +   GVPRLGLP Y+ W+EALHG+      
Sbjct: 61  LVCDSSAGYVERAQALISLFTLEELILNTQNSGPGVPRLGLPNYQVWNEALHGLDRANFA 120

Query: 111 -PGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNI 169
             G  F+     ATSFP  ILTTA+ N +L  +I   +ST+ARA  N GR GL  ++PN+
Sbjct: 121 TKGGQFE----WATSFPMPILTTAALNRTLIHQIADIISTQARAFSNSGRYGLDVYAPNV 176

Query: 170 NVARDPRWGRITETPGEDPFVVGR-YAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHY 228
           N  R P WGR  ETPGED F +   Y   Y+ G+Q          ++   LKV++  KH+
Sbjct: 177 NGFRSPLWGRGQETPGEDAFFLSSAYTYEYITGIQG--------GVDPEQLKVAATVKHF 228

Query: 229 AAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCAD 288
           A YD++NW    R  FDA +T+QD+ E +   F    +   + S+MCSYN VNG+PSCA+
Sbjct: 229 AGYDLENWNNQSRLGFDAIITQQDLSEYYTPQFLAAARYAKSRSLMCSYNSVNGVPSCAN 288

Query: 289 PKLLNQTVRGEWDL--HGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQY 346
              L   +R  W     GY+ +DCD++  + + H + ++    A A +L+AG D+DCGQ 
Sbjct: 289 SFFLQTLLRESWGFPEWGYVSSDCDAVYNVFNPHDYASNQSS-AAASSLRAGTDIDCGQT 347

Query: 347 YTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQDICSDENIELAA 406
           Y      +   G+V   +I++S+  LY  L+RLG+FD   QY SLG +D+   +   ++ 
Sbjct: 348 YPWHLNESFVAGEVTRGEIERSVTRLYANLVRLGYFDKKNQYRSLGWKDVVKTDAWNISY 407

Query: 407 EAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPI--AGF 464
           EAA EGIVLLKND  TLPL S KV+++A++GP ANAT  M GNY G     +SP+  A  
Sbjct: 408 EAAVEGIVLLKND-GTLPL-SKKVRSIALIGPWANATTQMQGNYFGPAPYLISPLEAAKK 465

Query: 465 SGYANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAESLDREDLWLPG 524
           +GY +V ++ G  ++A  S      A  AAK +DA + L G+D ++E E  DR D+  PG
Sbjct: 466 AGY-HVNFELGT-EIAGNSTAGFAKAIAAAKKSDAIVYLGGIDNTIEQEGADRTDIAWPG 523

Query: 525 YQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVF 584
            Q  LI Q++EV K P++++ M  G VD +  ++N  + +++W GYPG+ GG A+ D++ 
Sbjct: 524 NQLDLIKQLSEVGK-PLVVLQMGGGQVDSSSLKSNKKVNSLVWGGYPGQSGGVALFDILS 582

Query: 585 GKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYT 644
           GK  P GRL  T Y  +YV   P   M LRP D    PG+TY +Y G  +Y FG GL YT
Sbjct: 583 GKRAPAGRLITTQYPAEYVHQFPQNDMNLRP-DGKSNPGQTYIWYTGKPVYEFGSGLFYT 641

Query: 645 QFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNV 704
            FK  L S  K ++ N + +    +  YT      + P            F F+ + +N 
Sbjct: 642 TFKETLASHPKCLKFNTSSILSAPHPGYTYSE---QIP-----------VFTFEANIKNS 687

Query: 705 GSTDGSDVVIVYSKPPAEIAATYI-KQVIGFQRVF-VRAGRNKRIKFVFNACKSLNIVDY 762
           G T+     +++ +      A Y  K ++GF R+  ++ G + ++        +L  VD 
Sbjct: 688 GKTESPYTAMLFVRTSNAGPAPYPNKWLVGFDRLADIKPGHSSKLSIPI-PVSALARVDS 746

Query: 763 AANTLLPAGEHTI 775
             N ++  G++ +
Sbjct: 747 YGNRIVYPGKYEL 759


>gi|336471692|gb|EGO59853.1| hypothetical protein NEUTE1DRAFT_99999 [Neurospora tetrasperma FGSC
           2508]
 gi|350292807|gb|EGZ74002.1| glycoside hydrolase [Neurospora tetrasperma FGSC 2509]
          Length = 770

 Score =  429 bits (1102), Expect = e-117,   Method: Compositional matrix adjust.
 Identities = 260/741 (35%), Positives = 399/741 (53%), Gaps = 45/741 (6%)

Query: 49  MSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSN 108
           ++S   CD +L    R   LV+ MT +EK+Q L   + G PR+GLP Y WWSEALHGV+ 
Sbjct: 36  LASLKVCDVTLSPPQRAAALVAAMTTEEKLQNLVSKSKGAPRIGLPAYNWWSEALHGVA- 94

Query: 109 VGPGTHF---DDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYW 165
             PGT F   D     +TSFP  +L  A+F++ L +K+G+ + TE RA  N G +G  YW
Sbjct: 95  YAPGTQFWSGDGPFNASTSFPMPLLMAATFDDELIEKVGEVIGTEGRAFGNAGFSGFDYW 154

Query: 166 SPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCC 225
           +PN+N  +DPRWGR +ETPGED   + RYA + +RGLQ            +R  +V + C
Sbjct: 155 TPNVNPFKDPRWGRGSETPGEDILRIKRYAASMIRGLQG----------PARERRVVATC 204

Query: 226 KHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPS 285
           KHYAA D ++W G  R+ F+A+VT QD+ E +L PF+ C ++    S+MCSYN VNG+P+
Sbjct: 205 KHYAANDFEDWNGSTRHDFNAKVTLQDLAEYYLSPFQQCARDSKVGSIMCSYNAVNGVPA 264

Query: 286 CADPKLLNQTVRGEWDLHG---YIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLD 342
           CA+  L+   +R  W+      YI +DC+++  +  NH + A++  +  A   +AG+D  
Sbjct: 265 CANTYLMQTILREHWNWTAPGNYITSDCEAVLDISANHHY-AETNAEGTALAFEAGIDSS 323

Query: 343 CGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGS-PQYVSLGKQDICSDEN 401
           C    ++    A  QG ++++ +D++LK +Y  L+R+G+FDG+  +Y SLG +D+ S ++
Sbjct: 324 CEYESSSDIPGAWTQGLLEQSTVDRALKRIYEGLVRVGYFDGNHSEYASLGWKDVNSPKS 383

Query: 402 IELAAEAAREGIVLLKNDQNTLPLN--SAKVKTVAVVGPHANATVAMIGNYAGIPCRYMS 459
            E+A +AA EGIVLLKND+ TLPL+  +     +A++G  AN    + G Y+G P    S
Sbjct: 384 QEVALQAAVEGIVLLKNDK-TLPLDLRTDPKSKLAMIGFWANDPKTLSGGYSGKPAFEHS 442

Query: 460 PIAGFSGYANVTYKTGCDDVACKSNNSIF--AASEAAKTADATIILAGLDLSVEAESLDR 517
           P+             G   +   ++N  +  AA EAAK A+  +   G D S   E+ DR
Sbjct: 443 PVYAAQAMGFSVTTAGGPVLQNSTSNDTWTQAALEAAKDANYILYFGGQDTSAAGETKDR 502

Query: 518 EDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGR 577
             +  P  Q QLI  ++++ K P+++V M    +D         + AILWA + G++GG 
Sbjct: 503 TTINWPEAQLQLITTLSKLGK-PLVVVQM-GDQLDNTPLLAAKAVNAILWANWLGQDGGT 560

Query: 578 AIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPT-LYP 636
           A+  ++ G  NP GRLP+T Y  +Y   +P+T M LRP D L  PGRTY++Y  PT + P
Sbjct: 561 AVMQILTGLKNPAGRLPVTQYPANYTAAVPMTDMNLRPSDKL--PGRTYRWY--PTAVQP 616

Query: 637 FGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFE 696
           FG+GL YT F+  +      + +  + L  C   N  +       P              
Sbjct: 617 FGFGLHYTTFQTKIAVPLPRLAIQ-DLLSRCGGDNANAYPDTCALP-------------P 662

Query: 697 FKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKS 756
            KV+  N G+     VV+ +           IK ++ + R+   +  +K    +      
Sbjct: 663 LKVEVTNSGNRSSDYVVLAFLAGDVGPKPYPIKTLVSYTRLRDLSPGHKTTAHLKWTLGD 722

Query: 757 LNIVDYAANTLLPAGEHTIFV 777
           +   D   NT+L  G +T+ V
Sbjct: 723 IARYDEQGNTVLYPGTYTVTV 743


>gi|60729621|pir||JC7966 xylan 1,4-beta-xylosidase (EC 3.2.1.37) - Talaromyces emersonii
 gi|21326570|gb|AAL32053.2|AF439746_1 beta-xylosidase [Rasamsonia emersonii]
          Length = 796

 Score =  429 bits (1102), Expect = e-117,   Method: Compositional matrix adjust.
 Identities = 274/740 (37%), Positives = 401/740 (54%), Gaps = 43/740 (5%)

Query: 49  MSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSN 108
           +S+ L C++S     R + LVS  TL+E +    + A GVPRLGLPQY+ W+EALHG+  
Sbjct: 58  LSTNLVCNTSADPWARAEALVSLFTLEELINNTQNTAPGVPRLGLPQYQVWNEALHGLDR 117

Query: 109 VGPGTHFDDV--IPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWS 166
                +F D      ATSFP  IL+ ASFN +L  +I   ++T+ARA  N GR GL  ++
Sbjct: 118 A----NFSDSGEYSWATSFPMPILSMASFNRTLINQIASIIATQARAFNNAGRYGLDSYA 173

Query: 167 PNINVARDPRWGRITETPGEDPFVVGR-YAVNYVRGLQDVEGHENATDLNSRPLKVSSCC 225
           PNIN  R P WGR  ETPGED F +   YA  Y+ GLQ          ++   +K+ +  
Sbjct: 174 PNINGFRSPLWGRGQETPGEDAFFLSSAYAYEYITGLQG--------GVDPEHVKIVATA 225

Query: 226 KHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPS 285
           KH+A YD++NW  V R   +A +T+QD+ E +   F    +     S+MCSYN VNG+PS
Sbjct: 226 KHFAGYDLENWGNVSRLGSNAIITQQDLSEYYTPQFLASARYAKTRSLMCSYNAVNGVPS 285

Query: 286 CADPKLLNQTVRGEWDL--HGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDC 343
           C++   L   +R  ++    GY+ +DCD++  + + H + A ++  A A +L AG D+DC
Sbjct: 286 CSNSFFLQTLLRESFNFVDDGYVSSDCDAVYNVFNPHGY-ALNQSGAAADSLLAGTDIDC 344

Query: 344 GQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ-YVSLGKQDICSDENI 402
           GQ        +  +  V   DI+KSL  LY  L+RLG+FDG+   Y +L   D+ + +  
Sbjct: 345 GQTMPWHLNESFYERYVSRGDIEKSLTRLYANLVRLGYFDGNNSVYRNLNWNDVVTTDAW 404

Query: 403 ELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPI- 461
            ++ EAA EGI LLKND  TLPL S KV+++A++GP ANATV M GNY G P   +SP+ 
Sbjct: 405 NISYEAAVEGITLLKND-GTLPL-SKKVRSIALIGPWANATVQMQGNYYGTPPYLISPLE 462

Query: 462 -AGFSGYANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAESLDREDL 520
            A  SG+  V Y  G  +++  S      A  AAK +D  I   G+D ++EAE  DR DL
Sbjct: 463 AAKASGFT-VNYAFGT-NISTDSTQWFAEAISAAKKSDVIIYAGGIDNTIEAEGQDRTDL 520

Query: 521 WLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIA 580
             PG Q  LI Q+++V K P++++ M  G VD +  + N N+ A++W GYPG+ GG A+ 
Sbjct: 521 KWPGNQLDLIEQLSKVGK-PLVVLQMGGGQVDSSSLKANKNVNALVWGGYPGQSGGAALF 579

Query: 581 DVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYG 640
           D++ GK  P GRL  T Y  +Y    P   M LRP  S   PG+TY +Y G  +Y FG+G
Sbjct: 580 DILTGKRAPAGRLVSTQYPAEYATQFPANDMNLRPNGS--NPGQTYIWYTGTPVYEFGHG 637

Query: 641 LSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVD 700
           L YT+F+       ++     NK      L    D   T  PG    +L    +    VD
Sbjct: 638 LFYTEFQ-------ESAAAGTNKTSTLDIL----DLVPTPHPGYEYIELV--PFLNVTVD 684

Query: 701 FQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRV-FVRAGRNKRIKFVFNACKSLNI 759
            +NVG T      ++++   A       K ++GF R+  +   +  ++ F      ++  
Sbjct: 685 VKNVGHTPSPYTGLLFANTTAGPKPYPNKWLVGFDRLATIHPAKTAQVTFPV-PLGAIAR 743

Query: 760 VDYAANTLLPAGEHTIFVGN 779
            D   N ++  GE+ + + N
Sbjct: 744 ADENGNKVIFPGEYELALNN 763


>gi|70996610|ref|XP_753060.1| beta-xylosidase XylA [Aspergillus fumigatus Af293]
 gi|74672055|sp|Q4WRB0.1|XYND_ASPFU RecName: Full=Probable exo-1,4-beta-xylosidase xlnD; AltName:
           Full=1,4-beta-D-xylan xylohydrolase xlnD; AltName:
           Full=Beta-xylosidase A; AltName: Full=Beta-xylosidase
           xlnD; AltName: Full=Xylobiase xlnD; Flags: Precursor
 gi|66850695|gb|EAL91022.1| beta-xylosidase XylA [Aspergillus fumigatus Af293]
          Length = 792

 Score =  429 bits (1102), Expect = e-117,   Method: Compositional matrix adjust.
 Identities = 275/794 (34%), Positives = 423/794 (53%), Gaps = 49/794 (6%)

Query: 1   MAKVVSSLLCFSLSIALLVFSTNAVDANGSSSPVFVCDPGRFSKLGLQ--------MSSF 52
           +AK ++++L   L  AL   +T+ VD N  ++P     P   + + L         +S  
Sbjct: 3   VAKSIAAVLVALLPGALAQANTSYVDYNVEANPDLT--PQSVATIDLSFPDCENGPLSKT 60

Query: 53  LFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPG 112
           L CD+S     R   LVS  T +E V   G+ + GVPRLGLP Y+ WSEALHG+      
Sbjct: 61  LVCDTSARPHDRAAALVSMFTFEELVNNTGNTSPGVPRLGLPPYQVWSEALHGLDRA--- 117

Query: 113 THFDDV--IPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNIN 170
            +F D      ATSFP  ILT ++ N +L  +I   ++T+ RA  N+GR GL  ++PNIN
Sbjct: 118 -NFTDEGEYSWATSFPMPILTMSALNRTLINQIATIIATQGRAFNNVGRYGLDVYAPNIN 176

Query: 171 VARDPRWGRITETPGEDPFVVGR-YAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYA 229
             R   WGR  ETPGED + +   YA  Y+ G+Q          ++   LK+ +  KHYA
Sbjct: 177 AFRSAMWGRGQETPGEDAYCLASAYAYEYITGIQG--------GVDPEHLKLVATAKHYA 228

Query: 230 AYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADP 289
            YD++NW G  R   D  +T+Q++ E +   F +  ++    SVMCSYN VNG+PSCA+ 
Sbjct: 229 GYDLENWDGHSRLGNDMNITQQELSEYYTPQFLVAARDAKVHSVMCSYNAVNGVPSCANS 288

Query: 290 KLLNQTVRGEWDL--HGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYY 347
             L   +R  +     GY+ +DCDS   + + H+F A+    A A +++AG D+DCG  Y
Sbjct: 289 FFLQTLLRDTFGFVEDGYVSSDCDSAYNVWNPHEFAANIT-GAAADSIRAGTDIDCGTTY 347

Query: 348 TNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ-YVSLGKQDICSDENIELAA 406
             + G A  + +V   +I++ +  LY+ L+RLG+FDG+   Y  L   D+ + +   ++ 
Sbjct: 348 QYYFGEAFDEQEVTRAEIERGVIRLYSNLVRLGYFDGNGSVYRDLTWNDVVTTDAWNISY 407

Query: 407 EAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFSG 466
           EAA EGIVLLKND  TLPL +  V++VA++GP  N T  + GNY G     +SP+  F  
Sbjct: 408 EAAVEGIVLLKND-GTLPL-AKSVRSVALIGPWMNVTTQLQGNYFGPAPYLISPLNAFQN 465

Query: 467 YA-NVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAESLDREDLWLPGY 525
              +V Y  G + ++  S +    A  AAK +D  I   G+D ++EAE++DR ++  PG 
Sbjct: 466 SDFDVNYAFGTN-ISSHSTDGFSEALSAAKKSDVIIFAGGIDNTLEAEAMDRMNITWPGN 524

Query: 526 QTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFG 585
           Q QLI+Q++++ K P+I++ M  G VD +  ++N N+ +++W GYPG+ GG+A+ D++ G
Sbjct: 525 QLQLIDQLSQLGK-PLIVLQMGGGQVDSSSLKSNKNVNSLIWGGYPGQSGGQALLDIITG 583

Query: 586 KFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQ 645
           K  P GRL +T Y  +Y    P T M LRP  +   PG+TY +Y G  +Y FG+GL YT 
Sbjct: 584 KRAPAGRLVVTQYPAEYATQFPATDMSLRPHGN--NPGQTYMWYTGTPVYEFGHGLFYTT 641

Query: 646 FKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVG 705
           F  +L    K  + + N +Q      +   A+  + P  L+N         F V   N G
Sbjct: 642 FHASLPGTGKD-KTSFN-IQDLLTQPHPGFANVEQMP--LLN---------FTVTITNTG 688

Query: 706 STDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAAN 765
                   ++++   A  A    K ++GF R+        +   +     S+   D A N
Sbjct: 689 KVASDYTAMLFANTTAGPAPYPNKWLVGFDRLASLEPHRSQTMTIPVTIDSVARTDEAGN 748

Query: 766 TLLPAGEHTIFVGN 779
            +L  G++ + + N
Sbjct: 749 RVLYPGKYELALNN 762


>gi|292495282|sp|B0XP71.1|XYND_ASPFC RecName: Full=Probable exo-1,4-beta-xylosidase xlnD; AltName:
           Full=1,4-beta-D-xylan xylohydrolase xlnD; AltName:
           Full=Beta-xylosidase A; AltName: Full=Beta-xylosidase
           xlnD; AltName: Full=Xylobiase xlnD; Flags: Precursor
 gi|159131796|gb|EDP56909.1| beta-xylosidase XylA [Aspergillus fumigatus A1163]
          Length = 792

 Score =  428 bits (1101), Expect = e-117,   Method: Compositional matrix adjust.
 Identities = 275/794 (34%), Positives = 423/794 (53%), Gaps = 49/794 (6%)

Query: 1   MAKVVSSLLCFSLSIALLVFSTNAVDANGSSSPVFVCDPGRFSKLGLQ--------MSSF 52
           +AK ++++L   L  AL   +T+ VD N  ++P     P   + + L         +S  
Sbjct: 3   VAKSIAAVLVALLPGALAQANTSYVDYNVEANPNLT--PQSVATIDLSFPDCENGPLSKT 60

Query: 53  LFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPG 112
           L CD+S     R   LVS  T +E V   G+ + GVPRLGLP Y+ WSEALHG+      
Sbjct: 61  LVCDTSARPHDRAAALVSMFTFEELVNNTGNTSPGVPRLGLPPYQVWSEALHGLDRA--- 117

Query: 113 THFDDV--IPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNIN 170
            +F D      ATSFP  ILT ++ N +L  +I   ++T+ RA  N+GR GL  ++PNIN
Sbjct: 118 -NFTDEGEYSWATSFPMPILTMSALNRTLINQIATIIATQGRAFNNVGRYGLDVYAPNIN 176

Query: 171 VARDPRWGRITETPGEDPFVVGR-YAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYA 229
             R   WGR  ETPGED + +   YA  Y+ G+Q          ++   LK+ +  KHYA
Sbjct: 177 AFRSAMWGRGQETPGEDAYCLASAYAYEYITGIQG--------GVDPEHLKLVATAKHYA 228

Query: 230 AYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADP 289
            YD++NW G  R   D  +T+Q++ E +   F +  ++    SVMCSYN VNG+PSCA+ 
Sbjct: 229 GYDLENWDGHSRLGNDMNITQQELSEYYTPQFLVAARDAKVHSVMCSYNAVNGVPSCANS 288

Query: 290 KLLNQTVRGEWDL--HGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYY 347
             L   +R  +     GY+ +DCDS   + + H+F A+    A A +++AG D+DCG  Y
Sbjct: 289 FFLQTLLRDTFGFVEDGYVSSDCDSAYNVWNPHEFAANIT-GAAADSIRAGTDIDCGTTY 347

Query: 348 TNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ-YVSLGKQDICSDENIELAA 406
             + G A  + +V   +I++ +  LY+ L+RLG+FDG+   Y  L   D+ + +   ++ 
Sbjct: 348 QYYFGEAFDEQEVTRAEIERGVIRLYSNLVRLGYFDGNGSVYRDLTWNDVVTTDAWNISY 407

Query: 407 EAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFSG 466
           EAA EGIVLLKND  TLPL +  V++VA++GP  N T  + GNY G     +SP+  F  
Sbjct: 408 EAAVEGIVLLKND-GTLPL-AKSVRSVALIGPWMNVTTQLQGNYFGPAPYLISPLNAFQN 465

Query: 467 YA-NVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAESLDREDLWLPGY 525
              +V Y  G + ++  S +    A  AAK +D  I   G+D ++EAE++DR ++  PG 
Sbjct: 466 SDFDVNYAFGTN-ISSHSTDGFSEALSAAKKSDVIIFAGGIDNTLEAEAMDRMNITWPGN 524

Query: 526 QTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFG 585
           Q QLI+Q++++ K P+I++ M  G VD +  ++N N+ +++W GYPG+ GG+A+ D++ G
Sbjct: 525 QLQLIDQLSQLGK-PLIVLQMGGGQVDSSSLKSNKNVNSLIWGGYPGQSGGQALLDIITG 583

Query: 586 KFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQ 645
           K  P GRL +T Y  +Y    P T M LRP  +   PG+TY +Y G  +Y FG+GL YT 
Sbjct: 584 KRAPAGRLVVTQYPAEYATQFPATDMSLRPHGN--NPGQTYMWYTGTPVYEFGHGLFYTT 641

Query: 646 FKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVG 705
           F  +L    K  + + N +Q      +   A+  + P  L+N         F V   N G
Sbjct: 642 FHASLPGTGKD-KTSFN-IQDLLTQPHPGFANVEQMP--LLN---------FTVTITNTG 688

Query: 706 STDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAAN 765
                   ++++   A  A    K ++GF R+        +   +     S+   D A N
Sbjct: 689 KVASDYTAMLFANTTAGPAPYPNKWLVGFDRLASLEPHRSQTMTIPVTIDSVARTDEAGN 748

Query: 766 TLLPAGEHTIFVGN 779
            +L  G++ + + N
Sbjct: 749 RVLYPGKYELALNN 762


>gi|76160898|gb|ABA40420.1| Xld [Aspergillus fumigatus]
          Length = 792

 Score =  428 bits (1100), Expect = e-117,   Method: Compositional matrix adjust.
 Identities = 275/794 (34%), Positives = 423/794 (53%), Gaps = 49/794 (6%)

Query: 1   MAKVVSSLLCFSLSIALLVFSTNAVDANGSSSPVFVCDPGRFSKLGLQ--------MSSF 52
           +AK ++++L   L  AL   +T+ VD N  ++P     P   + + L         +S  
Sbjct: 3   VAKSIAAVLVALLPGALAQANTSYVDYNVEANPDLT--PQSVATIDLSFPDCENGPLSKT 60

Query: 53  LFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPG 112
           L CD+S     R   LVS  T +E V   G+ + GVPRLGLP Y+ WSEALHG+      
Sbjct: 61  LVCDTSARPHDRAAALVSMFTFEELVNNTGNTSPGVPRLGLPPYQVWSEALHGLDRA--- 117

Query: 113 THFDDV--IPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNIN 170
            +F D      ATSFP  ILT ++ N +L  +I   ++T+ RA  N+GR GL  ++PNIN
Sbjct: 118 -NFTDEGEYSWATSFPMPILTMSALNRTLINQIATIIATQGRAFNNVGRYGLDVYAPNIN 176

Query: 171 VARDPRWGRITETPGEDPFVVGR-YAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYA 229
             R   WGR  ETPGED + +   YA  Y+ G+Q          ++   LK+ +  KHYA
Sbjct: 177 AFRSAMWGRGQETPGEDAYCLASAYAYEYITGIQG--------GVDPEHLKLVATAKHYA 228

Query: 230 AYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADP 289
            YD++NW G  R   D  +T+Q++ E +   F +  ++    SVMCSYN VNG+PSCA+ 
Sbjct: 229 GYDLENWDGHSRLGNDMNITQQELSEYYTPQFLVAARDAKVHSVMCSYNAVNGVPSCANS 288

Query: 290 KLLNQTVRGEWDL--HGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYY 347
             L   +R  +     GY+ +DCDS   + + H+F A+    A A +++AG D+DCG  Y
Sbjct: 289 FFLQTLLRDTFGFVEDGYVSSDCDSAYNVWNPHEFAANIT-GAAADSIRAGTDIDCGTTY 347

Query: 348 TNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ-YVSLGKQDICSDENIELAA 406
             + G A  + +V   +I++ +  LY+ L+RLG+FDG+   Y  L   D+ + +   ++ 
Sbjct: 348 QYYFGEAFDEQEVTRAEIERGVIRLYSNLVRLGYFDGNGSVYRDLTWNDVVTTDAWNISY 407

Query: 407 EAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFSG 466
           EAA EGIVLLKND  TLPL +  V++VA++GP  N T  + GNY G     +SP+  F  
Sbjct: 408 EAAVEGIVLLKND-GTLPL-AKSVRSVALIGPWMNVTTQLQGNYFGPAPYLISPLNAFQN 465

Query: 467 YA-NVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAESLDREDLWLPGY 525
              +V Y  G + ++  S +    A  AAK +D  I   G+D ++EAE++DR ++  PG 
Sbjct: 466 SDFDVNYAFGTN-ISSHSTDGFSEALSAAKKSDVIIFAGGIDNTLEAEAMDRMNITWPGN 524

Query: 526 QTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFG 585
           Q QLI+Q++++ K P+I++ M  G VD +  ++N N+ +++W GYPG+ GG+A+ D++ G
Sbjct: 525 QLQLIDQLSQLGK-PLIVLQMGGGQVDSSSLKSNKNVNSLIWGGYPGQSGGQALLDIITG 583

Query: 586 KFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQ 645
           K  P GRL +T Y  +Y    P T M LRP  +   PG+TY +Y G  +Y FG+GL YT 
Sbjct: 584 KRAPAGRLVVTQYPAEYATQFPATDMSLRPHGN--NPGQTYMWYTGTPVYEFGHGLFYTT 641

Query: 646 FKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVG 705
           F  +L    K  + + N +Q      +   A+  + P  L+N         F V   N G
Sbjct: 642 FHASLPGTGKD-KTSFN-IQDLLTQPHPGFANVEQMP--LLN---------FTVTITNTG 688

Query: 706 STDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAAN 765
                   ++++   A  A    K ++GF R+        +   +     S+   D A N
Sbjct: 689 KVASDYTAMLFANTTAGPAPYPNKWLVGFDRLASLEPHRSQTMTIPVTIDSVARTDEAGN 748

Query: 766 TLLPAGEHTIFVGN 779
            +L  G++ + + N
Sbjct: 749 RVLYPGKYELALNN 762


>gi|421077748|ref|ZP_15538711.1| glycoside hydrolase family 3 domain protein [Pelosinus fermentans
           JBW45]
 gi|392524151|gb|EIW47314.1| glycoside hydrolase family 3 domain protein [Pelosinus fermentans
           JBW45]
          Length = 750

 Score =  427 bits (1099), Expect = e-117,   Method: Compositional matrix adjust.
 Identities = 262/760 (34%), Positives = 394/760 (51%), Gaps = 97/760 (12%)

Query: 48  QMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVS 107
           +M  F + D +L +  R KDLVSRMTL+EKV Q+   +  +PRLG+P Y WWSEALHGV+
Sbjct: 26  RMEIFDYQDETLSFEQRAKDLVSRMTLEEKVTQMVYISPAIPRLGVPAYNWWSEALHGVA 85

Query: 108 NVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGR-------- 159
             G           AT FP  I   A+F+E L   + + +S E RA ++  +        
Sbjct: 86  RAGV----------ATVFPQAIGLAATFDEKLIHDVAEVISIEGRAKFHEFQRKGDHGIY 135

Query: 160 AGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPL 219
            GLT+WSPN+N+ RDPRWGR  ET GEDP++ GR  V++++GLQ           + + L
Sbjct: 136 KGLTFWSPNVNIFRDPRWGRGQETFGEDPYLTGRLGVSFIKGLQG---------QDKKYL 186

Query: 220 KVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNR 279
           + ++C KH+A   V +    +R+ FDA V+ +D+ ET+L  F+ CVKE +  +VM +YNR
Sbjct: 187 RAAACAKHFA---VHSGPESERHSFDAVVSPKDLRETYLPAFKECVKEANVEAVMGAYNR 243

Query: 280 VNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGL 339
           VNG P C    LL +T+R EW   G++V+DC +I+   +NH+ +  S  ++VA  L  G 
Sbjct: 244 VNGEPCCGSNMLLKETLRQEWGFTGHVVSDCWAIKDFHENHR-VTSSAPESVALALNNGC 302

Query: 340 DLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ--YVSLGKQDIC 397
           DL+CG  Y N    A Q+G V E  I+ ++  L    M+LG FD +    Y ++G     
Sbjct: 303 DLNCGNMYLNLL-IAYQEGLVTEEAINTAVTRLMLTRMKLGLFDTAENVPYTNIGFHQND 361

Query: 398 SDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRY 457
             E+ E A E +++ +VLLKN+ N LPL+   + ++AV+GP+AN+  A+ GNY G    Y
Sbjct: 362 CQEHREFALEVSKKTLVLLKNENNLLPLDRNTISSIAVIGPNANSREALTGNYCGTASNY 421

Query: 458 MSPIAGFSGYAN----VTYKTGCDDVACKSNN------SIFAASEAAKTADATIILAGLD 507
           ++ + G          V+Y  GC     K+ N          A   A+ AD  ++  GLD
Sbjct: 422 ITVLEGIREAVGKDTIVSYAQGCHLYRDKAENLGEARDRFAEAVSTAERADIVVMCMGLD 481

Query: 508 LSVEAE---------SLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAET 558
            S+E E         S D+  L LPG Q +L+  + +  K P+ILV+++   + + +A  
Sbjct: 482 ASIEGEEGDVSNEYASGDKLGLNLPGLQQELLEVIYQTGK-PIILVLLAGSALAVTWAAE 540

Query: 559 NTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDS 618
              + AI+ A YPG EGG+A+A  +FG+++P G+LPIT+Y          T+  L     
Sbjct: 541 K--VPAIIQAWYPGAEGGKALASAIFGEYSPVGKLPITFYR---------TTEELPEFTD 589

Query: 619 LGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASK 678
                RTY++     LYPFGYGL YT F Y         Q+ LN+ Q        S    
Sbjct: 590 YSMKNRTYRYMTKEALYPFGYGLGYTTFAYR--------QLQLNRTQ-------ISAGEN 634

Query: 679 TRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVF 738
            +C  VLV               +N G+    + V +Y K         I ++ G Q+V 
Sbjct: 635 VQC-SVLV---------------KNTGNFASDETVQLYIKDVKASVEVPILELQGIQKVH 678

Query: 739 VRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVG 778
           +  G  + + F     + L +++   N +L  G   I+VG
Sbjct: 679 LLPGTEQEVFFTLTP-RQLALINEEGNCILEPGAFEIYVG 717


>gi|386347261|ref|YP_006045510.1| glycoside hydrolase family protein [Spirochaeta thermophila DSM
           6578]
 gi|339412228|gb|AEJ61793.1| glycoside hydrolase family 3 domain protein [Spirochaeta
           thermophila DSM 6578]
          Length = 693

 Score =  426 bits (1096), Expect = e-116,   Method: Compositional matrix adjust.
 Identities = 270/736 (36%), Positives = 395/736 (53%), Gaps = 92/736 (12%)

Query: 64  RVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGAT 123
           R+  L+S+M+++EK   +   A G+PRLG+P Y WW+EALHGV+N G           AT
Sbjct: 6   RMTSLLSKMSIEEKAGLMLHRAKGIPRLGIPHYNWWNEALHGVANSGE----------AT 55

Query: 124 SFPTVILTTASFNESLWKKIGQAVSTEARAMYN-LGRA-------GLTYWSPNINVARDP 175
            FP  I   A+F+  L +++ +A+STEARA +N +G+        GLT+WSPNIN+ RDP
Sbjct: 56  VFPQAIGLAATFDPDLVRRVAEAISTEARAKFNAIGKERAAEYERGLTFWSPNINIYRDP 115

Query: 176 RWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDN 235
           RWGR  ET GEDPF+  +  V++V+GLQ    +          ++V++C KHYA +    
Sbjct: 116 RWGRGQETYGEDPFLTSKIGVSFVKGLQGDHPYY---------MRVAACAKHYAVH--SG 164

Query: 236 WKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQT 295
            +G+ R+ FDARV+E+D+ ET+L  FE  VK G   +VM +YNRVNG P+C   +LL++ 
Sbjct: 165 PEGL-RHVFDARVSEKDLWETYLPAFEALVKAG-VEAVMGAYNRVNGEPACGSKRLLDEI 222

Query: 296 VRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFTGNAV 355
           +R  W   G++V+DC +I     +HK   D  E ++A  L+AG DL+CG  Y +   +AV
Sbjct: 223 LRKRWGFKGHVVSDCWAIADFHLHHKVTKDPIE-SIAMALEAGCDLNCGNTYEHLL-DAV 280

Query: 356 QQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQDICSDENIELAAEAAREGIVL 415
           + G V E  +D+S+  L + L RLG F     Y  L   DI  + +  LA EAA + +VL
Sbjct: 281 KAGVVSEELVDRSVARLLSTLDRLGLFTDDHPYARLSLSDIDWEAHRALAREAAEKSVVL 340

Query: 416 LKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFSGYA----NVT 471
           LKN+   LP +  K++ + V GP+A   VA++GNYAG+  R ++ + G +GYA     VT
Sbjct: 341 LKNN-GILPFDRQKLRYIYVTGPNAANPVALLGNYAGVSSRLVTVLEGITGYAGPGITVT 399

Query: 472 YKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAESLDR---------EDLWL 522
           YK GC  +     N I  AS  A+ AD T+ + G D +VE E  D           DL L
Sbjct: 400 YKIGC-PLQGNKINPIDWASGVARYADVTVAVMGRDSTVEGEEGDAIFSDNYGDLSDLDL 458

Query: 523 PGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADV 582
           P  Q + + ++ E+ K P+++V++S  G  +   E      AI++A YPGEEGG AIA V
Sbjct: 459 PREQIEYLRRIKEIGK-PLVVVLLS--GAPVCSPELEELADAIVYAWYPGEEGGNAIARV 515

Query: 583 VFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLS 642
           +FG+ +P GRLPIT+  G  V  LP       P       GRTY++     LYPFG+GLS
Sbjct: 516 LFGEISPSGRLPITFPRG--VDQLP-------PFTDYSMEGRTYRYMREEPLYPFGFGLS 566

Query: 643 YTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQ 702
           Y  F Y                   R L  ++     R            +  E   + +
Sbjct: 567 YATFSY-------------------RGLQSSASRWDKR------------ETLELVCEVE 595

Query: 703 NVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDY 762
           N  S    +VV +Y +         +  + GF RV + AG  K+++FV +  + L+ +D 
Sbjct: 596 NTSSIPADEVVQLYVRWEDAPFRVPLWSLKGFTRVSLGAGERKQVRFVLSP-EELSFIDE 654

Query: 763 AANTLLPAGEHTIFVG 778
               +LP G     VG
Sbjct: 655 EGRKVLPEGRLHFHVG 670


>gi|410628680|ref|ZP_11339398.1| beta-glucosidase [Glaciecola mesophila KMM 241]
 gi|410151684|dbj|GAC26167.1| beta-glucosidase [Glaciecola mesophila KMM 241]
          Length = 732

 Score =  426 bits (1096), Expect = e-116,   Method: Compositional matrix adjust.
 Identities = 268/751 (35%), Positives = 394/751 (52%), Gaps = 93/751 (12%)

Query: 53  LFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPG 112
           ++ +  L +  R + LV+ MT+DEK+ QL      +PRL +PQY WW+EALHG++  G  
Sbjct: 29  IWFNPELSFETRAQALVNAMTIDEKITQLSHSTPAIPRLEVPQYNWWNEALHGIARNGK- 87

Query: 113 THFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGR--------AGLTY 164
                    AT FP  I   A+F+  L +++  A+S EARA Y + +        AGLT+
Sbjct: 88  ---------ATIFPQAIGLGATFDPELAQEVANAISDEARAKYAIAQSIGNQGQYAGLTF 138

Query: 165 WSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSC 224
           W+PN+N+ RDPRWGR  ET GEDP +  +    +V+GLQ  +          + LK +  
Sbjct: 139 WTPNVNIFRDPRWGRGQETYGEDPLLTSQMGTAFVKGLQGDD---------PKYLKSAGV 189

Query: 225 CKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIP 284
            KH+A   V +     R+ FD   +++D+ ET+L  FE  V +   + VMC+YN V G P
Sbjct: 190 AKHFA---VHSGPESLRHQFDVEPSKKDLYETYLPAFEALVTQAKVAGVMCAYNGVYGQP 246

Query: 285 SCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCG 344
           SCA   LL + ++ +W  +GY+V+DC ++      HK   +  E A A  L+AG+DL+CG
Sbjct: 247 SCASEFLLGEMLKKKWQFNGYVVSDCGALHDFHSGHKVTHNRVESA-ALALRAGVDLNCG 305

Query: 345 QYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSP--QYVSLGKQDICSDENI 402
             Y      A ++G + ++ ID+ LK L  +  RLG FD S    + ++G++ I S E+I
Sbjct: 306 FTYEKSLKAAFEEGLITQSLIDQRLKNLLMIRFRLGLFDPSELNPHNAIGQEVIHSLEHI 365

Query: 403 ELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIA 462
           ELA + A + IVLLKN++  LPL S  +K   V GP A ++  ++GNY GI    ++ + 
Sbjct: 366 ELARKVAAKSIVLLKNEKQVLPL-SKDIKVPYVTGPFAASSDMLMGNYYGISDSLVTVLE 424

Query: 463 GFSGY----ANVTYKTGCDDVACKSN-NSIFAASEAAKTADATIILAGLDLSVEAESLD- 516
           G +G     +++ Y+ G   +   SN N +  A E AKTADA I + G+   +E E +D 
Sbjct: 425 GIAGKVSLGSSLNYRAGA--LPFHSNINPLNWAPEVAKTADAVIAVVGISADMEGEEVDA 482

Query: 517 --------REDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWA 568
                   R  + LP  Q   + Q+AE  KGP+ILV+ +   VDI+  E +    AILW 
Sbjct: 483 IASADRGDRVAITLPQNQVDYVKQLAENKKGPLILVVAAGSPVDIS--ELDPLADAILWI 540

Query: 569 GYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKF 628
            YPGE+GG A+ADV+FG  NP G LP+T+           T   L P D     GRTYKF
Sbjct: 541 WYPGEQGGNAVADVIFGDTNPSGHLPLTFVK---------TIDDLPPFDDYTMTGRTYKF 591

Query: 629 YNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVND 688
                LYPFG+GLSYTQFK+  LS +K         Q   N+N +               
Sbjct: 592 LKKLPLYPFGFGLSYTQFKFGKLSLSKRAP------QEGENINIS--------------- 630

Query: 689 LRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIK 748
                     V+ +N  + DG  VV VY  P   +    I  +  F+RV + A   + I+
Sbjct: 631 ----------VEVENSTALDGETVVQVYLSPQVPLKNEAITNLKAFKRVHIGAYEKRLIE 680

Query: 749 FVFNACKSLNIVDYAANTLLPAGEHTIFVGN 779
           F     K+L  V+ A   + P+G +T+ VG+
Sbjct: 681 FTIEG-KNLYRVNDAGENVWPSGAYTLAVGD 710


>gi|224068498|ref|XP_002302758.1| predicted protein [Populus trichocarpa]
 gi|222844484|gb|EEE82031.1| predicted protein [Populus trichocarpa]
          Length = 462

 Score =  426 bits (1095), Expect = e-116,   Method: Compositional matrix adjust.
 Identities = 211/449 (46%), Positives = 295/449 (65%), Gaps = 13/449 (2%)

Query: 336 KAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ---YVSLG 392
           +A LDLDCG +    T +AV++G + E +I+ +L    TV MRLG FDG P    Y +LG
Sbjct: 5   QASLDLDCGPFLGQHTEDAVRKGLLTEAEINNALLNTLTVQMRLGMFDGEPSSKPYGNLG 64

Query: 393 KQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAG 452
             D+C+  + ELA EAAR+GIVLLKN    LPL++   ++VA++GP++N TV MIGNYAG
Sbjct: 65  PTDVCTPAHQELALEAARQGIVLLKNHGPPLPLSTRHHQSVAIIGPNSNVTVTMIGNYAG 124

Query: 453 IPCRYMSPIAGFSGYANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEA 512
           + C Y +P+ G   YA   Y+ GC DVAC S+    AA +AA+ ADAT+++ GLD S+EA
Sbjct: 125 VACGYTTPLQGIGRYAKTIYQQGCADVACVSDQQFVAAMDAARQADATVLVMGLDQSIEA 184

Query: 513 ESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPG 572
           ES DR +L LPG Q +LI++VA  +KGP ILV+MS G +D++FAE +  I  I+WAGYPG
Sbjct: 185 ESRDRTELLLPGRQQELISKVAAASKGPTILVLMSGGPIDVSFAENDPKIGGIVWAGYPG 244

Query: 573 EEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGP 632
           + GG AI+DV+FG  NPGG+LP+TWY  DYV  LP+T+M +RP  S GYPGRTY+FY G 
Sbjct: 245 QAGGAAISDVLFGTTNPGGKLPMTWYPQDYVTNLPMTNMAMRPSKSNGYPGRTYRFYKGK 304

Query: 633 TLYPFGYGLSYTQFKYNLLSFTKTIQVNLN-KLQHCRNLNYTSDASKTRCPGVLVNDLRC 691
            +YPFG+G+SYT F + + S    + V L+   Q  RN   +  A       + V   RC
Sbjct: 305 VVYPFGHGISYTNFVHTIASAPTMVSVPLDGHRQASRNATISGKA-------IRVTHARC 357

Query: 692 DDY-FEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFV 750
           +   F  +VD +N GS DG+  ++VYSKPPA   A  +KQ++ F++V V AG  +R+   
Sbjct: 358 NRLSFGVQVDVKNTGSMDGTHTLLVYSKPPAGHWAP-LKQLVAFEKVHVAAGTQQRVGIN 416

Query: 751 FNACKSLNIVDYAANTLLPAGEHTIFVGN 779
            + CK L++VD +    +P G H++ +G+
Sbjct: 417 VHVCKFLSVVDRSGIRRIPMGAHSLHIGD 445


>gi|367053033|ref|XP_003656895.1| glycoside hydrolase family 3 protein [Thielavia terrestris NRRL
           8126]
 gi|347004160|gb|AEO70559.1| glycoside hydrolase family 3 protein [Thielavia terrestris NRRL
           8126]
          Length = 758

 Score =  426 bits (1094), Expect = e-116,   Method: Compositional matrix adjust.
 Identities = 283/748 (37%), Positives = 395/748 (52%), Gaps = 68/748 (9%)

Query: 55  CDSSLPYSIRVKDLVSRMTLDEKVQQLGDF------AHGVPRLGLPQYEWWSEALHGVSN 108
           CD++     R   LV  M + EK+  L ++      + G PRLGLP YEWWSEALHGV+ 
Sbjct: 11  CDTTASPPKRAAALVEAMNITEKLANLVEYVMARSSSKGAPRLGLPPYEWWSEALHGVA- 69

Query: 109 VGPGTHFD---DVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYW 165
             PG  F+        ATSF   I  +A+F++ L +K+   +STEARA  N G AGL +W
Sbjct: 70  ASPGVSFNWSGGPFSYATSFANPITLSAAFDDELVQKVADVISTEARAFANAGSAGLDFW 129

Query: 166 SPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCC 225
           +PNIN  RDPRWGR +ETPGEDP  +  Y  + +RGL   EG E+         KV + C
Sbjct: 130 TPNINPWRDPRWGRGSETPGEDPVRIKGYVRSLLRGL---EGEESIK-------KVIATC 179

Query: 226 KHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPS 285
           KHYAAYD++ W  + RY FDA V+ QD+ E +L PF+ C ++    S+MCSYN +NG P+
Sbjct: 180 KHYAAYDLERWHNITRYEFDAIVSLQDLSEYYLPPFQQCARDSKVGSIMCSYNSLNGTPA 239

Query: 286 CADPKLLNQTVRGEW---DLHGYIVADCDSIQ-VMVDNHKFLADSKEDAVAQTLKAGL-- 339
           CA+  L++  +R  W   + + YI +DC++I+  + D H F   + E A A         
Sbjct: 240 CANTYLMDDILRKHWRWTEDNNYITSDCNAIKDFLPDEHNFTQTAAEAAAAAYTAGTDTV 299

Query: 340 -DLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFD---GSPQYVSLGKQD 395
            ++     YT+  G A  Q  + E  ID++L+ LY  L+R G+FD    SP Y  +G  D
Sbjct: 300 CEVAGSPPYTDVVG-AYDQKLLSEEVIDRALRRLYEGLVRAGYFDPASASP-YRDIGWSD 357

Query: 396 ICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPC 455
           + + E   LA ++A +G+VLLKND  TLP+   + KTVA++G  A+ T +M+G Y+GIP 
Sbjct: 358 VNTAEAQALALQSASDGLVLLKND-GTLPIK-LEGKTVALIGHWASGTRSMLGGYSGIPP 415

Query: 456 RYMSPIAGFSGYANVTYKTGCDDVACKS---NNSIFAASEAAKTADATIILAGLDLSVEA 512
            Y SP+   +G  N+TYK     VA  S   +     A  AA  +D  +   GLD SV +
Sbjct: 416 YYHSPVYA-AGQLNLTYKYASGPVAPASAARDTWTADALSAANKSDVILYFGGLDQSVAS 474

Query: 513 ESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPG 572
           E  DR+ +  P  Q  LI  +A + K   ++VI     VD     TN N+ AILWAGYPG
Sbjct: 475 EDKDRDSIAWPPAQLTLIQTLAGLGK--PLVVIQLGDQVDDTPLLTNPNVSAILWAGYPG 532

Query: 573 EEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFY-NG 631
           + GG A+ + + G   P GRLP+T Y   Y   LPLT M LRP  + G PGRTY++    
Sbjct: 533 QSGGTAVLNAITGVSPPAGRLPVTQYPSSYTSQLPLTDMSLRPDPASGRPGRTYRWLPRN 592

Query: 632 PTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRC 691
            T+ PFGYGL YT F               N  Q     N+T   S    P  L +   C
Sbjct: 593 ATVLPFGYGLHYTNFT-----------ARPNPAQ-----NFTLTPSALLAPCKLAHRDLC 636

Query: 692 DDYFEFKVDFQNVGSTDGSDVVIVYSK-----PPAEIAATYIKQVIGFQRVF-VRAGRNK 745
              +   V+  N G+     V +V++      PP       +K ++ + R+  +  GR  
Sbjct: 637 PLPYPVTVEVTNTGARTSDYVGLVFATTRDAGPPPHP----LKTLVAYARLRGIAPGRTA 692

Query: 746 RIKFVFNACKSLNIVDYAANTLLPAGEH 773
           R + V  A   L  VD A N +L  G +
Sbjct: 693 RAQ-VQVALGDLARVDAAGNRVLYPGRY 719


>gi|429850127|gb|ELA25427.1| glycoside hydrolase family 3 protein [Colletotrichum
           gloeosporioides Nara gc5]
          Length = 918

 Score =  425 bits (1093), Expect = e-116,   Method: Compositional matrix adjust.
 Identities = 267/733 (36%), Positives = 396/733 (54%), Gaps = 34/733 (4%)

Query: 55  CDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTH 114
           CD SL    R   LV+ +T+ EK+  L + A G+PRL +P YEWWSE LHGV+   PGT 
Sbjct: 170 CDESLSDKQRAAALVAELTIWEKLDNLVNEAPGIPRLRVPPYEWWSEGLHGVAR-SPGTK 228

Query: 115 FDDV--IPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNINVA 172
           F        ATSFP  IL  ++F++ L + +G+ VS EARA  N GR+GL  +SPNIN  
Sbjct: 229 FTSKGNFSYATSFPQPILLGSAFDDELVRAVGEVVSREARAFSNAGRSGLDLYSPNINAF 288

Query: 173 RDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYD 232
           +DPRWGR  ETPGED F + +Y    + GL+  +  +          K+ + CKHYAA D
Sbjct: 289 KDPRWGRGQETPGEDTFHLQKYVSAMLSGLEGDDPDK----------KLIATCKHYAAND 338

Query: 233 VDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLL 292
            +N+KGVDR  F+A ++ QD+ E +L PF+ C  E +  S MCSYN +NG P CA+  L+
Sbjct: 339 FENYKGVDRSGFNAVISTQDLSEYYLPPFKTCAVEKNVGSFMCSYNGINGTPLCANSYLI 398

Query: 293 NQTVRGEWDLHG---YIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYY-T 348
              +R  W  +G   Y+  DCD + +MV  H +  D    A A +++AG DL+C  +  +
Sbjct: 399 EDILRKHWGWNGDGQYVSTDCDCVALMVSYHHYAPDLGH-AAAWSMQAGTDLECNAFPGS 457

Query: 349 NFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ--YVSLGKQDICSDENIELAA 406
               +A  Q  + E D+DK+L  +YT L+ +G FD   +    SLG  ++ + E  +LA 
Sbjct: 458 EALQSAWNQSLISEKDVDKALTRMYTSLVSVGLFDLDRKDPLRSLGWDEVNTKEAQDLAY 517

Query: 407 EAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFSG 466
            AA EG VL+KND   LPL+    K  A++GP  +AT  M GNY G     +SP      
Sbjct: 518 RAAVEGAVLMKND-GILPLSPDSSKKYALIGPWVSATTQMQGNYFGPAPYLISPRKAAKD 576

Query: 467 YA-NVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAESLDREDLWLPGY 525
              + TY  G      KS++S   A +AA+ AD  I + G+D ++E E+LDR  L  P  
Sbjct: 577 LGLDFTYFLGSR--TNKSDSSFAQAIKAAQAADVVIFMGGVDNTLEQETLDRNTLAWPEP 634

Query: 526 QTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFG 585
           Q QL+  ++EV K P++++    G VD      N ++ AILW GYPG+ GG+AI D+VFG
Sbjct: 635 QLQLLRALSEVGK-PLVVLQFGGGQVDDTELLANDSVNAILWGGYPGQSGGKAILDIVFG 693

Query: 586 KFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQ 645
           +  P GRL +T Y   Y   +P T M LRP       GRTY++Y G T  P+G+GL YT+
Sbjct: 694 RAAPAGRLSVTQYPASYNDAVPATDMNLRPGPGNSGLGRTYRWYTGETPVPYGFGLHYTK 753

Query: 646 FKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVG 705
           F  ++   +    +++ ++    N     D + +  P       R        V  +N G
Sbjct: 754 FSVDMKPASNVHNIDIAQMAAEAN-----DDAASEIPSWQRGLER--RMVTVTVSAKNEG 806

Query: 706 STDGSDVVIVYSKPPAEIAATYIKQVIGFQRVF-VRAGRNKRIKFVFNACKSLNIVDYAA 764
           +     V +V+ +  A       K ++G+ R+  ++ G  ++ + +    + L  VD   
Sbjct: 807 NVISDYVALVFLRSEAGPKPWPQKTLVGYTRLRNIKPGEERKEEIIIKM-EQLVRVDEVG 865

Query: 765 NTLLPAGEHTIFV 777
           N +L  G +++F+
Sbjct: 866 NRVLYEGLYSLFL 878


>gi|240146254|ref|ZP_04744855.1| beta-glucosidase [Roseburia intestinalis L1-82]
 gi|257201613|gb|EEU99897.1| beta-glucosidase [Roseburia intestinalis L1-82]
 gi|291539969|emb|CBL13080.1| Beta-glucosidase-related glycosidases [Roseburia intestinalis
           XB6B4]
          Length = 710

 Score =  425 bits (1092), Expect = e-116,   Method: Compositional matrix adjust.
 Identities = 267/747 (35%), Positives = 387/747 (51%), Gaps = 99/747 (13%)

Query: 61  YSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTHFDDVIP 120
           Y  R  +LV +MTL+EKV Q    A  V RL +  Y WW+EALHGV+  G          
Sbjct: 13  YRKRAAELVGKMTLEEKVAQTLYQAPAVERLNIKAYNWWNEALHGVARAGT--------- 63

Query: 121 GATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAG--------LTYWSPNINVA 172
            AT FP  I   A+F+E L +++G AVSTEARA +N+ + G        LT+W+PN+N+ 
Sbjct: 64  -ATVFPQAIGLAATFDEDLLEQVGDAVSTEARAKFNMQQEGKDTDIYKGLTFWAPNVNIF 122

Query: 173 RDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYD 232
           RDPRWGR  ET GEDP++  R  V Y+ GLQ   GH      +   LK ++C KH+A   
Sbjct: 123 RDPRWGRGHETFGEDPYLTSRLGVRYIEGLQ---GH------DENYLKAAACAKHFA--- 170

Query: 233 VDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLL 292
           V +     R+ FDA VTEQD+ ET+L  FE CVKEG   +VM +YNR NG+P C + +LL
Sbjct: 171 VHSGPEAVRHEFDAEVTEQDLRETYLPAFEACVKEGKVEAVMGAYNRTNGVPCCGNKRLL 230

Query: 293 NQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFTG 352
              +R EW   G++ +DC +I+   + H     + E +VA  +  G DL+CG  +  F  
Sbjct: 231 IDILRKEWGFSGHVTSDCWAIRDFHEGHHVTGTAIE-SVAMAMNNGCDLNCGTLF-GFLV 288

Query: 353 NAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ--YVSLGKQDICSDENIELAAEAAR 410
            AV+QG VKE  +D+++  L+   M+LG FD   +  Y  +      S E  +L    AR
Sbjct: 289 QAVRQGLVKEERLDEAVTNLFMARMKLGVFDKKEENPYDKIPYLAADSREMKKLNEAVAR 348

Query: 411 EGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFSGY--- 467
             +VLLKN ++ LPL+  K+KTV V+GP+A++  A++GNY G   RY++ + G   Y   
Sbjct: 349 RTVVLLKNKEHILPLDKNKIKTVGVIGPNADSRRALVGNYEGTASRYITVLEGIEDYVGD 408

Query: 468 -ANVTYKTGC----DDVA--CKSNNSIFAASEAAKTADATIILAGLDLSVEAE------- 513
              V Y  GC    D  +   + N+ +       K +D  + + GLD  +E E       
Sbjct: 409 DVRVLYSEGCHLYKDRTSNLAQENDRMSEVLGVCKESDVVVAVLGLDAGIEGEEGDAGNE 468

Query: 514 --SLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYP 571
             S D+ DL LPG Q +++       K PVILV++S   + + +A  + ++ AI+   YP
Sbjct: 469 YGSGDKPDLNLPGLQEEILEAAVSCGK-PVILVLLSGSALAVNWA--DEHVDAIVQGWYP 525

Query: 572 GEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNG 631
           G  GG AIAD++FG+ NP G+LP+T+Y          T+  L   +     GRTY++   
Sbjct: 526 GARGGAAIADILFGEANPEGKLPVTFYR---------TTEELPDFEDYSMQGRTYRYMEQ 576

Query: 632 PTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRC 691
             LYPFGYGLSYT++ Y                Q+ R L      S+    G+ V     
Sbjct: 577 EALYPFGYGLSYTEYAY----------------QNVRFLEQEPVVSEGVTIGLSV----- 615

Query: 692 DDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVF 751
                     +N G  DG++ V VY K  AE +     Q+    ++ + AG  K I    
Sbjct: 616 ----------KNTGKMDGTETVQVYVK--AEHSKMPHGQLKKIVKLPLCAGEEKEINIRL 663

Query: 752 NACKSLNIVDYAANTLLPAGEHTIFVG 778
            + ++  + D     +LP+G   IFVG
Sbjct: 664 ES-EAFMLYDENGEKILPSGHFEIFVG 689


>gi|220927661|ref|YP_002504570.1| glycoside hydrolase [Clostridium cellulolyticum H10]
 gi|219997989|gb|ACL74590.1| glycoside hydrolase family 3 domain protein [Clostridium
           cellulolyticum H10]
          Length = 712

 Score =  425 bits (1092), Expect = e-116,   Method: Compositional matrix adjust.
 Identities = 267/760 (35%), Positives = 394/760 (51%), Gaps = 102/760 (13%)

Query: 49  MSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSN 108
           M+   + D SL +  R  DLVSRMTL+EK  QL   A  V RLG+P+Y WW+EALHGV+ 
Sbjct: 1   MNKPKYLDKSLSFKERAVDLVSRMTLEEKASQLRYDAQPVERLGIPRYNWWNEALHGVAR 60

Query: 109 VGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRA-------- 160
            G           AT FP  I   A F++   +KI   ++TE RA YN            
Sbjct: 61  AGV----------ATVFPQAIGLAAIFDDEFLEKIADVIATEGRAKYNESSKKGDRDIYK 110

Query: 161 GLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLK 220
           G+T+WSPN+N+ RDPRWGR  ET GEDP++  R  V +V+GLQ           + + LK
Sbjct: 111 GITFWSPNVNIFRDPRWGRGHETYGEDPYLTSRLGVAFVKGLQG----------DGKYLK 160

Query: 221 VSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRV 280
            ++C KH+A   V +    DR+HF+A  +++DM ET+L  FE  VKE    SVM +YNR 
Sbjct: 161 SAACAKHFA---VHSGPEDDRHHFNAVASQKDMYETYLPAFEALVKEAKVESVMGAYNRT 217

Query: 281 NGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLD 340
           NG P      LL   +R +W   G++V+DC +I+   + H  +  +  ++VA  LK G D
Sbjct: 218 NGEPCNGSKTLLKDILRDDWGFDGHVVSDCWAIKDFHEGHG-VTKTPTESVALALKNGCD 276

Query: 341 LDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQDICSDE 400
           L+CG  Y      A+++GK+ E DID++   L T  M+LG FD   ++  +  +   S E
Sbjct: 277 LNCGNMYL-LILLALKEGKITEEDIDRAAIRLMTTRMKLGMFDDDCEFDKIPYEVNDSIE 335

Query: 401 NIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSP 460
           + +L+ EAAR+ +VLLKN+   LPL+S K+K +AV+GP+A++++A+  NY+G P   ++ 
Sbjct: 336 HNKLSLEAARKSMVLLKNN-GLLPLDSKKIKNIAVIGPNADSSLALRANYSGTPSHNITI 394

Query: 461 IAGF----SGYANVTYKTGC-------DDVACKSNNSIFAASEAAKTADATIILAGLDLS 509
           + G     S    V Y  G        +D+A + ++ +  A   A+ +D  ++  GLD S
Sbjct: 395 LDGVRSRVSEDTRVWYSLGSHLFMNREEDLA-QPDDRLKEAVSMAERSDVVVLCLGLDAS 453

Query: 510 VEAESLD-----------REDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAET 558
           VE E  D           + DL LP  Q  L+N V    K P I+ ++S   + I  A  
Sbjct: 454 VEGEQNDQGTVILDAGGDKADLNLPESQRNLLNAVLATGK-PTIVALLSGSALSIGDAAD 512

Query: 559 NTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDS 618
                AI+   YPG +GG A A+++FG ++P GRLP+T+Y          ++  L P + 
Sbjct: 513 KA--AAIVQCWYPGSKGGLAFAEMIFGDYSPAGRLPVTFYK---------STEELPPFED 561

Query: 619 LGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASK 678
                RTYKF  G  LYPFG+GLSYT F+Y                            S 
Sbjct: 562 YSMENRTYKFMKGEALYPFGFGLSYTNFEY----------------------------SN 593

Query: 679 TRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVF 738
             CP  + N     +     VD QN GS D  +VV VY K            + GF+R+F
Sbjct: 594 IVCPQAVNNG----ESLSVSVDVQNAGSVDSDEVVQVYIKDMEASVRVPNHSLCGFKRIF 649

Query: 739 VRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVG 778
           +++G  K + F  ++ +++ IVD      +  G+ T++VG
Sbjct: 650 LKSGEKKTVTFEIDS-RAMTIVDEEGKRYIENGDFTLYVG 688


>gi|376259588|ref|YP_005146308.1| beta-glucosidase-like glycosyl hydrolase [Clostridium sp. BNL1100]
 gi|373943582|gb|AEY64503.1| beta-glucosidase-like glycosyl hydrolase [Clostridium sp. BNL1100]
          Length = 712

 Score =  424 bits (1091), Expect = e-116,   Method: Compositional matrix adjust.
 Identities = 269/760 (35%), Positives = 392/760 (51%), Gaps = 102/760 (13%)

Query: 49  MSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSN 108
           M    + D SL +  R  DLVSRMTL+EK  QL   A  V RLG+P+Y WW+EALHGV+ 
Sbjct: 1   MEKPKYLDKSLSFKERAADLVSRMTLEEKASQLRYDAQPVERLGIPRYNWWNEALHGVAR 60

Query: 109 VGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRA-------- 160
            G           AT FP  I   A F++   +KI   ++TE RA YN            
Sbjct: 61  AGV----------ATVFPQAIGMAAIFDDEFLEKIADVIATEGRAKYNENAKKGDRDIYK 110

Query: 161 GLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLK 220
           G+T+WSPN+N+ RDPRWGR  ET GEDP++  R  V +V+GLQ           + + LK
Sbjct: 111 GITFWSPNVNIFRDPRWGRGHETYGEDPYLTSRLGVAFVKGLQG----------DGKYLK 160

Query: 221 VSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRV 280
            ++C KH+A   V +    DR+HFDA V+++D+ ET+L  FE  VKE    SVM +YNR 
Sbjct: 161 TAACAKHFA---VHSGPEDDRHHFDAVVSQKDLYETYLPAFEALVKEAKVESVMGAYNRT 217

Query: 281 NGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLD 340
           NG P      LL   +R  W   G++V+DC +I+   + H  +  +  ++VA  LK+G D
Sbjct: 218 NGEPCNGSKTLLKDILRDGWGFDGHVVSDCWAIKDFHEGHG-VTKTPTESVALALKSGCD 276

Query: 341 LDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQDICSDE 400
           L+CG  Y      A+++G++ E DID++   L T  MRLG FD   ++  +  +   S E
Sbjct: 277 LNCGNMYL-LILLALKEGRITEEDIDRAAIRLMTTRMRLGMFDDDCEFDKIPYELNDSVE 335

Query: 401 NIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSP 460
           + +L+ EAA++ +VLLKND   LPL+S K+K +AV+GP+A++++A+  NY+G P + ++ 
Sbjct: 336 HNKLSLEAAKKSMVLLKND-GLLPLDSKKIKNIAVIGPNADSSLALRANYSGTPSQNITI 394

Query: 461 IAGF----SGYANVTYKTGC-------DDVACKSNNSIFAASEAAKTADATIILAGLDLS 509
           + G     S    V Y  G        +D+A + ++ +  A   A+ +D  ++  GLD S
Sbjct: 395 LDGIRKRVSEDTRVWYSVGSHLFMNREEDLA-QPDDRLKEAVSVAERSDVVVLCLGLDAS 453

Query: 510 VEAESLD-----------REDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAET 558
           VE E  D           + DL LP  Q  L+N V    K P I+ ++S   + I  A  
Sbjct: 454 VEGEQNDQGTVILDAGGDKADLNLPESQRNLLNAVLATGK-PTIVALLSGSALSIGDAAD 512

Query: 559 NTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDS 618
                AI+   YPG  GG A A+++FG ++P GRLP+T+Y          ++  L P   
Sbjct: 513 KA--AAIVQCWYPGSRGGLAFAEMIFGDYSPAGRLPVTFYK---------STEELPPFAD 561

Query: 619 LGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASK 678
                RTYKF  G  LYPFG+GLSYT F+Y                            S 
Sbjct: 562 YSMENRTYKFMKGEALYPFGFGLSYTNFEY----------------------------SN 593

Query: 679 TRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVF 738
             CP  + N     +     VD QN GS D  +VV VY K            + GF+R+ 
Sbjct: 594 IVCPQNVNNG----ENLSVSVDVQNAGSVDSDEVVQVYIKDMDASVRVPKYSLCGFKRIH 649

Query: 739 VRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVG 778
           +++G  K + F  ++  ++ IVD A    +  GE T++VG
Sbjct: 650 LKSGEKKTVTFEIDS-NAMTIVDEAGKRYIENGEFTLYVG 688


>gi|242771939|ref|XP_002477942.1| beta-xylosidase XylA [Talaromyces stipitatus ATCC 10500]
 gi|218721561|gb|EED20979.1| beta-xylosidase XylA [Talaromyces stipitatus ATCC 10500]
          Length = 797

 Score =  424 bits (1091), Expect = e-116,   Method: Compositional matrix adjust.
 Identities = 247/601 (41%), Positives = 354/601 (58%), Gaps = 22/601 (3%)

Query: 53  LFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPG 112
           + C++S+ Y  R + L+S  TL+E +    + A GVPRLGLP Y+ WSE LHG+      
Sbjct: 62  IVCNTSVNYVERAEGLISLFTLEELINNTQNSAPGVPRLGLPPYQVWSEGLHGLDRANWA 121

Query: 113 THFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNINVA 172
              ++    ATSFP  IL+ A+ N +L  +I   ++T+ARA  N+GR GL  ++PNIN  
Sbjct: 122 KSGEE-WKWATSFPMPILSMAALNRTLINQIASIIATQARAFNNVGRYGLDAYAPNINGF 180

Query: 173 RDPRWGRITETPGEDP-FVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAY 231
           R P WGR  ETPGED  F+   YA  Y+ GLQ          ++   LK+ +  KH+A Y
Sbjct: 181 RSPLWGRGQETPGEDAGFLSSSYAYEYITGLQG--------GVDPEHLKIVATAKHFAGY 232

Query: 232 DVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKL 291
           D++NW    R  FDA +T+QD+ E +   F    +   A S MCSYN VNG+PSC+   L
Sbjct: 233 DLENWNNNSRLGFDASITQQDLAEYYTPQFLAASRYAKARSFMCSYNSVNGVPSCSSSFL 292

Query: 292 LNQTVRGEWDL--HGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTN 349
           L   +R  WD   +GY+ +DCD+   + + H + A +   A A +L+AG D+DCGQ Y  
Sbjct: 293 LQTLLRENWDFPDYGYVSSDCDAAYNVFNPHGY-AINISAAAADSLRAGTDIDCGQTYPW 351

Query: 350 FTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGS-PQYVSLGKQDICSDENIELAAEA 408
           +   +  +G V   +I++SL  LY+ L++LG+FDG+  +Y  LG  D+ + +   ++ EA
Sbjct: 352 YLNQSFIEGSVTRGEIERSLIRLYSNLVKLGYFDGNQSEYRQLGWNDVVATDAWNISYEA 411

Query: 409 AREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPI--AGFSG 466
           A EGIVLLKND   LPL S K+K+VAV+GP ANAT  + GNY G     ++P+  A  +G
Sbjct: 412 AVEGIVLLKND-GVLPL-SEKLKSVAVIGPWANATQQLQGNYFGPAPYLITPLQAARDAG 469

Query: 467 YANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAESLDREDLWLPGYQ 526
           Y  V Y  G  ++   + +   AA  AAK +D  I L G+D ++EAE  DR ++  PG Q
Sbjct: 470 Y-KVNYAFGT-NILGNTTDGFAAALSAAKKSDVIIYLGGIDNTIEAEGTDRMNVTWPGNQ 527

Query: 527 TQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGK 586
             LI Q+++  K P++++ M  G VD +  ++N N+ A++W GYPG+ GG+AI D++ GK
Sbjct: 528 LDLIQQLSQTGK-PLVVLQMGGGQVDSSSLKSNNNVNALVWGGYPGQSGGKAIFDILSGK 586

Query: 587 FNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQF 646
             P GRL  T Y  +Y    P T M LRP D    PG+TY +Y G  +Y FGY L YT F
Sbjct: 587 RAPAGRLVTTQYPAEYATQFPATDMNLRP-DGKSNPGQTYIWYTGKPVYEFGYALFYTTF 645

Query: 647 K 647
           K
Sbjct: 646 K 646


>gi|291537442|emb|CBL10554.1| Beta-glucosidase-related glycosidases [Roseburia intestinalis
           M50/1]
          Length = 710

 Score =  424 bits (1090), Expect = e-116,   Method: Compositional matrix adjust.
 Identities = 266/747 (35%), Positives = 387/747 (51%), Gaps = 99/747 (13%)

Query: 61  YSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTHFDDVIP 120
           Y  R  +LV +MTL+EKV Q    A  V RL +  Y WW+EALHGV+  G          
Sbjct: 13  YRKRAAELVGKMTLEEKVAQTLYQAPAVERLNIKAYNWWNEALHGVARAGT--------- 63

Query: 121 GATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAG--------LTYWSPNINVA 172
            AT FP  I   A+F+E L +++G AVSTEARA +N+ + G        LT+W+PN+N+ 
Sbjct: 64  -ATVFPQAIGLAATFDEDLLEQVGDAVSTEARAKFNMQQEGKDTDIYKGLTFWAPNVNIF 122

Query: 173 RDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYD 232
           RDPRWGR  ET GEDP++  R  V Y+ GLQ   GH      +   LK ++C KH+A   
Sbjct: 123 RDPRWGRGHETFGEDPYLTSRLGVRYIEGLQ---GH------DENYLKAAACAKHFA--- 170

Query: 233 VDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLL 292
           V +     R+ FDA VTEQD+ ET+L  FE CVKEG   +VM +YNR NG+P C + +LL
Sbjct: 171 VHSGPEAVRHEFDAEVTEQDLRETYLPAFEACVKEGKVEAVMGAYNRTNGVPCCGNKRLL 230

Query: 293 NQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFTG 352
              +R EW   G++ +DC +I+   + H     + E +VA  +  G DL+CG  +  F  
Sbjct: 231 IDILRKEWGFSGHVTSDCWAIRDFHEGHHVTGTAIE-SVAMAMNNGCDLNCGTLF-GFLV 288

Query: 353 NAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ--YVSLGKQDICSDENIELAAEAAR 410
            AV+QG VKE  +D+++  L+   M+LG FD   +  Y  +      S E  +L    AR
Sbjct: 289 QAVRQGLVKEERLDEAVTNLFMARMKLGVFDKKEENPYDKIPYLAADSREMKKLNEAVAR 348

Query: 411 EGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFSGY--- 467
             +VLLKN ++ LPL+  K+KT+ V+GP+A++  A++GNY G   RY++ + G   Y   
Sbjct: 349 RTVVLLKNKEHILPLDKNKIKTIGVIGPNADSRRALVGNYEGTASRYITVLEGIEDYVGD 408

Query: 468 -ANVTYKTGC----DDVA--CKSNNSIFAASEAAKTADATIILAGLDLSVEAE------- 513
              V Y  GC    D  +   + N+ +       K +D  + + GLD  +E E       
Sbjct: 409 DVRVLYSEGCHLYKDRTSNLAQENDRMSEVLGVCKESDVVVAVLGLDAGIEGEEGDAGNE 468

Query: 514 --SLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYP 571
             S D+ DL LPG Q +++       K PVILV++S   + + +A  + ++ AI+   YP
Sbjct: 469 YGSGDKPDLNLPGLQEEILEAAVSCGK-PVILVLLSGSALAVNWA--DEHVDAIVQGWYP 525

Query: 572 GEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNG 631
           G  GG AIAD++FG+ NP G+LP+T+Y          T+  L   +     GRTY++   
Sbjct: 526 GARGGAAIADILFGEANPEGKLPVTFYR---------TTEELPDFEDYSMQGRTYRYMEQ 576

Query: 632 PTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRC 691
             LYPFGYGLSYT++ Y                Q+ R L      S+    G+ V     
Sbjct: 577 EALYPFGYGLSYTEYAY----------------QNVRFLEQEPVVSEGVTIGLSV----- 615

Query: 692 DDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVF 751
                     +N G  DG++ V VY K  AE +     Q+    ++ + AG  K I    
Sbjct: 616 ----------KNTGKMDGTETVQVYVK--AEHSKMPHGQLKKIVKLPLCAGEEKEINIRL 663

Query: 752 NACKSLNIVDYAANTLLPAGEHTIFVG 778
            + ++  + D     +LP+G   IFVG
Sbjct: 664 ES-EAFMLYDENGEKILPSGHFEIFVG 689


>gi|421060771|ref|ZP_15523202.1| glycoside hydrolase family 3 domain protein [Pelosinus fermentans
           B3]
 gi|421065248|ref|ZP_15527033.1| glycoside hydrolase family 3 domain protein [Pelosinus fermentans
           A12]
 gi|421073214|ref|ZP_15534285.1| glycoside hydrolase family 3 domain protein [Pelosinus fermentans
           A11]
 gi|392444242|gb|EIW21677.1| glycoside hydrolase family 3 domain protein [Pelosinus fermentans
           A11]
 gi|392454445|gb|EIW31278.1| glycoside hydrolase family 3 domain protein [Pelosinus fermentans
           B3]
 gi|392459366|gb|EIW35779.1| glycoside hydrolase family 3 domain protein [Pelosinus fermentans
           A12]
          Length = 724

 Score =  424 bits (1089), Expect = e-115,   Method: Compositional matrix adjust.
 Identities = 259/759 (34%), Positives = 389/759 (51%), Gaps = 97/759 (12%)

Query: 49  MSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSN 108
           M  F + D +L +  R KDLVSRMTL+EKV Q+   +  +PRLG+P Y WWSEALHGV+ 
Sbjct: 1   MEIFAYQDETLSFEQRAKDLVSRMTLEEKVTQMVYISPAIPRLGVPAYNWWSEALHGVAR 60

Query: 109 VGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGR--------A 160
            G           AT FP  I   A+F+E L   + + +S E RA ++  +         
Sbjct: 61  AGV----------ATVFPQAIGLAATFDEKLIFNVAEVISIEGRAKFHEFQRKGDHGIYK 110

Query: 161 GLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLK 220
           GLT+WSPN+N+ RDPRWGR  ET GEDP++ GR  V++++GLQ           + + L+
Sbjct: 111 GLTFWSPNVNIFRDPRWGRGQETFGEDPYLTGRLGVSFIKGLQG---------QDKKYLR 161

Query: 221 VSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRV 280
            ++C KH+A   V +    +R+ FDA V+ +D+ ET+L  F+ CVKE +  +VM +YNRV
Sbjct: 162 AAACAKHFA---VHSGPESERHSFDAVVSPKDLRETYLPAFKECVKEANVEAVMGAYNRV 218

Query: 281 NGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLD 340
           NG P C    LL +T+R EW   G++V+DC +I+   +NH+ +  S  ++VA  L  G D
Sbjct: 219 NGEPCCGSNMLLKETLRREWGFTGHVVSDCWAIKDFHENHR-VTSSAPESVAMALNNGCD 277

Query: 341 LDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ--YVSLGKQDICS 398
           L+CG  Y N    A Q+G V E  I+ ++  L    M+LG FD +    Y  +G      
Sbjct: 278 LNCGNMYLNLL-IAYQEGLVTEEAINTAVTRLMLTRMKLGLFDTAENVPYTKIGFHQNDC 336

Query: 399 DENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYM 458
            E+ E A E +++ +VLLKN+ N LPL+   + ++AV+GP+AN+  A+ GNY G    Y+
Sbjct: 337 QEHREFALEVSKKTLVLLKNENNLLPLDRNTISSIAVIGPNANSREALTGNYCGTASNYI 396

Query: 459 SPIAGFSGYAN----VTYKTGCDDVACKSNN------SIFAASEAAKTADATIILAGLDL 508
           + + G          V+Y  GC     K+ N          A   A+ AD  ++  GLD 
Sbjct: 397 TVLEGIREAVGKDTMVSYAQGCHLYRDKAENLGEARDRFAEAVSTAERADIVVMCMGLDA 456

Query: 509 SVEAE---------SLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETN 559
           S+E E         S D+  L LPG Q +L+  + +  K P+ILV+++   + + +A   
Sbjct: 457 SIEGEEGDVSNEYASGDKLGLNLPGLQQELLEVIYQTGK-PIILVLLAGSALAVTWAA-- 513

Query: 560 TNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSL 619
             I AI+ A YPG EGG+A+A  +FG+++P G+LPIT+Y          T+  L      
Sbjct: 514 EKIPAIIQAWYPGAEGGKALASAIFGEYSPVGKLPITFYR---------TTEELPEFTDY 564

Query: 620 GYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKT 679
               RTY++     LYPFGYGL YT F Y         Q+ LN+ Q              
Sbjct: 565 SMKNRTYRYMTKEALYPFGYGLGYTTFAYR--------QLQLNRTQ-------------- 602

Query: 680 RCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFV 739
                    +   +  +  V  +N G+    + V +Y K         I  + G Q+V +
Sbjct: 603 ---------ISVGENVQGSVLVKNTGNFASDETVQLYIKDVKASVEVPIWALQGIQKVHL 653

Query: 740 RAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVG 778
             G  + + F     + L +++   N +L  G   I+VG
Sbjct: 654 LPGTEQEVFFTLTP-RQLALINEEGNCILEPGVFEIYVG 691


>gi|292495632|sp|Q0CMH8.2|XYND_ASPTN RecName: Full=Probable exo-1,4-beta-xylosidase xlnD; AltName:
           Full=1,4-beta-D-xylan xylohydrolase xlnD; AltName:
           Full=Beta-xylosidase A; AltName: Full=Beta-xylosidase
           xlnD; AltName: Full=Xylobiase xlnD; Flags: Precursor
          Length = 793

 Score =  423 bits (1088), Expect = e-115,   Method: Compositional matrix adjust.
 Identities = 272/738 (36%), Positives = 395/738 (53%), Gaps = 39/738 (5%)

Query: 49  MSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSN 108
           +S  L CD S     R   LVS  TL+E V   G+   GVPRLGLP+Y+ WSE+LHGV  
Sbjct: 57  LSKTLVCDKSARPHDRAAALVSMFTLEELVNNTGNTGTGVPRLGLPKYQVWSESLHGVYR 116

Query: 109 VGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPN 168
               +  D     ATSFP  ILT A+ N +L  +IG  +ST+ARA  N+GR GL  ++PN
Sbjct: 117 ANWASEGD--YSWATSFPQPILTMAALNRTLIHQIGDILSTQARAFSNVGRYGLDTYAPN 174

Query: 169 INVARDPRWGRITETPGEDPF-VVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKH 227
           IN  R P WGR  ETPGED + +   YA  Y+ G+Q          ++   LK+ +  KH
Sbjct: 175 INSFRHPVWGRGQETPGEDAYYLASTYAYEYITGIQG--------GVDPETLKLVATAKH 226

Query: 228 YAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCA 287
           YA YD++NW G  R   D ++T+QD+ E +   F +  ++    SVMCSYN VNG+PSC+
Sbjct: 227 YAGYDIENWDGHSRLGNDMQITQQDLSEYYTPQFLVSARDAKVHSVMCSYNAVNGVPSCS 286

Query: 288 DPKLLNQTVRGEWDL--HGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQ 345
           +   L   +R  +     GY+  DC ++    + H++ A+ +  A A +++AG D+DCG 
Sbjct: 287 NSFFLQTLLRETFGFVEDGYVSGDCGAVYNAFNPHEYAAN-ESSASADSIRAGTDIDCGT 345

Query: 346 YYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDG-SPQYVSLGKQDICSDENIEL 404
            Y     NA  +G++   DI++ +  LYT L+RLG+FDG S QY  L   D+ + +   +
Sbjct: 346 SYQYHFTNAFDEGEISRQDIERGVIRLYTNLVRLGYFDGNSSQYRDLTWSDVQTTDAWNI 405

Query: 405 AAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYM-SPIAG 463
           + EAA EG VLLKND  TLPL +  +++VA++GP ANAT  M GNY G P  Y+ SP+A 
Sbjct: 406 SHEAAVEGTVLLKND-GTLPL-ADSIRSVALIGPWANATTQMQGNYYG-PAPYLTSPLAA 462

Query: 464 FSGY-ANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAESLDREDLWL 522
                 +V Y  G  +++  +      A  AA+ ADA I   G+D ++E E+LDR ++  
Sbjct: 463 LEASDLDVHYAFGT-NISSTTTAGFADALAAARKADAIIFAGGIDNTIEGEALDRMNITW 521

Query: 523 PGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADV 582
           PG Q  LINQ++ + K P++++ M  G VD +  + NTN+ A+LW GYPG+ GG A+ D+
Sbjct: 522 PGNQLDLINQLSALGK-PLVVLQMGGGQVDSSALKHNTNVSALLWGGYPGQSGGTALLDI 580

Query: 583 VFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLS 642
           + G   P GRL  T Y   Y    P   M LRP  +   PG+TY +Y G  +Y FG+GL 
Sbjct: 581 IRGVRAPAGRLVTTQYPAGYATQFPAIDMGLRPNGT--NPGQTYMWYTGTPVYEFGHGLF 638

Query: 643 YTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQ 702
           YT F+    S T T   + N            D      PG     LR   +  F     
Sbjct: 639 YTTFEAKRAS-TATNHSSFN----------IEDLLTAPHPGYAYPQLR--PFLNFTAHIT 685

Query: 703 NVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRV-FVRAGRNKRIKFVFNACKSLNIVD 761
           N G T      ++++   A  A    K ++GF R+  +  G ++ + F      ++   D
Sbjct: 686 NTGRTTSDYTAMLFANTTAGPAPHPNKWLVGFDRLGALEPGASQTMTFPI-TIDNVARTD 744

Query: 762 YAANTLLPAGEHTIFVGN 779
              N +L  G + + + N
Sbjct: 745 ELGNRVLYPGRYELALNN 762


>gi|292495285|sp|B6EY09.1|XYND_ASPJA RecName: Full=Probable exo-1,4-beta-xylosidase xlnD; AltName:
           Full=1,4-beta-D-xylan xylohydrolase xlnD; AltName:
           Full=Beta-xylosidase A; AltName: Full=Beta-xylosidase
           xlnD; AltName: Full=Xylobiase xlnD; Flags: Precursor
 gi|211970990|dbj|BAG82824.1| 1,4-beta-D-xylosidase [Aspergillus japonicus]
          Length = 804

 Score =  423 bits (1088), Expect = e-115,   Method: Compositional matrix adjust.
 Identities = 266/748 (35%), Positives = 387/748 (51%), Gaps = 45/748 (6%)

Query: 49  MSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSN 108
           +S  L CDS+     R   LVS  TL+E +   G+ + GVPRLGLP Y+ WSEALHG++ 
Sbjct: 54  LSKNLVCDSTASPYDRAAALVSLFTLEELIANTGNTSPGVPRLGLPPYQVWSEALHGLAR 113

Query: 109 VGPGTHFDD--VIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWS 166
                +F D      ATSFP+ IL+ A+FN +L  +I   +ST+ RA  N GR GL  +S
Sbjct: 114 A----NFTDNGAYSWATSFPSPILSAAAFNRTLINQIASIISTQGRAFNNAGRFGLDVYS 169

Query: 167 PNINVARDPRWGRITETPGEDPFVV-GRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCC 225
           PNIN  R P WGR  ETPGED + +   YA  Y+ G+Q          +N   LK+++  
Sbjct: 170 PNINTFRHPVWGRGQETPGEDAYTLTAAYAYEYITGIQG--------GVNPEHLKLAATA 221

Query: 226 KHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPS 285
           KH+A YD++NW    R   D  +T+QD+ E +   F +  ++    S MCSYN VNG+PS
Sbjct: 222 KHFAGYDIENWDNHSRLGNDVNITQQDLAEYYTPQFLVAARDAHVHSFMCSYNAVNGVPS 281

Query: 286 CADPKLLNQTVRGEWDL--HGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDC 343
           C++   L   +R  +    HGY+  DC ++  + + H + A+ +  A A  + AG D+DC
Sbjct: 282 CSNTFFLQTLLRDTFSFVDHGYVSGDCGAVYGVFNPHGYAAN-EPSAAADAILAGTDIDC 340

Query: 344 GQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDG----SPQYVSLGKQDICSD 399
           G  Y      ++  G V   DI++    LY  L+ LG+FDG    S  Y SLG  D+   
Sbjct: 341 GTSYQYHFNESITTGAVARDDIERGFIRLYANLVELGYFDGNSSSSNPYRSLGWPDVQKT 400

Query: 400 ENIELAAEAAREGIVLLKNDQNTLPLNS---AKVKTVAVVGPHANATVAMIGNYAGIPCR 456
           +   ++ EAA EGIVLLKND  TLPL S    K K++A++GP ANAT  + GNY G    
Sbjct: 401 DAWNISYEAAVEGIVLLKND-GTLPLASPSEGKNKSIALIGPWANATTQLQGNYYGDAPY 459

Query: 457 YMSPIAGFSGYANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAESLD 516
            +SP+  F+      +     +++  S  +  AA  AA+ AD  + L G+D ++EAE+ D
Sbjct: 460 LISPVDAFTAAGYTVHYAPGTEISTNSTANFSAALSAARAADTIVFLGGIDNTIEAEAQD 519

Query: 517 REDLWLPGYQTQLINQVA--EVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEE 574
           R  +  PG Q +LI+Q+A  +    P+++  M  G VD +  ++N  + A+LW GYPG+ 
Sbjct: 520 RSSIAWPGNQLELISQLAAQKSDDQPLVVYQMGGGQVDSSALKSNAKVNALLWGGYPGQS 579

Query: 575 GGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTL 634
           GG A+ D++ G   P GRL  T Y   Y +      M LRP ++   PG+TY +Y G  +
Sbjct: 580 GGLALRDILTGARAPAGRLTTTQYPAAYAESFSALDMNLRPNETTQNPGQTYMWYTGEPV 639

Query: 635 YPFGYGLSYTQFKYNLLSFTKT-IQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDD 693
           Y FG+GL YT F  +     KT    N+  L    + + T+   +T              
Sbjct: 640 YAFGHGLFYTTFNASSAQAAKTKYTFNITDLTSAAHPDTTTVGQRT-------------- 685

Query: 694 YFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYI-KQVIGFQRVFVRAGRNKRIKF-VF 751
            F F     N G  D     +VY+       + Y  K ++GF R+   A      +  V 
Sbjct: 686 LFNFTASITNSGQRDSDYTALVYANTSTAGPSPYPNKWLVGFDRLAAVAKEGGTAELNVP 745

Query: 752 NACKSLNIVDYAANTLLPAGEHTIFVGN 779
            A   L  VD A NT+L  G + + + N
Sbjct: 746 VAVDRLARVDEAGNTVLFPGRYEVALNN 773


>gi|67523807|ref|XP_659963.1| hypothetical protein AN2359.2 [Aspergillus nidulans FGSC A4]
 gi|74597492|sp|Q5BAS1.1|XYND_EMENI RecName: Full=Exo-1,4-beta-xylosidase xlnD; AltName:
           Full=1,4-beta-D-xylan xylohydrolase xlnD; AltName:
           Full=Beta-xylosidase A; AltName: Full=Beta-xylosidase
           xlnD; AltName: Full=Xylobiase xlnD; Flags: Precursor
 gi|40745314|gb|EAA64470.1| hypothetical protein AN2359.2 [Aspergillus nidulans FGSC A4]
 gi|259487761|tpe|CBF86686.1| TPA: Beta-xylosidase (EC 3.2.1.37)
           [Source:UniProtKB/TrEMBL;Acc:O42810] [Aspergillus
           nidulans FGSC A4]
          Length = 803

 Score =  423 bits (1087), Expect = e-115,   Method: Compositional matrix adjust.
 Identities = 267/731 (36%), Positives = 389/731 (53%), Gaps = 38/731 (5%)

Query: 55  CDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGV--SNVGPG 112
           CD SL    R   LVS  T DE V   G+   GV RLGLP Y+ W EALHGV  +N    
Sbjct: 61  CDRSLSPKDRATALVSLFTFDELVNNTGNTGLGVSRLGLPNYQVWGEALHGVGRANFVES 120

Query: 113 THFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNINVA 172
            +F      ATSFP  I   A+ N++L  +IG  VST+ RA  N G  G+  +SPNIN  
Sbjct: 121 GNFS----WATSFPMPITMMAALNKTLIHQIGTIVSTQLRAFSNAGLGGVDVYSPNINTF 176

Query: 173 RDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYD 232
           R P WGR  ETPGED F+   Y   Y+  LQ          ++   LK+ +  KHYA YD
Sbjct: 177 RHPVWGRGQETPGEDAFLTSVYGYEYITALQG--------GVDPETLKIIATAKHYAGYD 228

Query: 233 VDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLL 292
           +++W    R   D ++T+Q++ E +  PF +  ++    SVMCSYN VNG+PSCA+   L
Sbjct: 229 IESWNNHSRLGNDMQITQQELSEYYTPPFIVASRDAKVRSVMCSYNAVNGVPSCANKFFL 288

Query: 293 NQTVRG--EWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNF 350
              +R   E+   GY+  DC ++  + + H + A ++  A A ++ AG D+DCG  Y   
Sbjct: 289 QTLLRDTFEFSEDGYVSGDCGAVYNVWNPHGY-ASNEAAASADSILAGTDIDCGTSYQWH 347

Query: 351 TGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGS-PQYVSLGKQDICSDENIELAAEAA 409
           + +A +   V  +DI++ +  LY+ L++ G+FDG    Y  +   D+ S +   +A EAA
Sbjct: 348 SEDAFEDSLVSRSDIERGVIRLYSNLVQAGYFDGEDAPYRDITWDDVLSTDAWNIAYEAA 407

Query: 410 REGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFSGYA- 468
            EGIVLLKND+ TLPL S  +K+VAV+GP AN T  + GNY G     +SP+ GF     
Sbjct: 408 VEGIVLLKNDE-TLPL-SKDIKSVAVIGPWANVTEELQGNYFGPAPYLISPLTGFRDSGL 465

Query: 469 NVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAESLDREDLWLPGYQTQ 528
           +V Y  G + +   S +    A  AAK ADA I   G+D ++EAE++DRE++  PG Q  
Sbjct: 466 DVHYALGTN-LTSHSTSGFEEALTAAKQADAIIFAGGIDNTIEAEAMDRENITWPGNQLD 524

Query: 529 LINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFN 588
           LI++++E+ K P++++ M  G VD +  + N N+ A++W GYPG+ GG A+AD++ GK  
Sbjct: 525 LISKLSELGK-PLVVLQMGGGQVDSSSLKDNDNVNALIWGGYPGQSGGHALADIITGKRA 583

Query: 589 PGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKY 648
           P GRL  T Y  +Y ++ P   M LRP ++ G PG+TY +Y G  +Y FG+GL YT F+ 
Sbjct: 584 PAGRLVTTQYPAEYAEVFPAIDMNLRPNETSGNPGQTYMWYTGTPVYEFGHGLFYTTFE- 642

Query: 649 NLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTD 708
                T     N+  +    +  Y     KT     L+N         F    +N G  +
Sbjct: 643 ESTETTDAGSFNIQTVLTTPHSGYEHAQQKT-----LLN---------FTATVKNTGERE 688

Query: 709 GSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLL 768
                +VY    A  A    K V+GF R+      + +   V    +S+   D   N +L
Sbjct: 689 SDYTALVYVNTTAGPAPYPKKWVVGFDRLGGLEPGDSQTLTVPVTVESVARTDEQGNRVL 748

Query: 769 PAGEHTIFVGN 779
             G + + + N
Sbjct: 749 YPGSYELALNN 759


>gi|115397385|ref|XP_001214284.1| hypothetical protein ATEG_05106 [Aspergillus terreus NIH2624]
 gi|114192475|gb|EAU34175.1| hypothetical protein ATEG_05106 [Aspergillus terreus NIH2624]
          Length = 776

 Score =  422 bits (1085), Expect = e-115,   Method: Compositional matrix adjust.
 Identities = 265/707 (37%), Positives = 381/707 (53%), Gaps = 36/707 (5%)

Query: 49  MSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSN 108
           +S  L CD S     R   LVS  TL+E V   G+   GVPRLGLP+Y+ WSE+LHGV  
Sbjct: 75  LSKTLVCDKSARPHDRAAALVSMFTLEELVNNTGNTGTGVPRLGLPKYQVWSESLHGVYR 134

Query: 109 VGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPN 168
               +  D     ATSFP  ILT A+ N +L  +IG  +ST+ARA  N+GR GL  ++PN
Sbjct: 135 ANWASEGD--YSWATSFPQPILTMAALNRTLIHQIGDILSTQARAFSNVGRYGLDTYAPN 192

Query: 169 INVARDPRWGRITETPGEDPF-VVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKH 227
           IN  R P WGR  ETPGED + +   YA  Y+ G+Q          ++   LK+ +  KH
Sbjct: 193 INSFRHPVWGRGQETPGEDAYYLASTYAYEYITGIQG--------GVDPETLKLVATAKH 244

Query: 228 YAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCA 287
           YA YD++NW G  R   D ++T+QD+ E +   F +  ++    SVMCSYN VNG+PSC+
Sbjct: 245 YAGYDIENWDGHSRLGNDMQITQQDLSEYYTPQFLVSARDAKVHSVMCSYNAVNGVPSCS 304

Query: 288 DPKLLNQTVRGEWDL--HGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQ 345
           +   L   +R  +     GY+  DC ++    + H++ A+ +  A A +++AG D+DCG 
Sbjct: 305 NSFFLQTLLRETFGFVEDGYVSGDCGAVYNAFNPHEYAAN-ESSASADSIRAGTDIDCGT 363

Query: 346 YYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDG-SPQYVSLGKQDICSDENIEL 404
            Y     NA  +G++   DI++ +  LYT L+RLG+FDG S QY  L   D+ + +   +
Sbjct: 364 SYQYHFTNAFDEGEISRQDIERGVIRLYTNLVRLGYFDGNSSQYRDLTWSDVQTTDAWNI 423

Query: 405 AAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGF 464
           + EAA EG VLLKND  TLPL +  +++VA++GP ANAT  M GNY G      SP+A  
Sbjct: 424 SHEAAVEGTVLLKND-GTLPL-ADSIRSVALIGPWANATTQMQGNYYGPAPYLTSPLAAL 481

Query: 465 SGY-ANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAESLDREDLWLP 523
                +V Y  G  +++  +      A  AA+ ADA I   G+D ++E E+LDR ++  P
Sbjct: 482 EASDLDVHYAFGT-NISSTTTAGFADALAAARKADAIIFAGGIDNTIEGEALDRMNITWP 540

Query: 524 GYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVV 583
           G Q  LINQ++ + K P++++ M  G VD +  + NTN+ A+LW GYPG+ GG A+ D++
Sbjct: 541 GNQLDLINQLSALGK-PLVVLQMGGGQVDSSALKHNTNVSALLWGGYPGQSGGTALLDII 599

Query: 584 FGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSY 643
            G   P GRL  T Y   Y    P   M LRP  +   PG+TY +Y G  +Y FG+GL Y
Sbjct: 600 RGVRAPAGRLVTTQYPAGYATQFPAIDMGLRPNGT--NPGQTYMWYTGTPVYEFGHGLFY 657

Query: 644 TQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQN 703
           T F+    S T T   + N            D      PG     LR   +  F     N
Sbjct: 658 TTFEAKRAS-TATNHSSFN----------IEDLLTAPHPGYAYPQLR--PFLNFTAHITN 704

Query: 704 VGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRV-FVRAGRNKRIKF 749
            G T      ++++   A  A    K ++GF R+  +  G ++ + F
Sbjct: 705 TGRTTSDYTAMLFANTTAGPAPHPNKWLVGFDRLGALEPGASQTMTF 751


>gi|310792973|gb|EFQ28434.1| glycosyl hydrolase family 3 N terminal domain-containing protein
           [Glomerella graminicola M1.001]
          Length = 728

 Score =  422 bits (1085), Expect = e-115,   Method: Compositional matrix adjust.
 Identities = 234/585 (40%), Positives = 346/585 (59%), Gaps = 30/585 (5%)

Query: 72  MTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTHF-DDVIPG--ATSFPTV 128
           M+++EKV+ L D + GV  LGLP + WW+E LHGV    PG  F  D  P   ATSFP  
Sbjct: 1   MSVEEKVRNLVDASAGVKSLGLPPHGWWNEGLHGVG-FSPGVLFAQDSEPFGYATSFPLP 59

Query: 129 ILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNINVARDPRWGRITETPGEDP 188
           ILT ASF++ L+  IGQ +  E RA  N G AG  +W+PN+N  RDPRWGR  ETPGED 
Sbjct: 60  ILTAASFDDDLFNAIGQVIGREGRAFSNYGYAGFNFWTPNMNAFRDPRWGRGQETPGEDV 119

Query: 189 FVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARV 248
            VV  Y  +YV GLQ  +  +           + + CKH+AAYD++  +  + Y+     
Sbjct: 120 LVVSNYVQSYVTGLQGSDPTDKV---------IIAACKHFAAYDIETARRANNYN----P 166

Query: 249 TEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDL---HGY 305
           T+QD+++ +L  F  CV++    +VMCSYN V+GIP+C+   LL + +R  W     + +
Sbjct: 167 TQQDLQDYYLPAFRRCVRDSHVGTVMCSYNSVDGIPACSSEYLLKEVLRDTWGFTNDYQF 226

Query: 306 IVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFTGNAVQQGKVKETDI 365
           +V+DC ++  +   H F  ++++DA + ++ AG DL+CG  Y +  G+   + +V +  +
Sbjct: 227 VVSDCGAVTDVWLLHNF-TNTEQDAASVSMAAGTDLECGSSYLHLNGSLADK-QVTQERV 284

Query: 366 DKSLKYLYTVLMRLGFFDGSPQYVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPL 425
           D++L  LY  L  +G+FDGS  + SLG  D+ + +  ++A EAAR G+ LLKND   LPL
Sbjct: 285 DEALTRLYKALFTVGYFDGS-SHSSLGWSDVSTIDAQQIACEAARAGMTLLKND-GVLPL 342

Query: 426 NSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFSGYAN--VTYKTGCDDVACKS 483
              K K+VA++GP ANAT  M GNY G      SP+  F+  ++  V Y  G  D+   S
Sbjct: 343 ADGKYKSVALIGPFANATTQMQGNYFGRAPFVRSPLWAFTQQSSLQVNYAAGT-DINSTS 401

Query: 484 NNSIFAASEAAKTADATIILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVIL 543
           ++    A  AAK +D  I   G+D ++EAE+LDR  +  PG Q  LI+Q++ + K P+++
Sbjct: 402 DSGFADALAAAKNSDIVIFCGGIDTTIEAETLDRVSITWPGNQLDLISQLSMLGK-PLVV 460

Query: 544 VIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYV 603
                G VD      N N+ A+ WAG PG+ GG A+ D+V GK +  GRLP T Y   Y 
Sbjct: 461 AQFGGGQVDDTALVDNANVNALFWAGLPGQAGGLAMYDLVVGKASFAGRLPTTQYPASYA 520

Query: 604 QMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKY 648
            ++ + ++ LRP  +  +PGRTYK+Y G  ++PFG+GL YT+F +
Sbjct: 521 DLVSIFNINLRPNGT--FPGRTYKWYIGEPVFPFGFGLHYTKFNF 563


>gi|67902828|ref|XP_681670.1| hypothetical protein AN8401.2 [Aspergillus nidulans FGSC A4]
 gi|74592887|sp|Q5ATH9.1|BXLB_EMENI RecName: Full=Exo-1,4-beta-xylosidase bxlB; AltName:
           Full=1,4-beta-D-xylan xylohydrolase bxlB; AltName:
           Full=Beta-xylosidase bxlB; AltName: Full=Xylobiase bxlB;
           Flags: Precursor
 gi|40747867|gb|EAA67023.1| hypothetical protein AN8401.2 [Aspergillus nidulans FGSC A4]
 gi|259484335|tpe|CBF80465.1| TPA: beta-1,4-xylosidase (Eurofung) [Aspergillus nidulans FGSC A4]
          Length = 763

 Score =  422 bits (1085), Expect = e-115,   Method: Compositional matrix adjust.
 Identities = 281/743 (37%), Positives = 397/743 (53%), Gaps = 57/743 (7%)

Query: 49  MSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSN 108
           +S    CD+SL    R K LVS +TL+EK+   G  A G  RLGLP Y WW+EALHGV+ 
Sbjct: 33  LSELPICDTSLSPLERAKSLVSALTLEEKINNTGHEAAGSSRLGLPAYNWWNEALHGVAE 92

Query: 109 VGPGTHFDDV--IPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWS 166
              G  F++      ATSFP  I+  A+FN++L +++ + +STEARA  N   AG+ YW+
Sbjct: 93  KH-GVSFEESGDFSYATSFPAPIVLGAAFNDALIRRVAEIISTEARAFSNSDHAGIDYWT 151

Query: 167 PNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCK 226
           PN+N  +DPRWGR  ETPGEDP    RY   +V GLQ         D   +P KV + CK
Sbjct: 152 PNVNPFKDPRWGRGQETPGEDPLHCSRYVKEFVGGLQG--------DDPEKP-KVVATCK 202

Query: 227 HYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSC 286
           H AAYD++ W GV R+ FDA+V+  D+ E +L PF+ C  +    + MCSYN +NG+P+C
Sbjct: 203 HLAAYDLEEWGGVSRFEFDAKVSAVDLLEYYLPPFKTCAVDASVGAFMCSYNALNGVPAC 262

Query: 287 ADPKLLNQTVRGEWDLHG---YIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDC 343
           AD  LL   +R  W   G   ++  DC +++ +   H ++ +S  +A A  L AG+DLDC
Sbjct: 263 ADRYLLQTVLREHWGWEGPGHWVTGDCGAVERIQTYHHYV-ESGPEAAAAALNAGVDLDC 321

Query: 344 GQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFD---GSPQYVSLGKQDICSDE 400
           G +  ++ G A +QG +    +D +L  LYT L++LG+FD   G P   SLG  D+ + E
Sbjct: 322 GTWLPSYLGEAERQGLISNETLDAALTRLYTSLVQLGYFDPAEGQP-LRSLGWDDVATSE 380

Query: 401 NIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRY--- 457
             ELA   A +G VLLKN   TLPL +    T+A++GP  N T  +  NYAG P ++   
Sbjct: 381 AEELAKTVAIQGTVLLKNIDWTLPLKAN--GTLALIGPFINFTTELQSNYAG-PAKHIPT 437

Query: 458 MSPIAGFSGYANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAESLDR 517
           M   A   GY NV    G  +V   S +    A   A  ADA I   G+D +VE ESLDR
Sbjct: 438 MIEAAERLGY-NVLTAPGT-EVNSTSTDGFDDALAIAAEADALIFFGGIDNTVEEESLDR 495

Query: 518 EDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGR 577
             +  PG Q +LI ++AE+ + P+ +V    G VD +    +  + AI+WAGYP + GG 
Sbjct: 496 TRIDWPGNQEELILELAELGR-PLTVVQFGGGQVDDSALLASAGVGAIVWAGYPSQAGGA 554

Query: 578 AIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPF 637
            + DV+ GK  P GRLPIT Y   YV  +P+T M L+P      PGRTY++Y    L PF
Sbjct: 555 GVFDVLTGKAAPAGRLPITQYPKSYVDEVPMTDMNLQP--GTDNPGRTYRWYEDAVL-PF 611

Query: 638 GYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEF 697
           G+GL YT F    +S+ K      +     R  N +S+   T                 F
Sbjct: 612 GFGLHYTTFN---VSWAKKAFGPYDAATLARGKNPSSNIVDT-----------------F 651

Query: 698 KVDFQNVGSTDGSDVVIVYSKPPAEIAA--TYIKQVIGFQRV-FVRAGRNKRIKFVFNAC 754
            +   N G      V +V++  P E+ A    IK ++G+ R   ++ G  +++       
Sbjct: 652 SLAVTNTGDVASDYVALVFASAP-ELGAQPAPIKTLVGYSRASLIKPGETRKVDVEVTVA 710

Query: 755 KSLNIVDYAANTLLPAGEHTIFV 777
                 +     L P GE+T+ V
Sbjct: 711 PLTRATEDGRVVLYP-GEYTLLV 732


>gi|334187562|ref|NP_196532.2| Glycosyl hydrolase family protein [Arabidopsis thaliana]
 gi|332004052|gb|AED91435.1| Glycosyl hydrolase family protein [Arabidopsis thaliana]
          Length = 526

 Score =  422 bits (1084), Expect = e-115,   Method: Compositional matrix adjust.
 Identities = 215/487 (44%), Positives = 314/487 (64%), Gaps = 12/487 (2%)

Query: 305 YIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFTGNAVQQGKVKETD 364
           YIV+DCDS+ ++  +  +   + E+A A+++ AGLDL+CG +  N T NAV++G + E  
Sbjct: 45  YIVSDCDSLGILYGSQHY-TKTPEEAAAKSILAGLDLNCGSFLGNHTENAVKKGLIDEAA 103

Query: 365 IDKSLKYLYTVLMRLGFFDGSPQ---YVSLGKQDICSDENIELAAEAAREGIVLLKNDQN 421
           I+K++   +  LMRLGFFDG+P+   Y  LG +D+C+ EN ELA E AR+GIVLLKN   
Sbjct: 104 INKAISNNFATLMRLGFFDGNPKNQPYGGLGPKDVCTVENRELAVETARQGIVLLKNSAG 163

Query: 422 TLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFSGYANVT-YKTGCDDVA 480
           +LPL+ + +KT+AV+GP+AN T  MIGNY G+ C+Y +P+ G       T Y  GC +V 
Sbjct: 164 SLPLSPSAIKTLAVIGPNANVTKTMIGNYEGVACKYTTPLQGLERTVLTTKYHRGCFNVT 223

Query: 481 CKSNNSIFAASEAAKTADATIILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGP 540
           C +   + +A   A +ADAT+++ G D ++E E+LDR DL LPG Q +L+ QVA+ A+GP
Sbjct: 224 C-TEADLDSAKTLAASADATVLVMGADQTIEKETLDRIDLNLPGKQQELVTQVAKAARGP 282

Query: 541 VILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNG 600
           V+LVIMS GG DI FA+ +  I +I+W GYPGE GG AIADV+FG+ NP G+LP+TWY  
Sbjct: 283 VVLVIMSGGGFDITFAKNDEKITSIMWVGYPGEAGGIAIADVIFGRHNPSGKLPMTWYPQ 342

Query: 601 DYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVN 660
            YV+ +P+T+M +RP  S GY GRTY+FY G T+Y FG GLSYT F + L+   K + +N
Sbjct: 343 SYVEKVPMTNMNMRPDKSNGYLGRTYRFYIGETVYAFGDGLSYTNFSHQLIKAPKFVSLN 402

Query: 661 LNKLQHCRNLNYTS-DASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKP 719
           L++ Q CR+    S DA    C   +    R D  FE ++  +NVG  +G++ V +++ P
Sbjct: 403 LDESQSCRSPECQSLDAIGPHCEKAVGE--RSD--FEVQLKVRNVGDREGTETVFLFTTP 458

Query: 720 PAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGN 779
           P E+  +  KQ++GF+++ +       ++F  + CK L +VD      L  G H + VG+
Sbjct: 459 P-EVHGSPRKQLLGFEKIRLGKKEETVVRFKVDVCKDLGVVDEIGKRKLALGHHLLHVGS 517

Query: 780 GGVSFPI 786
              SF I
Sbjct: 518 LKHSFNI 524


>gi|436410475|gb|AGB57183.1| beta-xylosidase [Aspergillus sp. BCC125]
          Length = 804

 Score =  422 bits (1084), Expect = e-115,   Method: Compositional matrix adjust.
 Identities = 269/719 (37%), Positives = 389/719 (54%), Gaps = 50/719 (6%)

Query: 49  MSSFLFCD-SSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVS 107
           + S L CD S+ PY  R   L+S  TLDE +   G+   GV RLGLP Y+ WSEALHG+ 
Sbjct: 63  LRSHLICDESATPYD-RAASLISLFTLDELIANTGNTGLGVSRLGLPAYQVWSEALHGLD 121

Query: 108 NVGPGTHFDDV--IPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYW 165
                 +F D+     ATSFP  ILTTA+ N +L  +I   +ST+ RA  N GR GL  +
Sbjct: 122 RA----NFSDLGSYNWATSFPQPILTTAALNRTLIHQIASIISTQGRAFNNAGRYGLDVY 177

Query: 166 SPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCC 225
           +PNIN  R P WGR  ETPGED  +   YA  Y+ G+Q  +   N        LK+++  
Sbjct: 178 APNINTFRHPVWGRGQETPGEDVSLAAIYAYEYITGIQGPDPDSN--------LKLAATA 229

Query: 226 KHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPS 285
           KHYA YD++NW    R   D  +T+QD+ E +   F +  ++    SVMC+YN VNG+P+
Sbjct: 230 KHYAGYDIENWHNHSRLGNDMNITQQDLSEYYTPQFHVAARDAKVHSVMCAYNAVNGVPA 289

Query: 286 CADPKLLNQTVRGEWDL--HGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDC 343
           CAD   L   +R  +    HGY+ +DCD+   + + H + + S+  A A+ + AG D+DC
Sbjct: 290 CADSYFLQTLLRDTFGFVDHGYVSSDCDAAYNIYNPHGYAS-SQAAAAAEAILAGTDIDC 348

Query: 344 GQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ-----YVSLGKQDICS 398
           G  Y      ++  G +   DI+K +  LYT L++ G+FD +       Y  L   D+  
Sbjct: 349 GTTYQWHLNESITAGDLSRDDIEKGVIRLYTTLVQAGYFDSNTTKANNPYRDLTWSDVLE 408

Query: 399 DENIELAAEAAREGIVLLKNDQNTLPLNSAKV----KTVAVVGPHANATVAMIGNYAGIP 454
            +   ++ +AA +GIVLLKN  N LPL          TVA++GP ANAT  ++GNY G  
Sbjct: 409 TDAWNISYQAATQGIVLLKNSNNVLPLTEKAYPPSNTTVALIGPWANATTQLLGNYYGNA 468

Query: 455 CRYMSPIAGF--SGYANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEA 512
              +SP A F  +GY NV +  G   ++  S +   AA  AA++AD  I   G+D ++EA
Sbjct: 469 PYMISPRAAFEEAGY-NVNFAEGT-GISSTSTSGFAAALSAAQSADVIIYAGGIDNTLEA 526

Query: 513 ESLDREDLWLPGYQTQLINQVAEVA-KGPVILVIMSAGGVDIAFAETNTNIKAILWAGYP 571
           E+LDRE +  PG Q  LI ++A  A   P+I++ M  G VD +  + NTN+ A+LW GYP
Sbjct: 527 EALDRESIAWPGNQLDLIQKLASSAGSKPLIVLQMGGGQVDSSSLKNNTNVSALLWGGYP 586

Query: 572 GEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNG 631
           G+ GG A+ D++ GK NP GRL  T Y   Y +  P T M LRP      PG+TYK+Y G
Sbjct: 587 GQSGGFALRDIITGKKNPAGRLVTTQYPASYAEEFPATDMNLRPEGD--NPGQTYKWYTG 644

Query: 632 PTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRC 691
             +Y FG+GL YT F  +  S T T ++ LN +Q   +  +   AS T+ P         
Sbjct: 645 EAVYEFGHGLFYTTFAES-SSNTTTREIKLN-IQDILSQTHEDLASITQLP--------- 693

Query: 692 DDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATY-IKQVIGFQRVF-VRAGRNKRIK 748
                F  + +N G  +     +V++       A Y +K ++G+ R+  V+ G  + ++
Sbjct: 694 --VLNFTANIKNTGKVESDYTAMVFANTSDAGPAPYPVKWLVGWDRLGDVKVGETRELR 750


>gi|2920706|emb|CAA73902.1| beta-xylosidase [Emericella nidulans]
          Length = 802

 Score =  421 bits (1083), Expect = e-115,   Method: Compositional matrix adjust.
 Identities = 267/731 (36%), Positives = 388/731 (53%), Gaps = 38/731 (5%)

Query: 55  CDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGV--SNVGPG 112
           CD SL    R   LVS  T DE V   G+   GV RLGLP Y+ W EALHGV  +N    
Sbjct: 60  CDRSLSPKDRATALVSLFTFDELVNNTGNTGLGVSRLGLPNYQVWGEALHGVGRANFVES 119

Query: 113 THFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNINVA 172
            +F      ATSFP  I   A+ N++L  +IG  VST+ RA  N G  G+  +SPNIN  
Sbjct: 120 GNFS----WATSFPMPITMMAALNKTLIHQIGTIVSTQLRAFSNAGLGGVDVYSPNINTF 175

Query: 173 RDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYD 232
           R P WGR  ETPGED F+   Y   Y+  LQ     E +        K+ +  KHYA YD
Sbjct: 176 RHPVWGRGQETPGEDAFLTSVYGYEYITALQGAVDPETS--------KIIATAKHYAGYD 227

Query: 233 VDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLL 292
           +++W    R   D ++T+Q++ E +  PF +  ++    SVMCSYN VNG+PSCA+   L
Sbjct: 228 IESWNNHSRLGNDMQITQQELSEYYTPPFIVASRDAKVRSVMCSYNAVNGVPSCANKFFL 287

Query: 293 NQTVRG--EWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNF 350
              +R   E+   GY+  DC ++  + + H + A ++  A A ++ AG D+DCG  Y   
Sbjct: 288 QTLLRDTFEFSEDGYVSGDCGAVYNVWNPHGY-ASNEAAASADSILAGTDIDCGTSYQWH 346

Query: 351 TGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGS-PQYVSLGKQDICSDENIELAAEAA 409
           + +A +   V  +DI++ +  LY+ L++ G+FDG    Y  +   D+ S +   +A EAA
Sbjct: 347 SEDAFEDSLVSRSDIERGVIRLYSNLVQAGYFDGEDAPYRDITWDDVLSTDAWNIAYEAA 406

Query: 410 REGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFSGYA- 468
            EGIVLLKND+ TLPL S  +K+VAV+GP AN T  + GNY G     +SP+ GF     
Sbjct: 407 VEGIVLLKNDE-TLPL-SKDIKSVAVIGPWANVTEELQGNYFGPAPYLISPLTGFRDSGL 464

Query: 469 NVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAESLDREDLWLPGYQTQ 528
           +V Y  G + +   S +    A  AAK ADA I   G+D ++EAE++DRE++  PG Q  
Sbjct: 465 DVHYALGTN-LTSHSTSGFEEALTAAKQADAIIFAGGIDNTIEAEAMDRENITWPGNQLD 523

Query: 529 LINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFN 588
           LI++++E+ K P++++ M  G VD +  + N N+ A++W GYPG+ GG A+AD++ GK  
Sbjct: 524 LISKLSELGK-PLVVLQMGGGQVDSSSLKDNDNVNALIWGGYPGQSGGHALADIITGKRA 582

Query: 589 PGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKY 648
           P GRL  T Y  +Y ++ P   M LRP ++ G PG+TY +Y G  +Y FG+GL YT F+ 
Sbjct: 583 PAGRLVTTQYPAEYAEVFPAIDMNLRPNETSGNPGQTYMWYTGTPVYEFGHGLFYTTFE- 641

Query: 649 NLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTD 708
                T     N+  +    +  Y     KT     L+N         F    +N G  +
Sbjct: 642 ESTETTDAGSFNIQTVLTTPHSGYEHAQQKT-----LLN---------FTATVKNTGERE 687

Query: 709 GSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLL 768
                +VY    A  A    K V+GF R+      + +   V    +S+   D   N +L
Sbjct: 688 SDYTALVYVNTTAGPAPYPKKWVVGFDRLGGLEPGDSQTLTVPVTVESVARTDEQGNRVL 747

Query: 769 PAGEHTIFVGN 779
             G + + + N
Sbjct: 748 YPGSYDVALNN 758


>gi|442803736|ref|YP_007371885.1| beta-xylosidase BxlB [Clostridium stercorarium subsp. stercorarium
           DSM 8532]
 gi|442739586|gb|AGC67275.1| beta-xylosidase BxlB [Clostridium stercorarium subsp. stercorarium
           DSM 8532]
          Length = 715

 Score =  421 bits (1083), Expect = e-115,   Method: Compositional matrix adjust.
 Identities = 265/755 (35%), Positives = 388/755 (51%), Gaps = 98/755 (12%)

Query: 53  LFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPG 112
           ++ D S  +  R KDLVSRMT++EKV Q+   +  + RLG+P Y WW+EALHGV+  G  
Sbjct: 6   VYLDPSYSFEERAKDLVSRMTIEEKVSQMLYNSPAIERLGIPAYNWWNEALHGVARAGT- 64

Query: 113 THFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRA--------GLTY 164
                    AT FP  I   A+F+E L  K+   +STE RA Y+            GLT+
Sbjct: 65  ---------ATMFPQAIGMAATFDEELIYKVADVISTEGRAKYHASSKKGDRGIYKGLTF 115

Query: 165 WSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSC 224
           WSPNIN+ RDPRWGR  ET GEDP++  R  V +V+GLQ           + + LK ++C
Sbjct: 116 WSPNINIFRDPRWGRGQETYGEDPYLTARLGVAFVKGLQGN---------HPKYLKAAAC 166

Query: 225 CKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIP 284
            KH+A   V +     R+ F+A V+++D+ ET+L  F+  V+E    SVM +YNR NG P
Sbjct: 167 AKHFA---VHSGPESLRHEFNAVVSKKDLYETYLPAFKALVQEAKVESVMGAYNRTNGEP 223

Query: 285 SCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCG 344
            C    LL+  +RGEW   G++V+DC +I+    +H   A + E A A  ++ G DL+CG
Sbjct: 224 CCGSKTLLSDILRGEWGFKGHVVSDCWAIRDFHMHHHVTATAPESA-ALAVRNGCDLNCG 282

Query: 345 QYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ--YVSLGKQDICSDENI 402
             + N    A+++G + E +ID+++  L    M+LG FD   Q  Y S+    +   E+ 
Sbjct: 283 NMFGNLL-IALKEGLITEEEIDRAVTRLMITRMKLGMFDPEDQVPYASISYDFVDCKEHR 341

Query: 403 ELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIA 462
           ELA + A++ IVLLKND   LPL+  K++++AV+GP+A++  A+IGNY G    Y++ + 
Sbjct: 342 ELALDVAKKSIVLLKND-GLLPLDRKKIRSIAVIGPNADSRQALIGNYEGTASEYVTVLD 400

Query: 463 GFSGYA----NVTYKTGCDDVACKSNN------SIFAASEAAKTADATIILAGLDLSVEA 512
           G    A     + Y  GC     +  N       I  A   A+ AD  I+  GLD ++E 
Sbjct: 401 GIREMAGDDVRIYYSVGCHLYKDRVENLGEPGDRIAEAVTCAEHADVVIMCLGLDSTIEG 460

Query: 513 ESL---------DREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIK 563
           E +         D+ DL LPG Q +L+  V    K P++LV+++   + + +A  + +I 
Sbjct: 461 EEMHESNIYGSGDKPDLNLPGQQQELLEAVYATGK-PIVLVLLTGSALAVTWA--DEHIP 517

Query: 564 AILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPG 623
           AIL A YPG  GGRAIA V+FG+ NP G+LP+T+Y          T+  L          
Sbjct: 518 AILNAWYPGALGGRAIASVLFGETNPSGKLPVTFYR---------TTEELPDFTDYSMEN 568

Query: 624 RTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPG 683
           RTY+F     LYPFG+GLSYT F Y+ L  +K                            
Sbjct: 569 RTYRFMKNEALYPFGFGLSYTTFDYSDLKLSK---------------------------- 600

Query: 684 VLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGR 743
              + +R  + F   V   N G   G +VV VY K           Q+ G +RV + +G 
Sbjct: 601 ---DTIRAGEGFNVSVKVTNTGKMAGEEVVQVYIKDLEASWRVPNWQLSGMKRVRLESGE 657

Query: 744 NKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVG 778
              I F     + L +V     +++  GE  I+VG
Sbjct: 658 TAEITFEIRP-EQLAVVTDEGKSVIEPGEFEIYVG 691


>gi|326202986|ref|ZP_08192853.1| glycoside hydrolase family 3 domain protein [Clostridium
           papyrosolvens DSM 2782]
 gi|325987063|gb|EGD47892.1| glycoside hydrolase family 3 domain protein [Clostridium
           papyrosolvens DSM 2782]
          Length = 712

 Score =  420 bits (1080), Expect = e-114,   Method: Compositional matrix adjust.
 Identities = 266/754 (35%), Positives = 388/754 (51%), Gaps = 100/754 (13%)

Query: 54  FCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGT 113
           + D SL +  R  DLVS+MTL+EK  QL   A  V RLG+P+Y WW+EALHGV+  G   
Sbjct: 6   YLDKSLSFKERAADLVSKMTLEEKASQLRYDAQPVERLGIPRYNWWNEALHGVARAGV-- 63

Query: 114 HFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGR--------AGLTYW 165
                   AT FP  I   A F++   +KI   ++TE RA YN            G+T+W
Sbjct: 64  --------ATVFPQAIGMAAMFDDEFLEKIADVIATEGRAKYNESAKKGDRDIYKGITFW 115

Query: 166 SPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCC 225
           SPN+N+ RDPRWGR  ET GEDP++  R  V +V+GLQ           + + LK ++C 
Sbjct: 116 SPNVNIFRDPRWGRGHETYGEDPYLTSRLGVAFVKGLQG----------DGKYLKTAACA 165

Query: 226 KHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPS 285
           KHYA   V +    DR+ FDA V+++D+ ET+L  FE  VKE    S+M +YNR NG P 
Sbjct: 166 KHYA---VHSGPEDDRHFFDAIVSQKDLYETYLPAFEALVKEAKVESIMGAYNRTNGEPC 222

Query: 286 CADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQ 345
                LL   +R  W   G++V+DC +I+   + H  +  +  ++VA  LK+G DL+CG 
Sbjct: 223 NGSKTLLKDILRDGWGFDGHVVSDCWAIKDFHEGHG-VTKTPTESVALALKSGCDLNCGN 281

Query: 346 YYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQDICSDENIELA 405
            Y      A+++G + E DID++   L T  M+LG FD   ++ ++  +   S E+ +++
Sbjct: 282 MYL-LILLALKEGLITEEDIDRAAIRLMTTRMKLGMFDDDCEFDNIPYELNDSAEHNKIS 340

Query: 406 AEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGF- 464
            EAA++ +VLLKND   LPL+S K+K VAV+GP+A++++A+  NY+G P + ++ I G  
Sbjct: 341 LEAAKKSMVLLKND-GLLPLDSKKIKNVAVIGPNADSSLALRANYSGTPSQNVTIIEGIR 399

Query: 465 ---SGYANVTYKTGC------DDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAESL 515
              S    V Y  G       D+   + ++ +  A  AA+ +D  ++  GLD SVE E  
Sbjct: 400 KRVSENTRVWYAMGSHLFLNRDEDLAQPDDRLKEAVSAAERSDVVVLCLGLDASVEGEQN 459

Query: 516 -----------DREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKA 564
                      D+ DL LP  Q  L+N V    K P I+ ++S   + I  A       A
Sbjct: 460 DQGTVILDAGGDKADLNLPESQRNLLNAVLATGK-PTIVALLSGSALSIGDAADKA--AA 516

Query: 565 ILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGR 624
           I+   YPG  GG A A+++FG ++P GRLP+T+Y          ++  L P        R
Sbjct: 517 IVQCWYPGAIGGLAFAEMIFGDYSPAGRLPVTFYK---------STEELPPFADYSMENR 567

Query: 625 TYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGV 684
           TYKF  G  LYPFG+GLSYT F+Y                            S   CP  
Sbjct: 568 TYKFMKGDALYPFGFGLSYTSFEY----------------------------SNMVCPQT 599

Query: 685 LVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRN 744
           + N     +     VD QN GS D  +VV VY K            + GF+R+ +++G  
Sbjct: 600 VNN----GENLSVSVDVQNTGSVDSDEVVQVYIKDMDASVRVPKYSLCGFKRIHLKSGEK 655

Query: 745 KRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVG 778
           K + F   A  +++IVD A    +  GE T++ G
Sbjct: 656 KTVTFEV-ASNAMSIVDEAGKRHIENGEFTLYAG 688


>gi|388857998|emb|CCF48443.1| related to Beta-xylosidase [Ustilago hordei]
          Length = 782

 Score =  420 bits (1080), Expect = e-114,   Method: Compositional matrix adjust.
 Identities = 232/615 (37%), Positives = 350/615 (56%), Gaps = 26/615 (4%)

Query: 54  FCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGT 113
            CD ++P+  R   LV++ T +E +    ++A GVPRLG+P Y+WW+EALHGV+   PG 
Sbjct: 36  ICDPTIPFYTRATSLVNQFTTEELLNNTINYAPGVPRLGIPNYQWWTEALHGVAK-SPGV 94

Query: 114 HFDDVIP-----GATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSP- 167
           +FD   P      AT FP  I   A+F++ L+++I   +++E RA  N G+AGL  +SP 
Sbjct: 95  NFDLSDPHAEFTSATQFPQTINLGATFDDDLYQQIASVIASEVRAYNNAGKAGLNLYSPL 154

Query: 168 NINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKH 227
           NIN  RDPRWGR  ET GEDP  + R+AV+ V GLQ   G     +     L V++ CKH
Sbjct: 155 NINCFRDPRWGRGQETVGEDPLHMSRFAVSIVHGLQ---GPHAQNEAEGNKLTVAATCKH 211

Query: 228 YAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCA 287
           + AYD++ +   +RY FDA V++QD+ +  L  F  CV++G A+++M SYN VN +P  A
Sbjct: 212 FLAYDLEQYDRGERYQFDAIVSKQDLSDFHLPQFRACVRDGGATTLMTSYNAVNNVPPSA 271

Query: 288 DPKLLNQTVRGEWDL---HGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCG 344
               L    R  W L   H Y+ +DCD++  + D H++ A +  +A A+++ AG DLDCG
Sbjct: 272 SKYYLQTLARQAWGLDKTHNYVTSDCDAVANVYDGHRY-AQNYVEAAAKSINAGTDLDCG 330

Query: 345 QYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFD--GSPQYVSLGKQDICSDENI 402
             Y+   G A++Q       I +++  +Y  L+RLG+FD   S     L  +D+ S  + 
Sbjct: 331 ATYSENLGAALKQKLTDIATIRRAVIRMYASLVRLGYFDDPASQPLRQLTWKDVNSPSSQ 390

Query: 403 ELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIA 462
            LA  +A   I LLKN  +TLP+     K +A++GP+ N + +  GNYAG     M+ + 
Sbjct: 391 RLAYTSALSSITLLKNLDSTLPIKQKPTK-IAIIGPYTNVSTSFSGNYAGPAAFNMTMVH 449

Query: 463 GFSGY---ANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAESLDRED 519
             S     A + +  G D       +    A +    AD+ +   G+D S+E ES DR+D
Sbjct: 450 AASQVFPDAKIVWVNGTDISGPYIPSDAQDAVKLTSDADSVVFAGGIDASIERESHDRKD 509

Query: 520 LWLPGYQTQLINQVAEV----AKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEG 575
           +  P  Q +LI+++++      K  +++V    G +D A  +++  + A++WAGYPG+  
Sbjct: 510 IAWPPNQLRLIHELSQSRKKDKKSKLVVVQFGGGQLDGASLKSDDAVGALVWAGYPGQSA 569

Query: 576 GRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLY 635
             A+ D++ GK  P GRLP+T Y   Y+  LP ++M LRP    GYPGRTYK+Y G   Y
Sbjct: 570 SLAVWDILAGKAVPAGRLPVTQYPASYIDGLPESAMSLRP--KAGYPGRTYKWYKGVPTY 627

Query: 636 PFGYGLSYTQFKYNL 650
           PFG+GL YT F  +L
Sbjct: 628 PFGHGLHYTTFSASL 642


>gi|392962219|ref|ZP_10327666.1| glycoside hydrolase family 3 domain protein [Pelosinus fermentans
           DSM 17108]
 gi|392452977|gb|EIW29882.1| glycoside hydrolase family 3 domain protein [Pelosinus fermentans
           DSM 17108]
          Length = 724

 Score =  420 bits (1079), Expect = e-114,   Method: Compositional matrix adjust.
 Identities = 259/759 (34%), Positives = 394/759 (51%), Gaps = 97/759 (12%)

Query: 49  MSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSN 108
           M  F + D +L +  R KDLVSRMT++EKV Q+   +  + RLG+P Y WWSEALHGV+ 
Sbjct: 1   MEIFDYQDETLSFEQRAKDLVSRMTIEEKVTQMVYSSPAISRLGIPAYNWWSEALHGVAR 60

Query: 109 VGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGR--------A 160
            G           AT FP  I   A+F+E L   + + +S EARA ++  +         
Sbjct: 61  AGV----------ATVFPQAIGLAATFDEKLIYDVAEIISIEARAKFHEFQRKGDHGIYK 110

Query: 161 GLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLK 220
           GLT+WSPN+N+ RDPRWGR  ET GEDP++ GR  V++++GLQ           + + L+
Sbjct: 111 GLTFWSPNVNIFRDPRWGRGQETFGEDPYLTGRLGVSFIKGLQGQ---------DKKYLR 161

Query: 221 VSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRV 280
            ++C KH+A   V +    +R+ FDA V+ +D+ ET+L  F+ CVKE +  +VM +YNRV
Sbjct: 162 AAACAKHFA---VHSGPESERHRFDAVVSPKDLRETYLPAFKECVKEANVEAVMGAYNRV 218

Query: 281 NGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLD 340
           NG P C    LL +T+R EW   G++V+DC +I+   +NH+ +  S  ++VA  L  G D
Sbjct: 219 NGEPCCGSNILLKETLRQEWGFTGHVVSDCWAIKDFHENHR-VTSSAPESVALALNNGCD 277

Query: 341 LDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ--YVSLGKQDICS 398
           L+CG  Y N    A Q+G V E  I+ ++  L    M+LG FD +    Y ++G      
Sbjct: 278 LNCGNMYLNLL-IAYQEGLVTEEAINTAVTRLMLTRMKLGLFDAAENVPYTNIGFHQNDC 336

Query: 399 DENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYM 458
            E+ E A E +++ +VLLKN+ + LPL+   + ++AV+GP+AN+  A+ GNY G    Y+
Sbjct: 337 QEHREFALEVSKKTLVLLKNENHLLPLDRNTISSIAVIGPNANSREALTGNYFGTASNYI 396

Query: 459 SPIAGFSGYAN----VTYKTGCDDVACKSNN------SIFAASEAAKTADATIILAGLDL 508
           + + G          V+Y  GC     K+ N          A   A+ AD  ++  GLD 
Sbjct: 397 TVLEGIREAVGKDTMVSYAQGCHLYRDKAENLGEERDRFAEAVSTAERADLVVMCMGLDA 456

Query: 509 SVEAE---------SLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETN 559
           S+E E         S D+  L LPG Q +L+  + +  K P+ILV+++   + + +A   
Sbjct: 457 SIEGEEGDVSNEYASGDKLGLNLPGLQQELLEVIYKTGK-PIILVLLAGSALAVTWAA-- 513

Query: 560 TNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSL 619
             + AI+ A YPG EGG+A+A  +FG+++P G+LPIT+Y          T+  L      
Sbjct: 514 EKVPAIIQAWYPGAEGGKALASAIFGEYSPVGKLPITFYR---------TTEELPEFTDY 564

Query: 620 GYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKT 679
               RTY++     LYPFGYGL YT F Y         Q+ LN+ + C   N        
Sbjct: 565 SMKNRTYRYMTKEALYPFGYGLGYTTFAYR--------QLQLNRTKICAGEN-------V 609

Query: 680 RCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFV 739
           +C  +LV               +N G+    + V +Y K         I  + G Q++ +
Sbjct: 610 QC-SILV---------------KNTGNFASDETVQLYIKDVKASVEVPIWALQGIQKIHL 653

Query: 740 RAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVG 778
             G  + I F   + + L +++   N +L  G   I+VG
Sbjct: 654 LPGAEQEISFTLTS-RQLALINEKGNCILEPGIFEIYVG 691


>gi|145230215|ref|XP_001389416.1| exo-1,4-beta-xylosidase xlnD [Aspergillus niger CBS 513.88]
 gi|74626559|sp|O00089.2|XYND_ASPNG RecName: Full=Exo-1,4-beta-xylosidase xlnD; AltName:
           Full=1,4-beta-D-xylan xylohydrolase xlnD; AltName:
           Full=Beta-xylosidase A; AltName: Full=Beta-xylosidase
           xlnD; AltName: Full=Xylobiase xlnD; Flags: Precursor
 gi|292495287|sp|A2QA27.1|XYND_ASPNC RecName: Full=Probable exo-1,4-beta-xylosidase xlnD; AltName:
           Full=1,4-beta-D-xylan xylohydrolase xlnD; AltName:
           Full=Beta-xylosidase A; AltName: Full=Beta-xylosidase
           xlnD; AltName: Full=Xylobiase xlnD; Flags: Precursor
 gi|2181180|emb|CAB06417.1| xylosidase [Aspergillus niger]
 gi|134055533|emb|CAK37179.1| xylosidase xlnD-Aspergillus niger
 gi|350638468|gb|EHA26824.1| hypothetical protein ASPNIDRAFT_205670 [Aspergillus niger ATCC
           1015]
          Length = 804

 Score =  420 bits (1079), Expect = e-114,   Method: Compositional matrix adjust.
 Identities = 268/719 (37%), Positives = 388/719 (53%), Gaps = 50/719 (6%)

Query: 49  MSSFLFCD-SSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVS 107
           + S L CD ++ PY  R   L+S  TLDE +   G+   GV RLGLP Y+ WSEALHG+ 
Sbjct: 63  LRSHLICDETATPYD-RAASLISLFTLDELIANTGNTGLGVSRLGLPAYQVWSEALHGLD 121

Query: 108 NVGPGTHFDD--VIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYW 165
                 +F D      ATSFP  ILTTA+ N +L  +I   +ST+ RA  N GR GL  +
Sbjct: 122 RA----NFSDSGAYNWATSFPQPILTTAALNRTLIHQIASIISTQGRAFNNAGRYGLDVY 177

Query: 166 SPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCC 225
           +PNIN  R P WGR  ETPGED  +   YA  Y+ G+Q  +   N        LK+++  
Sbjct: 178 APNINTFRHPVWGRGQETPGEDVSLAAVYAYEYITGIQGPDPESN--------LKLAATA 229

Query: 226 KHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPS 285
           KHYA YD++NW    R   D  +T+QD+ E +   F +  ++    SVMC+YN VNG+P+
Sbjct: 230 KHYAGYDIENWHNHSRLGNDMNITQQDLSEYYTPQFHVAARDAKVQSVMCAYNAVNGVPA 289

Query: 286 CADPKLLNQTVRGEWDL--HGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDC 343
           CAD   L   +R  +    HGY+ +DCD+   + + H + + S+  A A+ + AG D+DC
Sbjct: 290 CADSYFLQTLLRDTFGFVDHGYVSSDCDAAYNIYNPHGYAS-SQAAAAAEAILAGTDIDC 348

Query: 344 GQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ-----YVSLGKQDICS 398
           G  Y      ++  G +   DI++ +  LYT L++ G+FD +       Y  L   D+  
Sbjct: 349 GTTYQWHLNESIAAGDLSRDDIEQGVIRLYTTLVQAGYFDSNTTKANNPYRDLSWSDVLE 408

Query: 399 DENIELAAEAAREGIVLLKNDQNTLPLNSAKV----KTVAVVGPHANATVAMIGNYAGIP 454
            +   ++ +AA +GIVLLKN  N LPL          TVA++GP ANAT  ++GNY G  
Sbjct: 409 TDAWNISYQAATQGIVLLKNSNNVLPLTEKAYPPSNTTVALIGPWANATTQLLGNYYGNA 468

Query: 455 CRYMSPIAGF--SGYANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEA 512
              +SP A F  +GY  V +  G   ++  S +   AA  AA++AD  I   G+D ++EA
Sbjct: 469 PYMISPRAAFEEAGY-KVNFAEGT-GISSTSTSGFAAALSAAQSADVIIYAGGIDNTLEA 526

Query: 513 ESLDREDLWLPGYQTQLINQVAEVA-KGPVILVIMSAGGVDIAFAETNTNIKAILWAGYP 571
           E+LDRE +  PG Q  LI ++A  A K P+I++ M  G VD +  + NTN+ A+LW GYP
Sbjct: 527 EALDRESIAWPGNQLDLIQKLASAAGKKPLIVLQMGGGQVDSSSLKNNTNVSALLWGGYP 586

Query: 572 GEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNG 631
           G+ GG A+ D++ GK NP GRL  T Y   Y +  P T M LRP      PG+TYK+Y G
Sbjct: 587 GQSGGFALRDIITGKKNPAGRLVTTQYPASYAEEFPATDMNLRPEGD--NPGQTYKWYTG 644

Query: 632 PTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRC 691
             +Y FG+GL YT F  +  S T T +V LN +Q   +  +   AS T+ P         
Sbjct: 645 EAVYEFGHGLFYTTFAES-SSNTTTKEVKLN-IQDILSQTHEDLASITQLP--------- 693

Query: 692 DDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQ-VIGFQRV-FVRAGRNKRIK 748
                F  + +N G  +     +V++       A Y K+ ++G+ R+  V+ G  + ++
Sbjct: 694 --VLNFTANIRNTGKLESDYTAMVFANTSDAGPAPYPKKWLVGWDRLGEVKVGETRELR 750


>gi|290889355|gb|ADD69953.1| xylosidase HistTag [synthetic construct]
          Length = 810

 Score =  420 bits (1079), Expect = e-114,   Method: Compositional matrix adjust.
 Identities = 268/719 (37%), Positives = 388/719 (53%), Gaps = 50/719 (6%)

Query: 49  MSSFLFCD-SSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVS 107
           + S L CD ++ PY  R   L+S  TLDE +   G+   GV RLGLP Y+ WSEALHG+ 
Sbjct: 63  LRSHLICDETATPYD-RAASLISLFTLDELIANTGNTGLGVSRLGLPAYQVWSEALHGLD 121

Query: 108 NVGPGTHFDD--VIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYW 165
                 +F D      ATSFP  ILTTA+ N +L  +I   +ST+ RA  N GR GL  +
Sbjct: 122 RA----NFSDSGAYNWATSFPQPILTTAALNRTLIHQIASIISTQGRAFNNAGRYGLDVY 177

Query: 166 SPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCC 225
           +PNIN  R P WGR  ETPGED  +   YA  Y+ G+Q  +   N        LK+++  
Sbjct: 178 APNINTFRHPVWGRGQETPGEDVSLAAVYAYEYITGIQGPDPESN--------LKLAATA 229

Query: 226 KHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPS 285
           KHYA YD++NW    R   D  +T+QD+ E +   F +  ++    SVMC+YN VNG+P+
Sbjct: 230 KHYAGYDIENWHNHSRLGNDMNITQQDLSEYYTPQFHVAARDAKVQSVMCAYNAVNGVPA 289

Query: 286 CADPKLLNQTVRGEWDL--HGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDC 343
           CAD   L   +R  +    HGY+ +DCD+   + + H + + S+  A A+ + AG D+DC
Sbjct: 290 CADSYFLQTLLRDTFGFVDHGYVSSDCDAAYNIYNPHGYAS-SQAAAAAEAILAGTDIDC 348

Query: 344 GQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ-----YVSLGKQDICS 398
           G  Y      ++  G +   DI++ +  LYT L++ G+FD +       Y  L   D+  
Sbjct: 349 GTTYQWHLNESIAAGDLSRDDIEQGVIRLYTTLVQAGYFDSNTTKANNPYRDLSWSDVLE 408

Query: 399 DENIELAAEAAREGIVLLKNDQNTLPLNSAKV----KTVAVVGPHANATVAMIGNYAGIP 454
            +   ++ +AA +GIVLLKN  N LPL          TVA++GP ANAT  ++GNY G  
Sbjct: 409 TDAWNISYQAATQGIVLLKNSNNVLPLTEKAYPPSNTTVALIGPWANATTQLLGNYYGNA 468

Query: 455 CRYMSPIAGF--SGYANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEA 512
              +SP A F  +GY  V +  G   ++  S +   AA  AA++AD  I   G+D ++EA
Sbjct: 469 PYMISPRAAFEEAGY-KVNFAEGT-GISSTSTSGFAAALSAAQSADVIIYAGGIDNTLEA 526

Query: 513 ESLDREDLWLPGYQTQLINQVAEVA-KGPVILVIMSAGGVDIAFAETNTNIKAILWAGYP 571
           E+LDRE +  PG Q  LI ++A  A K P+I++ M  G VD +  + NTN+ A+LW GYP
Sbjct: 527 EALDRESIAWPGNQLDLIQKLASAAGKKPLIVLQMGGGQVDSSSLKNNTNVSALLWGGYP 586

Query: 572 GEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNG 631
           G+ GG A+ D++ GK NP GRL  T Y   Y +  P T M LRP      PG+TYK+Y G
Sbjct: 587 GQSGGFALRDIITGKKNPAGRLVTTQYPASYAEEFPATDMNLRPEGD--NPGQTYKWYTG 644

Query: 632 PTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRC 691
             +Y FG+GL YT F  +  S T T +V LN +Q   +  +   AS T+ P         
Sbjct: 645 EAVYEFGHGLFYTTFAES-SSNTTTKEVKLN-IQDILSQTHEDLASITQLP--------- 693

Query: 692 DDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQ-VIGFQRV-FVRAGRNKRIK 748
                F  + +N G  +     +V++       A Y K+ ++G+ R+  V+ G  + ++
Sbjct: 694 --VLNFTANIRNTGKLESDYTAMVFANTSDAGPAPYPKKWLVGWDRLGEVKVGETRELR 750


>gi|375150455|ref|YP_005012896.1| Beta-glucosidase [Niastella koreensis GR20-10]
 gi|361064501|gb|AEW03493.1| Beta-glucosidase [Niastella koreensis GR20-10]
          Length = 711

 Score =  419 bits (1078), Expect = e-114,   Method: Compositional matrix adjust.
 Identities = 266/758 (35%), Positives = 380/758 (50%), Gaps = 90/758 (11%)

Query: 42  FSKLGLQMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSE 101
           F  +  Q  + +F +   P   RV DL+ ++TL EK+  LG  +  V RLG+P Y WW+E
Sbjct: 5   FIVINTQAQTSVFRNPQQPMEARVNDLLHQLTLPEKISLLGYRSKEVERLGIPAYNWWNE 64

Query: 102 ALHGVSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRA- 160
           ALHGV+  G           AT FP  I   A+FN+ L K+    +STEARA YNL  A 
Sbjct: 65  ALHGVARAGV----------ATVFPQAIGMAATFNDDLLKEAATVISTEARAKYNLSLAQ 114

Query: 161 -------GLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATD 213
                  GLT+WSPNIN+ RDPRWGR  ET GEDPF+       +V+GLQ  +       
Sbjct: 115 GRHLQYMGLTFWSPNINIFRDPRWGRGQETYGEDPFLTAHMGTAFVKGLQGND------- 167

Query: 214 LNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSV 273
              R LK S+C KH+A   V +     R+ F+A V E+D+ ET+L  F   V  G   SV
Sbjct: 168 --PRYLKASACAKHFA---VHSGPENGRHTFNAIVDEKDLRETYLYAFHALVDAG-VESV 221

Query: 274 MCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQ 333
           MC+YNRVN  P C+   LLN  +R EW   G++V DC ++  +   HK +    E A A 
Sbjct: 222 MCAYNRVNDQPCCSGNFLLNSILRNEWKFKGHVVTDCGALDDIFMRHKVMPSGVEVAAA- 280

Query: 334 TLKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFD---GSPQYVS 390
            +KAG++LDC          AV+Q  + E DID SL +L    ++LGF+D    +P Y  
Sbjct: 281 AIKAGVNLDCSNVLQKDVEKAVEQKLLNEKDIDSSLAHLLRTQIKLGFYDDPTANPFY-K 339

Query: 391 LGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNY 450
            G   + +  +  LA   A++ +VLLKN    LPL+  K   + VVG ++ +  A++GNY
Sbjct: 340 YGADSVANTAHATLARAMAQQSMVLLKNSNQLLPLDKKKYPAIMVVGTNSASMDALLGNY 399

Query: 451 AGIPCRYMSPIAGFSGYANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGL---- 506
            G+  R +S + G +   +   +   D  +  ++ + F    AA  AD T+ + GL    
Sbjct: 400 HGVSNRAVSFVEGITNAVDAGTRVEYDQGSDYNDTTHFGGIWAAGNADITVAVIGLTPVY 459

Query: 507 -----DLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTN 561
                D  + A+  D+ D+ LP      +  + +  K P+I VI +   VDI+  E   +
Sbjct: 460 EGEEGDAFLAAKGGDKPDMSLPAAHIAFMKALRKANKKPIIAVITAGSAVDISAIEPYAD 519

Query: 562 IKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGY 621
             AIL A YPGE+GG A+AD++FGK +P GRLP+T+Y            +P    D+   
Sbjct: 520 --AILLAWYPGEQGGNALADILFGKVSPAGRLPVTFYQS-------FADVP--AYDNYAM 568

Query: 622 PGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRC 681
            GRTY+++NG   YPFGYGLSYT F Y        I+                       
Sbjct: 569 KGRTYRYFNGKVQYPFGYGLSYTSFAYEWQQMPANIRT---------------------- 606

Query: 682 PGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRA 741
                      D   F +  +N GS DG +VV VY + PA +    +K++  F+RV V+A
Sbjct: 607 ---------AKDSVSFSIKVKNTGSMDGDEVVQVYVEYPA-VERMPLKELKAFKRVHVKA 656

Query: 742 GRNKRIKFVFNACKSLNIVDYAANTL-LPAGEHTIFVG 778
           G  + ++    A   L   D A ++  L  G + IF G
Sbjct: 657 GGEETVQLTIPAS-DLQKWDLATSSWKLYPGSYNIFAG 693


>gi|292495281|sp|C0STH4.1|XYND_ASPAC RecName: Full=Probable exo-1,4-beta-xylosidase xlnD; AltName:
           Full=1,4-beta-D-xylan xylohydrolase xlnD; AltName:
           Full=Beta-xylosidase A; AltName: Full=Beta-xylosidase
           xlnD; AltName: Full=Xylobiase xlnD; Flags: Precursor
 gi|225878711|dbj|BAH30675.1| beta-xylosidase [Aspergillus aculeatus]
          Length = 805

 Score =  419 bits (1078), Expect = e-114,   Method: Compositional matrix adjust.
 Identities = 265/749 (35%), Positives = 385/749 (51%), Gaps = 46/749 (6%)

Query: 49  MSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSN 108
           +S  L CDS+     R   LVS  TL+E +   G+ + GVPRLGLP Y+ WSEALHG+  
Sbjct: 54  LSKNLVCDSTASPYDRAAALVSLFTLEELIANTGNTSPGVPRLGLPPYQVWSEALHGLGR 113

Query: 109 VGPGTHFDD---VIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYW 165
                +F D   +  G  SFP+ IL+ A+FN +L  +I   +ST+ RA  N GR GL  +
Sbjct: 114 A----NFTDNGALHAGRPSFPSPILSAAAFNRTLINQIASIISTQGRAFNNAGRFGLDVY 169

Query: 166 SPNINVARDPRWGRITETPGEDPFVV-GRYAVNYVRGLQDVEGHENATDLNSRPLKVSSC 224
           SPNIN  R P WGR  ETPGED + +   YA  Y+ G+Q          +N   LK+++ 
Sbjct: 170 SPNINTFRHPVWGRGQETPGEDAYTLTAAYAYEYITGIQG--------GVNPEHLKLAAT 221

Query: 225 CKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIP 284
            KH+A YD++NW    R   D  +T+QD+ E +   F +  ++    S MCSYN VNG+P
Sbjct: 222 AKHFAGYDIENWDNHSRLGNDVNITQQDLAEYYTPQFLVAARDAHVHSFMCSYNAVNGVP 281

Query: 285 SCADPKLLNQTVRGEWDL--HGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLD 342
           SC++   L   +R  +    HGY+  DC ++  + + H + A+ +  A A  + AG D+D
Sbjct: 282 SCSNTFFLQTLLRDTFSFVDHGYVSGDCGAVYGVFNPHGYAAN-EPSAAADAILAGTDID 340

Query: 343 CGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDG----SPQYVSLGKQDICS 398
           CG  Y      ++  G V   DI++    LY  L+ LG+FDG    S  Y SLG  D+  
Sbjct: 341 CGTSYQYHFNESITTGAVARDDIERGFIRLYANLVELGYFDGNSSSSNPYRSLGWPDVQK 400

Query: 399 DENIELAAEAAREGIVLLKNDQNTLPLNS---AKVKTVAVVGPHANATVAMIGNYAGIPC 455
            +   ++ EAA EGIVLLKND  TLPL S    K K++A++GP ANAT  + GNY G   
Sbjct: 401 TDAWNISYEAAVEGIVLLKND-GTLPLASPSEGKNKSIALIGPWANATTQLQGNYYGDAP 459

Query: 456 RYMSPIAGFSGYANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAESL 515
             +SP+  F+      +     +++  S  +  AA  AA+ AD  + L G+D ++EAE+ 
Sbjct: 460 YLISPVDAFTAAGYTVHYAPGTEISTNSTANFSAALSAARAADTIVFLGGIDNTIEAEAQ 519

Query: 516 DREDLWLPGYQTQLINQVA--EVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGE 573
           DR  +  PG Q +LI+Q+A  +    P+++  M  G VD +  + N  + A+LW GYPG+
Sbjct: 520 DRSSIAWPGNQLELISQLAAQKSDDQPLVVYQMGGGQVDSSSLKFNAKVNALLWGGYPGQ 579

Query: 574 EGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPT 633
            GG A+ D++ G   P GRL  T Y   Y +      M LRP ++   PG+TY +Y G  
Sbjct: 580 SGGLALRDILTGARAPAGRLTTTQYPAAYAESFSALDMNLRPNETTQNPGQTYMWYTGEP 639

Query: 634 LYPFGYGLSYTQFKYNLLSFTKT-IQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCD 692
           +Y FG+GL YT F  +     KT    N+  L    + + T+   +T             
Sbjct: 640 VYAFGHGLFYTTFNASSAQAAKTKYTFNITDLTSAAHPDTTTVGQRT------------- 686

Query: 693 DYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYI-KQVIGFQRVFVRAGRNKRIKF-V 750
             F F     N G  D     +VY+       + Y  K ++GF R+   A      +  V
Sbjct: 687 -LFNFTASITNSGQRDSDYTALVYANTSTAGPSPYPNKWLVGFDRLAAVAKEGGTAELNV 745

Query: 751 FNACKSLNIVDYAANTLLPAGEHTIFVGN 779
             A   L  VD A NT+L  G + + + N
Sbjct: 746 PVAVDRLARVDEAGNTVLFPGRYEVALNN 774


>gi|310797011|gb|EFQ32472.1| glycosyl hydrolase family 3 N terminal domain-containing protein
           [Glomerella graminicola M1.001]
          Length = 767

 Score =  419 bits (1078), Expect = e-114,   Method: Compositional matrix adjust.
 Identities = 271/735 (36%), Positives = 396/735 (53%), Gaps = 49/735 (6%)

Query: 55  CDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTH 114
           CD +L    R   LV  +T++EK+Q L   A G PR+GLP Y WWSEALHGV+   PGT+
Sbjct: 43  CDRTLSPPERAAALVKALTVEEKLQNLVSKAQGAPRIGLPAYNWWSEALHGVA-YAPGTY 101

Query: 115 F---DDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNINV 171
           F   D     +TS+P  +L  A+F++ L ++IG A+  EARA  N G AGL YW+PN+N 
Sbjct: 102 FPEGDVEFNSSTSYPMPLLMAAAFDDELIEQIGAAIGIEARAWGNAGWAGLDYWTPNVNP 161

Query: 172 ARDPRWGRITETPGEDPFVVGRYAVNYVRGLQ-DVEGHENATDLNSRPLKVSSCCKHYAA 230
            +DPRWGR +ETPGED   V RYA    RGL   V G +          +V S CKHYA 
Sbjct: 162 FKDPRWGRGSETPGEDVLRVKRYAEYITRGLDGPVPGEQR---------RVISTCKHYAG 212

Query: 231 YDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPK 290
            D ++W G  R+ FDA++T QD+ E +L PF+ C ++    S+MC+YN VNG+PSCA+  
Sbjct: 213 NDFEDWNGTSRHDFDAKITAQDLAEYYLMPFQQCARDSKVGSIMCAYNAVNGVPSCANEY 272

Query: 291 LLNQTVRGEWDL---HGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYY 347
           LL   +R  W+    + Y+ +DC+++  +  NHK+ A +     A   +AG+D  C    
Sbjct: 273 LLQNILREHWNWTEHNNYVTSDCEAVLDVSANHKY-APTNAAGTAICFEAGMDTSCEYTG 331

Query: 348 TNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ-YVSLGKQDICSDENIELAA 406
           ++    A  QG +KE  +D++L  LY  L+R G+FDG    Y  LG +D+ S E   LA 
Sbjct: 332 SSDIPGAWSQGLLKEETVDRALLRLYEGLVRAGYFDGHEAIYAKLGWKDVNSAEAQSLAL 391

Query: 407 EAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPI-AGFS 465
           +AA EGIVLLKN+  TLPL+      VA++G  A+A   + G Y+G      +P  A   
Sbjct: 392 QAAVEGIVLLKNN-GTLPLDLKPSHKVAMIGFWADAPDKLQGGYSGRAAHLHTPAYAARQ 450

Query: 466 GYANVTYKTG-CDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAESLDREDLWLPG 524
              ++T  +G        S+N   AA EAA+ AD  +   GLD S   E+LDR DL  P 
Sbjct: 451 LGLDITLASGPVLQRNNASDNWTAAALEAAEGADYILYFGGLDTSAAGETLDRTDLEWPE 510

Query: 525 YQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVF 584
            Q  LI +++ + K P+++ ++     D    + +  + +ILWA +PG++GG AI  ++ 
Sbjct: 511 AQLMLIKKLSALGK-PLVVNLLGDQLDDTPLLQLD-EVSSILWANWPGQDGGVAIMKLIT 568

Query: 585 GKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYT 644
           G+ +P GRLP+T Y  +Y  ++P+TSM LRP     YPGRTY++Y+ P +  FG+GL YT
Sbjct: 569 GEKSPAGRLPVTQYPSNYTDLIPMTSMDLRPTSQ--YPGRTYRWYDKP-IKRFGFGLHYT 625

Query: 645 QFKYNL-LSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQN 703
            FK  +  +F KT+++    L  C N +  +      CP                V   N
Sbjct: 626 TFKAEVGGAFPKTLRI--ADLVGCGNEHPDT------CPAP-----------PLPVSITN 666

Query: 704 VGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVF-VRAGRNKRIKFVFNACKSLNIVDY 762
            G+     V + Y           IK +  ++R+  V  G    +   +     +   D 
Sbjct: 667 TGNRTSDYVALAYLSGEYGPRPYPIKTLSAYKRLRDVAPGETATVDLAWT-LGDIARHDE 725

Query: 763 AANTLLPAGEHTIFV 777
             NT+L  GE+TI +
Sbjct: 726 QGNTVLYPGEYTITI 740


>gi|358385386|gb|EHK22983.1| glycoside hydrolase family 3 protein [Trichoderma virens Gv29-8]
          Length = 795

 Score =  419 bits (1077), Expect = e-114,   Method: Compositional matrix adjust.
 Identities = 270/730 (36%), Positives = 397/730 (54%), Gaps = 38/730 (5%)

Query: 53  LFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPG 112
           L CDSS  Y+ R + L+S  TL+E +    +   GVPRLGLP Y+ W+EALHG+      
Sbjct: 61  LVCDSSAGYAERAQALISLFTLEELILNTQNSGPGVPRLGLPNYQVWNEALHGLDRANFA 120

Query: 113 THFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNINVA 172
           T        ATSFP  IL+ A+ N +L  +I   +ST+ARA  N GR GL  ++PNIN  
Sbjct: 121 TK-GGQFQWATSFPMPILSMAALNRTLIHQIADIISTQARAFSNSGRYGLDVYAPNINGF 179

Query: 173 RDPRWGRITETPGEDPFVV-GRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAY 231
           R P WGR  ETPGED  V+   Y   Y+ G+Q     EN        LK+++  KH+A Y
Sbjct: 180 RSPLWGRGQETPGEDANVLTSAYTYEYITGMQGGVDPEN--------LKIAATAKHFAGY 231

Query: 232 DVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKL 291
           D++NW    R  FDA +T+QD+ E +   F    +   + S MC+YN VNG+PSCA+   
Sbjct: 232 DLENWNNQSRLGFDAIITQQDLSEYYTPQFLAASRYAKSHSFMCAYNSVNGVPSCANSFF 291

Query: 292 LNQTVRGEWDL--HGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTN 349
           L   +R  W     GY+ +DCD++  + + H + ++    A A +L+AG D+DCGQ Y  
Sbjct: 292 LQTLLRESWGFPEWGYVSSDCDAVYNVWNPHDYASNQSS-AAASSLRAGTDIDCGQTYPW 350

Query: 350 FTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQDICSDENIELAAEAA 409
               +   G+V   +I++S+  LY  L+RLG+FD   +Y SLG +D+   +   ++ EAA
Sbjct: 351 HLNESFVAGEVSRGEIERSVTRLYANLVRLGYFDKKNEYRSLGWKDVVKTDAWNISYEAA 410

Query: 410 REGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPI--AGFSGY 467
            EGIVLLKND  TLPL S KV+++A++GP ANAT  M GNY G     +SP+  A  +GY
Sbjct: 411 VEGIVLLKND-GTLPL-SKKVRSIALIGPWANATTQMQGNYFGAAPYLISPLEAAKKAGY 468

Query: 468 ANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAESLDREDLWLPGYQT 527
             V ++ G  + A  S      A  AAK +DA I   G+D +VE E  DR D+  PG Q 
Sbjct: 469 -QVNFELGT-ETASTSTAGFAKAIAAAKKSDAIIFAGGIDNTVEQEGADRTDIAWPGNQL 526

Query: 528 QLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKF 587
            LI Q++E+ K P++++ M  G VD +  ++N  + +++W GYPG+ GG A+ D++ GK 
Sbjct: 527 DLIKQLSELGK-PLVVLQMGGGQVDSSSLKSNKKVNSLVWGGYPGQSGGVALFDILSGKR 585

Query: 588 NPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFK 647
            P GRL  T Y  DYV   P   M LRP D    PG+TY +Y G  +Y FG G+ YT FK
Sbjct: 586 APAGRLVSTQYPADYVHQFPQNDMNLRP-DGKSNPGQTYIWYTGKPVYQFGDGIFYTTFK 644

Query: 648 YNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGST 707
             L   +K ++ N++ +    +  YT      + P              F  + +N G T
Sbjct: 645 ETLSGSSKGLKFNVSSVLAAPHPGYT---YSEQTP-----------VLTFTANIENSGKT 690

Query: 708 DGSDVVIVYSKPPAEIAATYI-KQVIGFQRV-FVRAGRNKRIKFVFNACKSLNIVDYAAN 765
           D     +++ +      A Y  K ++GF R+  ++ G + ++        +L  VD   N
Sbjct: 691 DSPYSAMLFVRTANAGPAPYPNKWLVGFDRLATIKPGHSSKLSIPI-PVSALARVDSLGN 749

Query: 766 TLLPAGEHTI 775
            ++  G++ +
Sbjct: 750 RIVYPGKYEL 759


>gi|194400335|gb|ACF61038.1| beta-xylosidase [Aspergillus awamori]
          Length = 804

 Score =  419 bits (1076), Expect = e-114,   Method: Compositional matrix adjust.
 Identities = 269/719 (37%), Positives = 387/719 (53%), Gaps = 50/719 (6%)

Query: 49  MSSFLFCD-SSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVS 107
           + S L CD S+ PY  R   L+S  TLDE +   G+   GV RLGLP Y+ WSEALHG+ 
Sbjct: 63  LRSHLICDESATPYD-RAASLISLFTLDELIANTGNTGLGVSRLGLPAYQVWSEALHGLD 121

Query: 108 NVGPGTHFDDV--IPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYW 165
                 +F D      ATSFP  ILTTA+ N +L  +I   +ST+ RA  N GR GL  +
Sbjct: 122 RA----NFSDSGSYNWATSFPQPILTTAALNRTLIHQIASIISTQGRAFNNAGRYGLDVY 177

Query: 166 SPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCC 225
           +PNIN  R P WGR  ETPGED  +   YA  Y+ G+Q  +   N        LK+++  
Sbjct: 178 APNINTFRHPVWGRGQETPGEDVSLAAVYAYEYITGIQGPDPDSN--------LKLAATA 229

Query: 226 KHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPS 285
           KHYA YD++NW    R   D  +T+QD+ E +   F +  ++    SVMC+YN VNG+P+
Sbjct: 230 KHYAGYDIENWHNHSRLGNDMNITQQDLSEYYTPQFHVAARDAKVHSVMCAYNAVNGVPA 289

Query: 286 CADPKLLNQTVRGEWDL--HGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDC 343
           CAD   L   +R  +    HGY+ +DCD+   + + H + + S+  A A+ + AG D+DC
Sbjct: 290 CADSYFLQTLLRDTFGFVDHGYVSSDCDAAYNIYNPHGYAS-SQAAAAAEAILAGTDIDC 348

Query: 344 GQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ-----YVSLGKQDICS 398
           G  Y      ++  G +   DI+K +  LYT L++ G+FD +       Y  L   D+  
Sbjct: 349 GTTYQWHLNESITAGDLSRDDIEKGVIRLYTTLVQAGYFDSNTTKANNPYRDLTWSDVLE 408

Query: 399 DENIELAAEAAREGIVLLKNDQNTLPLNSAKV----KTVAVVGPHANATVAMIGNYAGIP 454
            +   ++ +AA +GIVLLKN  N LPL          TVA++GP ANAT  ++GNY G  
Sbjct: 409 TDAWNISYQAATQGIVLLKNSNNVLPLTEKAYPPSNTTVALIGPWANATTQLLGNYYGNA 468

Query: 455 CRYMSPIAGF--SGYANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEA 512
              +SP A F  +GY  V +  G   ++  S +   AA  AA++AD  I   G+D ++EA
Sbjct: 469 PYMISPRAAFEEAGY-KVNFAEGT-GISSTSTSGFAAALSAARSADVIIYAGGIDNTLEA 526

Query: 513 ESLDREDLWLPGYQTQLINQVAEVA-KGPVILVIMSAGGVDIAFAETNTNIKAILWAGYP 571
           E+LDRE +  PG Q  LI ++A  A   P+I++ M  G VD +  + NTN+ A+LW GYP
Sbjct: 527 EALDRESIAWPGNQLDLIQKLASSAGSKPLIVLQMGGGQVDSSSLKNNTNVTALLWGGYP 586

Query: 572 GEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNG 631
           G+ GG A+ D++ GK NP GRL  T Y   Y +  P T M LRP      PG+TYK+Y G
Sbjct: 587 GQSGGFALRDIITGKKNPAGRLVTTQYPASYAEEFPATDMNLRPEGD--NPGQTYKWYTG 644

Query: 632 PTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRC 691
             +Y FG+GL YT F  +  S T T +V LN +Q   +  +   AS T+ P         
Sbjct: 645 EAVYEFGHGLFYTTFAES-SSNTTTKEVKLN-IQDILSQTHEELASITQLP--------- 693

Query: 692 DDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATY-IKQVIGFQRVF-VRAGRNKRIK 748
                F  + +N G  +     +V++       A Y +K ++G+ R+  V+ G  + ++
Sbjct: 694 --VLNFTANIKNTGKLESDYTAMVFANTSDAGPAPYPVKWLVGWDRLGDVKVGETRELR 750


>gi|333379783|ref|ZP_08471502.1| hypothetical protein HMPREF9456_03097 [Dysgonomonas mossii DSM
           22836]
 gi|332884929|gb|EGK05184.1| hypothetical protein HMPREF9456_03097 [Dysgonomonas mossii DSM
           22836]
          Length = 737

 Score =  418 bits (1074), Expect = e-114,   Method: Compositional matrix adjust.
 Identities = 273/772 (35%), Positives = 400/772 (51%), Gaps = 99/772 (12%)

Query: 42  FSKLGLQMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSE 101
           FS LG    ++ F +++L    RV DLVS++TL+EKV Q+ +    + RL +P Y WW+E
Sbjct: 18  FSLLG---QNYPFQNTNLSIDERVNDLVSKLTLEEKVAQMLNNTPAIERLNIPAYNWWNE 74

Query: 102 ALHGVSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRA- 160
            LHG+      T +       T FP  I   A++N+ L K++  A+S E RA+YN   + 
Sbjct: 75  CLHGIGR----TDYK-----VTVFPQAIGMAAAWNKELMKEVASAISDEGRAIYNDATSK 125

Query: 161 -------GLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATD 213
                  GLTYW+PNIN+ RDPRWGR  ET GEDPF+ G    ++V GLQ  +       
Sbjct: 126 GNREIYYGLTYWTPNINIFRDPRWGRGQETYGEDPFLTGVLGKSFVAGLQGDD------- 178

Query: 214 LNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSV 273
             ++ LK ++C KHYA   V +     R+ F+  VT+ D+ +T+L  F   V E   + V
Sbjct: 179 --TKYLKAAACAKHYA---VHSGPENTRHTFNTFVTDYDLWDTYLPAFRNLVVEAKVAGV 233

Query: 274 MCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQ 333
           MC+YN  NG P C +  L+ + +R +W+  GY+ +DC +I     +HK   D+K  A A 
Sbjct: 234 MCAYNAYNGEPCCGNNFLMQEILREKWNFTGYVTSDCGAIDDFYQHHKTHPDAKY-AAAD 292

Query: 334 TLKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSP--QYVSL 391
            +  G D+DCG        +AV+ G + E  ID SLK L+T+  RLG FD +   +Y  +
Sbjct: 293 AVYNGTDIDCGNEAYKALVDAVKTGIITEKQIDISLKRLFTIRFRLGMFDPAENVKYSQI 352

Query: 392 GKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYA 451
               + S ++ +LA +  RE IVLLKN+ NTLPL S K+K VAVVGP+AN  V+++GNY 
Sbjct: 353 STSVLESQKHKDLALKITRESIVLLKNENNTLPL-SKKLKKVAVVGPNANNEVSVLGNYN 411

Query: 452 GIPCRYMSPIAGFSGY---ANVTYKTGCDDVACKSNN--SIFAASEAAKTADATIILAGL 506
           G P   ++P          A V Y+ G D V   +N+   + A  +  K  D  I + G+
Sbjct: 412 GFPTEIVTPYEAVKQKLKGAEVIYEKGIDFVTPSTNSKEEVSALVKRLKDVDVVIFVGGI 471

Query: 507 DLSVEAESL----------DREDLWLPGYQTQLINQ-VAEVAKGPVILVIMSAGGVDIAF 555
              +E E +          DR  + LP  QT  +   VAE  K P + V+M+  G  IA 
Sbjct: 472 SPELEGEEMPVKIEGFTGGDRTSIKLPKIQTDFMKALVAE--KIPTVFVMMT--GSAIAT 527

Query: 556 AETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRP 615
              + NI AI+ A Y G++ G AIADV+FG +NP G+LP+T+Y  D       + +P   
Sbjct: 528 EWESQNIPAIVNAWYGGQDAGTAIADVLFGDYNPSGKLPVTFYAKD-------SDLP--A 578

Query: 616 VDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSD 675
            +S     RTY+++NG  LYPFGYGLSYT+F+Y+ +    TI    N             
Sbjct: 579 FNSYEMKNRTYRYFNGEVLYPFGYGLSYTKFEYSPIQVPSTIDTGNNA------------ 626

Query: 676 ASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQ 735
                               +  V  +N G  +G +VV +Y   P       +  + GF 
Sbjct: 627 --------------------KVSVSIKNTGKVEGEEVVQLYISYPDTKGQKPLYALKGFN 666

Query: 736 RVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGNGGVSFPIH 787
           RV ++AG +K ++F  +  + L +VD A    + AG+  IF+G G    P H
Sbjct: 667 RVSLKAGESKTVEFNLSP-RELGLVDDAGILKVSAGKRKIFIG-GSSPTPTH 716


>gi|121809149|sp|Q4AEG8.1|XYND_ASPAW RecName: Full=Exo-1,4-beta-xylosidase xlnD; AltName:
           Full=1,4-beta-D-xylan xylohydrolase xlnD; AltName:
           Full=Beta-xylosidase A; AltName: Full=Beta-xylosidase
           xlnD; AltName: Full=Xylobiase xlnD; Flags: Precursor
 gi|73486695|dbj|BAE19756.1| beta-xylosidase [Aspergillus awamori]
          Length = 804

 Score =  417 bits (1073), Expect = e-114,   Method: Compositional matrix adjust.
 Identities = 267/719 (37%), Positives = 387/719 (53%), Gaps = 50/719 (6%)

Query: 49  MSSFLFCD-SSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVS 107
           + S L CD ++ PY  R   L+S  TLDE +   G+   GV RLGLP Y+ WSEALHG+ 
Sbjct: 63  LRSHLICDETATPYD-RAASLISLFTLDELIANTGNTGLGVSRLGLPAYQVWSEALHGLD 121

Query: 108 NVGPGTHFDD--VIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYW 165
                 +F D      ATSFP  ILTTA+ N +L  +I   +ST+ RA  N GR GL  +
Sbjct: 122 RA----NFSDSGAYNWATSFPQPILTTAALNRTLIHQIASIISTQGRAFNNAGRYGLDVY 177

Query: 166 SPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCC 225
           +PNIN  R P WGR  ETPGED  +   YA  Y+ G+Q  +   N        LK+++  
Sbjct: 178 APNINTFRHPVWGRGQETPGEDVSLAAVYAYEYITGIQGPDPESN--------LKLAATA 229

Query: 226 KHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPS 285
           KHYA YD++NW    R   D  +T+QD+ E +   F +  ++    SVMC+YN VNG+P+
Sbjct: 230 KHYAGYDIENWHNHSRLGNDMNITQQDLSEYYTPQFHVAARDAKVQSVMCAYNAVNGVPA 289

Query: 286 CADPKLLNQTVRGEWDL--HGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDC 343
           CAD   L   +R  +    HGY+ +DCD+   + + H + + S+  A A+ + AG D+DC
Sbjct: 290 CADSYFLQTLLRDTFGFVDHGYVSSDCDAAYNIYNPHGYAS-SQAAAAAEAILAGTDIDC 348

Query: 344 GQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ-----YVSLGKQDICS 398
           G  Y      ++  G +   DI++ +  LYT L++ G+FD +       Y  L   D+  
Sbjct: 349 GTTYQWHLNESITAGDLSRDDIEQGVIRLYTTLVQAGYFDSNTTKANNPYRDLSWSDVLE 408

Query: 399 DENIELAAEAAREGIVLLKNDQNTLPLNSAKV----KTVAVVGPHANATVAMIGNYAGIP 454
            +   ++ +AA +GIVLLKN  N LPL          TVA++GP ANAT  ++GNY G  
Sbjct: 409 TDAWNISYQAATQGIVLLKNSNNVLPLTEKAYPPSNTTVALIGPWANATTQLLGNYYGNA 468

Query: 455 CRYMSPIAGF--SGYANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEA 512
              +SP A F  +GY  V +  G   ++  S +   AA  AA++AD  I   G+D ++EA
Sbjct: 469 PYMISPRAAFEEAGY-KVNFAEGT-GISSTSTSGFAAALSAAQSADVIIYAGGIDNTLEA 526

Query: 513 ESLDREDLWLPGYQTQLINQVAEVA-KGPVILVIMSAGGVDIAFAETNTNIKAILWAGYP 571
           E+LDRE +  PG Q  LI ++A  A K P+I++ M  G VD +  + NT + A+LW GYP
Sbjct: 527 EALDRESIAWPGNQLDLIQKLASAAGKKPLIVLQMGGGQVDSSSLKNNTKVSALLWGGYP 586

Query: 572 GEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNG 631
           G+ GG A+ D++ GK NP GRL  T Y   Y +  P T M LRP      PG+TYK+Y G
Sbjct: 587 GQSGGFALRDIITGKKNPAGRLVTTQYPASYAEEFPATDMNLRPEGD--NPGQTYKWYTG 644

Query: 632 PTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRC 691
             +Y FG+GL YT F  +  S T T +V LN +Q   +  +   AS T+ P         
Sbjct: 645 EAVYEFGHGLFYTTFAES-SSNTTTKEVKLN-IQDILSRTHEELASITQLP--------- 693

Query: 692 DDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQ-VIGFQRV-FVRAGRNKRIK 748
                F  + +N G  +     +V++       A Y K+ ++G+ R+  V+ G  + ++
Sbjct: 694 --VLNFTANIRNTGKLESDYTAMVFANTSDAGPAPYPKKWLVGWDRLGEVKVGETRELR 750


>gi|358393086|gb|EHK42487.1| glycoside hydrolase family 3 protein [Trichoderma atroviride IMI
           206040]
          Length = 794

 Score =  417 bits (1071), Expect = e-113,   Method: Compositional matrix adjust.
 Identities = 267/732 (36%), Positives = 396/732 (54%), Gaps = 39/732 (5%)

Query: 53  LFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPG 112
           L CDS+  Y  R + L+S  TL+E +    +   GVPRLGLP Y+ W+EALHG+      
Sbjct: 62  LVCDSTAGYVERAQALISLFTLEELILNTQNSGPGVPRLGLPNYQVWNEALHGLDRANFA 121

Query: 113 THFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNINVA 172
           T   +   G TSFP  IL+ A+ N +L  +I   +ST+ARA  N GR GL  ++PNIN  
Sbjct: 122 TKGGEFEWG-TSFPMPILSMAALNRTLIHQIADIISTQARAFSNNGRYGLDVYAPNINGF 180

Query: 173 RDPRWGRITETPGEDPFVV-GRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAY 231
           R P WGR  ETPGED  V+   Y   Y+ G+Q     EN        LK+++  KH+A Y
Sbjct: 181 RSPLWGRGQETPGEDANVLTSAYTYEYITGMQGGVDPEN--------LKIAATAKHFAGY 232

Query: 232 DVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKL 291
           D++N+    R  FDA +T+QD+ E +   F    +   + S MC+YN VNG+PSC++   
Sbjct: 233 DLENYNNQSRLGFDAIITQQDLSEYYTPQFLAASRYAKSHSFMCAYNSVNGVPSCSNSFF 292

Query: 292 LNQTVRGEWDL--HGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTN 349
           L   +R  W    +GY+ +DCD+I  + + H + A+S+  A A +LKAG D+DCGQ Y  
Sbjct: 293 LQTLLRESWGFPEYGYVSSDCDAIYNVWNPHNY-ANSQSSAAADSLKAGTDIDCGQTYPW 351

Query: 350 FTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQDICSDENIELAAEAA 409
               +   G V   +I++S+  LY  L+RLG+FD   +Y SLG +D+   +   ++ EAA
Sbjct: 352 HLNESFVAGTVSRGEIERSVTRLYANLVRLGYFDKKNEYRSLGWKDVVKTDAWNISYEAA 411

Query: 410 REGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPI--AGFSGY 467
            EGIVLLKND  TLPL S KV+++A++GP  NAT  + GNY G     +SP+  A  +GY
Sbjct: 412 VEGIVLLKND-GTLPL-SKKVRSIALIGPWVNATEQLQGNYFGTAPYLISPLQAAKKAGY 469

Query: 468 ANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAESLDREDLWLPGYQT 527
             V Y+ G   +  ++      A  AAK +DA I + G+D ++E E  DR D+  PG Q 
Sbjct: 470 -EVNYELGT-GINNQTTAGFAKAIAAAKKSDAIIFIGGIDNTIEQEGADRTDIAWPGNQL 527

Query: 528 QLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKF 587
            LI Q++EV K P++++ M  G VD +  ++N  + +++W GYPG+ GG A+ D++ GK 
Sbjct: 528 DLIKQLSEVGK-PLVVLQMGGGQVDSSSIKSNKKVNSLVWGGYPGQSGGYALFDILSGKR 586

Query: 588 NPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFK 647
            P GRL  T Y  +YV       M LRP D    PG+TY +Y G  +Y FG GL YT FK
Sbjct: 587 APAGRLVSTQYPAEYVHQFAQNDMNLRP-DGKKNPGQTYIWYTGKPVYQFGDGLFYTTFK 645

Query: 648 YNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGST 707
              L    T++ N +++    +  YT      + P            F F  + QN G T
Sbjct: 646 -ETLGKQSTLKFNASQILGAGHPGYTYSE---QTP-----------VFTFTANIQNSGKT 690

Query: 708 DGSDVVIVYSKPPAEIAATYI-KQVIGFQRV-FVRAGRNKRIKFVFNACKSLNIVDYAAN 765
                 + + +        Y  K ++GF R+  ++ G +  +        +L+ VD   N
Sbjct: 691 ASPYSAMAFVRTSNAGPKPYPNKWLVGFDRLATIKPGHSSTLSIPI-PLNALSRVDSNGN 749

Query: 766 TLLPAGEHTIFV 777
            ++  G++ + +
Sbjct: 750 KIVYPGKYELVL 761


>gi|354508473|gb|AER26905.1| beta-xylosidase 3 [synthetic construct]
          Length = 778

 Score =  417 bits (1071), Expect = e-113,   Method: Compositional matrix adjust.
 Identities = 268/719 (37%), Positives = 387/719 (53%), Gaps = 50/719 (6%)

Query: 49  MSSFLFCD-SSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVS 107
           + S L CD S+ PY  R   L+S  TLDE +   G+   GV RLGLP Y+ WSEALHG+ 
Sbjct: 37  LRSHLICDESATPYD-RAASLISLFTLDELIANTGNTGLGVSRLGLPAYQVWSEALHGLD 95

Query: 108 NVGPGTHFDDV--IPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYW 165
                 +F D      ATSFP  ILTTA+ N +L  +I   +ST+ RA  N GR GL  +
Sbjct: 96  RA----NFSDSGSYNWATSFPQPILTTAALNRTLIHQIASIISTQGRAFNNAGRYGLDVY 151

Query: 166 SPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCC 225
           +PNIN  R P WGR  ETPGED  +   YA  Y+ G+Q  +   N        LK+++  
Sbjct: 152 APNINTFRHPVWGRGQETPGEDVSLAAVYAYEYITGIQGPDPDSN--------LKLAATA 203

Query: 226 KHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPS 285
           KHYA YD++NW    R   D  +T+QD+ E +   F +  ++    SVMC+YN V+G+P+
Sbjct: 204 KHYAGYDIENWHNHSRLGNDMNITQQDLSEYYTPQFHVAARDAKVHSVMCAYNAVDGVPA 263

Query: 286 CADPKLLNQTVRGEWDL--HGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDC 343
           CAD   L   +R  +    HGY+ +DCD+   + + H + + S+  A A+ + AG D+DC
Sbjct: 264 CADSYFLQTLLRDTFGFVDHGYVSSDCDAAYNIYNPHGYAS-SQAAAAAEAILAGTDIDC 322

Query: 344 GQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ-----YVSLGKQDICS 398
           G  Y      ++  G +   DI+K +  LYT L++ G+FD +       Y  L   D+  
Sbjct: 323 GTTYQWHLNESITAGDLSRDDIEKGVIRLYTTLVQAGYFDSNTTKANNPYRDLTWSDVLE 382

Query: 399 DENIELAAEAAREGIVLLKNDQNTLPLNSAKV----KTVAVVGPHANATVAMIGNYAGIP 454
            +   ++ +AA +GIVLLKN  N LPL          TVA++GP ANAT  ++GNY G  
Sbjct: 383 TDAWNISYQAATQGIVLLKNSNNVLPLTEKAYPPSNTTVALIGPWANATTQLLGNYYGNA 442

Query: 455 CRYMSPIAGF--SGYANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEA 512
              +SP A F  +GY  V +  G   ++  S +   AA  AA++AD  I   G+D ++EA
Sbjct: 443 PYMISPRAAFEEAGY-KVNFAEGT-GISSTSTSGFAAALSAARSADVIIYAGGIDNTLEA 500

Query: 513 ESLDREDLWLPGYQTQLINQVAEVA-KGPVILVIMSAGGVDIAFAETNTNIKAILWAGYP 571
           E+LDRE +  PG Q  LI ++A  A   P+I++ M  G VD +  + NTN+ A+LW GYP
Sbjct: 501 EALDRESIAWPGNQLDLIQKLASSAGSKPLIVLQMGGGQVDSSSLKNNTNVTALLWGGYP 560

Query: 572 GEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNG 631
           G+ GG A+ D++ GK NP GRL  T Y   Y +  P T M LRP      PG+TYK+Y G
Sbjct: 561 GQSGGFALRDIITGKKNPAGRLVTTQYPASYAEEFPATDMNLRPEGD--NPGQTYKWYTG 618

Query: 632 PTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRC 691
             +Y FG+GL YT F  +  S T T +V LN +Q   +  +   AS T+ P         
Sbjct: 619 EAVYEFGHGLFYTTFAES-SSNTTTKEVKLN-IQDILSQTHEELASITQLP--------- 667

Query: 692 DDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATY-IKQVIGFQRVF-VRAGRNKRIK 748
                F  + +N G  +     +V++       A Y +K ++G+ R+  V+ G  + ++
Sbjct: 668 --VLNFTANIKNTGKLESDYTAMVFANTSDAGPAPYPVKWLVGWDRLGDVKVGETRELR 724


>gi|4235093|gb|AAD13106.1| beta-xylosidase [Aspergillus niger]
          Length = 804

 Score =  416 bits (1070), Expect = e-113,   Method: Compositional matrix adjust.
 Identities = 268/719 (37%), Positives = 387/719 (53%), Gaps = 50/719 (6%)

Query: 49  MSSFLFCD-SSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVS 107
           + S L CD S+ PY  R   L+S  TLDE +   G+   GV RLGLP Y+ WSEALHG+ 
Sbjct: 63  LRSHLICDESATPYD-RAASLISLFTLDELIANTGNTGLGVSRLGLPAYQVWSEALHGLD 121

Query: 108 NVGPGTHFDDV--IPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYW 165
                 +F D      ATSFP  ILTTA+ N +L  +I   +ST+ RA  N GR GL  +
Sbjct: 122 RA----NFSDSGSYNWATSFPQPILTTAALNRTLIHQIASIISTQGRAFNNAGRYGLDVY 177

Query: 166 SPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCC 225
           +PNIN  R P WGR  ETPGED  +   YA  Y+ G+Q  +   N        LK+++  
Sbjct: 178 APNINTFRHPVWGRGQETPGEDVSLAAVYAYEYITGIQGPDPDSN--------LKLAATA 229

Query: 226 KHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPS 285
           KHYA YD++NW    R   D  +T+QD+ E +   F +  ++    SVMC+YN V+G+P+
Sbjct: 230 KHYAGYDIENWHNHSRLGNDMNITQQDLSEYYTPQFHVAARDAKVHSVMCAYNAVDGVPA 289

Query: 286 CADPKLLNQTVRGEWDL--HGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDC 343
           CAD   L   +R  +    HGY+ +DCD+   + + H + + S+  A A+ + AG D+DC
Sbjct: 290 CADSYFLQTLLRDTFGFVDHGYVSSDCDAAYNIYNPHGYAS-SQAAAAAEAILAGTDIDC 348

Query: 344 GQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ-----YVSLGKQDICS 398
           G  Y      ++  G +   DI+K +  LYT L++ G+FD +       Y  L   D+  
Sbjct: 349 GTTYQWHLNESITAGDLSRDDIEKGVIRLYTTLVQAGYFDSNTTKANNPYRDLTWSDVLE 408

Query: 399 DENIELAAEAAREGIVLLKNDQNTLPLNSAKV----KTVAVVGPHANATVAMIGNYAGIP 454
            +   ++ +AA +GIVLLKN  N LPL          TVA++GP ANAT  ++GNY G  
Sbjct: 409 TDAWNISYQAATQGIVLLKNSNNVLPLTEKAYPPSNTTVALIGPWANATTQLLGNYYGNA 468

Query: 455 CRYMSPIAGF--SGYANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEA 512
              +SP A F  +GY  V +  G   ++  S +   AA  AA++AD  I   G+D ++EA
Sbjct: 469 PYMISPRAAFEEAGY-KVNFAEGT-GISSTSTSGFAAALSAARSADVIIYAGGIDNTLEA 526

Query: 513 ESLDREDLWLPGYQTQLINQVAEVA-KGPVILVIMSAGGVDIAFAETNTNIKAILWAGYP 571
           E+LDRE +  PG Q  LI ++A  A   P+I++ M  G VD +  + NTN+ A+LW GYP
Sbjct: 527 EALDRESIAWPGNQLDLIQKLASSAGSKPLIVLQMGGGQVDSSSLKNNTNVTALLWGGYP 586

Query: 572 GEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNG 631
           G+ GG A+ D++ GK NP GRL  T Y   Y +  P T M LRP      PG+TYK+Y G
Sbjct: 587 GQSGGFALRDIITGKKNPAGRLVTTQYPASYAEEFPATDMNLRPEGD--NPGQTYKWYTG 644

Query: 632 PTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRC 691
             +Y FG+GL YT F  +  S T T +V LN +Q   +  +   AS T+ P         
Sbjct: 645 EAVYEFGHGLFYTTFAES-SSNTTTKEVKLN-IQDILSQTHEELASITQLP--------- 693

Query: 692 DDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATY-IKQVIGFQRVF-VRAGRNKRIK 748
                F  + +N G  +     +V++       A Y +K ++G+ R+  V+ G  + ++
Sbjct: 694 --VLNFTANIKNTGKLESDYTAMVFANTSDAGPAPYPVKWLVGWDRLGDVKVGETRELR 750


>gi|307719075|ref|YP_003874607.1| glycoside hydrolase family protein [Spirochaeta thermophila DSM
           6192]
 gi|306532800|gb|ADN02334.1| glycoside hydrolase family 3 [Spirochaeta thermophila DSM 6192]
          Length = 693

 Score =  416 bits (1070), Expect = e-113,   Method: Compositional matrix adjust.
 Identities = 273/736 (37%), Positives = 392/736 (53%), Gaps = 92/736 (12%)

Query: 64  RVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGAT 123
           R+  L+SRM+++EK   +   A GVPRLG+P Y WW+EALHGV+N G           AT
Sbjct: 6   RMTSLLSRMSIEEKAGLMVHRAKGVPRLGIPNYNWWNEALHGVANSGE----------AT 55

Query: 124 SFPTVILTTASFNESLWKKIGQAVSTEARAMYN-LGRA-------GLTYWSPNINVARDP 175
            FP  I   A+F+  L +++  A+S EARA +N +G+        GLT+WSPNIN+ RDP
Sbjct: 56  VFPQAIGLAATFDPDLVRRVADAISREARAKFNAVGKERAAEYERGLTFWSPNINIYRDP 115

Query: 176 RWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDN 235
           RWGR  ET GEDPF+  +  V +V+GLQ    +          L+V++C KHYA +    
Sbjct: 116 RWGRGQETYGEDPFLTSKIGVAFVKGLQGDHPYY---------LRVAACAKHYAVH--SG 164

Query: 236 WKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQT 295
            +G+ R+ FDARV+E+D+ ET+L  FE  VK G   +VM +YNRVNG P+C   +LL + 
Sbjct: 165 PEGL-RHVFDARVSEKDLWETYLPAFEALVKAG-VEAVMGAYNRVNGEPACGSKRLLEEI 222

Query: 296 VRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFTGNAV 355
           +R +W   G++V+DC +I     +HK   D  E ++A  L+AG DL+CG  Y +   +AV
Sbjct: 223 LRKKWGFKGHVVSDCWAIADFHLHHKVTKDPIE-SIAMALEAGCDLNCGNTYEHLL-DAV 280

Query: 356 QQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQDICSDENIELAAEAAREGIVL 415
           + G V E  +D+S+  L + L RLG F     YV L   DI  + +  LA EAA + +VL
Sbjct: 281 KAGAVSEELVDRSVARLLSTLDRLGLFTDDHPYVRLSLADIDWEAHRALAREAAEKSVVL 340

Query: 416 LKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFSGYA----NVT 471
           LKN+   LPL+  K++ + V GP+A   VA++GNYAG+  R ++ + G +GYA     VT
Sbjct: 341 LKNN-GILPLDRRKLRYIYVTGPNAANPVALLGNYAGVSSRLVTVLEGITGYAGPGITVT 399

Query: 472 YKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAESLDR---------EDLWL 522
           YK GC  +     N I  AS  A+ AD T+ + G D +VE E  D           DL L
Sbjct: 400 YKIGC-PLQGNKINPIDWASGVARYADVTVAVMGRDSAVEGEEGDAIFSDNYGDLSDLNL 458

Query: 523 PGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADV 582
              Q   + ++ E+ K P+++V++S  G  +   E      AI++A YPGEEGG AIA V
Sbjct: 459 SREQIDYLRRIKEIGK-PLVVVLLS--GAPVCSPELEELADAIVYAWYPGEEGGNAIARV 515

Query: 583 VFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLS 642
           +FG+ +P GRLPIT+  G  V  LP       P       GRTY++     LYPFG+GLS
Sbjct: 516 LFGEVSPSGRLPITFPKG--VDQLP-------PFTDYSMEGRTYRYMKEEPLYPFGFGLS 566

Query: 643 YTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQ 702
           Y  F Y                      +  S AS+         D R  +  E   + +
Sbjct: 567 YATFSYR---------------------DPKSSASRW--------DKR--ETLEVVCEVE 595

Query: 703 NVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDY 762
           N  S    +VV +Y +         +  + GF RV +  G   +++FV +  + L+ +D 
Sbjct: 596 NTSSIPADEVVQLYVRWEDAPFRVPLWSLKGFTRVSLGTGERIQVRFVLSP-EDLSFIDE 654

Query: 763 AANTLLPAGEHTIFVG 778
               +LP G     VG
Sbjct: 655 KGRKVLPEGRLRFHVG 670


>gi|310795958|gb|EFQ31419.1| glycosyl hydrolase family 3 N terminal domain-containing protein
           [Glomerella graminicola M1.001]
          Length = 824

 Score =  416 bits (1068), Expect = e-113,   Method: Compositional matrix adjust.
 Identities = 270/755 (35%), Positives = 388/755 (51%), Gaps = 56/755 (7%)

Query: 55  CDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTH 114
           CD +L    R   LV+ +T+ EK+  L + A GVPRL +P YEWWSE LHGV++  PGT 
Sbjct: 65  CDETLSPKERAAALVAELTIWEKLDNLVNEAPGVPRLAIPPYEWWSEGLHGVAS-SPGTK 123

Query: 115 FDDV--IPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYW------- 165
           F        ATSFP  I+  ++F++ L K IG+ VS EARA  N GR+GL  +       
Sbjct: 124 FAKSGNFSYATSFPQPIVLGSAFDDDLVKAIGEVVSKEARAFSNRGRSGLDLYVSSISRH 183

Query: 166 --------------SPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENA 211
                         SPNIN  +DPRWGR  ETPGEDPF +  Y    + GL   EG + +
Sbjct: 184 IEPEVRDDMLTEPESPNINAFKDPRWGRGQETPGEDPFHLQNYVAAMLTGL---EGGDPS 240

Query: 212 TDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDAS 271
                   K+ + CKHYAA D +N+KGVDR  FDA +T QD+ E +L PF+ C  +    
Sbjct: 241 K-------KLIATCKHYAANDFENYKGVDRAGFDANITTQDLSEYYLPPFKTCAVDKKVG 293

Query: 272 SVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHG---YIVADCDSIQVMVDNHKFLADSKE 328
           S MCSYN +NG P CA+P LL   +R  W  +G   Y+  DCD + +MV +H +  D   
Sbjct: 294 SFMCSYNAINGEPLCANPYLLEDILRQHWGWNGDGQYVSTDCDCVALMVSHHHYAPDLGH 353

Query: 329 DAVAQTLKAGLDLDCGQYY-TNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFD---G 384
            A A  +KAG DL+C  +  +     A  Q  + E ++DKSL  +YT L+ +G FD   G
Sbjct: 354 -AAAWAMKAGTDLECNAFPGSEALQLAWNQSLISEKEVDKSLTRMYTALVSVGQFDSARG 412

Query: 385 SPQYVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSA-KVKTVAVVGPHANAT 443
            P   SL   D+ + E  +LA +A  EG VLLKND   LPL++A + K  A++GP  NAT
Sbjct: 413 QP-LRSLSWDDVNTKEAQKLAYQAVIEGAVLLKND-GILPLSAAWREKKYALIGPWINAT 470

Query: 444 VAMIGNYAGIPCRYMSPIAGFSGYANVTYKTGCDDVACKSNNSIFAASEAAKTADATIIL 503
             M GNY G P  Y+  +   +    + +          +++S   A ++A  A   +  
Sbjct: 471 TQMQGNYFG-PAPYLISLYQAAKEFGLDFTYSLGSRINSTDDSFKQALDSAHAAALIVFA 529

Query: 504 AGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIK 563
            G+D ++EAE+ DR+ L  P  Q  L+  V+ + K PVI++    G VD      N +I 
Sbjct: 530 GGVDNTLEAETRDRKTLAWPESQLDLLRAVSALGK-PVIVLQFGGGQVDDTELLANHSIN 588

Query: 564 AILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPG 623
           A+LW GYPG+ GG+A+ D++FG+  P GRL +T Y   Y + +P T M LRP       G
Sbjct: 589 ALLWGGYPGQSGGKAVIDLLFGRAAPAGRLSVTQYPASYNEDVPSTDMNLRPGPGNSGLG 648

Query: 624 RTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPG 683
           RTY +YNG  + P+G+GL YT F   L +   +  +   ++    + +Y S        G
Sbjct: 649 RTYMWYNGDAVVPYGFGLHYTTFDAKLKARQASALIKTEEVSSLLSNDYVS--------G 700

Query: 684 VLV-NDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAG 742
            LV   +         +   N G+     V +++ +  A       K + G+ R      
Sbjct: 701 TLVWQQILTKPVVSVLITVSNTGNVASDYVALLFLRSNAGPTPQPTKTLAGYHRFRNIQP 760

Query: 743 RNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFV 777
            ++  + V    + L  VD   N +L  G + +FV
Sbjct: 761 GDRSEREVSITIERLVRVDELGNRVLHPGSYELFV 795


>gi|322512556|gb|ADX05682.1| putative carbohydrate-active enzyme [uncultured organism]
          Length = 717

 Score =  415 bits (1067), Expect = e-113,   Method: Compositional matrix adjust.
 Identities = 262/761 (34%), Positives = 399/761 (52%), Gaps = 106/761 (13%)

Query: 49  MSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSN 108
           M+   + D +  +  R + LV  MTL+EKV Q    A  + RLG+P Y +W+EALHGV+ 
Sbjct: 1   MTDKAWLDETKTFEERAQALVCEMTLEEKVFQTLFNAPAIERLGVPAYNYWNEALHGVAR 60

Query: 109 VGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRA-------- 160
            G           AT FP  I   ASF+E L  ++   +STEARA +N+ +         
Sbjct: 61  AGV----------ATVFPQAIGLAASFDEELLGQVADTISTEARAKFNMQQKFGDRDIYK 110

Query: 161 GLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLK 220
           GLT+WSPN+N+ RDPRWGR  ET GEDPF+ GR  V+++RG+Q           + R +K
Sbjct: 111 GLTFWSPNVNIFRDPRWGRGHETFGEDPFLSGRLGVSFIRGMQGD---------DERYMK 161

Query: 221 VSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRV 280
           V++C KH+A   V +     R+ F+A V+EQD+ ET+L  F  CV E    +VM +YNR 
Sbjct: 162 VAACAKHFA---VHSGPEDQRHSFNAVVSEQDLRETYLPAFHACVTEAGVEAVMGAYNRT 218

Query: 281 NGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKF--LADSKEDAVAQTLKAG 338
           NG   C   KLL   +RGEW   G++ +DC +++   D H+F  +  ++E+ VA  + +G
Sbjct: 219 NGEACCGSKKLLVDILRGEWGFRGHVTSDCWALK---DFHEFHMVTKNQEETVALAMNSG 275

Query: 339 LDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ--YVSLGKQDI 396
            DL+CG  Y +    AV+ G V+E+ ID+++  L+T  M+LG FD S +  Y  +G   +
Sbjct: 276 CDLNCGNLYVHLL-QAVRDGLVEESVIDRAVTRLFTTRMKLGLFDRSEEVPYNGIGYDRV 334

Query: 397 CSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCR 456
            ++ N +L  EA+R  + LLKN    LPL+ +K++T+ VVGP+A+   A++GNY G    
Sbjct: 335 DTEANRKLNREASRRTVCLLKNADGLLPLDISKLRTIGVVGPNADNRKALVGNYEGTASE 394

Query: 457 YMSPIAGFSGYA----NVTYKTGC----DDVA--CKSNNSIFAASEAAKTADATIILAGL 506
           Y++ + G    A     V Y  GC    D V    + N+ I  A   A+ +D  I + GL
Sbjct: 395 YVTVLDGIRELAGDDVRVVYSEGCHLFRDRVQGLGQPNDRIAEARAVAELSDVVIAVMGL 454

Query: 507 DLSVEAE---------SLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAE 557
           D  +E E         S D+ +L LPG Q +++  + E  K PV+LV++    + I +AE
Sbjct: 455 DPGLEGEEGDQGNEFASGDKPNLELPGLQGEVLKALVESGK-PVVLVLLGGSALAIPWAE 513

Query: 558 TNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVD 617
              ++ AIL A YPG +GGRA+ADV+FG+  P G+LP+T+Y          TS  L    
Sbjct: 514 --EHVPAILDAWYPGAQGGRAVADVLFGRACPEGKLPVTFYR---------TSEELPAFT 562

Query: 618 SLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDAS 677
                 RTY++   P LYPFGYGLSYT ++                       N T++ S
Sbjct: 563 DYSMKNRTYRYMKQPALYPFGYGLSYTSWELT---------------------NTTAEGS 601

Query: 678 KTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRV 737
                         DD    +   +N G+  G+  V VY K P  +A     Q+ G +++
Sbjct: 602 -------------VDDGVVCRAVLRNTGAMAGAQTVQVYVKAP--LATGPNAQLKGLRKI 646

Query: 738 FVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVG 778
            ++ G +  +    +  ++  + +     +L  GE+ I++G
Sbjct: 647 RLQPGESAEVAISLDK-EAFGVYNEKGLRVLLPGEYKIYIG 686


>gi|380696433|ref|ZP_09861292.1| glycoside hydrolase [Bacteroides faecis MAJ27]
          Length = 739

 Score =  414 bits (1064), Expect = e-112,   Method: Compositional matrix adjust.
 Identities = 266/757 (35%), Positives = 389/757 (51%), Gaps = 93/757 (12%)

Query: 47  LQMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGV 106
           L    F F D  LP   RV+DLVSR+TL+EKV+Q+ +    V RLG+P Y WW+E LHG+
Sbjct: 20  LAQEKFPFRDPQLPVEQRVEDLVSRLTLEEKVKQMLNSTPPVERLGIPAYNWWNECLHGI 79

Query: 107 SNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRA------ 160
                 T +       T FP  I   A++N++L K++  +++ E RA+YN  +       
Sbjct: 80  GR----TKYH-----VTVFPQAIGMAAAWNDALIKEVASSIADEGRAIYNDTQRKEDYSQ 130

Query: 161 --GLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRP 218
              LTYW+PNIN+ RDPRWGR  ET GEDP++  R    +V+GLQ           N R 
Sbjct: 131 YHALTYWTPNINIFRDPRWGRGQETYGEDPYLTARIGEAFVQGLQGD---------NPRY 181

Query: 219 LKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYN 278
           LK S+C KHYA   V +    +R+ F++ V+  D+ +T+L  F   V +   S VMC+YN
Sbjct: 182 LKASACAKHYA---VHSGPEKNRHSFNSDVSTYDLWDTYLPAFRTLVVDAKVSGVMCAYN 238

Query: 279 RVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAG 338
              G P C +  L+   +R +W+  GY+ +DC +I  + ++HK   D+   A       G
Sbjct: 239 AFQGQPCCGNDLLMQSILRDKWNFTGYVTSDCGAIDDIFNHHKTHPDAATAAADAVFH-G 297

Query: 339 LDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSP--QYVSLGKQDI 396
            DLDCG         AV+ G + E  +D S+K L+T+  RLG FD      Y  +    +
Sbjct: 298 TDLDCGHSAYLALVKAVKDGIITEKQLDVSVKRLFTIRFRLGLFDPVELVDYARIPISIL 357

Query: 397 CSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCR 456
              ++ +LA + ARE +VLLKNDQ  LPL   K+K V V+GP+A++  +++GNY G P R
Sbjct: 358 ECRKHQDLAKQLARESMVLLKNDQ-LLPLQKNKLKKVVVMGPNADSRESLLGNYNGNPSR 416

Query: 457 YMSPIAG----FSGYANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEA 512
            ++P+        G+  V Y  G D V   S + +      AK ADA I + G+   +E 
Sbjct: 417 MLTPLQAIRERLGGWTEVEYIEGVDHVNTISADDLKQYVNRAKGADAVIFIGGISPRLEG 476

Query: 513 ESL----------DREDLWLPGYQTQLINQVAEVAKG-PVILVIMSAGGVDIAFAETNTN 561
           E +          DR  + LP  QTQ++   A VA+  P + V+M+   + I +   N  
Sbjct: 477 EEMPVSKDGFDGGDRTTIALPAVQTQMMK--AWVAEHIPTVFVMMTGSALAIPWEAQN-- 532

Query: 562 IKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGY 621
           + AIL A Y G+ GG AIADV+FG +NP G+LP+T+Y  D       + +P    +S   
Sbjct: 533 VPAILNAWYGGQYGGEAIADVLFGDYNPSGKLPVTFYAKD-------SDLP--DFESYDM 583

Query: 622 PGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRC 681
            GRTY+++NG  LYPFGYGLSYT F Y+ L   K           CR             
Sbjct: 584 QGRTYRYFNGKALYPFGYGLSYTSFAYSSLKLPKV----------CRT------------ 621

Query: 682 PGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRA 741
                     D   E  V  +N G T+G +VV +Y   P +     +  + GF+R+ ++A
Sbjct: 622 ---------TDKEIEVTVTVKNTGHTEGEEVVQLYVSHPDKKILVPLTALKGFKRIQLKA 672

Query: 742 GRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVG 778
           G  +R+ F  ++ + L+ VD      + AG   I VG
Sbjct: 673 GEAQRVTFSLSS-EDLSCVDENGIRKVWAGTVKIQVG 708


>gi|398406144|ref|XP_003854538.1| hypothetical protein MYCGRDRAFT_38178 [Zymoseptoria tritici IPO323]
 gi|339474421|gb|EGP89514.1| hypothetical protein MYCGRDRAFT_38178 [Zymoseptoria tritici IPO323]
          Length = 884

 Score =  413 bits (1061), Expect = e-112,   Method: Compositional matrix adjust.
 Identities = 271/730 (37%), Positives = 386/730 (52%), Gaps = 55/730 (7%)

Query: 32  SPVFVCDPGRFSKLGLQMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRL 91
           SPV + DP   +K          CD+SL    R+  L+S+MT++EK   L D A G+PR+
Sbjct: 132 SPVCLTDPFCANKA---------CDTSLSQDDRIAALISQMTVEEKATNLVDGALGLPRI 182

Query: 92  GLPQYEWWSEALHGVSNVGPGTHFDDV----IPGATSFPTVILTTASFNESLWKKIGQAV 147
           GLP YEWW+EALHGV+    G  FD         ATSFP  IL  A+F++ L   +   +
Sbjct: 183 GLPPYEWWNEALHGVAG-SRGVSFDSPNGSDFSYATSFPLPILMGAAFDDPLIYDVASII 241

Query: 148 STEARAMYNLGRAGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEG 207
             EARA  N   +G  +W+PN+N   DPRWGR  E P ED F   RY  + V GLQ   G
Sbjct: 242 GKEARAFANYAHSGYDFWTPNMNTFLDPRWGRGLEVPTEDSFHAQRYVASLVPGLQ---G 298

Query: 208 HENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKE 267
            +  TD      ++ + CKH+A YDV+     +R+  +   T QD+ E +L  F+ CV++
Sbjct: 299 GKEKTDHK----QIIATCKHFAVYDVE----TNRHAQNYEPTPQDLGEYYLPAFKTCVRD 350

Query: 268 GDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDL---HGYIVADCDSIQVMVDNHKFLA 324
            +  S+MCSYN V G+P+CA    L   +R +W+    + Y+ +DC++++ +   H F  
Sbjct: 351 VNVGSIMCSYNAVYGVPACASEYFLQDVLRDQWNFNEPYHYVTSDCEAVKDIWTPHNF-T 409

Query: 325 DSKEDAVAQTLKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDG 384
           D++  A A  L AG D +CG  Y      +V      E  +D SL  LY  L  +G+FDG
Sbjct: 410 DTEPAAAAVALNAGTDTNCGTSYLQLN-TSVANNWTTEAQMDISLTRLYNALFTVGYFDG 468

Query: 385 SPQYVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATV 444
            P+Y  L   D+ +      A  AA EGI LLKND   LPL  +   +VA++GP ANAT 
Sbjct: 469 QPEYDGLSFADVSTPFAQATAYRAASEGITLLKND-GLLPLKKS-YNSVALIGPWANATT 526

Query: 445 AMIGNYAGIPCRYMSPIAGFSG-YANVTYKTGCDDVACKSNNSIFAAS--EAAKTADATI 501
            M G Y GI    +SP+A     + ++++  G    A  S N+   AS   AA+ AD  I
Sbjct: 527 QMQGIYQGIAPYLVSPLAAAQAQWGHISFTNG---TAINSTNTTGFASALSAARDADVII 583

Query: 502 ILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTN 561
              G+D S+E ES DR  +  PG Q  L+ Q++E+ K P+++V    G VD +    N N
Sbjct: 584 YAGGIDSSIEKESRDRTSISWPGNQLDLVQQLSELGK-PLVVVQFGGGQVDDSALLRNKN 642

Query: 562 IKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGY 621
           + +++WAGYPG++GG A+ DV+ GK +P GRL IT Y  DY+  + L    LRP DS   
Sbjct: 643 VNSLVWAGYPGQDGGSALIDVLVGKQSPAGRLTITQYPADYINQISLFDPNLRPSDSS-- 700

Query: 622 PGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRC 681
           PGRTYK+YN   + PFGYGL YT F+++   + K  Q + +           S AS T  
Sbjct: 701 PGRTYKWYNKEPVLPFGYGLHYTTFEFD---WAKAPQASYDIASLVD-----STASYTTS 752

Query: 682 PGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYI-KQVIGFQRVF-V 739
           P    ND     + E  +   N GS     V +V+ + P    A Y  K +  + R+  +
Sbjct: 753 PK--KND--ASPWTELSIKVHNSGSLGSDYVGLVFLRTPNAGPAPYPNKWLASYARLHGL 808

Query: 740 RAGRNKRIKF 749
            AG +  + F
Sbjct: 809 SAGASAELSF 818


>gi|23304843|emb|CAD48309.1| beta-xylosidase B [Clostridium stercorarium]
          Length = 715

 Score =  412 bits (1058), Expect = e-112,   Method: Compositional matrix adjust.
 Identities = 265/758 (34%), Positives = 389/758 (51%), Gaps = 104/758 (13%)

Query: 53  LFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPG 112
           ++ D S  +  R KDLVSRMT++EKV Q+   +  + RLG+P Y WW+EALHGV+  G  
Sbjct: 6   VYLDPSYSFEERAKDLVSRMTIEEKVSQMLYNSPAIERLGIPAYNWWNEALHGVARAGT- 64

Query: 113 THFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRA--------GLTY 164
                    AT FP  I   A+F+E L  K+   +STE RA Y+            GLT+
Sbjct: 65  ---------ATMFPQAIGMAATFDEELIYKVADVISTEGRAKYHASSKKGDRGIYKGLTF 115

Query: 165 WSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSC 224
           WSPNIN+ RDPRWGR  ET GEDP++  R  V +V+GLQ           + + LK    
Sbjct: 116 WSPNINIFRDPRWGRGQETYGEDPYLTARLGVAFVKGLQGN---------HPKYLKAGGM 166

Query: 225 CKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIP 284
           CK+   + V       R+ F+A V+++D+ ET+L  F+  V+E    SVM +YNR NG P
Sbjct: 167 CKNILPFTV--VPESLRHEFNAVVSKKDLYETYLPAFKALVQEAKVESVMGAYNRTNGEP 224

Query: 285 SCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCG 344
            C    LL+  +RGEW   G++V+DC +I+    +H   A + E A A  ++ G DL+CG
Sbjct: 225 CCGSKTLLSDILRGEWGFKGHVVSDCWAIRDFHMHHHVTATAPESA-ALAVRNGCDLNCG 283

Query: 345 QYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ--YVSLGKQDICSDENI 402
             + N    A+++G + E +ID+++  L    M+LG FD   Q  Y S+     C  E+ 
Sbjct: 284 NMFGNLL-IALKEGLITEEEIDRAVTRLMITRMKLGMFDPEDQVPYASISSFVDCK-EHR 341

Query: 403 ELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIA 462
           ELA + A++ IVLLKND   LPL+  K++++AV+GP+A++  A+IGNY G    Y++ + 
Sbjct: 342 ELALDVAKKSIVLLKND-GLLPLDRKKIRSIAVIGPNADSRQALIGNYEGTASEYVTVLD 400

Query: 463 GFSGYA----NVTYKTGCDDVACKSNN------SIFAASEAAKTADATIILAGLDLSVEA 512
           G    A     + Y  GC     +  N       I  A   A+ AD  I+  GLD ++E 
Sbjct: 401 GIREMAGDDVRIYYSVGCHLYKDRVENLGEPGDRIAEAVTCAEHADVVIMCLGLDSTIEG 460

Query: 513 ESL---------DREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIK 563
           E +         D+ DL LPG Q +L+  V    K P++LV+++   + + +A  + +I 
Sbjct: 461 EEMHESNIYGSGDKPDLNLPGQQQELLEAVYATGK-PIVLVLLTGSALAVTWA--DEHIP 517

Query: 564 AILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPG 623
           AIL A YPG  GGRAIA V+FG+ NP G+LP+T+Y          T+  L          
Sbjct: 518 AILNAWYPGALGGRAIASVLFGETNPSGKLPVTFYR---------TTEELPDFTDYSMEN 568

Query: 624 RTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPG 683
           RTY+F     LYPFG+GLSYT F Y+ L  +K                            
Sbjct: 569 RTYRFMKNEALYPFGFGLSYTTFDYSDLKLSK---------------------------- 600

Query: 684 VLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIK---QVIGFQRVFVR 740
              + +R  + F   V   N G   G +VV VY K   ++ A++     Q+ G +RV + 
Sbjct: 601 ---DTIRAGEGFNVSVKVTNTGKMAGEEVVQVYIK---DLEASWRVPNWQLSGMKRVRLE 654

Query: 741 AGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVG 778
           +G    I F     + L +V     +++  GE  I+VG
Sbjct: 655 SGETAEITFEIRP-EQLAVVTDEGKSVIEPGEFEIYVG 691


>gi|154313073|ref|XP_001555863.1| hypothetical protein BC1G_05538 [Botryotinia fuckeliana B05.10]
          Length = 755

 Score =  412 bits (1058), Expect = e-112,   Method: Compositional matrix adjust.
 Identities = 252/603 (41%), Positives = 350/603 (58%), Gaps = 46/603 (7%)

Query: 55  CD-SSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGT 113
           CD SS PY+ R   L+S  TL EKV   G+ + GVPR+GLP YEWW+EALHG++   PGT
Sbjct: 34  CDTSSDPYT-RAAALISLFTLAEKVNNTGNTSPGVPRIGLPSYEWWNEALHGIAR-SPGT 91

Query: 114 HFDDVIPG---ATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNIN 170
            F         +TSFP  IL  A+F++ L  K+   VSTEARA  N+ R GL +W+PNIN
Sbjct: 92  TFAATGSNYSYSTSFPQPILMGATFDDELIHKVATQVSTEARAFNNVNRFGLNFWTPNIN 151

Query: 171 VARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVS-SCCKHYA 229
             +DPRWGR  ETPGEDPF    Y    + GLQ          L+  P K   + CKH+A
Sbjct: 152 PYKDPRWGRGQETPGEDPFHTSSYVNALITGLQG--------GLDDLPYKKGVATCKHFA 203

Query: 230 AYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADP 289
            YD++N  G  RY FDA +  QD+ + +L PF+ C ++ +  SVMCSYN +NG+P+CAD 
Sbjct: 204 GYDLENSDGAIRYGFDAIIKSQDLRDYYLPPFQQCARDSNVQSVMCSYNAMNGVPTCADD 263

Query: 290 KLLNQTVRGEW---DLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQY 346
            LL   +R  W   +   ++ +DCD+++ + D H +   + E + A  L AG DLDCG +
Sbjct: 264 WLLQTLLREHWGWTEEDQWVTSDCDAVKNIWDYHNYTL-TPEQSAADALNAGTDLDCGTF 322

Query: 347 YTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFD--GSPQYVSLGKQDICSDENIEL 404
           +  + G+A  QG    + +D+SL   Y  L+RLG+FD      Y  L   ++ +    +L
Sbjct: 323 WPTYLGSAYDQGLYDISTLDRSLARRYASLVRLGYFDPPSVQPYRQLNWDNVSTPAAQQL 382

Query: 405 AAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSP-IAG 463
           A +AA +GIVLLKND   LPL S+ +  VA++GP ANAT  M GNY G      SP IA 
Sbjct: 383 ALQAAEDGIVLLKND-GILPL-SSNITNVALIGPLANATKQMQGNYYGTAPYLRSPLIAA 440

Query: 464 FSGYANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAESLDREDLWLP 523
            +    VTY  G  D+  ++     AA  AA++AD  I + G+D S+EAE +        
Sbjct: 441 QNAGFKVTYVQGA-DIDSQNTTDFSAAISAAQSADLVIYVGGIDNSIEAEEI-------- 491

Query: 524 GYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVV 583
                    +A ++  P+I+  M    +D +   +NT + A+LWAGYPG++GG AI +++
Sbjct: 492 ---------LANLST-PLIISQMGC-MIDSSSLLSNTGVNALLWAGYPGQDGGTAIFNIL 540

Query: 584 FGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSY 643
            GK  P GRLPIT Y  +YV  + +T M L+P  S   PGRTYK+YNG  ++ +GYGL Y
Sbjct: 541 TGKTAPAGRLPITQYPSNYVNQVTMTDMNLQP--SRFNPGRTYKWYNGEPVFEYGYGLQY 598

Query: 644 TQF 646
           T F
Sbjct: 599 TTF 601


>gi|291518645|emb|CBK73866.1| Beta-glucosidase-related glycosidases [Butyrivibrio fibrisolvens
           16/4]
          Length = 713

 Score =  411 bits (1057), Expect = e-112,   Method: Compositional matrix adjust.
 Identities = 252/746 (33%), Positives = 393/746 (52%), Gaps = 103/746 (13%)

Query: 64  RVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGAT 123
           R K+LVS+MT++EK  Q+   A  + RLG+P+Y WW+EALHGV+  G           AT
Sbjct: 8   RAKELVSQMTIEEKCSQMLHHAEAIDRLGIPKYCWWNEALHGVARAGD----------AT 57

Query: 124 SFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRA--------GLTYWSPNINVARDP 175
            FP  I   A+F+E L +K+    STE RA YN            GLTYW+PN+N+ RDP
Sbjct: 58  VFPQAIGLGATFDEELVEKVADVTSTEGRAKYNEFTKHGDRDIYKGLTYWAPNVNIFRDP 117

Query: 176 RWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDN 235
           RWGR  ET GEDP++ G+  + YVRGLQ         DL++   K ++C KH+A   V +
Sbjct: 118 RWGRGHETYGEDPYLTGQLGMAYVRGLQ-------GDDLDNP--KSAACAKHFA---VHS 165

Query: 236 WKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQT 295
               +R+HFDA+V +QD+ +T+L  F+  VK+    +VM +YNRVNG P+C   +LL   
Sbjct: 166 GPEAERHHFDAKVNDQDLYDTYLYAFKRLVKDAKVEAVMGAYNRVNGEPACGSKRLLKDI 225

Query: 296 VRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFTGNAV 355
           +RG+W   G++V+DC +I+   +NHK      E A A  +  G DL+CG  Y      A 
Sbjct: 226 LRGDWGFEGHVVSDCWAIRDFHENHKVTGCEVESA-ALAVNNGCDLNCGCVYEKLL-YAY 283

Query: 356 QQGKVKETDIDKSLKYLYTVLMRLGFF-DGSPQYVSLGKQDICSDENIELAAEAAREGIV 414
           +   V E  I +S++ L  + +RLG   +   +Y  +  + +   E+ ELA EAA+  +V
Sbjct: 284 KANLVTEETITESVERLIELRLRLGTLPERRSKYDDIPYEVVECKEHKELAIEAAKRSMV 343

Query: 415 LLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFSGYANVTYKT 474
           LLKND   LPL   ++KT+ V+GP++N+ +A++GNY GI   Y++ + G   Y     + 
Sbjct: 344 LLKND-GLLPLKKDEIKTIGVIGPNSNSRMALVGNYEGISSEYITVLEGIQQYVGDDVRV 402

Query: 475 GCDD----------VACKSNNSIFAASEAAKTADATIILAGLDLSVEAE---------SL 515
              D          V  ++ ++   A   A+ +D  ++  GLD ++E E         S 
Sbjct: 403 FHSDGTPLWKDRMHVLSEARDTFAEAMAVAEHSDVVVLAMGLDSTIEGEEGDAGNEFGSG 462

Query: 516 DREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEG 575
           D++ L LPG Q +L+ ++  + K PV+L++++   +D+++A  N N+ AI+   YPG  G
Sbjct: 463 DKKGLKLPGLQQELLEKITAIGK-PVVLLVLAGSAMDLSWA--NENVNAIMHCWYPGARG 519

Query: 576 GRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLY 635
           G+AIA V+FG+ +P G+LP+T+Y  D           L P +     GRTY+++ G  LY
Sbjct: 520 GKAIAQVLFGEDSPSGKLPLTFYKSD---------ADLPPFEDYSMEGRTYRYFKGTPLY 570

Query: 636 PFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYF 695
           PFGYGLSY+  +Y                         S+A   +  G +       D F
Sbjct: 571 PFGYGLSYSDIQY-------------------------SNAGIDKTEGAI------GDKF 599

Query: 696 EFKVDFQNVGSTDGSDVVIVYSK---PPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFN 752
             KV  +N G     + V VY K       +A   ++++    +V +  G +K +    +
Sbjct: 600 TVKVTVKNAGDYKAHETVQVYVKDVEASTRVANCSLRKI---AKVELLPGESKEVSLELS 656

Query: 753 ACKSLNIVDYAANTLLPAGEHTIFVG 778
           A +   I+D   + ++  G+  +FVG
Sbjct: 657 A-RDFAIIDEKGHCIVEPGKFKVFVG 681


>gi|410617070|ref|ZP_11328046.1| beta-glucosidase [Glaciecola polaris LMG 21857]
 gi|410163339|dbj|GAC32184.1| beta-glucosidase [Glaciecola polaris LMG 21857]
          Length = 731

 Score =  411 bits (1056), Expect = e-112,   Method: Compositional matrix adjust.
 Identities = 259/751 (34%), Positives = 384/751 (51%), Gaps = 93/751 (12%)

Query: 53  LFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPG 112
           ++ D  + ++ R   LV+ MT+DEK+ QL      + RL +PQY WW+EALHG++  G  
Sbjct: 28  VWFDPDISFAQRANLLVNAMTVDEKIAQLSHATPAIARLNVPQYNWWNEALHGIARNG-- 85

Query: 113 THFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGR--------AGLTY 164
                    AT FP  I   A+F+  L  ++  A+S EARA Y + +        AGLT+
Sbjct: 86  --------KATIFPQAIGLAATFDPDLAHQVASAISDEARAKYAIAQSIGNQGQYAGLTF 137

Query: 165 WSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSC 224
           W+PN+N+ RDPRWGR  ET GEDPF+  +    +V+GLQ  +          + LK +  
Sbjct: 138 WTPNVNIFRDPRWGRGQETYGEDPFLTAQMGTAFVKGLQGDD---------PKYLKSAGV 188

Query: 225 CKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIP 284
            KH+A   V +     R+HFD   +++D+ ET+L  FE  V +   + VMC+YN VNG P
Sbjct: 189 AKHFA---VHSGPESLRHHFDVEPSQKDLYETYLPAFEALVTQAKVAGVMCAYNAVNGEP 245

Query: 285 SCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCG 344
           +CA  +LL+  ++ +W  HGYIV+DC ++      HK      E A A  L++G++L+CG
Sbjct: 246 ACASAQLLDGILKKQWGFHGYIVSDCGALNDFQAGHKVTKSGPESA-ALALQSGVNLNCG 304

Query: 345 QYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFD--GSPQYVSLGKQDICSDENI 402
             Y +F   A++Q  V    ID+ L  L  +  +LGFFD  G   Y  +    I S E+I
Sbjct: 305 STYEHFLKAALEQNLVPLELIDQRLTQLLMIRFQLGFFDPAGLNPYNEVTPDVIHSPEHI 364

Query: 403 ELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIA 462
            L+ + AR+ IVLLKND + LPL S  +K   V GP A ++  +IGNY GI    +S + 
Sbjct: 365 NLSRDVARKSIVLLKNDNHVLPL-SKDIKVPYVTGPFAASSDMLIGNYYGISDSLVSVLE 423

Query: 463 GFSGY----ANVTYKTGCDDVACKSN-NSIFAASEAAKTADATIILAGLDLSVEAESL-- 515
           G +G     +++ Y++G   +   +N N +  A + AKTADA I + G+   +E E +  
Sbjct: 424 GIAGKVSLGSSLNYRSGS--LPFHNNINPLNWAPQVAKTADAVIAVVGVSADMEGEEVDA 481

Query: 516 -------DREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWA 568
                  DR  + LP  Q   + Q+A   KGP+ILV+ +   VDI+  E   +  AILW 
Sbjct: 482 IASADRGDRVAITLPQNQVDYVKQLAAHKKGPLILVVAAGSPVDISDLEPLAD--AILWI 539

Query: 569 GYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKF 628
            YPGE+GG A+ADV+FG  NP G LP+T+     +  LP       P D     GRTYKF
Sbjct: 540 WYPGEQGGNAVADVLFGDTNPSGHLPLTFVKS--IDDLP-------PFDDYAMTGRTYKF 590

Query: 629 YNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVND 688
                LYPFG+G SYT+F +N L+ ++   +    L                        
Sbjct: 591 LEKAPLYPFGFGRSYTEFSFNDLTVSQGKAIEGEAL------------------------ 626

Query: 689 LRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIK 748
                     V+ +N G   G  VV  Y  P A +    I  +  F+R+ +     + ++
Sbjct: 627 -------TLSVEVENRGDIAGETVVQAYLSPIARMNNEAISSLKSFKRIHLAPKETRWVE 679

Query: 749 FVFNACKSLNIVDYAANTLLPAGEHTIFVGN 779
                 K L  V+ A  T+ P G +++ VG+
Sbjct: 680 LTIQG-KDLYQVNNAGETVWPQGRYSLAVGD 709


>gi|255957137|ref|XP_002569321.1| Pc21g23540 [Penicillium chrysogenum Wisconsin 54-1255]
 gi|211591032|emb|CAP97251.1| Pc21g23540 [Penicillium chrysogenum Wisconsin 54-1255]
          Length = 791

 Score =  410 bits (1055), Expect = e-111,   Method: Compositional matrix adjust.
 Identities = 260/746 (34%), Positives = 393/746 (52%), Gaps = 49/746 (6%)

Query: 7   SLLCFSLSIALLVFSTNAVDANGSSSPVFVCDPGRFSKLGLQ--------MSSFLFCDSS 58
           +LL F+   AL   +T+  D N  + P     PG  +K+           +S  + CD++
Sbjct: 8   ALLAFA-PTALSQANTSYADYNTQAQPDLY--PGTTAKVDFSFPDCSNGPLSKTMVCDTT 64

Query: 59  LPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTHFDDV 118
                R   L++  T +E V   G+    +PRLGLP Y+ W+EALHG+      T F D 
Sbjct: 65  AKPHDRAAALIAMFTFEELVNSTGNVMPAIPRLGLPPYQVWNEALHGLDRANL-TEFGD- 122

Query: 119 IPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNINVARDPRWG 178
              ATSFP+ ILT A+ N +L  +IG  VST+ RA  N GR GL  +SPNIN  R P WG
Sbjct: 123 YSWATSFPSPILTMAALNRTLINQIGGIVSTQGRAFNNGGRYGLDVYSPNINSFRHPVWG 182

Query: 179 RITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKG 238
           R  ETPGED  +   Y + Y+ GLQ          L+ + LK+++  KH+A YD++NW  
Sbjct: 183 RGQETPGEDIQLCSVYGLEYITGLQG--------GLDPKELKLAATAKHFAGYDIENWGN 234

Query: 239 VDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRG 298
             R   D  ++  D    +   F   V++    SVM SYN VNG+P+ A+  LL   +R 
Sbjct: 235 HSRLGNDMSISAFDFASYYAPQFVTAVRDARVHSVMASYNAVNGVPASANSFLLQTLLRD 294

Query: 299 EWDL--HGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFTGNAVQ 356
            W+    GY+ +DCDS+  + + H + + S   A A++++AG D+DCG  Y  +   +  
Sbjct: 295 TWNFVEDGYVSSDCDSVYNVFNPHGYAS-SASLAAAKSIQAGTDIDCGATYQLYLNQSFT 353

Query: 357 QGKVKETDIDKSLKYLYTVLMRLGFFDG-SPQYVSLGKQDICSDENIELAAEAAREGIVL 415
           QG++  ++I+++    Y+ L+ LG+FDG + +Y  L   D+ + +   ++ EAA EGIVL
Sbjct: 354 QGEISRSEIERAATRFYSNLVSLGYFDGDNSKYRDLDWSDVVATDAWNISYEAAVEGIVL 413

Query: 416 LKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFSGY-ANVTYKT 474
           LKND  TLPL S    +VA++GP AN T  M GNY G       P+A       +V Y  
Sbjct: 414 LKND-GTLPL-SKDTHSVALIGPWANVTTTMQGNYYGAAPYLTGPLAALQASDLDVNYAF 471

Query: 475 GCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAESLDREDLWLPGYQTQLINQVA 534
           G  +++ ++ +   AA  AA+ +D  I   G+D SVEAE +DRE +  PG Q QLI Q++
Sbjct: 472 GT-NISSETTSGFEAALSAARKSDVVIFAGGIDNSVEAEGVDRETITWPGNQLQLIEQLS 530

Query: 535 EVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLP 594
           E+ K P++++ M  G VD +  + N N+ +++W GYPG+ GG AI D++ GK  P GRL 
Sbjct: 531 ELGK-PLVVLQMGGGQVDSSSLKANKNVNSLVWGGYPGQSGGPAILDILTGKRAPAGRLT 589

Query: 595 ITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLS-- 652
           +T Y  +Y    P T M LRP  S   PG+TY +Y G  +Y FG+GL YT F+ +L +  
Sbjct: 590 VTQYPAEYALQFPATDMSLRPKGS--NPGQTYMWYTGKPVYEFGHGLFYTTFETSLANSH 647

Query: 653 -FTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSD 711
                   ++ KL    N  Y              N +    +  + ++ +N G+     
Sbjct: 648 GANNGASFDIVKLLSRSNAGY--------------NVIEQVPFMNYTIEVENTGTVTSDY 693

Query: 712 VVIVYSKPPAEIAATYIKQVIGFQRV 737
             + +    A  +    K ++GF R+
Sbjct: 694 TAMAFVNTKAGPSPHPNKWLVGFDRL 719


>gi|358365439|dbj|GAA82061.1| beta-xylosidase [Aspergillus kawachii IFO 4308]
          Length = 788

 Score =  410 bits (1055), Expect = e-111,   Method: Compositional matrix adjust.
 Identities = 261/699 (37%), Positives = 377/699 (53%), Gaps = 48/699 (6%)

Query: 68  LVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTHFDDV--IPGATSF 125
           L+S  TLDE +   G+   GV RLGLP Y+ WSEALHG+       +F D      ATSF
Sbjct: 66  LISLFTLDELIANTGNTGLGVSRLGLPAYQVWSEALHGLDRA----NFSDSGSYNWATSF 121

Query: 126 PTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNINVARDPRWGRITETPG 185
           P  ILTTA+ N +L  +I   +ST+ RA  N GR GL  ++PNIN  R P WGR  ETPG
Sbjct: 122 PQPILTTAALNRTLIHQIASIISTQGRAFNNAGRYGLDVYAPNINTFRHPVWGRGQETPG 181

Query: 186 EDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFD 245
           ED  +   YA  Y+ G+Q  +   N        LK+++  KHYA YD++NW    R   D
Sbjct: 182 EDVSLAAVYAYEYITGIQGPDPDSN--------LKLAATAKHYAGYDIENWHNHSRLGND 233

Query: 246 ARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDL--H 303
             +T+QD+ E +   F +  ++    SVMC+YN VNG+P+CAD   L   +R  +    H
Sbjct: 234 MNITQQDLSEYYTPQFHVAARDAKVHSVMCAYNAVNGVPACADSYFLQTLLRDTFGFVDH 293

Query: 304 GYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFTGNAVQQGKVKET 363
           GY+ +DCD+   + + H + + S+  A A+ + AG D+DCG  Y      ++  G +   
Sbjct: 294 GYVSSDCDAAYNIYNPHGYAS-SQAAAAAEAILAGTDIDCGTTYQWHLNESITAGDLSRD 352

Query: 364 DIDKSLKYLYTVLMRLGFFDGSPQ-----YVSLGKQDICSDENIELAAEAAREGIVLLKN 418
           DI+K +  LYT L++ G+FD +       Y  L   D+   +   ++ +AA +GIVLLKN
Sbjct: 353 DIEKGVIRLYTTLVQAGYFDSNTTKANNPYRDLTWSDVLETDAWNISYQAATQGIVLLKN 412

Query: 419 DQNTLPLNSAKV----KTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGF--SGYANVTY 472
             N LPL          TVA++GP ANAT  ++GNY G     +SP A F  +GY  V +
Sbjct: 413 SNNVLPLTEKAYPPSNTTVALIGPWANATTQLLGNYYGNAPYMISPRAAFEEAGY-KVNF 471

Query: 473 KTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAESLDREDLWLPGYQTQLINQ 532
             G   ++  S +   AA  AA++AD  I   G+D ++EAE+LDRE +  PG Q  LI +
Sbjct: 472 AEGT-GISSTSTSGFAAALSAARSADVIIYAGGIDNTLEAEALDRESIAWPGNQLDLIQK 530

Query: 533 VAEVA-KGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGG 591
           +A  A   P+I++ M  G VD +  + NTN+ A+LW GYPG+ GG A+ D++ GK NP G
Sbjct: 531 LASSAGSKPLIVLQMGGGQVDSSSLKNNTNVTALLWGGYPGQSGGFALRDIITGKKNPAG 590

Query: 592 RLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLL 651
           RL  T Y   Y +  P T M LRP      PG+TYK+Y G  +Y FG+GL YT F  +  
Sbjct: 591 RLVTTQYPASYAEEFPATDMNLRPEGD--NPGQTYKWYTGEAVYEFGHGLFYTTFAES-S 647

Query: 652 SFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSD 711
           S T T +V LN +Q   +  +   AS T+ P              F  + +N G  +   
Sbjct: 648 SNTTTKEVKLN-IQDILSQTHEELASITQLP-----------VLNFTANIKNTGKLESDY 695

Query: 712 VVIVYSKPPAEIAATY-IKQVIGFQRVF-VRAGRNKRIK 748
             +V++       A Y +K ++G+ R+  V+ G  + ++
Sbjct: 696 TAMVFANTSDAGPAPYPVKWLVGWDRLGDVKVGETRELR 734


>gi|238483831|ref|XP_002373154.1| beta-xylosidase XylA [Aspergillus flavus NRRL3357]
 gi|292495283|sp|B8MYV0.1|XYND_ASPFN RecName: Full=Probable exo-1,4-beta-xylosidase xlnD; AltName:
           Full=1,4-beta-D-xylan xylohydrolase xlnD; AltName:
           Full=Beta-xylosidase A; AltName: Full=Beta-xylosidase
           xlnD; AltName: Full=Xylobiase xlnD; Flags: Precursor
 gi|220701204|gb|EED57542.1| beta-xylosidase XylA [Aspergillus flavus NRRL3357]
          Length = 797

 Score =  410 bits (1053), Expect = e-111,   Method: Compositional matrix adjust.
 Identities = 262/764 (34%), Positives = 393/764 (51%), Gaps = 47/764 (6%)

Query: 25  VDANGSSSPVFVCDPGRFSKLGLQMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDF 84
           ++  G+S P   C+ G  SK        L CD+S     R   LVS +T +E V    + 
Sbjct: 42  LETGGTSFPD--CESGPLSKT-------LVCDTSAKPHDRAAALVSLLTFEELVNNTANT 92

Query: 85  AHGVPRLGLPQYEWWSEALHGVSNVGPGTHFDDV--IPGATSFPTVILTTASFNESLWKK 142
            HG PR+GLP Y+ W+EALHGV++      F D      +TSFP  I T A+ N +L  +
Sbjct: 93  GHGAPRIGLPAYQVWNEALHGVAHA----DFSDAGDFSWSTSFPQPISTMAALNRTLIHQ 148

Query: 143 IGQAVSTEARAMYNLGRAGLTYWSPNINVARDPRWGRITETPGEDPF-VVGRYAVNYVRG 201
           I   +ST+ RA  N GR GL  +SPNIN  R P WGR  ETPGED + +   YA  Y+ G
Sbjct: 149 IATIISTQGRAFMNAGRYGLDVYSPNINTFRHPVWGRGQETPGEDAYCLASTYAYEYITG 208

Query: 202 LQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPF 261
           +Q          +++ PLK+ +  KHYA YD++NW    R   D ++T+QD+ E +   F
Sbjct: 209 IQG--------GVDANPLKLIATAKHYAGYDIENWDNHSRLGNDMQITQQDLAEYYTPQF 260

Query: 262 EMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDL--HGYIVADCDSIQVMVDN 319
            +  ++    SVMCSYN VNG+PSC++   L   +R  +D    GY+  DC ++  + + 
Sbjct: 261 LVASRDAKVHSVMCSYNAVNGVPSCSNSFFLQTLLRDTFDFVEDGYVSGDCGAVYNVFNP 320

Query: 320 HKFLADSKEDAVAQTLKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRL 379
           H + A ++  A A +++AG D+DCG  Y      +    +V   D+++ +  LY  L+R 
Sbjct: 321 HGY-ATNESSAAADSIRAGTDIDCGVSYPRHFQESFHDQEVSRQDLERGVTRLYASLIRA 379

Query: 380 GFFDG-SPQYVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPL-NSAKVKTVAVVG 437
           G+FDG +  Y ++   D+ S     L+ EAA + IVLLKND   LPL  S+  KT+A++G
Sbjct: 380 GYFDGKTSPYRNITWSDVVSTNAQNLSYEAAAQSIVLLKND-GILPLTTSSSTKTIALIG 438

Query: 438 PHANATVAMIGNYAGIPCRYMSPIAGF--SGYANVTYKTGCDDVACKSNNSIFAASEAAK 495
           P ANAT  M+GNY G     +SP+  F  S Y  +TY  G +      + S   A   AK
Sbjct: 439 PWANATTQMLGNYYGPAPYLISPLQAFQDSEY-KITYTIGTNTTTDPDSTSQSTALTTAK 497

Query: 496 TADATIILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAF 555
            AD  I   G+D ++E E+ DR ++  P  Q  LI ++A++ K P+I++ M  G VD + 
Sbjct: 498 EADLIIFAGGIDNTLETEAQDRSNITWPSNQLSLITKLADLGK-PLIVLQMGGGQVDSSA 556

Query: 556 AETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRP 615
            + N N+ A++W GYPG+ GG+A+AD++ GK  P  RL  T Y  +Y ++ P   M LRP
Sbjct: 557 LKNNKNVNALIWGGYPGQSGGQALADIITGKRAPAARLVTTQYPAEYAEVFPAIDMNLRP 616

Query: 616 VDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSD 675
             S   PG+TY +Y G  +Y FG+GL YT F  +  + + T        ++  + N    
Sbjct: 617 NGS--NPGQTYMWYTGTPVYEFGHGLFYTNFTASASAGSGT--------KNRTSFNIDEV 666

Query: 676 ASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQ 735
             +      LV  +       F VD +N G        + +    A  A    K ++GF 
Sbjct: 667 LGRPHPGYKLVEQMPL---LNFTVDVKNTGDRVSDYTAMAFVNTTAGPAPHPNKWLVGFD 723

Query: 736 RVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGN 779
           R+      + +   +     SL   D   N +L  G + + + N
Sbjct: 724 RLSAVEPGSAKTMVIPVTVDSLARTDEEGNRVLYPGRYEVALNN 767


>gi|171678585|ref|XP_001904242.1| hypothetical protein [Podospora anserina S mat+]
 gi|170937362|emb|CAP62020.1| unnamed protein product [Podospora anserina S mat+]
          Length = 800

 Score =  409 bits (1052), Expect = e-111,   Method: Compositional matrix adjust.
 Identities = 262/744 (35%), Positives = 399/744 (53%), Gaps = 50/744 (6%)

Query: 49  MSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSN 108
           +S+   C+++L    R   LV+ +T +EK+Q +   + G PR+GLP Y WWSEALHGV+ 
Sbjct: 34  LSTNQVCNTTLSPPERAAALVAALTPEEKLQNIVSKSLGAPRIGLPAYNWWSEALHGVA- 92

Query: 109 VGPGTHF---DDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYW 165
             PGT F   D     +TSFP  +L  A+F++ L +KI + +  E RA  N G +GL YW
Sbjct: 93  YAPGTQFWQGDGPFNSSTSFPMPLLMAATFDDELLEKIAEVIGIEGRAFGNAGFSGLDYW 152

Query: 166 SPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCC 225
           +PN+N  +DPRWGR +ETPGED  +V RYA   ++GL+          +  +  +V + C
Sbjct: 153 TPNVNPFKDPRWGRGSETPGEDVLLVKRYAAAMIKGLEG--------PVPEKERRVVATC 204

Query: 226 KHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPS 285
           KHYAA D ++W G  R++F+A+++ QDM E +  PF+ CV++    S+MC+YN VNG+PS
Sbjct: 205 KHYAANDFEDWNGATRHNFNAKISLQDMAEYYFMPFQQCVRDSRVGSIMCAYNAVNGVPS 264

Query: 286 CADPKLLNQTVRGEWDL---HGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLD 342
           CA P LL   +R  W+    + YI +DC+++  +  NHK+ A + E   A + +AG+D  
Sbjct: 265 CASPYLLQTILREHWNWTEHNNYITSDCEAVLDVSLNHKYAATNAE-GTAISFEAGMDTS 323

Query: 343 CGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ-YVSLGKQDICSDEN 401
           C    ++    A  QG +KE+ +D++L  LY  ++R G+FDG    Y SLG  D+     
Sbjct: 324 CEYEGSSDIPGAWSQGLLKESTVDRALLRLYEGIVRAGYFDGKQSLYSSLGWADVNKPSA 383

Query: 402 IELAAEAAREGIVLLKNDQNTLP----LNSAKVKTVAVVGPHANATVAMIGNYAGIPCRY 457
            +L+ +AA +G VLLKND  TLP    L+ ++ K VA++G  ++A   + G Y+G    Y
Sbjct: 384 QKLSLQAAVDGTVLLKND-GTLPLSDLLDKSRPKKVAMIGFWSDAKDKLRGGYSGT-AAY 441

Query: 458 MSPIAGFSGYANVTYKTGCDDVA---CKSNNSIF-AASEAAKTADATIILAGLDLSVEAE 513
           +   A  +    + + T    +      SN S    A  AAK AD  +   G+D S   E
Sbjct: 442 LHTPAYAASQLGIPFSTASGPILHSDLASNQSWTDNAMAAAKDADYILYFGGIDTSAAGE 501

Query: 514 SLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGE 573
           + DR DL  PG Q  LIN +  ++K P+I++ M    +D     +N  I AILWA +PG+
Sbjct: 502 TKDRYDLDWPGAQLSLINLLTTLSK-PLIVLQM-GDQLDNTPLLSNPKINAILWANWPGQ 559

Query: 574 EGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPT 633
           +GG A+ ++V G  +P GRLP+T Y  ++ +++P+T M LRP       GRTY++Y  P 
Sbjct: 560 DGGTAVMELVTGLKSPAGRLPVTQYPSNFTELVPMTDMALRPSAGNSQLGRTYRWYKTP- 618

Query: 634 LYPFGYGLSYTQFKYNL-LSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCD 692
           +  FG+GL YT F       F   I V+   L+ C       D     CP   + DL   
Sbjct: 619 VQAFGFGLHYTTFSPKFGKKFPAVIDVD-EVLEGC------DDKYLDTCP---LPDL--- 665

Query: 693 DYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATY-IKQVIGFQRVFVRAGRNKRIKFVF 751
                 V  +N G+     V + +   P      + IK +  F R+    G  KR   + 
Sbjct: 666 -----PVVVENRGNRTSDYVALAFVSAPGVGPGPWPIKTLGAFTRLRGVKGGEKREGGLK 720

Query: 752 NACKSLNIVDYAANTLLPAGEHTI 775
               +L   D   NT++  G++ +
Sbjct: 721 WNLGNLARHDEEGNTVVYPGKYEV 744


>gi|253579611|ref|ZP_04856880.1| glycoside hydrolase, family 3 domain-containing protein
           [Ruminococcus sp. 5_1_39B_FAA]
 gi|251849112|gb|EES77073.1| glycoside hydrolase, family 3 domain-containing protein
           [Ruminococcus sp. 5_1_39BFAA]
          Length = 706

 Score =  409 bits (1050), Expect = e-111,   Method: Compositional matrix adjust.
 Identities = 273/749 (36%), Positives = 382/749 (51%), Gaps = 102/749 (13%)

Query: 64  RVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGAT 123
           + + LVS+MTL EK  QL   A  V RLG+P Y +W+EALHGV+  G           AT
Sbjct: 14  KAEKLVSQMTLLEKASQLKYDAAPVKRLGVPAYNYWNEALHGVARAGV----------AT 63

Query: 124 SFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRA--------GLTYWSPNINVARDP 175
            FP  I   A F++   KK+G  ++TE RA YN   A        GLT+WSPN+N+ RDP
Sbjct: 64  MFPQAIAMAAVFDDEEMKKVGDIIATEGRAKYNAYSAKEDRDIYKGLTFWSPNVNIFRDP 123

Query: 176 RWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDN 235
           RWGR  ET GEDP++  R  V +V G+Q           +   +K ++C KHYA   V +
Sbjct: 124 RWGRGHETYGEDPYLTSRLGVKFVEGIQG----------DGPVMKAAACAKHYA---VHS 170

Query: 236 WKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQT 295
                R+ FDA+ + +DM ET+L  FE  V E D  +VM +YNR NG P CA   L+   
Sbjct: 171 GPESLRHEFDAQASMKDMWETYLPAFEALVTEADVEAVMGAYNRTNGEPCCAHKYLMEDV 230

Query: 296 VRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFTGNAV 355
           +RG+W   G+  +DC +I+   ++H   +  ++ A A  L AG DL+CG  Y +  G A 
Sbjct: 231 LRGKWKFEGHYTSDCWAIRDFHEHHMVTSTPRQSA-AMALNAGCDLNCGNTYLHMMG-AY 288

Query: 356 QQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQDICSDENIELAAEAAREGIVL 415
           Q G V E  I +S   L T    LG FDGS +Y  +    +   E+I+ A + AR+  VL
Sbjct: 289 QDGLVTEEKITESAVRLLTTRYLLGLFDGS-EYDKIPYSVVECKEHIDEALKMARKSCVL 347

Query: 416 LKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFSGYA----NVT 471
           LKND   LP++  KV T+ V+GP+A++  A+IGNY G    Y++ + G    A     + 
Sbjct: 348 LKND-GVLPIDKTKVNTIGVIGPNADSRAALIGNYHGTSSEYITVLEGIREEAGDDVRIL 406

Query: 472 YKTGCDDVACKSNN------SIFAASEAAKTADATIILAGLDLSVEAE---------SLD 516
           Y  GCD    K  N       I  A   A+ +D  I+  GL+ ++E E         S D
Sbjct: 407 YSQGCDLYKDKVENLAWDQDRISEAVITAENSDVVILCVGLNETLEGEEGDTGNSDASGD 466

Query: 517 REDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGG 576
           + DL LP  Q +LI +V  V K P I+V+M+   +D+ +A+ N N   IL A YPG  GG
Sbjct: 467 KVDLHLPKVQEELIEKVTAVGK-PTIVVLMAGSAIDLNYAQDNCN--GILLAWYPGARGG 523

Query: 577 RAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYP 636
           RAIAD++FGK +P G+LPIT+Y  D   M   T   ++         RTY++     LYP
Sbjct: 524 RAIADLLFGKESPSGKLPITFYK-DLEGMPEFTDYSMK--------NRTYRYMEKEALYP 574

Query: 637 FGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFE 696
           FGYGL+Y                              SD   T     +V ++  +    
Sbjct: 575 FGYGLTY------------------------------SDTCVTEAE--VVGEVSAESDIV 602

Query: 697 FKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKS 756
            K   +N G+ D  +VV VY K      A     + GF+RV ++AG  K ++F  +  K+
Sbjct: 603 LKATVKNNGTVDTDEVVQVYIKDLDSPLAVRNYSLCGFKRVSLKAGEEKSVEFTISN-KA 661

Query: 757 LNIVDYAANTLLPAGEHTIFVGNGGVSFP 785
           +NIVD   N  + AG+H  F    GVS P
Sbjct: 662 MNIVDEDGNRYI-AGKH--FRLFAGVSQP 687


>gi|261368518|ref|ZP_05981401.1| beta-glucosidase [Subdoligranulum variabile DSM 15176]
 gi|282569400|gb|EFB74935.1| glycosyl hydrolase family 3 C-terminal domain protein
           [Subdoligranulum variabile DSM 15176]
          Length = 717

 Score =  409 bits (1050), Expect = e-111,   Method: Compositional matrix adjust.
 Identities = 263/748 (35%), Positives = 380/748 (50%), Gaps = 102/748 (13%)

Query: 61  YSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTHFDDVIP 120
           Y  R + LV++MTL EK+ Q+  +A  +PRLG+P Y WW+E +HGV   G          
Sbjct: 11  YRERARALVAQMTLKEKISQMLSWAPAIPRLGIPAYNWWNEGIHGVGRAGT--------- 61

Query: 121 GATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRA--------GLTYWSPNINVA 172
            AT FP  I   ASF+E L  ++G+AV  EAR  YN+ R+        GLT W+PN+N+ 
Sbjct: 62  -ATVFPQAIGLAASFDEDLLGQVGEAVGVEARGKYNMYRSYQDRDIYKGLTIWAPNVNIF 120

Query: 173 RDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYD 232
           RDPRWGR  ET GEDP++  R  V +V G+Q           +   L+ ++C KH+A   
Sbjct: 121 RDPRWGRGHETYGEDPYLTSRLGVRFVEGMQGD---------DPDYLRAAACAKHFA--- 168

Query: 233 VDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLL 292
           V +     R++FDA+V++QD+ ET+L  F   VKE    +VM +YNR NG P C    LL
Sbjct: 169 VHSGPEDQRHYFDAKVSQQDLWETYLPAFRALVKEAGVEAVMGAYNRTNGEPCCGSKTLL 228

Query: 293 NQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFTG 352
              +RG+W+  G++ +DC +I+   + H  +     D+VA  +  G DL+CG  Y  +  
Sbjct: 229 VDILRGKWNFQGHVTSDCWAIKDFHEGH-MVTSGPVDSVALAVNNGCDLNCGDLYA-YLE 286

Query: 353 NAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ--YVSLGKQDICSDENIELAAEAAR 410
            AV +GKVKE  ID+SL  L+T  M+LG FD   +  Y  +G   + S E   L  E A 
Sbjct: 287 EAVAEGKVKEETIDRSLVRLFTTRMKLGMFDAEEKVPYNKIGYDAVDSREMQALNLEVAE 346

Query: 411 EGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFSGY--- 467
           + +VLLKN+ +TLPL+ +K+  VAVVGP+A+   A++GNY G   RY++ + G   Y   
Sbjct: 347 KILVLLKNENHTLPLDKSKLHRVAVVGPNADNRKALVGNYEGTASRYVTVLDGIQEYLGE 406

Query: 468 -ANVTYKTGCDDVA------CKSNNSIFAASEAAKTADATIILAGLDLSVEAE------- 513
              V Y  GC   A       KSN  I          D  I   GLD  +E E       
Sbjct: 407 DVQVRYSEGCHLYADKIQGLAKSNELISEVRGVCAECDVVICCLGLDAGLEGEEGDQGNQ 466

Query: 514 --SLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYP 571
             S D++ L LPG Q  ++    E  K PV++V++S  G  +A         A+L A YP
Sbjct: 467 FASGDKQSLSLPGNQESVLKACIESGK-PVVVVVLS--GSALALGTAQEGAAAVLQAWYP 523

Query: 572 GEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLP-LTSMPLRPVDSLGYPGRTYKFYN 630
           G +GGRA+A  +FG+ NP G+LP+T+Y+ D  + LP  T   ++        GRTY++  
Sbjct: 524 GAQGGRAVARALFGECNPQGKLPVTFYHSD--EDLPAFTDYAMK--------GRTYRYME 573

Query: 631 GPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLR 690
              LYPFGYGLSY+ F +                      +  +DA++    GV      
Sbjct: 574 KEPLYPFGYGLSYSHFTFR---------------------DAKADAAQIGPDGV------ 606

Query: 691 CDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFV 750
                + +V   N G   G + V VY K  AE   T   Q+    +V +  G  K +   
Sbjct: 607 -----DVRVTVVNDGQYRGRETVEVYVK--AERPGTPNAQLKALAKVDLMPGEEKCVTLH 659

Query: 751 FNACKSLNIVDYAANTLLPAGEHTIFVG 778
              C      +   + +LP GE+T+++G
Sbjct: 660 LPQCAFALCNEEGISEVLP-GEYTVWLG 686


>gi|329745495|gb|AEB98984.1| xylosidase precursor [synthetic construct]
          Length = 804

 Score =  408 bits (1049), Expect = e-111,   Method: Compositional matrix adjust.
 Identities = 264/720 (36%), Positives = 383/720 (53%), Gaps = 52/720 (7%)

Query: 49  MSSFLFCD-SSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVS 107
           + S L CD S+ PY  R   L+S  TLDE +   G+   GV RLGLP Y+ WSEALHG+ 
Sbjct: 63  LRSHLICDESATPYD-RAASLISLFTLDELIANTGNTGLGVSRLGLPVYQVWSEALHGLD 121

Query: 108 NVGPGTHFDDV--IPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYW 165
                 +F D      ATSFP  ILTTA+ N +L  +I   +ST+ RA  N GR GL  +
Sbjct: 122 RA----NFSDSGSYNWATSFPQPILTTAALNRTLIHQIASIISTQGRAFNNAGRYGLDVY 177

Query: 166 SPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCC 225
           +PNIN  R P  GR  ETPGED  +   YA  Y+ G+Q  +   N        LK+++  
Sbjct: 178 APNINTFRHPVRGRGQETPGEDVSLAAVYAYEYITGIQGPDPDSN--------LKLAATA 229

Query: 226 KHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPS 285
           KHYA YD++NW    R   D  +T+QD+ E +   F +  ++    SVMC+YN VNG+P+
Sbjct: 230 KHYAGYDIENWHNHSRLGNDMNITQQDLSEYYTPQFHVAARDAKVHSVMCAYNAVNGVPA 289

Query: 286 CADPKLLNQTVRGEWDL--HGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDC 343
           CAD   L   +R  +    HGY+ +DCD+   + + H + +     A A+ + AG D+DC
Sbjct: 290 CADSYFLQTLLRDTFGFVDHGYVSSDCDAAYNIYNPHGYASSQAA-AAAEAILAGTDIDC 348

Query: 344 GQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ-----YVSLGKQDICS 398
           G  Y      ++  G +   DI+K +  LYT L++ G+FD +       Y  L   D+  
Sbjct: 349 GTTYQWHLNESITAGDLSRDDIEKGVIRLYTTLVQAGYFDSNTTKANNPYRDLTWSDVLE 408

Query: 399 DENIELAAEAAREGIVLLKNDQNTLPLNSAKV----KTVAVVGPHANATVAMIGNYAGIP 454
            +   ++ +AA +GIVLLKN    LPL          TVA++GP ANAT  ++GNY G  
Sbjct: 409 TDAWNISYQAATQGIVLLKNSNKVLPLTEKAYPPSNTTVALIGPWANATTQLLGNYYGNA 468

Query: 455 CRYMSPIAGF--SGY-ANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVE 511
              +SP   F  +GY  N   +TG   ++  + +   AA  AA++AD  I   G+D ++E
Sbjct: 469 PYMISPRVAFEEAGYNVNFAERTG---ISSTNTSGFAAALSAAQSADVIIYAGGIDNTLE 525

Query: 512 AESLDREDLWLPGYQTQLINQVAEVA-KGPVILVIMSAGGVDIAFAETNTNIKAILWAGY 570
           AE+LDRE +  PG Q  LI ++A  A   P+I++ M  G VD +  + NTN+ A+LW GY
Sbjct: 526 AEALDRESIAWPGNQLDLIQKLASSAGSKPLIVLQMGGGQVDSSSLKNNTNVSALLWGGY 585

Query: 571 PGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYN 630
           PG+ GG A+ D++ GK NP GRL  T Y   Y +  P T M LRP      PG+TYK+Y 
Sbjct: 586 PGQSGGFALRDIITGKKNPAGRLVTTQYPASYAEEFPATDMNLRPEGD--NPGQTYKWYT 643

Query: 631 GPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLR 690
           G  +Y FG+GL YT F  +  S T T ++ LN +Q   +  +   AS T+ P        
Sbjct: 644 GEAVYEFGHGLFYTTFAES-SSNTTTREIKLN-IQDILSQTHEDLASITQLP-------- 693

Query: 691 CDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATY-IKQVIGFQRV-FVRAGRNKRIK 748
                 F  + +N G  +     +V++       A Y +K ++G+ R+  V+ G  + ++
Sbjct: 694 ---VLNFTANIKNTGKVESDYTAMVFANTSDAGPAPYPVKWLVGWDRLGEVKVGETRELR 750


>gi|367032987|ref|XP_003665776.1| glycoside hydrolase family 3 protein [Myceliophthora thermophila
           ATCC 42464]
 gi|347013048|gb|AEO60531.1| glycoside hydrolase family 3 protein [Myceliophthora thermophila
           ATCC 42464]
          Length = 835

 Score =  408 bits (1049), Expect = e-111,   Method: Compositional matrix adjust.
 Identities = 250/623 (40%), Positives = 353/623 (56%), Gaps = 33/623 (5%)

Query: 49  MSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSN 108
           +S    CD +LP + R   LV+ +T +EK+Q L   A G PR+GLP Y WWSEALHGV++
Sbjct: 23  LSDIKVCDRTLPEAERAAALVAALTDEEKLQNLVSKAPGAPRIGLPAYNWWSEALHGVAH 82

Query: 109 VGPGTHFDDVIPG----ATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTY 164
             PGT F D  PG    +TSFP  +L  A+F++ L + +G  + TEARA  N G +GL Y
Sbjct: 83  A-PGTQFRDG-PGDFNSSTSFPMPLLMAAAFDDELIEAVGDVIGTEARAFGNAGWSGLDY 140

Query: 165 WSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNS--RPLKVS 222
           W+PN+N  RDPRWGR +ETPGED   + RYA + +RGL+      ++    S   P +V 
Sbjct: 141 WTPNVNPFRDPRWGRGSETPGEDVVRLKRYAASMIRGLEGRSSSSSSCSFGSGGEPPRVI 200

Query: 223 SCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNG 282
           S CKHYA  D ++W G  R+ FDA ++ QD+ E +L PF+ C ++    SVMC+YN VNG
Sbjct: 201 STCKHYAGNDFEDWNGTTRHDFDAVISAQDLAEYYLAPFQQCARDSRVGSVMCAYNAVNG 260

Query: 283 IPSCADPKLLNQTVRGEWDL---HGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGL 339
           +PSCA+  L+N  +RG W+      Y+ +DC+++ + V  H   AD+  +      +AG+
Sbjct: 261 VPSCANSYLMNTILRGHWNWTEHDNYVTSDCEAV-LDVSAHHHYADTNAEGTGLCFEAGM 319

Query: 340 DLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDG--SPQYVSLGKQDIC 397
           D  C    ++    A   G +    +D++L  LY  L+R+G+FDG  SP + SLG  D+ 
Sbjct: 320 DTSCEYEGSSDIPGASAGGFLTWPAVDRALTRLYRSLVRVGYFDGPESP-HASLGWADVN 378

Query: 398 SDENIELAAEAAREGIVLLKNDQNTLPL---------NSAKVKTVAVVGPHANATVAMIG 448
             E  ELA  AA EGIVLLKND +TLPL              + VA++G  A+A   + G
Sbjct: 379 RPEAQELALRAAVEGIVLLKNDNDTLPLPLPDDVVVTADGGRRRVAMIGFWADAPDKLFG 438

Query: 449 NYAGIPCRYMSPIAGFSGYA-NVTYKTGC---DDVACKSNNSIFAASEAAKTADATIILA 504
            Y+G P    SP +       NVT   G     D   + +     A EAA  AD  +   
Sbjct: 439 GYSGAPPFARSPASAARQLGWNVTVAGGPVLEGDSDEEEDTWTAPAVEAAADADYIVYFG 498

Query: 505 GLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKA 564
           GLD S   E+ DR  +  P  Q  LI+++A + K PV++V M     D    E +  + A
Sbjct: 499 GLDTSAAGETKDRMTIGWPAAQLALISELARLGK-PVVVVQMGDQLDDTPLFELD-GVGA 556

Query: 565 ILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGR 624
           +LWA +PG++GG A+  ++ G  +P GRLP+T Y  +Y   +PLT M LRP  S   PGR
Sbjct: 557 VLWANWPGQDGGTAVVRLLSGAESPAGRLPVTQYPANYTDAVPLTDMTLRP--SATNPGR 614

Query: 625 TYKFYNGPTLYPFGYGLSYTQFK 647
           TY++Y  P + PFG+GL YT F+
Sbjct: 615 TYRWYPTP-VRPFGFGLHYTTFR 636


>gi|116197206|ref|XP_001224415.1| hypothetical protein CHGG_05201 [Chaetomium globosum CBS 148.51]
 gi|88181114|gb|EAQ88582.1| hypothetical protein CHGG_05201 [Chaetomium globosum CBS 148.51]
          Length = 735

 Score =  407 bits (1045), Expect = e-110,   Method: Compositional matrix adjust.
 Identities = 260/705 (36%), Positives = 381/705 (54%), Gaps = 55/705 (7%)

Query: 87  GVPRLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQA 146
           GV RLGL  Y+WW+EALHGV++   G  +      AT FP  I ++A+F++ L ++IG  
Sbjct: 47  GVSRLGLSAYQWWNEALHGVAH-NRGITWGGQFSAATQFPQAITSSAAFDDHLIERIGVI 105

Query: 147 VSTEARAMYNLGRAGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVE 206
           +STEARA  N GRA L +W+PN+N  RDPRWGR  ETPGED F   ++A  +V+G+Q  E
Sbjct: 106 ISTEARAFANNGRAHLDFWTPNVNPFRDPRWGRGHETPGEDAFRNKKWAEAFVQGMQGTE 165

Query: 207 GHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVK 266
                        +V + CKHYAAYD++N     R++FDA+V+ QD+ E +L PF+ C +
Sbjct: 166 STH----------RVIATCKHYAAYDLENSGSTTRFNFDAKVSTQDLAEYYLPPFQQCAR 215

Query: 267 EGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEW---DLHGYIVADCDSIQVMVD---NH 320
           +    S+MCSYN VNG+P+CA P L++  +R  W   D + Y+V+DCD++  + +    H
Sbjct: 216 DSKVGSIMCSYNAVNGVPACASPYLMDTILRKHWNWTDQNQYVVSDCDAVYYLGNANGGH 275

Query: 321 KFLADSKEDAVAQTLKAGLDLDCGQYYTNFT----GNAVQQGKVKETDIDKSLKYLYTVL 376
           ++   S   A+  +L+AG D  C  + T  T     +A    +  +  +DK++      L
Sbjct: 276 RY-KSSYAAAIGASLEAGCDNMC--WATGGTTPDPASAFNSRQFTQATLDKAMLRQMQGL 332

Query: 377 MRLGFFDG-SPQYVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKT-VA 434
           ++ G+FDG +  Y +L   D+ +    + A +AA EGIVLLKND N LPL      T VA
Sbjct: 333 VKAGYFDGPNSLYRNLTAADVNTQVARDTALKAAEEGIVLLKND-NILPLTLGGSNTQVA 391

Query: 435 VVGPHANATVAMIGNYAGIPCRYMSPI-AGFSGYANVTYKTGCDDVACKSNNSIFAASEA 493
           ++G  ANA   M+G Y+G P     P+ A  S    V Y  G      ++N    AA  A
Sbjct: 392 MIGFWANAADKMLGGYSGSPPFSHDPVTAARSMGITVNYVNGP---LTQTNADTSAAVNA 448

Query: 494 AKTADATIILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDI 553
           A+ +   I   G+D +VE ES DR  +  P  Q  +I ++A+  K PVI+V M    VD 
Sbjct: 449 AQKSSVVIFFGGIDNTVEKESQDRTSIAWPSGQLTMIQRLAQTGK-PVIVVRMGT-HVDD 506

Query: 554 AFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPL 613
               +  N+KAILWAGYPG++GG A+ +++ G  +P GRLP+T Y   Y    P T+M L
Sbjct: 507 TPLLSIPNVKAILWAGYPGQDGGTAVMNLITGLASPAGRLPVTVYPSSYTNQAPYTNMAL 566

Query: 614 RPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYT 673
           RP  S  YPGRTY++Y  P ++PFG+GL YT F    L F  T  +  + L  C+ + Y 
Sbjct: 567 RPSSS--YPGRTYRWYKDP-VFPFGHGLHYTNFSVAPLDFPATFSI-ADLLASCKGVTYL 622

Query: 674 SDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIG 733
                  CP           +    V   N GS     VV+ +           IK +  
Sbjct: 623 E-----LCP-----------FPSVSVSVTNTGSRASDYVVLGFLAGDFGPTPRPIKSLAT 666

Query: 734 FQRVF-VRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFV 777
           ++RVF V+ G+ +  +  +   +SL  VD   N +L  G +T+ +
Sbjct: 667 YKRVFDVQPGKTQSAELDWK-LESLARVDGKGNRVLYPGTYTLLL 710


>gi|297745533|emb|CBI40698.3| unnamed protein product [Vitis vinifera]
          Length = 461

 Score =  405 bits (1042), Expect = e-110,   Method: Compositional matrix adjust.
 Identities = 199/382 (52%), Positives = 270/382 (70%), Gaps = 11/382 (2%)

Query: 154 MYNLGRAGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATD 213
           MYN+G AGLT+WSPN+N+ RDPRWGR  ETPGEDP +  +YA  YVRGLQ       + D
Sbjct: 1   MYNVGLAGLTFWSPNVNIFRDPRWGRGQETPGEDPLLSSKYASGYVRGLQ------QSDD 54

Query: 214 LNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSV 273
            +   LK+++CCKHY AYD+DNWKGVDR+HF+A VT+QDM++TF  PF+ CV +G+ +SV
Sbjct: 55  GSPDRLKIAACCKHYTAYDLDNWKGVDRFHFNAVVTKQDMDDTFQPPFKSCVIDGNVASV 114

Query: 274 MCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQ 333
           MCSYN+VNG P+CADP LL+  VRGEW L+GYIV+DCDS+ V  ++  +   + E+A A+
Sbjct: 115 MCSYNQVNGKPACADPDLLSGIVRGEWKLNGYIVSDCDSVDVFYNSQHY-TKTPEEAAAK 173

Query: 334 TLKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ---YVS 390
            + AGLDL+CG +    T  AV+ G V E+ +DK++   +  LMRLGFFDG+P    Y  
Sbjct: 174 AILAGLDLNCGSFLGQHTEAAVKGGLVDESAVDKAVSNNFATLMRLGFFDGNPSKAIYGK 233

Query: 391 LGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNY 450
           LG +D+C+ E+ ELA EAAR+GI+LLKN + +LPL+   +KT+A++GP+AN T  MIGNY
Sbjct: 234 LGPKDVCTLEHQELAREAARQGIMLLKNSKGSLPLSPTAIKTLAIIGPNANVTKTMIGNY 293

Query: 451 AGIPCRYMSPIAGFSGYANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSV 510
            G PC+Y +P+ G       TY +GC +VAC S   I  A + A  ADAT+++ G+D S+
Sbjct: 294 EGTPCKYTTPLQGLMALVATTYLSGCSNVAC-STAQIDEAKKIAAAADATVLIVGIDQSI 352

Query: 511 EAESLDREDLWLPGYQTQLINQ 532
           EAE  DR ++ LPG Q  LI +
Sbjct: 353 EAEGRDRVNIQLPGQQPLLITE 374


>gi|169767016|ref|XP_001817979.1| exo-1,4-beta-xylosidase xlnD [Aspergillus oryzae RIB40]
 gi|121805502|sp|Q2UR38.1|XYND_ASPOR RecName: Full=Exo-1,4-beta-xylosidase xlnD; AltName:
           Full=1,4-beta-D-xylan xylohydrolase xlnD; AltName:
           Full=Beta-xylosidase A; AltName: Full=Beta-xylosidase
           xlnD; AltName: Full=Xylobiase xlnD; Flags: Precursor
 gi|83765834|dbj|BAE55977.1| unnamed protein product [Aspergillus oryzae RIB40]
          Length = 798

 Score =  405 bits (1040), Expect = e-110,   Method: Compositional matrix adjust.
 Identities = 264/768 (34%), Positives = 396/768 (51%), Gaps = 54/768 (7%)

Query: 25  VDANGSSSPVFVCDPGRFSKLGLQMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDF 84
           ++  G+S P   C+ G  SK        L CD+S     R   LVS +T +E V    + 
Sbjct: 42  LETGGTSFPD--CESGPLSKT-------LVCDTSAKPHDRAAALVSLLTFEELVNNTANT 92

Query: 85  AHGVPRLGLPQYEWWSEALHGVSNVGPGTHFDDV--IPGATSFPTVILTTASFNESLWKK 142
            HG PR+GLP Y+ W+EALHGV++      F D      +TSFP  I T A+ N +L  +
Sbjct: 93  GHGAPRIGLPAYQVWNEALHGVAHA----DFSDAGGFSWSTSFPQPISTMAALNRTLIHQ 148

Query: 143 IGQAVSTEARAMYNLGRAGLTYWSPNINVARDPRWGRITETPGEDPF-VVGRYAVNYVRG 201
           I   +ST+ RA  N GR GL  +SPNIN  R P WGR  ETPGED + +   YA  Y+ G
Sbjct: 149 IATIISTQGRAFMNAGRYGLDVYSPNINTFRHPVWGRGQETPGEDAYCLASTYAYEYITG 208

Query: 202 LQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPF 261
           +Q          +++ PLK+ +  KHYA YD++NW    R   D ++T+QD+ E +   F
Sbjct: 209 IQG--------GVDANPLKLIATAKHYAGYDIENWDNHSRLGNDMQITQQDLAEYYTPQF 260

Query: 262 EMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDL--HGYIVADCDSIQVMVDN 319
            +  ++    SVMCSYN VNG+PSC++   L   +R  +D    GY+  DC ++  + + 
Sbjct: 261 LVASRDAKVHSVMCSYNAVNGVPSCSNSFFLQTLLRDTFDFVEDGYVSGDCGAVYNVFNP 320

Query: 320 HKFLADSKEDAVAQTLKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRL 379
           H + A ++  A A +++AG D+DCG  Y      +    +V   D+++ +  LY  L+R 
Sbjct: 321 HGY-ATNESSAAADSIRAGTDIDCGVSYPRHFQESFHDQEVSRQDLERGVTRLYASLIRA 379

Query: 380 GFFDG-SPQYVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPL--NSAKVKTVAVV 436
           G+FDG +  Y ++   D+ S     L+ EAA + IVLLKND   LPL   S+  KT+A++
Sbjct: 380 GYFDGKTSPYRNITWSDVVSTNAQNLSYEAAAQSIVLLKND-GILPLTSTSSSTKTIALI 438

Query: 437 GPHANATVAMIGNYAGIPCRYMSPIAGF--SGYANVTYKTGCDDVACKSNNSIFAASEAA 494
           GP ANAT  M+GNY G     +SP+  F  S Y  +TY  G +      + S   A   A
Sbjct: 439 GPWANATTQMLGNYYGPAPYLISPLQAFQDSEY-KITYTIGTNTTTDPDSTSQSTALTTA 497

Query: 495 KTADATIILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIA 554
           K AD  I   G+D ++E E+ DR ++  P  Q  LI ++A++ K P+I++ M  G VD +
Sbjct: 498 KEADLIIFAGGIDNTLETEAQDRSNITWPSNQLSLITKLADLGK-PLIVLQMGGGQVDSS 556

Query: 555 FAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLR 614
             + N N+ A++W GYPG+ GG+A+AD++ GK  P  RL  T Y  +Y ++ P   M LR
Sbjct: 557 ALKNNKNVNALIWGGYPGQSGGQALADIITGKRAPAARLVTTQYPAEYAEVFPAIDMNLR 616

Query: 615 PVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKT---IQVNLNKLQHCRNLN 671
           P  S   PG+TY +Y G  +Y FG+GL YT F  +  + + T      N++++    +L 
Sbjct: 617 PNGS--NPGQTYMWYTGTPVYEFGHGLFYTNFTASASASSGTKNRTSFNIDEVLGRPHLG 674

Query: 672 YTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQV 731
           Y            LV  +       F VD +N G        + +    A  A    K +
Sbjct: 675 YK-----------LVEQMPL---LNFTVDVKNTGDRVSDYTAMAFVNTTAGPAPHPNKWL 720

Query: 732 IGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGN 779
           +GF R+      + +   +     SL   D   N +L  G + + + N
Sbjct: 721 VGFDRLSAVEPGSAKTMVIPVTVDSLARTDEEGNRVLYPGRYEVALNN 768


>gi|449303062|gb|EMC99070.1| glycoside hydrolase family 3 protein [Baudoinia compniacensis UAMH
           10762]
          Length = 786

 Score =  405 bits (1040), Expect = e-110,   Method: Compositional matrix adjust.
 Identities = 237/604 (39%), Positives = 341/604 (56%), Gaps = 25/604 (4%)

Query: 49  MSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSN 108
           +S+   C+ SL    R   LV   TL+E     G+ A GVPRLGLP YE W+EALHG+S+
Sbjct: 54  LSTTPVCNRSLSAWDRAHALVQLFTLEELANNTGNTAPGVPRLGLPAYEVWNEALHGISH 113

Query: 109 VGPGTHF--DDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWS 166
                HF  +     ATSFP+ IL+ AS N +L  +IG  +ST+ RA  N GR GL  ++
Sbjct: 114 ----GHFATNGTWSWATSFPSPILSMASMNRTLINQIGDIISTQGRAFSNAGRYGLDSYA 169

Query: 167 PNINVARDPRWGRITETPGEDPFVVGR-YAVNYVRGLQDVEGHENATDLNSRPLKVSSCC 225
           PNIN  R P WGR  ETPGED F +   YA  Y+ G+Q   G   A        K+ +  
Sbjct: 170 PNINGFRSPVWGRGQETPGEDAFFLSSLYAYEYITGMQG--GKAPAVP------KLVAVP 221

Query: 226 KHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPS 285
           KH+A YD++NW    R   D  +T+QD+   +   F   ++   A  +MCSYN VNG+PS
Sbjct: 222 KHFAGYDIENWNNNSRLGLDVNITQQDLAGYYTPQFRSAIQNAKALGLMCSYNAVNGVPS 281

Query: 286 CADPKLLNQTVRGEWDL-HGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCG 344
           C++   L    R  W   +G++ +DCD++  + + H + A++   AVA +L+AG D+DCG
Sbjct: 282 CSNSFFLQTLARDTWGFGNGFVSSDCDAVYNVYNPHGYAANTT-GAVADSLRAGTDIDCG 340

Query: 345 QYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDG-SPQYVSLGKQDICSDENIE 403
             Y  +   A   G V   DI+ +L   Y+ L+  G+FDG S  Y +LG  D+ + +   
Sbjct: 341 TSYPFYLVPAFNAGLVSRNDIELALTRYYSGLVMQGYFDGNSSLYRNLGWNDVLTTDAWN 400

Query: 404 LAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAG 463
           ++ EAA EGI LLKND  TLPL S   ++VA++GP ANAT+ + GNY       +SP+  
Sbjct: 401 ISYEAAVEGITLLKND-GTLPL-SKSTRSVALIGPWANATLQLQGNYYAAAPYLISPLQA 458

Query: 464 FSGYANVTYKTGCDDVACKSNNSIFA-ASEAAKTADATIILAGLDLSVEAESLDREDLWL 522
           F   + +T           +N S FA A   A+ +D  I   G+D S+EAE LDR+++  
Sbjct: 459 FRA-SGMTVNFVNGTTISSTNTSGFAEAITLAQQSDVIIYAGGIDNSIEAEGLDRQNITW 517

Query: 523 PGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADV 582
           PG Q  LI Q+++V K P++++ M  G VD +  + N+ + A++W GYPG+ GG+A+ D+
Sbjct: 518 PGNQLDLIYQLSQVGK-PLVVLQMGGGQVDSSALKNNSKVNALVWGGYPGQSGGQALFDI 576

Query: 583 VFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLS 642
           + G   P GRL  T Y   Y       +M + PV+  G  G+TY +Y G  +YPFG+GL 
Sbjct: 577 IMGNRAPAGRLVTTQYPASYATSFNQLNMNMAPVN--GSLGQTYMWYTGTPVYPFGHGLF 634

Query: 643 YTQF 646
           YT F
Sbjct: 635 YTNF 638


>gi|367028614|ref|XP_003663591.1| glycoside hydrolase family 3 protein [Myceliophthora thermophila
           ATCC 42464]
 gi|347010860|gb|AEO58346.1| glycoside hydrolase family 3 protein [Myceliophthora thermophila
           ATCC 42464]
          Length = 760

 Score =  404 bits (1038), Expect = e-109,   Method: Compositional matrix adjust.
 Identities = 268/744 (36%), Positives = 397/744 (53%), Gaps = 56/744 (7%)

Query: 49  MSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSN 108
           + S   CD+S     R   LVS M  +EK+  L + + GV RLGL  Y+WW+EALHGV++
Sbjct: 33  LKSNTVCDTSASPGARAAALVSVMNNNEKLANLVNNSPGVSRLGLSAYQWWNEALHGVAH 92

Query: 109 VGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPN 168
              G  +      AT FP  I T+A+F+++L ++IG  +STEARA  N GRA L +W+PN
Sbjct: 93  -NRGITWGGEFSAATQFPQAITTSATFDDALIEQIGTIISTEARAFANNGRAHLDFWTPN 151

Query: 169 INVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHY 228
           +N  RDPRWGR  ETPGED F   ++A  +V+G+Q                +V + CKHY
Sbjct: 152 VNPFRDPRWGRGHETPGEDAFKNKKWAEAFVKGMQGPGPTH----------RVIATCKHY 201

Query: 229 AAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCAD 288
           AAYD++N     R++FDA+V+ QD+ E +L PF+ C ++    S+MCSYN VN IP+CA+
Sbjct: 202 AAYDLENSGSTTRFNFDAKVSTQDLAEYYLPPFQQCARDSKVGSIMCSYNAVNEIPACAN 261

Query: 289 PKLLNQTVRGEW---DLHGYIVADCDSIQVMVD---NHKFLADSKEDAVAQTLKAGLDLD 342
           P L++  +R  W   D H YIV+DCD++  + +    H++   S   A+  +L+AG D  
Sbjct: 262 PYLMDTILRKHWNWTDEHQYIVSDCDAVYYLGNANGGHRY-KPSYAAAIGASLEAGCDNM 320

Query: 343 CGQYYTNFT----GNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDG-SPQYVSLGKQDIC 397
           C  + T  T     +A   G+  +T +D ++      L+  G+FDG    Y +L   D+ 
Sbjct: 321 C--WATGGTAPDPASAFNSGQFSQTTLDTAILRQMQGLVLAGYFDGPGGMYRNLSVADVN 378

Query: 398 SDENIELAAEAAREGIVLLKNDQNTLPL--NSAKVKTVAVVGPHANATVAMIGNYAGIPC 455
           +    + A +AA  GIVLLKND   LPL  N +  + VA++G  ANA   M+G Y+G P 
Sbjct: 379 TQTAQDTALKAAEGGIVLLKND-GILPLSVNGSNFQ-VAMIGFWANAADKMLGGYSGSPP 436

Query: 456 RYMSPI-AGFSGYANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAES 514
               P+ A  S    V Y  G      + N    AA  AA+ ++A +   G+D +VE ES
Sbjct: 437 FNHDPVTAARSMGITVNYVNG---PLTQPNGDTSAALNAAQKSNAVVFFGGIDNTVEKES 493

Query: 515 LDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEE 574
            DR  +  P  Q  LI ++AE  K PVI+V +    VD     +  N++AILWAGYPG++
Sbjct: 494 QDRTSIEWPSGQLALIRRLAETGK-PVIVVRLGT-HVDDTPLLSIPNVRAILWAGYPGQD 551

Query: 575 GGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTL 634
           GG A+  ++ G  +P GRLP T Y   Y    P T+M LRP  S  YPGRTY++Y+   +
Sbjct: 552 GGTAVVKIITGLASPAGRLPATVYPSSYTSQAPFTNMALRPSSS--YPGRTYRWYSN-AV 608

Query: 635 YPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDY 694
           +PFG+GL YT F  ++  F  +  +  + L  C +    S A    CP           +
Sbjct: 609 FPFGHGLHYTNFSVSVRDFPASFAI-ADLLASCGD----SVAYLDLCP-----------F 652

Query: 695 FEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVF-VRAGRNKRIKFVFNA 753
               ++  N G+     V + +       +   IK +  ++RVF +  G  +  +  +  
Sbjct: 653 PSVSLNVTNTGTRVSDYVALGFLSGDFGPSPHPIKTLATYKRVFNIEPGETQVAELDWK- 711

Query: 754 CKSLNIVDYAANTLLPAGEHTIFV 777
            +SL  VD   N +L  G +T+ V
Sbjct: 712 LESLVRVDEKGNRVLYPGTYTLLV 735


>gi|288870210|ref|ZP_06113312.2| beta-glucosidase [Clostridium hathewayi DSM 13479]
 gi|288868024|gb|EFD00323.1| beta-glucosidase [Clostridium hathewayi DSM 13479]
          Length = 730

 Score =  404 bits (1037), Expect = e-109,   Method: Compositional matrix adjust.
 Identities = 245/651 (37%), Positives = 361/651 (55%), Gaps = 72/651 (11%)

Query: 44  KLGLQMSSFLFCDSSLPYSIRVKD--LVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSE 101
           K G QM      +++L    R K   LV +MTL+EKV Q  + A  + RLG+  Y WW+E
Sbjct: 2   KRGFQMKETSEKETALDRQRREKAEYLVKQMTLEEKVFQTMNQAPAIERLGIKAYNWWNE 61

Query: 102 ALHGVSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGR-- 159
            LHGV+  G           AT FP  I   A+F+E L + +G+AVSTEARA Y++ +  
Sbjct: 62  GLHGVARAGV----------ATIFPQAIGLAATFDEDLIETVGEAVSTEARAKYHMQQRY 111

Query: 160 ------AGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATD 213
                  GLT W+PNIN+ RDPRWGR  ET GEDP++  R  + Y+RGLQ    HE    
Sbjct: 112 GDTDIYKGLTLWAPNINIFRDPRWGRGHETYGEDPWLTSRLGIRYIRGLQG--SHE---- 165

Query: 214 LNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSV 273
              + LK ++C KH+A   V +     R+ FDA V+E+D+ ET+L  FE CVK+GD  +V
Sbjct: 166 ---KYLKTAACVKHFA---VHSGPEELRHSFDAEVSEKDLRETYLPAFEACVKDGDVEAV 219

Query: 274 MCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQ 333
           M +YNRVNG+P C +  LL   +R EW  HG++V+DC +I+   + H  + DS  ++V+ 
Sbjct: 220 MGAYNRVNGVPCCGNEYLLETILRKEWGFHGHVVSDCWAIKDFHEGHG-VTDSPVESVSM 278

Query: 334 TLKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ---YVS 390
            +  G DL+CG  +T     AV++GKVKE  +D+++  L+T  ++LG      +   Y  
Sbjct: 279 AMNHGCDLNCGNLFTYLI-QAVKEGKVKEERLDEAVIRLFTTRLKLGALGKMEEDDPYAG 337

Query: 391 LGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNY 450
           +   ++ S    +L   AA + +VLLKN +  LP+++ + KT+ V+GP+A++  A++GNY
Sbjct: 338 ISYLEVDSPAMKKLNRSAAGKSVVLLKNTEGLLPIDTKRYKTIGVIGPNADSRRALVGNY 397

Query: 451 AGIPCRYMSPIAGFSG----YANVTYKTGCDDVACKSNNSIFAA-----SEA---AKTAD 498
            G    Y++ + G        A V Y  GC     KSN S   A     SE     + +D
Sbjct: 398 EGTASEYVTVLEGIREAAEPEARVLYSEGCH--LYKSNVSGLGARNDRLSEVKGICRESD 455

Query: 499 ATIILAGLDLSVEAES---------LDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAG 549
             I   GLD ++E E           D+ DL LPG Q +++    +  K PV+LV+++  
Sbjct: 456 IVIACMGLDSTLEGEQGDTGNIYAGGDKPDLMLPGLQQKILETAYDSGK-PVVLVLLAGS 514

Query: 550 GVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLT 609
            + + +A  + ++ AIL A YPG EGGR +ADV+FG  NP GRLP+T+Y          T
Sbjct: 515 AMAVTWA--DEHLPAILTAWYPGAEGGRGVADVLFGTVNPEGRLPVTFYR---------T 563

Query: 610 SMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVN 660
           +  L    +    GRTY+F     LYPFG+GLSYT+F  + L  ++   V+
Sbjct: 564 TEELPDFTNYSMEGRTYRFMKQKALYPFGFGLSYTEFSCSGLEVSERDSVD 614


>gi|218186207|gb|EEC68634.1| hypothetical protein OsI_37026 [Oryza sativa Indica Group]
          Length = 1241

 Score =  404 bits (1037), Expect = e-109,   Method: Compositional matrix adjust.
 Identities = 198/325 (60%), Positives = 241/325 (74%), Gaps = 17/325 (5%)

Query: 145  QAVSTEARAMYNLGRAGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQD 204
            QAVSTEARAMYN+G+ GLTYWSPNINV RDPRWGR  ETPGEDP+VVGRYAVN+VRG+QD
Sbjct: 916  QAVSTEARAMYNMGKGGLTYWSPNINVVRDPRWGRALETPGEDPYVVGRYAVNFVRGMQD 975

Query: 205  VEGHENAT---DLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPF 261
            + GHE      D N+RPLK S+CCKHYAAYD+D+W    R+ FDARV E+DM ETF RPF
Sbjct: 976  IPGHEAVAAGGDPNTRPLKTSACCKHYAAYDLDDWHNHTRFEFDARVDERDMVETFQRPF 1035

Query: 262  EMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHK 321
            EMCV++GD SSVMCSYNRVNGIP+CAD +LL+QT+R +W LHGYIV+DCD+++VM DN  
Sbjct: 1036 EMCVRDGDVSSVMCSYNRVNGIPACADARLLSQTIRRDWGLHGYIVSDCDAVRVMTDNAT 1095

Query: 322  FLADSKEDAVAQTLKAGLDLDCGQYYTNFTG-------------NAVQQGKVKETDIDKS 368
            +L  +  +A A  LKAGLDLDCG+ + N T               AV +GK++E+DID +
Sbjct: 1096 WLGYTGAEASAAALKAGLDLDCGESWKNETDGHPLMDFLTTYGMEAVNKGKMRESDIDNA 1155

Query: 369  LKYLYTVLMRLGFFDGSPQYVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSA 428
            L   Y  LMRLG+FD   QY SLG+QDIC+D++  LA + AR+GIVLLKND   LPL++ 
Sbjct: 1156 LTNQYMTLMRLGYFDDIAQYSSLGRQDICTDQHKTLALDGARQGIVLLKNDNKLLPLDAN 1215

Query: 429  KVKTVAVVGPHANA-TVAMIGNYAG 452
            KV  V V GPH  A    M G+Y G
Sbjct: 1216 KVGFVNVRGPHVQAPEKIMDGDYTG 1240


>gi|336435507|ref|ZP_08615222.1| hypothetical protein HMPREF0988_00807 [Lachnospiraceae bacterium
           1_4_56FAA]
 gi|336000960|gb|EGN31106.1| hypothetical protein HMPREF0988_00807 [Lachnospiraceae bacterium
           1_4_56FAA]
          Length = 717

 Score =  402 bits (1033), Expect = e-109,   Method: Compositional matrix adjust.
 Identities = 254/742 (34%), Positives = 390/742 (52%), Gaps = 89/742 (11%)

Query: 64  RVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGAT 123
           + ++LV +MTL EK  QL   A  +PRL +P Y WW+E+LHGV+  G           AT
Sbjct: 13  QAEELVDQMTLMEKASQLRYDAPAIPRLHIPAYNWWNESLHGVARGGT----------AT 62

Query: 124 SFPTVILTTASFNESLWKKIGQAVSTEARAMYNLG--------RAGLTYWSPNINVARDP 175
            FP  I   ASF+  + ++IG+A++ E RA YN            GLT+W+PN+N+ RDP
Sbjct: 63  VFPQAIGLAASFDREMLEEIGEAIALEGRAKYNAAVKLDDRDIYKGLTFWAPNVNIFRDP 122

Query: 176 RWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDN 235
           RWGR  ET GEDP++  R  V+Y+RGLQ           +   +K ++C KH+A   V +
Sbjct: 123 RWGRGHETYGEDPYLSSRLGVSYIRGLQG----------DGETMKAAACAKHFA---VHS 169

Query: 236 WKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQT 295
                R+ FDA V+E+D+ ET+L  F+ CV+EG   +VM +YN VNG P C    LL + 
Sbjct: 170 GPEALRHEFDAEVSEKDLRETYLPAFQACVQEGHVEAVMGAYNCVNGEPCCGSETLLKKI 229

Query: 296 VRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFTGNAV 355
           +R EW   G++V+DC +I+   +NH  +  +   + A  ++AG DL+CG  Y +   +A 
Sbjct: 230 LREEWGFDGHVVSDCWAIKDFHENH-LVTGTPVQSAALAMEAGCDLNCGVTYLHLV-HAC 287

Query: 356 QQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQDICSDENIELAAEAAREGIVL 415
           Q+G V E  I ++   L+T    LG FDGS +Y S+    +   E+ +L+  AARE IVL
Sbjct: 288 QEGLVTEAQITEAAIRLFTTRFLLGMFDGS-EYDSVPYTVVECKEHRDLSERAARESIVL 346

Query: 416 LKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFSGYA----NVT 471
           LKN+   LPL+  K+KT+ ++GP+A++  A+IGNY G    Y++ + G          + 
Sbjct: 347 LKNN-GILPLDREKLKTIGIIGPNADSRKALIGNYHGTSSEYITVLEGVRRLVGDEVRIL 405

Query: 472 YKTGCDDVACKSNN------SIFAASEAAKTADATIILAGLDLSVEAE---------SLD 516
           Y  GC     K+ N       +  A   A+ +D  I+  GLD ++E E         S D
Sbjct: 406 YSDGCHLYENKTENLAREQDRLSEARIVARESDVVILCLGLDETLEGEEGDTGNSYASGD 465

Query: 517 REDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGG 576
           + DL LP  Q  L+  VA + K P +L +M+   +D++FAE + +    LW  YPG  GG
Sbjct: 466 KVDLRLPKSQRMLMEAVA-MEKKPTVLCLMAGSDIDLSFAEKHFDAIVDLW--YPGAYGG 522

Query: 577 RAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYP 636
            A AD++FGK +P G+LPIT+Y    +++LP         +     GRTY++      YP
Sbjct: 523 AAAADILFGKCSPSGKLPITFYES--LEVLP-------SFEDYSMRGRTYRYLEQKAQYP 573

Query: 637 FGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFE 696
           FGYGL+YT+ K   + + +  + ++ ++    N    ++A+   C  V            
Sbjct: 574 FGYGLTYTKMKIRNV-WLENAEKDMKEVTDGEN----AEAAVIVCAEV------------ 616

Query: 697 FKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKS 756
                +N G  D  +V+ +Y +       T    + GF+R+FV  G  K +K   N   +
Sbjct: 617 -----ENCGGMDSQEVLQIYIRDTESEHETPHPHLAGFERIFVEKGVKKLVKIPVNR-SA 670

Query: 757 LNIVDYAANTLLPAGEHTIFVG 778
             +VD +      +G++ IF G
Sbjct: 671 FTVVDESGRRFTDSGKYEIFAG 692


>gi|223945397|gb|ACN26782.1| unknown [Zea mays]
          Length = 516

 Score =  402 bits (1033), Expect = e-109,   Method: Compositional matrix adjust.
 Identities = 217/520 (41%), Positives = 313/520 (60%), Gaps = 29/520 (5%)

Query: 274 MCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQ 333
           MCSYNRVNG+P+CAD  LL+ T R +W  +GYI +DCD++ ++ D   + A + EDAVA 
Sbjct: 1   MCSYNRVNGVPTCADYNLLSTTARQDWGFYGYITSDCDAVAIIHDAQGY-AKTAEDAVAD 59

Query: 334 TLKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ---YVS 390
            LKAG+D++CG Y  +   +A+QQGK+ E DI+++L  L+ V MRLG F+G P+   Y  
Sbjct: 60  VLKAGMDVNCGSYVQDHGASALQQGKITEQDINRALHNLFAVRMRLGLFNGDPRRNLYGD 119

Query: 391 LGKQDICSDENIELAAEAAREGIVLLKND--QNTLPLNSAKVKTVAVVGPHANATVAMIG 448
           +G   +C+ E+ +LA EAA++GIVLLKND     LPL+   V ++AV+G +AN  + + G
Sbjct: 120 IGPDQVCTQEHQDLALEAAQDGIVLLKNDGGAGALPLSKPNVASLAVIGFNANDAIRLRG 179

Query: 449 NYAGIPCRYMSPIAGFSGYA-NVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLD 507
           NY G PC  ++P+    GY  + ++  GC+  AC    +I  A +AA +AD+ ++  GLD
Sbjct: 180 NYFGPPCVTVTPLQVLQGYVKDTSFVAGCNSAACNVT-TIPEAVQAASSADSVVLFMGLD 238

Query: 508 LSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILW 567
              E E +DR DL LPG Q  LI  VA  AK PVILV++  G VD++FA+TN  I AILW
Sbjct: 239 QDQEREEVDRLDLTLPGQQQTLIESVANAAKKPVILVLLCGGPVDVSFAKTNPKIGAILW 298

Query: 568 AGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYK 627
           AGYPGE GG AIA V+FG+ NPGGRLP+TWY  D+ + +P+T M +R   + GYPGRTY+
Sbjct: 299 AGYPGEAGGIAIAQVLFGEHNPGGRLPVTWYPQDFTR-VPMTDMRMRADPATGYPGRTYR 357

Query: 628 FYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVN 687
           FY GPT++ FGYGLSY+++ +   +       N+  L+          A +    G+   
Sbjct: 358 FYRGPTVFNFGYGLSYSKYSHRFATKPPPTS-NVAGLK----------AVEATAGGMASY 406

Query: 688 DLR------CDDY-FEFKVDFQNVGSTDGSDVVIVYSKPP--AEIAATYIKQVIGFQRVF 738
           D+       CD   F   V  QN G  DG   V+V+ + P   + +     Q+IGFQ + 
Sbjct: 407 DVEAIGSETCDRLKFPAVVRVQNHGPMDGKHSVLVFMRWPNATDGSGRPASQLIGFQSLH 466

Query: 739 VRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVG 778
           +RA +   ++F  + CK  +        ++  G H + VG
Sbjct: 467 LRATQTAHVEFEVSPCKHFSRATEDGRKVIDQGSHFVMVG 506


>gi|425780840|gb|EKV18836.1| Beta-xylosidase XylA [Penicillium digitatum PHI26]
 gi|425783077|gb|EKV20946.1| Beta-xylosidase XylA [Penicillium digitatum Pd1]
          Length = 792

 Score =  402 bits (1033), Expect = e-109,   Method: Compositional matrix adjust.
 Identities = 232/609 (38%), Positives = 346/609 (56%), Gaps = 21/609 (3%)

Query: 49  MSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSN 108
           +S  + CD++     R   L S  TL+E V   G+    VPRLGLP Y+ WSEALHG+  
Sbjct: 56  LSKTIVCDTTAKPHDRAAALTSMFTLEELVNSTGNVIPAVPRLGLPPYQVWSEALHGLDR 115

Query: 109 VGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPN 168
                  D     ATSFP+ IL  A+ N +L  +IG+ +ST+ RA  N GR GL  ++PN
Sbjct: 116 ANLTESGD--YSWATSFPSPILIMAALNRTLINQIGEIISTQGRAFNNGGRYGLDVYAPN 173

Query: 169 INVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHY 228
           IN  R P WGR  ETPGED  +   Y V Y+ G+Q          LN R LK+++  KH+
Sbjct: 174 INSFRHPVWGRGQETPGEDVQLCSIYGVEYITGIQG--------GLNPRDLKLAATAKHF 225

Query: 229 AAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCAD 288
           A YD++NW    R   +  ++  D+   +   F   V++    SVM SYN VNG+PS A+
Sbjct: 226 AGYDLENWGNHSRLGNNVAISSFDLASYYTPQFITAVRDARVHSVMSSYNAVNGVPSSAN 285

Query: 289 PKLLNQTVRGEWDL--HGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQY 346
             LL   +R  W+    GY+ +DCD++  + + H + + S   A A++++AG D+DCG  
Sbjct: 286 SFLLQTLLRETWNFVEDGYVSSDCDAVFNVFNPHGYAS-SASLAAAKSIQAGTDIDCGAT 344

Query: 347 YTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDG-SPQYVSLGKQDICSDENIELA 405
           Y  +   ++   ++  ++I++++   Y+ L+ LG+FDG + +Y  L   D+ + +   ++
Sbjct: 345 YQLYLNESLSHDEISRSEIERAVTRFYSTLVSLGYFDGDNSKYRHLHWPDVVATDAWNIS 404

Query: 406 AEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGF- 464
            EAA EGIVLLKND  TLPL S   ++VA++GP AN T  + GNY G       P+A   
Sbjct: 405 YEAAVEGIVLLKND-GTLPL-SNNTRSVALIGPWANVTTTLQGNYYGAAPYLTGPLAALQ 462

Query: 465 SGYANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAESLDREDLWLPG 524
           +   +V Y  G  +++  S +   AA  AA  ++  I   G+D +VEAE +DRE +  PG
Sbjct: 463 ASNLDVNYAFGT-NISSDSTSGFEAALSAAGKSEVIIFAGGIDNTVEAEGVDRESITWPG 521

Query: 525 YQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVF 584
            Q QLI Q++++ K P++++ M  G VD +  + N N+ +++W GYPG+ GG AI D++ 
Sbjct: 522 NQLQLIEQLSKLGK-PLVVLQMGGGQVDSSSLKANKNVNSLVWGGYPGQSGGPAILDILT 580

Query: 585 GKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYT 644
           GK  P GRL +T Y  +Y    P T M LRP  +   PG+TY +Y G  +Y FG+GL YT
Sbjct: 581 GKRAPAGRLTVTQYPAEYALQFPATDMSLRPKGN--NPGQTYMWYTGKPVYEFGHGLFYT 638

Query: 645 QFKYNLLSF 653
            FK +L  F
Sbjct: 639 TFKVSLAHF 647


>gi|391872736|gb|EIT81831.1| beta-glucosidase-related glycosidase [Aspergillus oryzae 3.042]
          Length = 798

 Score =  402 bits (1032), Expect = e-109,   Method: Compositional matrix adjust.
 Identities = 262/765 (34%), Positives = 393/765 (51%), Gaps = 48/765 (6%)

Query: 25  VDANGSSSPVFVCDPGRFSKLGLQMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDF 84
           ++  G+S P   C+ G  SK        L CD+S     R   LVS +T +E V    + 
Sbjct: 42  LETGGTSFPD--CESGPLSKT-------LVCDTSAKPHDRAAALVSLLTFEELVNNTANT 92

Query: 85  AHGVPRLGLPQYEWWSEALHGVSNVGPGTHFDDV--IPGATSFPTVILTTASFNESLWKK 142
            HG PR+GLP Y+ W+EALHGV++      F D      +TSFP  I T A+ N +L  +
Sbjct: 93  GHGAPRIGLPAYQVWNEALHGVAHA----DFSDAGDFSWSTSFPQPISTMAALNRTLIHQ 148

Query: 143 IGQAVSTEARAMYNLGRAGLTYWSPNINVARDPRWGRITETPGEDPF-VVGRYAVNYVRG 201
           I   +ST+ RA  N GR GL  +SPNIN  R P WGR  ETPGED + +   YA  Y+ G
Sbjct: 149 IATIISTQGRAFMNAGRYGLDVYSPNINTFRHPVWGRGQETPGEDAYCLASTYAYEYITG 208

Query: 202 LQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPF 261
           +Q          +++ PLK+ +  KHYA YD++NW    R   D ++T+QD+ E +   F
Sbjct: 209 IQG--------GVDANPLKLIATAKHYAGYDIENWDNHSRLGNDMQITQQDLAEYYTPQF 260

Query: 262 EMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDL--HGYIVADCDSIQVMVDN 319
            +  ++    SVMCSYN VNG+PSC++   L   +R  +D    GY+  DC ++  + + 
Sbjct: 261 LVASRDAKVHSVMCSYNAVNGVPSCSNSFFLQTLLRDTFDFVEDGYVSGDCGAVYNVFNP 320

Query: 320 HKFLADSKEDAVAQTLKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRL 379
           H + A ++  A A +++AG D+DCG  Y      +    +V   D+++ +  LY  L+R 
Sbjct: 321 HGY-ATNESSAAADSIRAGTDIDCGVSYPRHFQESFHDQEVSRQDLERGVTRLYASLIRA 379

Query: 380 GFFDG-SPQYVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPL--NSAKVKTVAVV 436
           G+FDG +  Y ++   D+ S     L+ EAA + IVLLKND   LPL   S+  KT+A++
Sbjct: 380 GYFDGKTSPYRNITWSDVVSTNAQNLSYEAAAQSIVLLKND-GILPLTSTSSSTKTIALI 438

Query: 437 GPHANATVAMIGNYAGIPCRYMSPIAGF--SGYANVTYKTGCDDVACKSNNSIFAASEAA 494
           GP ANAT  M+GNY G     +SP+  F  S Y  +TY  G +      + S   A   A
Sbjct: 439 GPWANATTQMLGNYYGPAPYLISPLQAFQDSEY-KITYTIGTNTTTDPDSTSQSTALTTA 497

Query: 495 KTADATIILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIA 554
           K AD  I   G+D ++E E+ DR ++  P  Q  LI ++A++ K P+I++ M  G VD +
Sbjct: 498 KEADLIIFAGGIDNTLETEAQDRSNITWPSNQLSLITKLADLGK-PLIVLQMGGGQVDSS 556

Query: 555 FAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLR 614
             + N N+ A++W GYPG+ GG+A+AD++ GK  P  RL  T Y  +Y ++ P   M LR
Sbjct: 557 ALKNNKNVNALIWGGYPGQSGGQALADIITGKRAPAARLVTTQYPAEYAEVFPAIDMNLR 616

Query: 615 PVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTS 674
           P  S   PG+TY +Y G  +Y FG+GL YT F  +  + + T        ++  + N   
Sbjct: 617 PNGS--NPGQTYMWYTGTPVYEFGHGLFYTNFTASASAGSGT--------KNRTSFNIDE 666

Query: 675 DASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGF 734
              +      LV  +       F VD +N G        + +    A  A    K ++GF
Sbjct: 667 VLGRPHPGYKLVEQMPL---LNFTVDVKNTGDRVSDYTAMAFVNTTAGPAPHPNKWLVGF 723

Query: 735 QRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGN 779
            R+      + +   +     SL   D   N +L  G + + + N
Sbjct: 724 DRLSAVEPGSAKTMVIPVTVDSLARTDEEGNRVLYPGRYEVALNN 768


>gi|302669556|ref|YP_003829516.1| beta-xylosidase [Butyrivibrio proteoclasticus B316]
 gi|302394029|gb|ADL32934.1| beta-xylosidase Xyl3A [Butyrivibrio proteoclasticus B316]
          Length = 709

 Score =  401 bits (1031), Expect = e-109,   Method: Compositional matrix adjust.
 Identities = 258/745 (34%), Positives = 386/745 (51%), Gaps = 104/745 (13%)

Query: 64  RVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGAT 123
           R K+LV++MT++EK  QL   A  + RLG+P Y WW+EALHGV+  G           AT
Sbjct: 9   RAKELVAKMTVEEKASQLRYDAPAIDRLGIPAYNWWNEALHGVARAGT----------AT 58

Query: 124 SFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRA--------GLTYWSPNINVARDP 175
            FP  I   A+F+E L  ++G+ ++ EARA YN            GLT+W+PN+N+ RDP
Sbjct: 59  MFPQAIGLAAAFDEELMSEVGEVIAEEARAKYNEQSKREDRDIYKGLTFWAPNVNIFRDP 118

Query: 176 RWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDN 235
           RWGR  ET GEDPF+  R AV +V+ +Q           +   +K ++C KH+A   V +
Sbjct: 119 RWGRGHETYGEDPFLTSRLAVPFVKAMQG----------DGEYMKAAACAKHFA---VHS 165

Query: 236 WKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQT 295
               +R+ FDA+ +++D+EET+L  FE  VKE +  +VM +YNR NG P CA+  L+  T
Sbjct: 166 GPEGERHFFDAKASKKDLEETYLPAFEALVKEAEVEAVMGAYNRTNGEPCCANKPLMVDT 225

Query: 296 VRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFTGNAV 355
           +RG+W   G+ V+DC +I+   +NHK +  S E++    L+ G DL+CG  Y +   N V
Sbjct: 226 LRGKWGFQGHFVSDCWAIKDFHENHK-VTSSPEESAKLALEMGCDLNCGCTYQSIM-NGV 283

Query: 356 QQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQDICSDENIELAAEAAREGIVL 415
           + G + E  I +S + L+T    LG FD + ++  +  + +   E++ +A  AARE +VL
Sbjct: 284 RAGLIDEKLITESCERLFTTRFLLGMFDKT-EFDEIPYEKVECKEHLAVAKRAARESVVL 342

Query: 416 LKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFSGY----ANVT 471
           LKND   LPLN   +KT+ VVGP+AN+ +++IGNY G   RY++ + G          V 
Sbjct: 343 LKND-GLLPLNKDSIKTIGVVGPNANSRLSLIGNYHGTSSRYITVLEGIQDKVGDDVRVL 401

Query: 472 YKTGCDDVACKSNN--------SIFAASEAAKTADATIILAGLDLSVEAE---------S 514
           Y  GCD      +N         +  A   A  +D  +++ GLD ++E E         S
Sbjct: 402 YSEGCDIFQNNISNLADPNLPDRLSEAQAVADHSDVVVVVVGLDENLEGEEGDAGNQFAS 461

Query: 515 LDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEE 574
            D+ +L LP  Q QL+N V +  K P I++ M+   +D++ A+   N  A+L A YPG  
Sbjct: 462 GDKINLNLPLSQRQLLNAVLDCGK-PTIVIDMAGSAIDLSKAQDEAN--AVLQAFYPGAR 518

Query: 575 GGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTL 634
           GG  +AD++FG  +P G+LP+T+Y          ++  L          RTYK++ G  L
Sbjct: 519 GGADVADILFGDVSPSGKLPVTFYK---------SADDLPDFKDYSMKNRTYKYFTGTPL 569

Query: 635 YPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDY 694
           YPFGYGL+Y                   K  +  N+ Y +DA K                
Sbjct: 570 YPFGYGLTYGDCYV--------------KPDYDFNVKY-ADADKVSGA------------ 602

Query: 695 FEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNAC 754
            E  V   N G  D  +VV +Y K      AT    ++GF+RV V AG   R+       
Sbjct: 603 -EITVTVVNDGKLDTDEVVQLYIKDMDSYFATTNPSLVGFKRVHVPAGGETRV------- 654

Query: 755 KSLNIVDYAANTLLPAGEHTIFVGN 779
            +L + + A  ++   GE  +F  N
Sbjct: 655 -TLTVSEKAFTSVNEEGERAVFGKN 678


>gi|2723496|dbj|BAA24107.1| beta-1,4-xylosidase [Aspergillus oryzae]
          Length = 798

 Score =  401 bits (1031), Expect = e-109,   Method: Compositional matrix adjust.
 Identities = 262/765 (34%), Positives = 393/765 (51%), Gaps = 48/765 (6%)

Query: 25  VDANGSSSPVFVCDPGRFSKLGLQMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDF 84
           ++  G+S P   C+ G  SK        L CD+S     R   LVS +T +E V    + 
Sbjct: 42  LETGGTSFPD--CESGPLSKT-------LVCDTSAKPHDRAAALVSLLTFEELVNNTANT 92

Query: 85  AHGVPRLGLPQYEWWSEALHGVSNVGPGTHFDDV--IPGATSFPTVILTTASFNESLWKK 142
            HG PR+GLP Y+ W+EALHGV++      F D      +TSFP  I T A+ N +L  +
Sbjct: 93  GHGAPRIGLPAYQVWNEALHGVAHA----DFSDAGDFSWSTSFPQPISTMAALNRTLIHQ 148

Query: 143 IGQAVSTEARAMYNLGRAGLTYWSPNINVARDPRWGRITETPGEDPF-VVGRYAVNYVRG 201
           I   +ST+ RA  N GR GL  +SPNIN  R P WGR  ETPGED + +   YA  Y+ G
Sbjct: 149 IATIISTQGRAFMNAGRYGLDVYSPNINTFRHPVWGRGQETPGEDAYCLASTYAYEYITG 208

Query: 202 LQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPF 261
           +Q          +++ PLK+ +  KHYA YD++NW    R   D ++T+QD+ E +   F
Sbjct: 209 IQG--------GVDANPLKLIATAKHYAGYDIENWDNHSRLGNDMQITQQDLAEYYTPQF 260

Query: 262 EMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDL--HGYIVADCDSIQVMVDN 319
            +  ++    SVMCSYN VNG+PSC++   L   +R  +D    GY+  DC ++  + + 
Sbjct: 261 LVASRDAKVHSVMCSYNAVNGVPSCSNSFFLQTLLRDTFDFVEDGYVSGDCGAVYNVFNP 320

Query: 320 HKFLADSKEDAVAQTLKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRL 379
           H + A ++  A A +++AG D+DCG  Y      +    +V   D+++ +  LY  L+R 
Sbjct: 321 HGY-ATNESSAAADSIRAGTDIDCGVSYPRHFQESFHDQEVSRQDLERGVIRLYASLIRA 379

Query: 380 GFFDG-SPQYVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPL--NSAKVKTVAVV 436
           G+FDG +  Y ++   D+ S     L+ EAA + IVLLKND   LPL   S+  KT+A++
Sbjct: 380 GYFDGKTSPYRNITWSDVVSTNAQNLSYEAAAQSIVLLKND-GILPLTSTSSSTKTIALI 438

Query: 437 GPHANATVAMIGNYAGIPCRYMSPIAGF--SGYANVTYKTGCDDVACKSNNSIFAASEAA 494
           GP ANAT  M+GNY G     +SP+  F  S Y  +TY  G +      + S   A   A
Sbjct: 439 GPWANATTQMLGNYYGPAPYLISPLQAFQDSEY-KITYTIGTNTTTDPDSTSQSTALTTA 497

Query: 495 KTADATIILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIA 554
           K AD  I   G+D ++E E+ DR ++  P  Q  LI ++A++ K P+I++ M  G VD +
Sbjct: 498 KEADLIIFAGGIDNTLETEAQDRSNITWPSNQLSLITKLADLGK-PLIVLQMGGGQVDSS 556

Query: 555 FAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLR 614
             + N N+ A++W GYPG+ GG+A+AD++ GK  P  RL  T Y  +Y ++ P   M LR
Sbjct: 557 ALKNNKNVNALIWGGYPGQSGGQALADIITGKRAPAARLVTTQYPAEYAEVFPAIDMNLR 616

Query: 615 PVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTS 674
           P  S   PG+TY +Y G  +Y FG+GL YT F  +  + + T        ++  + N   
Sbjct: 617 PNGS--NPGQTYMWYTGTPVYEFGHGLFYTNFTASASAGSGT--------KNRTSFNIDE 666

Query: 675 DASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGF 734
              +      LV  +       F VD +N G        + +    A  A    K ++GF
Sbjct: 667 VLGRPHPGYKLVEQMPL---LNFTVDVKNTGDRVSDYTAMAFVNTTAGPAPHPNKWLVGF 723

Query: 735 QRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGN 779
            R+      + +   +     SL   D   N +L  G + + + N
Sbjct: 724 DRLSAVEPGSAKTMVIPVTVDSLARTDEEGNRVLYPGRYEVALNN 768


>gi|336425135|ref|ZP_08605165.1| hypothetical protein HMPREF0994_01171 [Lachnospiraceae bacterium
           3_1_57FAA_CT1]
 gi|336013044|gb|EGN42933.1| hypothetical protein HMPREF0994_01171 [Lachnospiraceae bacterium
           3_1_57FAA_CT1]
          Length = 705

 Score =  401 bits (1030), Expect = e-109,   Method: Compositional matrix adjust.
 Identities = 250/711 (35%), Positives = 360/711 (50%), Gaps = 96/711 (13%)

Query: 64  RVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGAT 123
           +  +LVS+MTL+EK  QL   A  +PRLG+P Y WW+EALHGV+  G           AT
Sbjct: 10  KAHELVSQMTLEEKASQLRYDAPAIPRLGVPTYNWWNEALHGVARAGV----------AT 59

Query: 124 SFPTVILTTASFNESLWKKIGQAVSTEARAMYN-LGR-------AGLTYWSPNINVARDP 175
           SFP  I   A+F++ L K +G AV+ E RA YN   R        GLT+WSPN+N+ RDP
Sbjct: 60  SFPQAIAMAAAFDDELLKTVGDAVAAEGRAKYNEYSRHDDRDIYKGLTFWSPNVNIFRDP 119

Query: 176 RWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDN 235
           RWGR  ET GEDP++  R  V YV GLQ  +        +   +K ++C KH+A   V +
Sbjct: 120 RWGRGHETYGEDPYLTSRLGVAYVEGLQGSQ--------DDDFMKTAACAKHFA---VHS 168

Query: 236 WKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQT 295
                R+ FDA+ +++DM ET+L  FE CVKE    +VM +YNR NG P C  P L+   
Sbjct: 169 GPESVRHEFDAQASKKDMYETYLPAFEACVKEAGVEAVMGAYNRTNGEPCCGSPTLIQNI 228

Query: 296 VRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFTGNAV 355
           +R EWD  G+ V+DC +I      H  +  + E++ A  LK+G D++CG  Y +    A 
Sbjct: 229 LREEWDFQGHYVSDCWAI-ADFHMHHMVTKTPEESAALALKSGCDVNCGVTYLHLL-KAY 286

Query: 356 QQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQDICSDENIELAAEAAREGIVL 415
           QQG V E +I ++ + L+T    LG FD + +Y  +  + +   E++ELA + A+E +VL
Sbjct: 287 QQGLVTEEEITQAAERLFTTRFLLGCFDKN-EYDDIPYEVVECKEHLELAQKMAKESMVL 345

Query: 416 LKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFSGY----ANVT 471
           LKND   LPLN   +KT+ V+GP+A++   ++GNY G   RY++ + G   +      V 
Sbjct: 346 LKND-GILPLNKDGLKTIGVIGPNADSRTPLVGNYHGTSSRYITLLEGIQDFVGEDVRVY 404

Query: 472 YKTGCDDVACK------SNNSIFAASEAAKTADATIILAGLDLSVEAE---------SLD 516
           Y  GC     +        + I  A   A+ +D  ++  GLD ++E E         S D
Sbjct: 405 YSEGCHIYKDRVEGLGWKQDRISEALTVAEHSDVVVLCLGLDENLEGEEGDTGNSYASGD 464

Query: 517 REDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGG 576
           ++DL LP  Q +L+  VA   K PV+L +MS   +D+ FA  + N  AIL   YPG  GG
Sbjct: 465 KKDLELPESQRELLEAVAGCGK-PVVLCMMSGSAIDMQFAAEHVN--AILQVWYPGARGG 521

Query: 577 RAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYP 636
           +A A+++FG  +P G+LP+T+Y         L   P    +     GRTY++     LYP
Sbjct: 522 KAAAEILFGACSPSGKLPVTFYK-------DLEGFP--AFEDYSMKGRTYRYLEKEPLYP 572

Query: 637 FGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFE 696
           FGYGL+Y Q        T  ++                                      
Sbjct: 573 FGYGLTYGQVCVKAAELTGAVEEGKE--------------------------------LT 600

Query: 697 FKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRI 747
            K   +N G  D  DV+ VY K      A     +  F+RV ++ G    I
Sbjct: 601 IKAMVENSGKYDTDDVIQVYIKDLDSKNAVPNHSLCAFKRVSLKKGEKAEI 651


>gi|238923424|ref|YP_002936940.1| beta-glucosidase [Eubacterium rectale ATCC 33656]
 gi|238875099|gb|ACR74806.1| beta-glucosidase [Eubacterium rectale ATCC 33656]
          Length = 714

 Score =  401 bits (1030), Expect = e-109,   Method: Compositional matrix adjust.
 Identities = 234/619 (37%), Positives = 348/619 (56%), Gaps = 66/619 (10%)

Query: 66  KDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGATSF 125
           K LVS+MT+DEK+ Q+   +  + RLG+P+Y WW+EALHGV+  G           AT F
Sbjct: 10  KKLVSQMTIDEKISQMLYESPAIERLGIPEYNWWNEALHGVARAGV----------ATVF 59

Query: 126 PTVILTTASFNESLWKKIGQAVSTEARAMYNLGRA--------GLTYWSPNINVARDPRW 177
           P  I   A+F+  L +KIG  VSTE R  +N            GLT+W+PN+N+ RDPRW
Sbjct: 60  PQAIGLAATFDTDLIEKIGDVVSTEGRGKFNEFSKKGDHGIYKGLTFWAPNVNIFRDPRW 119

Query: 178 GRITETPGEDPFVVGRYAVNYVRGLQ-DVEGHENATDLNSRPLKVSSCCKHYAAYDVDNW 236
           GR  ET GEDP++ G+    Y+RGLQ D   H          LK ++C KH+A   V + 
Sbjct: 120 GRGHETYGEDPYLTGKLGCAYIRGLQGDDPDH----------LKSAACAKHFA---VHSG 166

Query: 237 KGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTV 296
               R+ FDA+ ++ DM +T+L  F+ CVK+    +VM +YNRVNG P+C    LL   +
Sbjct: 167 PEAIRHEFDAKASKHDMYDTYLYAFKRCVKDAKVEAVMGAYNRVNGEPACGSRTLLKDIL 226

Query: 297 RGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFTGNAVQ 356
           R E+   G++V+DC +I +    H  + D+ E++ A  +  G DL+CG  + +   +A  
Sbjct: 227 RDEFGFEGHVVSDCWAI-LDFHEHHHVTDTVEESAAMAVNNGCDLNCGSAFLHLK-DAYD 284

Query: 357 QGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ-YVSLGKQDICSDENIELAAEAAREGIVL 415
           +G V +  I  +++ L  V +RLG     P  Y  +  + +   E++EL+ EAAR  +VL
Sbjct: 285 KGMVSDEAITAAVERLMEVRIRLGMMKDYPSPYEDISYEVVECKEHVELSVEAARRSLVL 344

Query: 416 LKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFSGY----ANVT 471
           LKN  N LPL+   VKT+AV+GP+AN+  A+IGNY G   RY++P+ G   Y      V 
Sbjct: 345 LKNKDNFLPLDRKNVKTIAVIGPNANSRDALIGNYYGTSSRYITPLEGLQQYLGEDTRVL 404

Query: 472 YKTGC----DDVA--CKSNNSIFAASEAAKTADATIILAGLDLSVEAE---------SLD 516
           Y  GC    D V    +  +    A   A+ +D  ++  GLD ++E E         S D
Sbjct: 405 YAEGCHLYKDKVQGLAEEKDRFKEALIMAEQSDVVVMCLGLDATIEGEEGDAGNEYASGD 464

Query: 517 REDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGG 576
           +  L LPG Q +L+  VA V K PVILV+ +   +D+++AE   ++ AI+ + YPG  GG
Sbjct: 465 KLGLMLPGLQEELLEAVAAVGK-PVILVLSAGSAIDLSWAE--EHVDAIIDSWYPGARGG 521

Query: 577 RAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYP 636
           +A+A+ +FG+++P G+LP+T+Y G         ++P     S+ +  RTY++ N   LYP
Sbjct: 522 KAVAEAIFGEYSPNGKLPVTFYQG-------TENLPEFTDYSMAH--RTYRYTNENVLYP 572

Query: 637 FGYGLSYTQFKYNLLSFTK 655
           FGYGL Y +  Y+ LS  K
Sbjct: 573 FGYGLHYGETNYDGLSVDK 591


>gi|347531439|ref|YP_004838202.1| beta-glucosidase [Roseburia hominis A2-183]
 gi|345501587|gb|AEN96270.1| beta-glucosidase [Roseburia hominis A2-183]
          Length = 716

 Score =  401 bits (1030), Expect = e-109,   Method: Compositional matrix adjust.
 Identities = 228/620 (36%), Positives = 346/620 (55%), Gaps = 62/620 (10%)

Query: 58  SLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTHFDD 117
           SL      K LV +MTL+EK+ Q+   +  + RL +P Y WW+EALHGV+  G       
Sbjct: 2   SLETKEYAKRLVEQMTLEEKISQMRYESPAIERLHIPAYNWWNEALHGVARSGV------ 55

Query: 118 VIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNL--GRA------GLTYWSPNI 169
               AT FP  I   A+F+E L +KIG  VSTE RA +    GR       GLT+W+PNI
Sbjct: 56  ----ATMFPQAIALAATFDEELIEKIGDVVSTEGRAKFEAYSGRGDRGIYKGLTFWAPNI 111

Query: 170 NVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYA 229
           N+ RDPRWGR  ET GEDP +  +    Y+RG+Q           +   LK ++C KH+A
Sbjct: 112 NIFRDPRWGRGHETYGEDPCLTAKLGCAYIRGIQGK---------DPDHLKAAACAKHFA 162

Query: 230 AYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADP 289
              V +     R+ FDA+V+  D+ +T+L  F+ CVK+    +VM +YNRVNG P+C   
Sbjct: 163 ---VHSGPEALRHEFDAKVSLHDLYDTYLYAFKRCVKDAGVEAVMGAYNRVNGEPACGSK 219

Query: 290 KLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTN 349
            LL   +R ++   G++V+DC +I +    H  +  + E++ A  +  G DL+CG+ +  
Sbjct: 220 TLLQDILREQFGFEGHVVSDCWAI-LDFHEHHHVTKTVEESAAMAVNHGCDLNCGKAFL- 277

Query: 350 FTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ-YVSLGKQDICSDENIELAAEA 408
           +   A +QG V+E  I ++++ L  V +RLG  +  P  Y ++    +   E+I L+ EA
Sbjct: 278 YLSRACEQGLVEEKTITEAVERLMDVRIRLGMMEDYPSPYANIPYDVVECPEHIALSLEA 337

Query: 409 AREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFSGY- 467
           ++  +VLLKND + LPL   +V T+AV+GP+AN+  A++GNY G   RY++P+ G   Y 
Sbjct: 338 SKRSMVLLKNDNHFLPLKQEQVHTIAVIGPNANSRAALVGNYEGTSSRYITPLEGIQEYT 397

Query: 468 ---ANVTYKTGCD------DVACKSNNSIFAASEAAKTADATIILAGLDLSVEAE----- 513
                V Y  GC       +   +  +    A  AA+ AD  ++  GLD  +E E     
Sbjct: 398 GEKTRVLYAQGCHLYKDQVEFLGEPKDRFKEALIAAERADVIVMCLGLDAGIEGEEGDAG 457

Query: 514 ----SLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAG 569
               S D+  L LPG Q +L+  VA V K P++L +++   +D+++A+ +  I+AIL   
Sbjct: 458 NEYASGDKLGLKLPGLQQELLEAVAAVGK-PIVLTVLAGSALDLSWAQEHAQIRAILDCW 516

Query: 570 YPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFY 629
           YPG  GG+AIA+ +FG+F+P G+LP+T+Y G   + LP               GRTY++ 
Sbjct: 517 YPGARGGKAIAEALFGEFSPCGKLPVTFYEG--TEFLP-------DFTDYSMAGRTYRYT 567

Query: 630 NGPTLYPFGYGLSYTQFKYN 649
           +   LYPFGYGL+Y+Q +Y+
Sbjct: 568 DRHVLYPFGYGLTYSQIRYS 587


>gi|3135209|dbj|BAA28267.1| beta-xylosidase A [Aspergillus oryzae]
          Length = 798

 Score =  401 bits (1030), Expect = e-109,   Method: Compositional matrix adjust.
 Identities = 262/765 (34%), Positives = 393/765 (51%), Gaps = 48/765 (6%)

Query: 25  VDANGSSSPVFVCDPGRFSKLGLQMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDF 84
           ++  G+S P   C+ G  SK        L CD+S     R   LVS +T +E V    + 
Sbjct: 42  LETGGTSFPD--CESGPLSKT-------LVCDTSAKPHDRAAALVSLLTFEELVNNTANT 92

Query: 85  AHGVPRLGLPQYEWWSEALHGVSNVGPGTHFDDV--IPGATSFPTVILTTASFNESLWKK 142
            HG PR+GLP Y+ W+EALHGV++      F D      +TSFP  I T A+ N +L  +
Sbjct: 93  GHGAPRIGLPAYQVWNEALHGVAHA----DFSDAGDFSWSTSFPQPISTMAALNRTLIHQ 148

Query: 143 IGQAVSTEARAMYNLGRAGLTYWSPNINVARDPRWGRITETPGEDPF-VVGRYAVNYVRG 201
           I   +ST+ RA  N GR GL  +SPNIN  R P WGR  ETPGED + +   YA  Y+ G
Sbjct: 149 IATIISTQGRAFMNAGRYGLDVYSPNINTFRHPVWGRGQETPGEDAYCLASTYAYEYITG 208

Query: 202 LQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPF 261
           +Q          +++ PLK+ +  KHYA YD++NW    R   D ++T+QD+ E +   F
Sbjct: 209 IQG--------GVDANPLKLIATAKHYAGYDIENWDNHSRLGNDMQITQQDLAEYYTPQF 260

Query: 262 EMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDL--HGYIVADCDSIQVMVDN 319
            +  ++    SVMCSYN VNG+PSC++   L   +R  +D    GY+  DC ++  + + 
Sbjct: 261 LVASRDAKVHSVMCSYNAVNGVPSCSNSFFLQTLLRDTFDFVEDGYVSGDCGAVYNVFNP 320

Query: 320 HKFLADSKEDAVAQTLKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRL 379
           H + A ++  A A +++AG D+DCG  Y      +    +V   D+++ +  LY  L+R 
Sbjct: 321 HGY-ATNESSAAADSIRAGTDIDCGVSYPRHFQESFHDQEVSRQDLERGVIRLYASLIRA 379

Query: 380 GFFDG-SPQYVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPL--NSAKVKTVAVV 436
           G+FDG +  Y ++   D+ S     L+ EAA + IVLLKND   LPL   S+  KT+A++
Sbjct: 380 GYFDGKTSPYRNITWSDVVSTNAQNLSYEAAAQSIVLLKND-GILPLTSTSSSTKTIALI 438

Query: 437 GPHANATVAMIGNYAGIPCRYMSPIAGF--SGYANVTYKTGCDDVACKSNNSIFAASEAA 494
           GP ANAT  M+GNY G     +SP+  F  S Y  +TY  G +      + S   A   A
Sbjct: 439 GPWANATTQMLGNYYGPAPYLISPLQAFQDSEY-KITYTIGTNTTTDPDSTSQSTALTTA 497

Query: 495 KTADATIILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIA 554
           K AD  I   G+D ++E E+ DR ++  P  Q  LI ++A++ K P+I++ M  G VD +
Sbjct: 498 KEADLIIFAGGIDNTLETEAQDRSNITWPSNQLSLITKLADLGK-PLIVLQMGGGQVDSS 556

Query: 555 FAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLR 614
             + N N+ A++W GYPG+ GG+A+AD++ GK  P  RL  T Y  +Y ++ P   M LR
Sbjct: 557 ALKNNKNVNALIWGGYPGQSGGQALADIITGKRAPAARLVTTQYPAEYAEVFPAIDMNLR 616

Query: 615 PVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTS 674
           P  S   PG+TY +Y G  +Y FG+GL YT F  +  + + T        ++  + N   
Sbjct: 617 PNGS--NPGQTYMWYTGTPVYEFGHGLFYTNFTASASAGSGT--------KNRTSFNIDE 666

Query: 675 DASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGF 734
              +      LV  +       F VD +N G        + +    A  A    K ++GF
Sbjct: 667 VLGRPHPGYKLVEQMPL---LNFTVDVKNTGDRVSDYTAMAFVNTTAGPAPHPNKWLVGF 723

Query: 735 QRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGN 779
            R+      + +   +     SL   D   N +L  G + + + N
Sbjct: 724 DRLSAVEPGSAKTMVIPVTVDSLARTDEEGNRVLYPGRYEVALNN 768


>gi|330836687|ref|YP_004411328.1| Beta-glucosidase [Sphaerochaeta coccoides DSM 17374]
 gi|329748590|gb|AEC01946.1| Beta-glucosidase [Sphaerochaeta coccoides DSM 17374]
          Length = 709

 Score =  401 bits (1030), Expect = e-109,   Method: Compositional matrix adjust.
 Identities = 248/723 (34%), Positives = 376/723 (52%), Gaps = 97/723 (13%)

Query: 66  KDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGATSF 125
           + +VSRMTLDEK+ Q+   A  +PRL +P+Y WW+EALHGV+  G           AT F
Sbjct: 15  RRIVSRMTLDEKISQIDYRASAIPRLDIPEYNWWNEALHGVARAGI----------ATVF 64

Query: 126 PTVILTTASFNESLWKKIGQAVSTEARAMYNLG--------RAGLTYWSPNINVARDPRW 177
           P  I   A F+  + ++IG  +STE RA YN            GLT+WSPN+N+ RDPRW
Sbjct: 65  PQAIGLAAMFDSDMMERIGAVISTEGRAKYNEAVRHGDRDIYKGLTFWSPNVNIFRDPRW 124

Query: 178 GRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWK 237
           GR  ET GEDP++  R AV ++RG+Q           + + LK ++C KH+A   V +  
Sbjct: 125 GRGQETYGEDPYLTARLAVAFIRGIQG----------DGKYLKAAACAKHFA---VHSGP 171

Query: 238 GVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVR 297
              R+ FDARV+++D+ ET+L  F+  VKE     VM +YNRVNG+P+CA  +LL+  +R
Sbjct: 172 EALRHEFDARVSQKDLHETYLSAFKAAVKEAQVEIVMGAYNRVNGVPACASHELLSDILR 231

Query: 298 GEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFTGNAVQQ 357
            EW   G++V+D ++++ +  +H ++AD     +A  LKAG +L  G+   +   ++V +
Sbjct: 232 SEWGFEGHVVSDYEALEDIFKHHHYVADEAH-TMAVALKAGCNLCAGKIARHLR-SSVDE 289

Query: 358 GKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQDICSDENIELAAEAAREGIVLLK 417
           G + E +I ++++ L+T  + +G       Y S+G ++  + E+ +LA EAA    VLLK
Sbjct: 290 GLISEDEITEAVERLFTTRIMMGMMADDCPYDSIGYEENDTPEHHQLAVEAASRSFVLLK 349

Query: 418 NDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFSGY----ANVTYK 473
           ND   LPL   K+ ++AV+GP+AN+   + GNY G   RY++ + G          V Y 
Sbjct: 350 ND-GLLPLEMEKISSIAVIGPNANSRKMLEGNYNGTASRYVTVLEGIQDLVGDSVRVWYS 408

Query: 474 TGC------DDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAE---------SLDRE 518
            GC             N+ +  A  AA+ AD  ++  GLD ++E E         S D+ 
Sbjct: 409 EGCHLYKNFHSSLSGRNDRLAEAVSAAQHADVVVLCLGLDATLEGEEGDVEVGFGSGDKP 468

Query: 519 DLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRA 578
           +L LPG Q  L++ +  V K PVIL++ S   + +   E + N+KAIL   YPG  GG+A
Sbjct: 469 NLSLPGRQQLLLDTMLTVGK-PVILLLASGSALTLGGRENDENLKAILQIWYPGAMGGKA 527

Query: 579 IADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFG 638
           +ADV+FG+  P G+LP+T+Y          ++  L   +     GRTY++  G  LYPFG
Sbjct: 528 VADVLFGRRAPAGKLPVTFYA---------SADELPAFEDYSMAGRTYRYMKGNALYPFG 578

Query: 639 YGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFK 698
           YGL+Y+                      C ++     + KT   GV           E  
Sbjct: 579 YGLTYSP---------------------C-SIVSAGISGKTADGGV-----------EIT 605

Query: 699 VDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLN 758
           VD +N G     +VV VY K      A     + GF+R+ +  G       V    ++  
Sbjct: 606 VDIRNDGGRTTEEVVQVYVKDMDSPLAVINHALAGFRRITLAPGEKTSRTIVIEP-EAFT 664

Query: 759 IVD 761
           +VD
Sbjct: 665 VVD 667


>gi|255690205|ref|ZP_05413880.1| xylosidase/arabinosidase [Bacteroides finegoldii DSM 17565]
 gi|260624224|gb|EEX47095.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
            finegoldii DSM 17565]
          Length = 1425

 Score =  400 bits (1029), Expect = e-108,   Method: Compositional matrix adjust.
 Identities = 260/757 (34%), Positives = 388/757 (51%), Gaps = 95/757 (12%)

Query: 52   FLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGP 111
            + F +  L    RV DLVSR+TL+EKV+Q+ + A  + RLG+P Y WW+E LHGV     
Sbjct: 712  YPFRNPQLSIEQRVDDLVSRLTLEEKVRQMLNNAPAIKRLGIPAYNWWNECLHGVGR--- 768

Query: 112  GTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRA--------GLT 163
             T +       T FP  I   AS+N+ L K++  +++ E RA+YN  +          LT
Sbjct: 769  -TKYH-----VTVFPQAIGMAASWNDVLMKEVASSIADEGRAIYNDAQKRGDYSQYHALT 822

Query: 164  YWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSS 223
            YW+PNIN+ RDPRWGR  ET GEDP++  +    +V GLQ  +          R LK S+
Sbjct: 823  YWTPNINIFRDPRWGRGQETYGEDPYLTSKIGKAFVLGLQGDD---------PRYLKASA 873

Query: 224  CCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGI 283
            C KHYA   V +    +R+ F++ V+  D+ +T+L  F   V + + S VMC+YN   G 
Sbjct: 874  CAKHYA---VHSGPEKNRHSFNSDVSTYDLWDTYLPAFRTLVVDANVSGVMCAYNAFKGQ 930

Query: 284  PSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDC 343
            P C +  L+   +R +W+  GY+ +DC +I  + ++HK   D+   A       G DLDC
Sbjct: 931  PCCGNDLLMQSILRDKWNFKGYVTSDCGAIDDIFNHHKAHPDAATAAADAVFH-GTDLDC 989

Query: 344  GQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ--YVSLGKQDICSDEN 401
            GQ        AV+ G + E  +D S+K L+T+  RLG FD + Q  Y  +    +   ++
Sbjct: 990  GQSAYLALVKAVKNGIITEKQLDVSVKRLFTIRFRLGLFDPAEQVDYAHIPISVLECKKH 1049

Query: 402  IELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPI 461
             +LA + ARE +VLLKND+  LPL   K+K V V+GP+A+   A++GNY G P R ++P+
Sbjct: 1050 QDLAKQLARESMVLLKNDR-LLPLQKNKLKKVVVMGPNADCKDALLGNYNGHPSRMLTPL 1108

Query: 462  AG----FSGYANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAESL-- 515
                    G A V Y +G D +   S + +      AK ADA I + G+   +E E +  
Sbjct: 1109 QAIRERLKGVAEVVYVSGIDYINTVSEDELKRYVNQAKGADAVIFIGGISPRLEGEEMSV 1168

Query: 516  --------DREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILW 567
                    DR  + LP  QTQL+  +    + P + V+M+   + I +   +  + AIL 
Sbjct: 1169 NKDGFDGGDRTSIALPTVQTQLMKALV-AGRIPTVFVMMTGSALAIPWEAKH--VPAILN 1225

Query: 568  AGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYK 627
            A Y G+ GG AIADV+FG +NP G+LP+T+Y  D       + +P    +S    GRTY+
Sbjct: 1226 AWYGGQYGGEAIADVLFGDYNPSGKLPVTFYAKD-------SDLP--DFESYDMQGRTYR 1276

Query: 628  FYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVN 687
            ++ G  LYPFGYGLSYT F+Y+ L                           T C      
Sbjct: 1277 YFKGKALYPFGYGLSYTDFRYSSLKMP------------------------TACN----- 1307

Query: 688  DLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRI 747
                D      V  +N G  DG +VV +Y   P +     +  + GF+R++++AG  K+I
Sbjct: 1308 --TTDKEIPVTVTVKNTGKMDGEEVVQLYVSHPDKKILVPVTALKGFKRIYLKAGEAKQI 1365

Query: 748  KFVFNACKSLNIVDY-AANTLLPAGEHTIFVGNGGVS 783
             F  ++ + L+ VD      +LP    T+ +  GG S
Sbjct: 1366 TFSLSS-EDLSCVDENGIRKVLPG---TVKIQVGGCS 1398


>gi|291528382|emb|CBK93968.1| Beta-glucosidase-related glycosidases [Eubacterium rectale M104/1]
          Length = 714

 Score =  400 bits (1028), Expect = e-108,   Method: Compositional matrix adjust.
 Identities = 234/619 (37%), Positives = 348/619 (56%), Gaps = 66/619 (10%)

Query: 66  KDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGATSF 125
           K LVS+MT+DEK+ Q+   +  + RLG+P+Y WW+EALHGV+  G           AT F
Sbjct: 10  KKLVSQMTIDEKISQMLYESPAIERLGIPEYNWWNEALHGVARAGV----------ATVF 59

Query: 126 PTVILTTASFNESLWKKIGQAVSTEARAMYNLGRA--------GLTYWSPNINVARDPRW 177
           P  I   A+F+  L +KIG  VSTE R  +N            GLT+W+PN+N+ RDPRW
Sbjct: 60  PQAIGLAAAFDADLIEKIGDVVSTEGRGKFNEFSKKGDHGIYKGLTFWAPNVNIFRDPRW 119

Query: 178 GRITETPGEDPFVVGRYAVNYVRGLQ-DVEGHENATDLNSRPLKVSSCCKHYAAYDVDNW 236
           GR  ET GEDP++ G+    Y+RGLQ D   H          LK ++C KH+A   V + 
Sbjct: 120 GRGHETYGEDPYLTGKLGCAYIRGLQGDDPDH----------LKSAACAKHFA---VHSG 166

Query: 237 KGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTV 296
               R+ FDA+ ++ DM +T+L  F+ CVK+    +VM +YNRVNG P+C    LL   +
Sbjct: 167 PEAIRHEFDAKASKHDMYDTYLYAFKRCVKDAKVEAVMGAYNRVNGEPACGSRTLLKDIL 226

Query: 297 RGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFTGNAVQ 356
           R E+   G++V+DC +I +    H  + D+ E++ A  +  G DL+CG  + +   +A  
Sbjct: 227 RDEFGFEGHVVSDCWAI-LDFHEHHHVTDTVEESAAMAVNNGCDLNCGSAFLHLK-DAYD 284

Query: 357 QGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ-YVSLGKQDICSDENIELAAEAAREGIVL 415
           +G V +  I  +++ L  V +RLG     P  Y  +  + +   E++EL+ EAAR  +VL
Sbjct: 285 KGLVSDEAITAAVERLMEVRIRLGMMKDYPSPYEDISYEVVECKEHVELSVEAARRSLVL 344

Query: 416 LKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFSGY----ANVT 471
           LKN  N LPL+   VKT+AV+GP+AN+  A+IGNY G   RY++P+ G   Y      V 
Sbjct: 345 LKNKDNFLPLDRKNVKTIAVIGPNANSRDALIGNYYGTSSRYITPLEGLQQYLGDDTRVL 404

Query: 472 YKTGC----DDVA--CKSNNSIFAASEAAKTADATIILAGLDLSVEAE---------SLD 516
           Y  GC    D V    +  +    A   A+ +D  ++  GLD ++E E         S D
Sbjct: 405 YAEGCHLYKDKVQGLAEEKDRFKEALIMAEQSDVVVMCLGLDATIEGEEGDAGNEYASGD 464

Query: 517 REDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGG 576
           +  L LPG Q +L+  VA V K PVILV+ +   +D+++AE   ++ AI+ + YPG  GG
Sbjct: 465 KLGLMLPGLQEELLEAVAAVGK-PVILVLSAGSAIDLSWAE--EHVDAIIDSWYPGARGG 521

Query: 577 RAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYP 636
           +A+A+ +FG+++P G+LP+T+Y G         ++P     S+ +  RTY++ N   LYP
Sbjct: 522 KAVAEAIFGEYSPSGKLPVTFYQG-------TENLPEFTDYSMAH--RTYRYTNENVLYP 572

Query: 637 FGYGLSYTQFKYNLLSFTK 655
           FGYGL Y +  Y+ LS  K
Sbjct: 573 FGYGLHYGETNYDGLSVDK 591


>gi|358380569|gb|EHK18247.1| glycoside hydrolase family 3 protein, partial [Trichoderma virens
           Gv29-8]
          Length = 722

 Score =  400 bits (1028), Expect = e-108,   Method: Compositional matrix adjust.
 Identities = 268/724 (37%), Positives = 385/724 (53%), Gaps = 58/724 (8%)

Query: 72  MTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPG-------THFDDVIPGATS 124
           +TLDEK   L + A GV RLGLP YEW +EALHG++ V PG       T  +     +T 
Sbjct: 12  LTLDEKAANLVNNAPGVKRLGLPPYEWRNEALHGLAGVSPGQGINSTFTQGNVAFNSSTQ 71

Query: 125 FPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNINVARDPRWGRITETP 184
           FP+ I+  A+F++ L   I  AVSTEARA  N  +AGL YW+PNIN  RDPRWGR  ETP
Sbjct: 72  FPSPIVLGAAFDDHLVHDIATAVSTEARAFSNHLKAGLDYWAPNINPYRDPRWGRGQETP 131

Query: 185 GEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHF 244
           GEDP+ V +YA NYV GL+   G   +        KV S CKH+A YD+++  GV R  +
Sbjct: 132 GEDPYHVAQYAYNYVVGLKGGVGPAKS--------KVVSTCKHFAGYDIEDSDGVVRGSY 183

Query: 245 DARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHG 304
           +A ++ QD+ E +L  F  C ++    +VMCSYN VNG PSCA+  +L+  +R  W    
Sbjct: 184 NAIISTQDLAEYYLPSFRSCFRDAKTGAVMCSYNAVNGHPSCANSYMLDTVLRDHWGWGS 243

Query: 305 ---YIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFTGNAVQQGKVK 361
              ++  DC ++  + + H  +  S    VA  +  G DLDCG  Y +   +AVQ     
Sbjct: 244 SAHWVTGDCGAVDGVFNQHH-VGQSAAQGVAFAINNGTDLDCGTAYASNIASAVQNNYTT 302

Query: 362 ETDIDKSLKYLYTVLMRLGFFD--GSPQYVSLGKQDICSDENIELAAEAAREGIVLLKND 419
           E  +D++L  LY+ L+ LG+FD     +Y +LG  D+ +    +LA  A  EGI      
Sbjct: 303 EAQLDQALSRLYSSLIVLGYFDPPEGQEYRTLGVSDVNTPSTQKLAYTALVEGI------ 356

Query: 420 QNTLPLNSAKVKTVAVVGPHA-NATVAMIGNYAGI-PCRYMS-PIAGFSGYA-NVTYKTG 475
            N LP+     +TV  VGP A NA+V+M GNY G+ P + +  P A  S Y  NVTY  G
Sbjct: 357 -NILPIRPMG-QTVLFVGPWANNASVSMFGNYNGVAPYKTIPVPTANSSAYNWNVTYSQG 414

Query: 476 CDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAESLDREDLWLPGYQTQLINQVAE 535
              V     +   AA  AA+ AD  + + G+D  VEAE+ DR  +  PG Q  LI Q+A 
Sbjct: 415 LQYVLSNDTSQFAAAVSAAQEADVVVYIGGIDEQVEAEAHDRTSIDWPGAQLNLIKQLAA 474

Query: 536 VAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPI 595
           V   PV++V +  G VD +    N N+K +LW GYPG+E G  + D++ G   P GRLP+
Sbjct: 475 VK--PVVVVQVGGGQVDDSSLLQNKNVKGLLWMGYPGQEFGSGLIDILSGASAPAGRLPV 532

Query: 596 TWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTK 655
           T Y  +Y+  +P+T   LRP  S   PGRTY++YNG ++ PFG G+ YT+F         
Sbjct: 533 TQYPANYITQVPMTDQSLRPSSS--NPGRTYRWYNG-SVIPFGTGIHYTKFN-------- 581

Query: 656 TIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIV 715
              ++       R    T+D      P  L       ++  F+++ +NVGST    V ++
Sbjct: 582 ---ISWKTGGSGRGTYDTADFINAEDPKDLA------EFDVFQINVENVGSTTSDYVALL 632

Query: 716 YSKPPAEIAATY-IKQVIGFQRVF-VRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEH 773
           + K        Y +K ++ + R    + G   +I    N  + +   D + N +L  G +
Sbjct: 633 FVKSSDSGPQPYPLKTLVSYARAHGTQPGETTKIDLRVNVGQ-IARNDSSGNLVLYPGAY 691

Query: 774 TIFV 777
           T+ +
Sbjct: 692 TLEI 695


>gi|359409694|ref|ZP_09202159.1| Beta-glucosidase [Clostridium sp. DL-VIII]
 gi|357168578|gb|EHI96752.1| Beta-glucosidase [Clostridium sp. DL-VIII]
          Length = 723

 Score =  400 bits (1028), Expect = e-108,   Method: Compositional matrix adjust.
 Identities = 257/743 (34%), Positives = 389/743 (52%), Gaps = 103/743 (13%)

Query: 66  KDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGATSF 125
           K+LV++MTL EK +QL   +  V RL +P+Y WW+E LHGV+  G           AT F
Sbjct: 30  KELVAKMTLQEKAEQLTYNSPAVKRLNIPEYNWWNEGLHGVARAGT----------ATVF 79

Query: 126 PTVILTTASFNESLWKKIGQAVSTEARAMYNLGRA--------GLTYWSPNINVARDPRW 177
           P  I   A F+E    K+   ++TE RA YN            GLTYWSPN+N+ RDPRW
Sbjct: 80  PQAIGLAAMFDEEFLGKVAGIIATEGRAKYNENSKKEDRDIYKGLTYWSPNVNIFRDPRW 139

Query: 178 GRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWK 237
           GR  ET GEDP++  R  V +V+GLQ           + + LK+S+C KH+A   V +  
Sbjct: 140 GRGHETYGEDPYLTSRLGVAFVKGLQG----------DGKYLKLSACAKHFA---VHSGP 186

Query: 238 GVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVR 297
              R+ F+A V+++D+ ET+L  FE CVKE +  SVM +YNR NG P C    LL   +R
Sbjct: 187 ESLRHEFNAVVSQKDLHETYLPAFEACVKEANVESVMGAYNRTNGEPCCGSKALLKDILR 246

Query: 298 GEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFTGNAVQQ 357
           G+W   G++V+DC ++     +HK  + + E +VA  ++ G DL+CG  Y N    A ++
Sbjct: 247 GKWGFKGHVVSDCWALADFHMHHKVTSTATE-SVALAIENGCDLNCGNMYLNLL-LAYKE 304

Query: 358 GKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQDICSDENIELAAEAAREGIVLLK 417
           G V E  I  + + L T   +LG FD   +Y  +  +     E+ +++ EA+R+ +VLLK
Sbjct: 305 GLVTEEQITTAAERLMTTRFKLGMFDEDCEYNQIPYEVNDCKEHNQVSLEASRKSMVLLK 364

Query: 418 NDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFSGYAN----VTYK 473
           N+   LPL+ +K+K VAV+GP+AN+ + + GNY+G   +Y + + G     +    V Y 
Sbjct: 365 NN-GILPLDKSKLKAVAVIGPNANSEIMLKGNYSGTASKYTTILDGIHDVLDDDVRVYYS 423

Query: 474 TGC-------DDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAE---------SLDR 517
            GC       +D+A + ++ +  A   A+ AD  I+  GLD ++E E         + D+
Sbjct: 424 EGCHLYKEKVEDLA-RRDDRLAEAVSVAERADVVILCLGLDSTIEGEQGDAGNGYGAGDK 482

Query: 518 EDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGR 577
            DL LPG Q +L+ +V E  K PV++V+ +  G+ +  AE      AIL A YPG  GG 
Sbjct: 483 LDLNLPGIQQELLEKVLETGK-PVVVVLGTGSGLTLNGAEERC--AAILNAWYPGSHGGT 539

Query: 578 AIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPT-LYP 636
           A AD++FGK +P G+LP+T+Y  D  ++   T   ++        GRTY++ +    LYP
Sbjct: 540 AAADILFGKCSPSGKLPVTFYK-DTDKLPEFTDYAMK--------GRTYRYMDESNCLYP 590

Query: 637 FGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCD-DYF 695
           FGYGL+Y+                            T + S  + P V     R + D  
Sbjct: 591 FGYGLTYS----------------------------TVELSNLQVPAV-----RGEFDGI 617

Query: 696 EFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACK 755
           +  V+ +N GS D  +VV  Y K      A     + GF+RV ++ G +K +    N  +
Sbjct: 618 DISVEIENTGSYDIEEVVQCYIKDLESKYAVLNHSLAGFKRVSLKKGESKTVTMKLNR-R 676

Query: 756 SLNIVDYAANTLLPAGEHTIFVG 778
           +   VD A   +L + +  +FVG
Sbjct: 677 AFEAVDDAGERILDSKKFKLFVG 699


>gi|291525508|emb|CBK91095.1| Beta-glucosidase-related glycosidases [Eubacterium rectale DSM
           17629]
          Length = 714

 Score =  400 bits (1027), Expect = e-108,   Method: Compositional matrix adjust.
 Identities = 233/619 (37%), Positives = 348/619 (56%), Gaps = 66/619 (10%)

Query: 66  KDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGATSF 125
           K LVS+MT+DEK+ Q+   +  + RLG+P+Y WW+EALHGV+  G           AT F
Sbjct: 10  KKLVSQMTIDEKISQMLYESPAIERLGIPEYNWWNEALHGVARAGV----------ATVF 59

Query: 126 PTVILTTASFNESLWKKIGQAVSTEARAMYNLGRA--------GLTYWSPNINVARDPRW 177
           P  I   A+F+  L +KIG  VSTE R  +N            GLT+W+PN+N+ RDPRW
Sbjct: 60  PQAIGLAATFDTDLIEKIGDVVSTEGRGKFNEFSKKGDHGIYKGLTFWAPNVNIFRDPRW 119

Query: 178 GRITETPGEDPFVVGRYAVNYVRGLQ-DVEGHENATDLNSRPLKVSSCCKHYAAYDVDNW 236
           GR  ET GEDP++ G+    Y+RGLQ D   H          LK ++C KH+A   V + 
Sbjct: 120 GRGHETYGEDPYLTGKLGCAYIRGLQGDDPDH----------LKSAACAKHFA---VHSG 166

Query: 237 KGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTV 296
               R+ FDA+ ++ DM +T+L  F+ CVK+    +VM +YNRVNG P+C    LL   +
Sbjct: 167 PEAIRHEFDAKASKHDMYDTYLYAFKRCVKDAKVEAVMGAYNRVNGEPACGSRTLLKDIL 226

Query: 297 RGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFTGNAVQ 356
           R E+   G++V+DC +I +    H  + D+ E++ A  +  G DL+CG  + +   +A  
Sbjct: 227 RDEFGFEGHVVSDCWAI-LDFHEHHHVTDTVEESAAMAVNNGCDLNCGSAFLHLK-DAYD 284

Query: 357 QGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ-YVSLGKQDICSDENIELAAEAAREGIVL 415
           +G V +  I  +++ L  V +RLG     P  Y  +  + +   E++EL+ EAAR  +VL
Sbjct: 285 KGLVSDEAITAAVERLMEVRIRLGMMKDYPSPYEDISYEVVECKEHVELSVEAARRSLVL 344

Query: 416 LKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFSGY----ANVT 471
           LKN  N LPL+   VKT+AV+GP+AN+  A+IGNY G   RY++P+ G   Y      V 
Sbjct: 345 LKNKDNFLPLDRKNVKTIAVIGPNANSRDALIGNYYGTSSRYITPLEGLQQYLGEDTRVL 404

Query: 472 YKTGC----DDVA--CKSNNSIFAASEAAKTADATIILAGLDLSVEAE---------SLD 516
           Y  GC    D V    +  +    A   A+ +D  ++  GLD ++E E         S D
Sbjct: 405 YAEGCHLYKDKVQGLAEEKDRFKEALIMAEQSDVVVMCLGLDATIEGEEGDAGNEYASGD 464

Query: 517 REDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGG 576
           +  L LPG Q +L+  VA V K PVILV+ +   +D+++AE   ++ AI+ + YPG  GG
Sbjct: 465 KLGLMLPGLQEELLEAVAAVGK-PVILVLSAGSAIDLSWAE--EHVDAIIDSWYPGARGG 521

Query: 577 RAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYP 636
           +A+A+ +FG+++P G+LP+T+Y G         ++P     S+ +  RTY++ N   LYP
Sbjct: 522 KAVAEAIFGEYSPSGKLPVTFYQG-------TENLPEFTDYSMAH--RTYRYTNENVLYP 572

Query: 637 FGYGLSYTQFKYNLLSFTK 655
           FGYGL Y +  Y+ +S  K
Sbjct: 573 FGYGLHYGETNYDGMSVDK 591


>gi|346225847|ref|ZP_08846989.1| beta-glucosidase [Anaerophaga thermohalophila DSM 12881]
 gi|346227016|ref|ZP_08848158.1| beta-glucosidase [Anaerophaga thermohalophila DSM 12881]
          Length = 718

 Score =  399 bits (1026), Expect = e-108,   Method: Compositional matrix adjust.
 Identities = 268/767 (34%), Positives = 395/767 (51%), Gaps = 99/767 (12%)

Query: 42  FSKLGLQMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSE 101
           FS    +  SF   D SL    R + +V ++T++EK+ QL + A  V RL +P+Y+WW+E
Sbjct: 7   FSLKAQEDCSFRNPDISL--DERAECIVKQLTVEEKINQLMNAAPAVDRLEIPEYDWWNE 64

Query: 102 ALHGVSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNL---- 157
            LHGV+  G           AT FP  I   A+++ +L  ++G A+STEARA YN+    
Sbjct: 65  CLHGVARAGR----------ATVFPQAIGMAATWDTTLVYRVGDAISTEARAKYNVFSKH 114

Query: 158 ----GRAGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATD 213
                  GLT+W+PN+N+ RDPRWGR  ET GEDPF+  R  V++V+GLQ   G+     
Sbjct: 115 GYRGQYKGLTFWTPNVNIFRDPRWGRGQETYGEDPFLTSRIGVSFVKGLQ---GN----- 166

Query: 214 LNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSV 273
            + + LKV++  KHYA   V N     R+ FDA+V+ +D+ ET+L  FE  VKE     V
Sbjct: 167 -HPKYLKVAALAKHYA---VHNGPEALRHEFDAKVSMKDLWETYLPAFEALVKEAGVEGV 222

Query: 274 MCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQ 333
           M +YNR NG P CA P L+ + +R +W   GY V+DC +I      HK + D+ E+A A 
Sbjct: 223 MGAYNRTNGDPCCAHPYLMQEVLREKWGFDGYYVSDCGAIMDFYTGHK-IVDTPEEAAAM 281

Query: 334 TLKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFF--DGSPQYVSL 391
            L AG +L+CG  Y +    ++++G   E +ID+S+K L+   +RLG F  +G+  Y ++
Sbjct: 282 ALNAGCNLNCGDTYASLL-KSLEKGLTTEEEIDRSVKQLFKTRLRLGLFAPEGAVPYDTI 340

Query: 392 GKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYA 451
               I S E+ +LA EAAR+ +VLLKN+ NTLP+ +  VK V V GP A    A++ NY 
Sbjct: 341 STDVIRSKEHQKLALEAARKSVVLLKNEANTLPV-ARDVKKVYVTGPTATHVQALLANYY 399

Query: 452 GIPCRYMSPIAGFSG----YANVTYKTGCDDVACKSN-NSIFAASEAAKTADATIILAGL 506
           G+     + + G  G      +V Y+ G   +  ++N N++   S AA +AD T+   G+
Sbjct: 400 GVSEDMTTILEGIVGKVSPQTSVQYRQGA--LLYEANRNTMDWFSGAAASADVTVACLGI 457

Query: 507 DLSVEAES---------LDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAE 557
              +E E           DRE   LP  Q   + ++   AK    LV++   G  I+  E
Sbjct: 458 SQLIEGEEGEAIASEHRGDRERTRLPQNQIDFLKRIRASAKK---LVVVITSGSAISLPE 514

Query: 558 TNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVD 617
                 A+L+  YPGE+GG+A+ADV+FG   P GRLP+T      V  LP       P +
Sbjct: 515 IYDMADALLYVWYPGEQGGKAVADVLFGDAVPSGRLPVTVVKS--VDDLP-------PYE 565

Query: 618 SLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDAS 677
           +    GRTY++      +PFG+GLSYT F Y+ L+                         
Sbjct: 566 NYDMKGRTYRYMEVSPQFPFGFGLSYTDFTYSNLTLES---------------------- 603

Query: 678 KTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQ-VIGFQR 736
                    N ++  +      D  N G  D  +VV  Y     E +    KQ +IGF+R
Sbjct: 604 ---------NKVKSGESVRLSFDLTNEGEYDADEVVQFYIT-DVEASVNVPKQSLIGFKR 653

Query: 737 VFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGNGGVS 783
           V + AG + +I+F       + IVD     +L +GE  I++G    S
Sbjct: 654 VGLAAGESTKIEFTVTP-DMMKIVDNNGEKILESGEFKIYIGGSSYS 699


>gi|372208556|ref|ZP_09496358.1| beta-glucosidase [Flavobacteriaceae bacterium S85]
          Length = 729

 Score =  399 bits (1025), Expect = e-108,   Method: Compositional matrix adjust.
 Identities = 260/758 (34%), Positives = 387/758 (51%), Gaps = 95/758 (12%)

Query: 48  QMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVS 107
           Q  +  + D+SL +  R+  LV  MTL EK+ QL   +  V RL +P+Y WW+EALHGV+
Sbjct: 20  QKKAQKWLDTSLTFEERIHHLVKAMTLKEKIAQLDSGSPEVKRLDIPEYNWWNEALHGVA 79

Query: 108 NVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGR-------- 159
             G           +T FP  I   A+F+  L K++  A+S EARA +N+ +        
Sbjct: 80  RNGK----------STVFPQAIGLAATFDPVLAKQVASAISDEARAKFNISQSIGNRGQY 129

Query: 160 AGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPL 219
           AGLT+W+PN+N+ RDPRWGR  ET GEDP++  +  V +V+GLQ   G+      + + L
Sbjct: 130 AGLTFWTPNVNIFRDPRWGRGQETYGEDPYLTSQMGVAFVKGLQ---GN------HPKYL 180

Query: 220 KVSSCCKHYAAYDVDNWKGVD--RYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSY 277
           K ++C KH+A +      G +  R+HF+A  +++D+ ET+L  FE  VK+ +   VM +Y
Sbjct: 181 KSAACAKHFAVHS-----GPEELRHHFNANPSKKDLYETYLPAFEALVKQANVEGVMSAY 235

Query: 278 NRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKA 337
           N V G+P+ +   LL +T+R  W   GYIV+DC ++  +   HK +    E A A  LKA
Sbjct: 236 NAVYGVPAGSSEFLLKETLRKSWGFDGYIVSDCGALGDIFKGHKQVKTMPE-AAAVALKA 294

Query: 338 GLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ--YVSLGKQD 395
           G++L+CG  Y      AVQQG V E  ID  LK L     +LGFFD      Y ++    
Sbjct: 295 GVNLNCGYVYNGALEKAVQQGLVSEELIDTRLKQLLKTRFKLGFFDPKEANPYNAIPTSV 354

Query: 396 ICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPC 455
           I SD++I LA + A++ IVLLKN  +TLPL+   +K   V GP A+++  ++ NY G+  
Sbjct: 355 IHSDDHIALARKTAQKSIVLLKNKNHTLPLDK-NIKVPYVTGPFASSSDVLLANYYGMTT 413

Query: 456 RYMSPIAGFSGY----ANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVE 511
             +S + G +       ++ Y+ G      K+ N    A   AKTADA I + GL    E
Sbjct: 414 NLVSVLEGIADKVSLGTSLNYRMGALPF-NKNLNPKNWAPNVAKTADAVIAVVGLSADFE 472

Query: 512 AESL---------DREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNI 562
            E +         D++DL LP  Q   + ++A   KGP+ILV+  A G  +A  E     
Sbjct: 473 GEEVDAIASPNKGDKKDLKLPQNQIDYVKEMAAKKKGPLILVV--ASGSAVALGELYDLA 530

Query: 563 KAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYP 622
            AI+   YPGE+GG A+ADV+FG  +P G LP+T+         P +   L P +     
Sbjct: 531 DAIVLMWYPGEQGGNAVADVLFGDVSPSGHLPVTF---------PKSVAQLPPFEDYSMQ 581

Query: 623 GRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCP 682
           GRTYK+     L+PFG+GLSYT FK++       +Q++  K                   
Sbjct: 582 GRTYKYMEEEPLFPFGFGLSYTDFKFS------NVQISEEK------------------- 616

Query: 683 GVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAG 742
                 ++  D F       N G  DG +VV +Y  P          Q++ F+R+ ++  
Sbjct: 617 ------IKKKDSFTVSCSVANNGKVDGEEVVQLYLVPLNSNKDLPKYQLLKFKRIEIQKN 670

Query: 743 RNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGNG 780
            +K + F   A K L  V+         G++ + V N 
Sbjct: 671 TSKTVSFNLEA-KDLFQVNKEGKKTWIKGKYKLVVANA 707


>gi|366163035|ref|ZP_09462790.1| glycoside hydrolase family 3 [Acetivibrio cellulolyticus CD2]
          Length = 705

 Score =  399 bits (1025), Expect = e-108,   Method: Compositional matrix adjust.
 Identities = 250/773 (32%), Positives = 386/773 (49%), Gaps = 116/773 (15%)

Query: 61  YSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTHFDDVIP 120
           Y  + ++LV++MTL+EK  QL   +  + RLG+P Y WW+EALHGV+  G          
Sbjct: 7   YKKKAEELVAQMTLEEKASQLTYNSPAIERLGIPAYNWWNEALHGVARAGT--------- 57

Query: 121 GATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRA--------GLTYWSPNINVA 172
            AT FP  I   A F++    KI  A++ EARA YN            GLT WSPNIN+ 
Sbjct: 58  -ATVFPQAIGLAAMFDDEFLMKIANAIAIEARAKYNESSKHGDRDIYKGLTIWSPNINIF 116

Query: 173 RDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYD 232
           RDPRWGR  ET GEDPF+ G+  V +++GLQ           +   +  ++C KH+AAY 
Sbjct: 117 RDPRWGRGHETYGEDPFLSGKLGVAFIKGLQG----------DKDVMMTAACVKHFAAYS 166

Query: 233 VDNWKGVD--RYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPK 290
                G +  R+ F+A VT++D+ ET+L  FE CVK+    +VM  YNR NG P C    
Sbjct: 167 -----GPEDLRHGFNAEVTKKDLWETYLPAFETCVKDAKVEAVMGGYNRTNGEPCCGSYT 221

Query: 291 LLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNF 350
           LL   +R +W   G++V+DC +I+    +H  +  + E++VA  + AG DL+CG  Y   
Sbjct: 222 LLRDILREKWGFEGHVVSDCWAIKDFHTDH-MVTKTPEESVALAIDAGCDLNCGNMYLML 280

Query: 351 TGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQDICSDENIELAAEAAR 410
              A+Q+G + E  I ++   ++T   +LG F+GS ++ ++  + +   E+ E+A EAAR
Sbjct: 281 L-IALQEGLITEEHITRAAVRIFTTRFKLGLFEGS-EFDNIPYEVVECSEHKEMAIEAAR 338

Query: 411 EGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFS----G 466
           +  VLLKND   LP+N   +KT+ V+GP+AN+ +A+ GNY G   RY++ + G       
Sbjct: 339 KSAVLLKND-GILPINKGAIKTIGVIGPNANSRIALKGNYHGTSSRYITLLEGIQDEVGD 397

Query: 467 YANVTYKTGCD------DVACKSNNSIFAASEAAKTADATIILAGLDLSVEAE------- 513
              V Y  GC+      +V   +N+ +  A   A+ +D  ++  GLD ++E E       
Sbjct: 398 EVRVLYSNGCELVKDRTEVLAYANDRLAEAVTVAEHSDLVVLCLGLDETIEGEQSDEGNN 457

Query: 514 --SLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYP 571
             S D++DL LP  Q  L+ ++    K P +L +M+   +++++A  + N   IL   YP
Sbjct: 458 GGSGDKKDLDLPEVQKSLLEKIVATGK-PTVLCLMAGSAINLSYAHEHCN--GILLTWYP 514

Query: 572 GEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNG 631
           G  GG+A+AD++FG  +P G+LP+T+Y         L ++P  P+       RTY++   
Sbjct: 515 GARGGKAVADILFGNASPSGKLPVTFYRS-------LDNLP--PITDYSMKNRTYRYIEE 565

Query: 632 PTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRC 691
             LYPFGYGL+Y   +   +    T+++  +                             
Sbjct: 566 APLYPFGYGLTYGDVELKHVEIKGTVEIEKD----------------------------- 596

Query: 692 DDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVF 751
                  V  QN GS    +VV  Y K    + A     +  F RV + A   K++    
Sbjct: 597 ---IYITVTLQNRGSVAVEEVVQAYIKDEQSMYAVTNTSLCAFMRVGLGANEEKQVSMRI 653

Query: 752 NACKSLNIVDYAANTLLPAGEHTIFVGNGG-------------VSFPIHLNFN 791
               SL +V+     +L + + T+F G  G             +S  I L FN
Sbjct: 654 -PFDSLKVVNLDGEKVLDSKKFTLFAGLCGPDKRSVELTGKEPISILIELEFN 705


>gi|225878709|dbj|BAH30674.1| beta-xylosidase [Aspergillus aculeatus]
          Length = 785

 Score =  398 bits (1023), Expect = e-108,   Method: Compositional matrix adjust.
 Identities = 258/734 (35%), Positives = 380/734 (51%), Gaps = 40/734 (5%)

Query: 53  LFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPG 112
           L CD +     R   L S MTL+E +   G+    +PRLGLP Y+ W+EALHG+      
Sbjct: 60  LVCDRTASAHDRAAALTSMMTLEELMNSTGNRIPAIPRLGLPPYQIWNEALHGLYLA--- 116

Query: 113 THFDDVIP--GATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNIN 170
            +F +  P   +TSFP+ ILT A+ N +L  +I Q ++T+ RA  N GR GL  +SPNIN
Sbjct: 117 -NFTESGPFSWSTSFPSPILTMATLNRTLIHQIAQIIATQGRAFNNAGRYGLNAFSPNIN 175

Query: 171 VARDPRWGRITETPGEDP-FVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYA 229
             R P WGR  ETPGED   +   YA  Y+ GLQ      NAT+      K+ +  KHYA
Sbjct: 176 AFRHPVWGRGQETPGEDANCLCSAYAYEYITGLQG-----NATNP-----KIIATAKHYA 225

Query: 230 AYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADP 289
            YD++NW+   R+  D  +T+QD+ E F   F + V++    SVM SYN VNG+PS A+ 
Sbjct: 226 GYDIENWRQRSRFGNDLNITQQDLAEYFTPQFVVAVRDAQVRSVMPSYNAVNGVPSSANT 285

Query: 290 KLLNQTVRGEWDL--HGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYY 347
            LL   VR  W     GY+ +DCD++  + + H + A+    A A +L+AG D+DCG  Y
Sbjct: 286 FLLQTLVRDSWGFIQDGYMASDCDAVYNVFNPHGYAAN-LSSASAMSLRAGTDIDCGISY 344

Query: 348 TNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDG-SPQYVSLGKQDICSDENIELAA 406
                 ++ QG++  ++I++++   Y+ L+  G+FDG    Y  L   D+       +A 
Sbjct: 345 LTTLNESLTQGQISRSEIERAVTRFYSNLVSAGYFDGPDAPYRDLSWSDVVRTNRWNVAY 404

Query: 407 EAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFSG 466
           EAA  G+VLLKND   LPL S  V+ VA++GP ANAT  M GNY G+     SP+A    
Sbjct: 405 EAAVAGVVLLKND-GVLPL-SKSVQRVALIGPWANATEQMQGNYHGVAPYLTSPLAAVQA 462

Query: 467 YA-NVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAESLDREDLWLPGY 525
               V Y  G  ++     N   AA  AA+ +D  I   G+D ++EAE LDR ++  PG 
Sbjct: 463 SGLEVNYAFGT-NITSNVTNCFAAALAAAEKSDIIIFAGGIDNTLEAEELDRANITWPGN 521

Query: 526 QTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFG 585
           Q +LI+++ E+ K P++++ M  G VD +  + +  + A+LW GYPG+ GG+A+ D++ G
Sbjct: 522 QLELIHRLGELGK-PLVVLQMGGGQVDSSALKASEKVGALLWGGYPGQAGGQALWDILTG 580

Query: 586 KFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQ 645
           +  P GRL  T Y  +Y    P T M LRP      PG+TY +Y G  +Y FG+GL YT 
Sbjct: 581 QRAPAGRLTTTQYPAEYALQFPATDMSLRPRGD--NPGQTYMWYTGEPVYAFGHGLFYTT 638

Query: 646 FKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVG 705
           F   L    +  +         R+ +  +  ++      LV  L    +  F V   N G
Sbjct: 639 FATALAGPGQEPE---------RSFDIGALLARPHAGYNLVEQL---PFLNFTVKVTNTG 686

Query: 706 STDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAAN 765
                   + ++   A       K ++GF R+     R      V  +  SL   D   N
Sbjct: 687 EVISDYTAMAFANTTAGPRPHPNKWLVGFDRIGPLDPRVSARMSVPVSLDSLARTDAQGN 746

Query: 766 TLLPAGEHTIFVGN 779
            ++  G + + + N
Sbjct: 747 RVIYPGPYELALNN 760


>gi|182415033|ref|YP_001820099.1| Beta-glucosidase [Opitutus terrae PB90-1]
 gi|177842247|gb|ACB76499.1| Beta-glucosidase [Opitutus terrae PB90-1]
          Length = 905

 Score =  397 bits (1021), Expect = e-107,   Method: Compositional matrix adjust.
 Identities = 269/764 (35%), Positives = 385/764 (50%), Gaps = 114/764 (14%)

Query: 53  LFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPG 112
           ++ DSS P  +R  DL+ RM+L EKV QL + A G+PRLGLP Y++W+EA HG++N G  
Sbjct: 204 IWRDSSKPLRVRADDLIRRMSLAEKVSQLKNAAPGIPRLGLPAYDYWNEAAHGIANNGI- 262

Query: 113 THFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYN--LGR--------AGL 162
                    AT FP  I   A++N +L  + G  +  E RA +N    R         GL
Sbjct: 263 ---------ATVFPQAIGAAAAWNPALLHQEGTVIGIEGRAKFNDYANRHNGDSKWWTGL 313

Query: 163 TYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVS 222
           TYW+PNIN+ RDPRWGR  ET GEDPF+     + +V+G+Q  +          R +   
Sbjct: 314 TYWAPNINLFRDPRWGRGQETYGEDPFLTAEIGIEFVKGVQGDD---------PRYMLAM 364

Query: 223 SCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNG 282
           +C KHYA   V +     R+ F+A + E+D+ +T+L  FE  V+EG  + VM +YN VNG
Sbjct: 365 ACAKHYA---VHSGPERTRHSFNAEIPERDLFDTYLPHFERVVREGKVAGVMSAYNAVNG 421

Query: 283 IPSCADPKLLNQTVRGEWDLHGYIVADCDSIQ-VMVDNHKFLADSKEDAVAQTLKAGLDL 341
           +P+ A+  LL + +R  W   GY+ +DCD+I+ +  +       + E+A A  +KAG +L
Sbjct: 422 VPASANSFLLTELLRKRWGFEGYVPSDCDAIRDIYGEKQHHYVKTAEEAAALAVKAGCNL 481

Query: 342 DCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQY----VSLGKQDIC 397
            CG  Y N    AVQQG V E D+D +L +      RLG FD + Q      +L   D+ 
Sbjct: 482 CCGGDY-NALVRAVQQGLVTEKDLDGALYHTLWTRFRLGLFDPAEQVPFSGYTLKDNDLP 540

Query: 398 SDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRY 457
           +   + L  E AR+ IVLLKND  TLPL+  K+K +AV+GP+A +   + GNY G   R 
Sbjct: 541 AHSQVAL--ELARQAIVLLKND-GTLPLDRTKLKQIAVIGPNAASKSMLEGNYHGSASRS 597

Query: 458 MSPIAGFSGYAN------------VTYKTGCDDVACKSNNS-------IFAASEAAKTAD 498
           +S +                    VT K G    + + N +          A + A  AD
Sbjct: 598 ISILDDIRNLVGSEIKITHAMGSPVTTKPGTAPWSGQDNTTDRPVAELKAEALKLAAEAD 657

Query: 499 ATIILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAET 558
           A I + G+  + E ES DRE + LP  Q  LI  +    K PV++V  S  G  +A    
Sbjct: 658 AIIYVGGITPAQEGESFDRESIELPSEQEDLIRALHATGK-PVVMVNCS--GSAMALTWQ 714

Query: 559 NTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDS 618
           + N+ AI+ A YPG+EGGRA+A+V+FG+ NP G LPIT+Y          ++  L     
Sbjct: 715 DENLPAIVQAWYPGQEGGRAVAEVLFGETNPSGHLPITFYR---------STADLPDFSD 765

Query: 619 LGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASK 678
                RTY+++ G  LY FG+GLSY+ F+Y                    NL     A+ 
Sbjct: 766 YSMKNRTYRYFTGRPLYAFGHGLSYSTFEYA-------------------NLRVAPAAN- 805

Query: 679 TRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVF 738
               G L   L          D  N G  DG DVV +Y+ PPA      ++ + GF+R  
Sbjct: 806 ----GALTVTL----------DLTNSGKRDGDDVVQLYATPPASSQPQELRALCGFRRTH 851

Query: 739 VRAGRNKRIKFVFNAC--KSLNIV--DYAANTLLPAGEHTIFVG 778
           V+AG  + +     A   +  +I   DYA    +P+G+ TI  G
Sbjct: 852 VKAGETRTVTVTVPAVALRRWDIAKKDYA----IPSGDWTIAAG 891


>gi|440472411|gb|ELQ41274.1| beta-xylosidase [Magnaporthe oryzae Y34]
 gi|440484691|gb|ELQ64724.1| beta-xylosidase [Magnaporthe oryzae P131]
          Length = 792

 Score =  397 bits (1020), Expect = e-107,   Method: Compositional matrix adjust.
 Identities = 237/628 (37%), Positives = 349/628 (55%), Gaps = 45/628 (7%)

Query: 49  MSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSN 108
           +S+ + CD +   + R   LV  M LDEK++ L + + G PR+GLP YEWWSEALHGV+ 
Sbjct: 35  LSTNIVCDQAATPAERAAGLVDIMELDEKLENLVNKSPGAPRIGLPAYEWWSEALHGVAK 94

Query: 109 VGPGTHFDD----VIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTY 164
             PG  F+         ATSF   I+ +A+F++ L + +   +STEARA  N G AGL +
Sbjct: 95  -SPGVTFNKSSGAAFSSATSFSNPIVLSAAFDDELVEAVATQISTEARAFSNAGLAGLDW 153

Query: 165 WSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSC 224
           W+PNIN  +DPRWGR  ETPGED   + +Y    +RGL+        +D  +R  K+ + 
Sbjct: 154 WTPNINPYKDPRWGRGMETPGEDALRISKYVKALLRGLE-------GSDPTTR--KMVAN 204

Query: 225 CKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRV---- 280
           CKHYAA D++ W GV RY+FDA VT QD+ E +L  F+ C ++ +  S MC+YN +    
Sbjct: 205 CKHYAANDLERWNGVTRYNFDAPVTLQDLSEYYLPAFKQCARDSNVGSFMCAYNAMSIKG 264

Query: 281 -----NGIPSCADPKLLNQTVRGEW---DLHGYIVADCDSIQVMVDNHKFLADSKEDAVA 332
                NG P CA   L+N  +R  W   + + +I +DC+++  M + H + +D++E+A  
Sbjct: 265 KDLSWNGTPVCASKYLMNDILREHWGWKEHNNWITSDCNAVLHMWNQHHW-SDTREEAAG 323

Query: 333 QTLKAGLDLDC--GQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDG-SPQYV 389
               AG D  C    Y       A  +G + E  +D++LK LY  L+R G+FDG    Y 
Sbjct: 324 SAYTAGTDTVCEVSNYDKTAVKGAFDRGLLDEDVVDRALKRLYEGLVRAGYFDGPDAPYR 383

Query: 390 SLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLN----SAKVKTVAVVGPHANATVA 445
           ++   D+ + E  +LA  +A EG+VL KN+   LP+       K KTVA++G   +    
Sbjct: 384 NITWADVNTPEARKLAHRSAVEGMVLTKNN-GVLPIKLEELQKKGKTVALIGNWVDNGEQ 442

Query: 446 MIGNYAGIPCRYMSPIAGFSG--YANVTYKTGCDDVACKSNNSIFAASEAAKTADATIIL 503
           M+G Y+GI     +P+A         VT     +      ++    A  AA  AD  +  
Sbjct: 443 MLGTYSGIAPFRNTPLAAAKALNLKMVTAGGPVNQSTGSRDSWTRPALNAAIQADVVLYF 502

Query: 504 AGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIK 563
            G+DLSVEAE  DR  L  P  Q +L++ ++ + K P ++V +     D A  + N NI 
Sbjct: 503 GGIDLSVEAEDRDRYSLAWPSAQAKLLSDISALGK-PTVVVQLGTMLDDTALLD-NKNIS 560

Query: 564 AILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPV-DSLG-- 620
           AI+WAGYPG++GG A  D++ GK  P GRLP+T Y   Y   +P+T M +RP  D+ G  
Sbjct: 561 AIIWAGYPGQDGGTAAFDIITGKTAPSGRLPVTQYPAKYANQVPMTDMEVRPSKDTKGGA 620

Query: 621 --YPGRTYKFYNGPTLYPFGYGLSYTQF 646
              PGRTY++Y+   ++PFG+GL +T F
Sbjct: 621 ASNPGRTYRWYD-EAVHPFGFGLHFTNF 647


>gi|389632743|ref|XP_003714024.1| beta-xylosidase [Magnaporthe oryzae 70-15]
 gi|351646357|gb|EHA54217.1| beta-xylosidase [Magnaporthe oryzae 70-15]
          Length = 847

 Score =  397 bits (1020), Expect = e-107,   Method: Compositional matrix adjust.
 Identities = 237/628 (37%), Positives = 349/628 (55%), Gaps = 45/628 (7%)

Query: 49  MSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSN 108
           +S+ + CD +   + R   LV  M LDEK++ L + + G PR+GLP YEWWSEALHGV+ 
Sbjct: 90  LSTNIVCDQAATPAERAAGLVDIMELDEKLENLVNKSPGAPRIGLPAYEWWSEALHGVAK 149

Query: 109 VGPGTHFDD----VIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTY 164
             PG  F+         ATSF   I+ +A+F++ L + +   +STEARA  N G AGL +
Sbjct: 150 -SPGVTFNKSSGAAFSSATSFSNPIVLSAAFDDELVEAVATQISTEARAFSNAGLAGLDW 208

Query: 165 WSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSC 224
           W+PNIN  +DPRWGR  ETPGED   + +Y    +RGL+        +D  +R  K+ + 
Sbjct: 209 WTPNINPYKDPRWGRGMETPGEDALRISKYVKALLRGLE-------GSDPTTR--KMVAN 259

Query: 225 CKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRV---- 280
           CKHYAA D++ W GV RY+FDA VT QD+ E +L  F+ C ++ +  S MC+YN +    
Sbjct: 260 CKHYAANDLERWNGVTRYNFDAPVTLQDLSEYYLPAFKQCARDSNVGSFMCAYNAMSIKG 319

Query: 281 -----NGIPSCADPKLLNQTVRGEW---DLHGYIVADCDSIQVMVDNHKFLADSKEDAVA 332
                NG P CA   L+N  +R  W   + + +I +DC+++  M + H + +D++E+A  
Sbjct: 320 KDLSWNGTPVCASKYLMNDILREHWGWKEHNNWITSDCNAVLHMWNQHHW-SDTREEAAG 378

Query: 333 QTLKAGLDLDC--GQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDG-SPQYV 389
               AG D  C    Y       A  +G + E  +D++LK LY  L+R G+FDG    Y 
Sbjct: 379 SAYTAGTDTVCEVSNYDKTAVKGAFDRGLLDEDVVDRALKRLYEGLVRAGYFDGPDAPYR 438

Query: 390 SLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLN----SAKVKTVAVVGPHANATVA 445
           ++   D+ + E  +LA  +A EG+VL KN+   LP+       K KTVA++G   +    
Sbjct: 439 NITWADVNTPEARKLAHRSAVEGMVLTKNN-GVLPIKLEELQKKGKTVALIGNWVDNGEQ 497

Query: 446 MIGNYAGIPCRYMSPIAGFSG--YANVTYKTGCDDVACKSNNSIFAASEAAKTADATIIL 503
           M+G Y+GI     +P+A         VT     +      ++    A  AA  AD  +  
Sbjct: 498 MLGTYSGIAPFRNTPLAAAKALNLKMVTAGGPVNQSTGSRDSWTRPALNAAIQADVVLYF 557

Query: 504 AGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIK 563
            G+DLSVEAE  DR  L  P  Q +L++ ++ + K P ++V +     D A  + N NI 
Sbjct: 558 GGIDLSVEAEDRDRYSLAWPSAQAKLLSDISALGK-PTVVVQLGTMLDDTALLD-NKNIS 615

Query: 564 AILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPV-DSLG-- 620
           AI+WAGYPG++GG A  D++ GK  P GRLP+T Y   Y   +P+T M +RP  D+ G  
Sbjct: 616 AIIWAGYPGQDGGTAAFDIITGKTAPSGRLPVTQYPAKYANQVPMTDMEVRPSKDTKGGA 675

Query: 621 --YPGRTYKFYNGPTLYPFGYGLSYTQF 646
              PGRTY++Y+   ++PFG+GL +T F
Sbjct: 676 ASNPGRTYRWYD-EAVHPFGFGLHFTNF 702


>gi|150019484|ref|YP_001311738.1| glycoside hydrolase family protein [Clostridium beijerinckii NCIMB
           8052]
 gi|149905949|gb|ABR36782.1| glycoside hydrolase, family 3 domain protein [Clostridium
           beijerinckii NCIMB 8052]
          Length = 709

 Score =  397 bits (1019), Expect = e-107,   Method: Compositional matrix adjust.
 Identities = 256/744 (34%), Positives = 379/744 (50%), Gaps = 102/744 (13%)

Query: 64  RVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGAT 123
           + K+LV +MTL+EK +QL   +  V RL +P+Y WW+E LHGV+  G           AT
Sbjct: 15  KAKELVGKMTLEEKAEQLTYKSSAVKRLNVPRYNWWNEGLHGVARAGT----------AT 64

Query: 124 SFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRA--------GLTYWSPNINVARDP 175
            FP  I   A F++ L   I + +STE RA YN            G+T+WSPN+N+ RDP
Sbjct: 65  VFPQAIGLAAMFDDELLNYIAKVISTEGRAKYNENSKKDDRDIYKGITFWSPNVNIFRDP 124

Query: 176 RWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDN 235
           RWGR  ET GEDP++  R  V +V+GLQ  EG         + LK ++C KH+A +    
Sbjct: 125 RWGRGHETYGEDPYLTSRLGVAFVKGLQG-EG---------KYLKAAACAKHFAVHS--G 172

Query: 236 WKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQT 295
            +G+ R+ FDA V+++D+ ET+L  FE CVKEGD  +VM +YNR NG P C    LL   
Sbjct: 173 PEGL-RHEFDAVVSKKDLYETYLPAFEACVKEGDVEAVMGAYNRTNGEPCCGSKTLLRDI 231

Query: 296 VRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFTGNAV 355
           +RG+W+  G++V+DC +I     +H+  + + E A A  +K G DL+CG  Y      A 
Sbjct: 232 LRGKWNFKGHVVSDCWAIADFHLHHRVTSTATESA-ALAMKNGCDLNCGNVYLQLL-LAY 289

Query: 356 QQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQ-DICSDENIELAAEAAREGIV 414
           ++G V E DI  + + L    +RLG FD   +Y  +  + + C + N EL+ +AAR  +V
Sbjct: 290 KEGLVTEEDITTAAERLMATRIRLGMFDEECEYNKIPYELNDCKEHN-ELSLKAARNSMV 348

Query: 415 LLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFSGY----ANV 470
           LLKN+   LPLN   +K++AV+GP+A++ + + GNY+G   RY++ + G          V
Sbjct: 349 LLKNN-GILPLNKNNLKSIAVIGPNADSQIMLKGNYSGTASRYITVLEGIHEAVGEDVRV 407

Query: 471 TYKTGCD------DVACKSNNSIFAASEAAKTADATIILAGLDLSVEAE---------SL 515
            Y  GC       +   + N+ +  A   A+ +D  I+  GLD ++E E         + 
Sbjct: 408 YYSEGCHLFRDRVEELAEPNDRLKEAISIAERSDVAILCLGLDSTIEGEQGDAGNSEGAG 467

Query: 516 DREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEG 575
           D+  L LPG Q +L+ ++ E    PVILVI    G  + F        AIL A YPG  G
Sbjct: 468 DKASLNLPGRQQELLEKIIETGT-PVILVI--GAGSALTFNNAEDKCSAILDAWYPGSRG 524

Query: 576 GRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLY 635
           GRA+AD++FGK +P G+LPIT+Y           +  L          RTY++ +  +LY
Sbjct: 525 GRAVADLIFGKCSPSGKLPITFYR---------NTKDLPEFIDYSMKDRTYRYMSCESLY 575

Query: 636 PFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCD-DY 694
           PFGYGL+Y+  K + L                                  V D++ D + 
Sbjct: 576 PFGYGLTYSTVKLSELH---------------------------------VPDVKSDFED 602

Query: 695 FEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNAC 754
            E  V   N G+ D  +V+  Y K      A     + GF+RV ++ G +K  K      
Sbjct: 603 VEVSVKITNTGNFDIEEVIQCYIKDLESKYAVRNHSLAGFKRVRLKIGESKIAKMKIKK- 661

Query: 755 KSLNIVDYAANTLLPAGEHTIFVG 778
            S  +V+     +L +    +FVG
Sbjct: 662 SSFEVVNDDGERILDSKRFKLFVG 685


>gi|30316196|sp|P83344.1|XYNB_PRUPE RecName: Full=Putative beta-D-xylosidase; AltName: Full=PpAz152
 gi|19879972|gb|AAM00218.1|AF362990_1 beta-D-xylosidase, partial [Prunus persica]
          Length = 461

 Score =  396 bits (1018), Expect = e-107,   Method: Compositional matrix adjust.
 Identities = 199/451 (44%), Positives = 286/451 (63%), Gaps = 9/451 (1%)

Query: 332 AQTLKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSP---QY 388
           A  +KAGLDLDCG +    T  AV++G V + +I+ +L    TV MRLG FDG P   QY
Sbjct: 1   ADAIKAGLDLDCGPFLAIHTEAAVRRGLVSQLEINWALANTMTVQMRLGMFDGEPSAHQY 60

Query: 389 VSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIG 448
            +LG +D+C+  + +LA EAAR+GIVLL+N   +LPL++ + +TVAV+GP+++ TV MIG
Sbjct: 61  GNLGPRDVCTPAHQQLALEAARQGIVLLENRGRSLPLSTRRHRTVAVIGPNSDVTVTMIG 120

Query: 449 NYAGIPCRYMSPIAGFSGYANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDL 508
           NYAG+ C Y +P+ G   Y    ++ GC DV C  N    AA  AA+ ADAT+++ GLD 
Sbjct: 121 NYAGVACGYTTPLQGIGRYTRTIHQAGCTDVHCNGNQLFGAAEAAARQADATVLVMGLDQ 180

Query: 509 SVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWA 568
           S+EAE +DR  L LPG+Q +L+++VA  ++GP ILV+MS G +D+ FA+ +  I AI+W 
Sbjct: 181 SIEAEFVDRAGLLLPGHQQELVSRVARASRGPTILVLMSGGPIDVTFAKNDPRISAIIWV 240

Query: 569 GYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKF 628
           GYPG+ GG AIA+V+FG  NPGG+LP+TWY  +YV  LP+T M +R   + GYPGRTY+F
Sbjct: 241 GYPGQAGGTAIANVLFGTANPGGKLPMTWYPQNYVTHLPMTDMAMRADPARGYPGRTYRF 300

Query: 629 YNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVND 688
           Y GP ++PFG GLSYT F +NL      + V L  L+   N    S   +   P     D
Sbjct: 301 YIGPVVFPFGLGLSYTTFAHNLAHGPTLVSVPLTSLKATANSTMLSKTVRVSHP-----D 355

Query: 689 LRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIK 748
                  +  VD +N GS DG+  ++V++ PP    A+  KQ++GF ++ +  G  KR++
Sbjct: 356 CNALSPLDVHVDVKNTGSMDGTHTLLVFTSPPDGKWASS-KQLMGFHKIHIATGSEKRVR 414

Query: 749 FVFNACKSLNIVDYAANTLLPAGEHTIFVGN 779
              + CK L++VD      +P GEH + +G+
Sbjct: 415 IAVHVCKHLSVVDRFGIRRIPLGEHKLQIGD 445


>gi|336261464|ref|XP_003345521.1| hypothetical protein SMAC_07509 [Sordaria macrospora k-hell]
 gi|380088197|emb|CCC13872.1| unnamed protein product [Sordaria macrospora k-hell]
          Length = 762

 Score =  396 bits (1017), Expect = e-107,   Method: Compositional matrix adjust.
 Identities = 263/790 (33%), Positives = 400/790 (50%), Gaps = 96/790 (12%)

Query: 11  FSLSI-ALLVFSTNAVDANGSSSPVFVCDPGRFSKLGLQMSSFLFCDSSLPYSIRVKDLV 69
           FSLS  A LV+   A+D    + P  V  P         ++S   CD++L    R   LV
Sbjct: 16  FSLSCSAALVY---AIDLPFQTYPDCVNGP---------LASLKVCDATLSPPQRAAALV 63

Query: 70  SRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTHF---DDVIPGATSFP 126
           + MT +EK+Q L   + G PR+GLP Y WWSEALHGV+   PGT F   +     +TSFP
Sbjct: 64  AAMTTEEKLQNLVSKSKGAPRIGLPAYNWWSEALHGVA-YAPGTQFRSGNGTFNSSTSFP 122

Query: 127 TVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNINVARDPRWGRITETPGE 186
             +L  A+F++ L +++G+ +  E RA  N G +G  YW+PN+N  +DPRWGR +ETPGE
Sbjct: 123 MPLLMAATFDDELIERVGEVIGIEGRAFGNAGFSGFDYWTPNVNPFKDPRWGRGSETPGE 182

Query: 187 DPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDA 246
           D   + RYA + +RGL+          +  R  ++ + CKHYAA D ++W G  R+ F+A
Sbjct: 183 DILRIKRYAASMIRGLEG--------PVRERERRIVATCKHYAANDFEDWNGSTRHDFNA 234

Query: 247 RVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHG-- 304
           +VT QD+ E +L PF+ C ++    S+MCSYN VNG+P+CA+  L+   +R  W+     
Sbjct: 235 KVTLQDLAEYYLSPFQQCARDSKVGSIMCSYNAVNGVPACANTYLMQTILRDHWNWTAPG 294

Query: 305 -YIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFTGNAVQQGKVKET 363
            YI +DC+++  +  NH + A +  +  A   +AG+D  C    ++    A  QG +K++
Sbjct: 295 NYITSDCEAVLDISANHHY-AKTNAEGTALAFEAGIDSSCEYEGSSDILGAWTQGLLKQS 353

Query: 364 DIDKSLKYLYTVLMRLGFFDGS-PQYVSLGKQDICSDENIELAAEAAREGIVLLKNDQNT 422
            +D++L+ LY  L+++G+FDG+  +Y SLG   +   ++ E+A +AA EGIVLLKND+ T
Sbjct: 354 TVDRALRRLYEGLVQVGYFDGNRSEYASLGWNHVNRPKSQEVALQAAVEGIVLLKNDK-T 412

Query: 423 LPL----NSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFSGYANVTYKTGCDD 478
           LPL    N  K+K +A++G  AN    + G Y+G P    SP+             G   
Sbjct: 413 LPLGVKKNGPKLK-LAMIGFWANDPKTLSGGYSGTPAFEHSPVYATQAMGFKVTTAGGPV 471

Query: 479 VACKSNNSIFAASEAAKTADATIIL--AGLDLSVEAESLDREDLWLPGYQTQLINQVAEV 536
           +   ++   +  +  A   DA  IL   G D S   E+ DR  +  P  Q QLI  ++++
Sbjct: 472 LQNSTSKDTWTQAALAAAKDANYILYFGGQDTSAAGETKDRTTINWPEAQLQLITDLSKL 531

Query: 537 AKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPIT 596
            K P+++V M    +D      +  I +ILWA +P                 P GRLP+T
Sbjct: 532 GK-PLVVVQM-GDQLDNTPLLASKAINSILWANWP----------------VPAGRLPVT 573

Query: 597 WYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKT 656
            Y+ +Y   +P+T M LRP D L  PGRTY++Y  P + PFG+GL YT FK  ++     
Sbjct: 574 QYHANYTAAVPMTDMTLRPSDKL--PGRTYRWYPTP-VQPFGFGLHYTTFKTKIV----- 625

Query: 657 IQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDL--RCDDYF-------EFKVDFQNVGST 707
                                  R P   + DL  RC + +         KV+  N G  
Sbjct: 626 -----------------------RLPRFAIKDLLSRCGNAYPDTCGLPPLKVEVTNTGKR 662

Query: 708 DGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTL 767
               VV+ + K         IK ++ + R+   +   K    +      +   D   NT+
Sbjct: 663 SSDYVVLAFLKGDVGPKPYPIKTLVSYTRLRDLSPGRKTTAHLDWTLGDIARYDEQGNTV 722

Query: 768 LPAGEHTIFV 777
           L  G +T+ V
Sbjct: 723 LYPGTYTVIV 732


>gi|150019782|ref|YP_001312036.1| glycoside hydrolase family protein [Clostridium beijerinckii NCIMB
           8052]
 gi|149906247|gb|ABR37080.1| glycoside hydrolase, family 3 domain protein [Clostridium
           beijerinckii NCIMB 8052]
          Length = 709

 Score =  395 bits (1014), Expect = e-107,   Method: Compositional matrix adjust.
 Identities = 252/741 (34%), Positives = 387/741 (52%), Gaps = 100/741 (13%)

Query: 66  KDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGATSF 125
           K+LVS+MTL EK +QL   +  +  L +P+Y WW+E LHGV+  G           AT F
Sbjct: 17  KELVSKMTLQEKAEQLTYQSPAIKHLNVPEYNWWNEGLHGVARAGT----------ATVF 66

Query: 126 PTVILTTASFNESLWKKIGQAVSTEARAMYNLGRA--------GLTYWSPNINVARDPRW 177
           P  I   A F++    K+   ++TE RA YN            GLTYWSPNIN+ RDPRW
Sbjct: 67  PQAIGLAAIFDDEFLGKVANIIATEGRAKYNEYSKKDDRGIYKGLTYWSPNINIFRDPRW 126

Query: 178 GRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWK 237
           GR  ET GEDP++  R  V +++GLQ  EG         + LK+++C KH+A +     +
Sbjct: 127 GRGHETYGEDPYLTSRLGVAFIKGLQG-EG---------KYLKLAACAKHFAVHS--GPE 174

Query: 238 GVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVR 297
           G+ R+ F+A V ++D+ ET+L  FE CVKE +  SVM +YNR NG P C    LL   +R
Sbjct: 175 GL-RHEFNAVVNKKDLYETYLPAFEACVKEANVESVMGAYNRTNGEPCCGSKTLLKDILR 233

Query: 298 GEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFTGNAVQQ 357
           G+W   G++V+DC ++      H  +  +  ++VA  ++ G DL+CG  Y N    A ++
Sbjct: 234 GKWGFKGHVVSDCWAL-ADFHLHHMVTSTATESVALAIENGCDLNCGNMYLNLL-LAYKE 291

Query: 358 GKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQDICSDENIELAAEAAREGIVLLK 417
           G V E  I  + + L T   +LG FD   +Y  +  +   S E+ E+A  A+R+ +VLLK
Sbjct: 292 GLVTEEQITTAAERLMTTRFKLGMFDEECEYNKIPYEVNDSREHNEVALIASRKSMVLLK 351

Query: 418 NDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFSGY----ANVTYK 473
           N+  TLPL+ + +K++AV+GP+AN+ + + GNY+G   +Y + + G          V Y 
Sbjct: 352 NN-GTLPLDKSNLKSIAVIGPNANSEIMLKGNYSGTASKYTTILEGIHDAVGNDVRVYYS 410

Query: 474 TGC-------DDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAE---------SLDR 517
            GC       +D+A + ++ +  A   A+ +D  ++  GLD ++E E         + D+
Sbjct: 411 EGCHLFKDKVEDLA-RPDDRLSEAISVAERSDVVVLCLGLDSTIEGEQGDAGNSYGAGDK 469

Query: 518 EDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGR 577
           E+L LPG Q  L+ +V EV K PVI+V+ +   + +  AE      AIL A YPG  GG 
Sbjct: 470 ENLNLPGRQQNLLEKVLEVGK-PVIVVLGAGSALTLNGAEEKC--AAILNAWYPGSHGGT 526

Query: 578 AIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPF 637
           A+AD++FGK +P G+LP+T+Y  D  ++   T   ++        GRTY++    +LYPF
Sbjct: 527 AVADILFGKCSPSGKLPVTFYK-DTAKLPDFTDYSMK--------GRTYRYLGHESLYPF 577

Query: 638 GYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEF 697
           GYGL+Y+                            T + S  + P V     +    F+ 
Sbjct: 578 GYGLTYS----------------------------TVELSNLQVPSV----KQGFGSFDI 605

Query: 698 KVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSL 757
            ++ +N G  D  +VV  Y K      A     + GF+RV ++ G +K +    N  KS 
Sbjct: 606 SIEIKNTGEYDIEEVVQCYVKDIESKYAVLNHSLAGFKRVSLKKGESKIVTIKLNK-KSF 664

Query: 758 NIVDYAANTLLPAGEHTIFVG 778
            +V+     LL + +  +FVG
Sbjct: 665 EVVNDDGERLLDSKKFKLFVG 685


>gi|449299051|gb|EMC95065.1| glycoside hydrolase family 3 protein [Baudoinia compniacensis UAMH
           10762]
          Length = 849

 Score =  394 bits (1013), Expect = e-107,   Method: Compositional matrix adjust.
 Identities = 243/622 (39%), Positives = 346/622 (55%), Gaps = 44/622 (7%)

Query: 49  MSSFLFCDS-SLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVS 107
           ++S L CD+ + PY  R   +++ M + EK+  L D ++G  RLGLP YEWWSEALHGV+
Sbjct: 37  LTSNLVCDTNATPYQ-RASAIINAMNITEKLANLLDVSYGSARLGLPPYEWWSEALHGVA 95

Query: 108 NVGPGTHFDDV--IPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYW 165
              PG +F        ATSFP  I  +++F++   + I   +STEARA  N  R GL Y+
Sbjct: 96  G-SPGVNFTSSGNYSYATSFPMPITFSSAFDDPSVQNIASVISTEARAYSNAARGGLDYF 154

Query: 166 SPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVE-GHENATDLNSRPLKVSSC 224
           +PNIN  +DPRWGR +ETPGEDP  +  Y  N + GL+  + G+ N +    +  K+ + 
Sbjct: 155 TPNINPFKDPRWGRGSETPGEDPLRIQGYVKNLLIGLEGTDDGYFNTSHSGYK--KMIAT 212

Query: 225 CKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIP 284
           CKH+A YD+++W G  RY +DA +T QD+ E +L PF+ C ++ + +S+MCSYN VN +P
Sbjct: 213 CKHFAGYDLEDWDGYIRYGYDAEITTQDLAEYYLPPFQTCARDQNVASIMCSYNSVNSVP 272

Query: 285 SCADPKLLNQTVRGEWDL---HGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDL 341
           +CA+  L    +R  W     + YI +DC++I  +  NH +  ++   A   +L  G+D 
Sbjct: 273 ACANSYLQETILREHWGWTIDNNYITSDCNAISDIYYNHNYSVNNAA-AAGLSLSNGMDT 331

Query: 342 DC------------GQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFD--GSPQ 387
            C            G YY          G V E  I  +L   Y  L+  G+FD   S  
Sbjct: 332 ACIVANTGVMTDVNGSYY---------GGYVTEATITTALIRQYEALVIAGYFDPASSNP 382

Query: 388 YVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMI 447
           Y S+G   + +     LA +AA EG  LLKN    LP        VA++G  AN T  M 
Sbjct: 383 YRSIGWSSVNTPAAQTLARQAATEGTTLLKN-TGLLPYKFTSQTKVAMIGMWANGTSQMQ 441

Query: 448 GNYAGIPCRYM-SPIAGFSGYA-NVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAG 505
           G Y+G P  Y+ SP+   S    +  Y  G  +    ++N    A+ AA+ AD  +   G
Sbjct: 442 GGYSG-PAPYLHSPLYAASQLGLSYNYANGPINQTTLTSNYSQNATAAAQNADVILFFGG 500

Query: 506 LDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAI 565
           +D SVEAE++DR  +  PG Q  LI Q+A + K P+I++ M +  +D     +N NI A+
Sbjct: 501 IDWSVEAEAMDRYQIAWPGAQQALIAQLAALGK-PMIVLQMGS-MLDATPILSNNNISAL 558

Query: 566 LWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRT 625
           +W GYPG++GG A  D++ G   P GRLP+T Y  DYV  +P+T+M LRP    G PGRT
Sbjct: 559 VWVGYPGQDGGVAAFDILTGAVAPAGRLPVTMYPADYVNQVPMTNMSLRP--GPGNPGRT 616

Query: 626 YKFYNGPTLYPFGYGLSYTQFK 647
           YK+YN   L PF YGL YT FK
Sbjct: 617 YKWYNNAVL-PFAYGLHYTTFK 637


>gi|326791674|ref|YP_004309495.1| beta-glucosidase [Clostridium lentocellum DSM 5427]
 gi|326542438|gb|ADZ84297.1| Beta-glucosidase [Clostridium lentocellum DSM 5427]
          Length = 696

 Score =  394 bits (1011), Expect = e-106,   Method: Compositional matrix adjust.
 Identities = 254/724 (35%), Positives = 376/724 (51%), Gaps = 108/724 (14%)

Query: 64  RVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGAT 123
           + K LV+ MTL+E+  QL   +  + RLG+P Y WW+EALHGV+  G           AT
Sbjct: 9   KAKALVAEMTLEERASQLKYDSPAIKRLGVPAYNWWNEALHGVARAGV----------AT 58

Query: 124 SFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGR--------AGLTYWSPNINVARDP 175
           SFP  I   A+F++ L K++ + ++ E RA YN            GLT+WSPN+N+ RDP
Sbjct: 59  SFPQAIGMAATFDDELLKRVAEVIAEEGRAKYNAYSQEGDRDIYKGLTFWSPNVNIFRDP 118

Query: 176 RWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDN 235
           RWGR  ET GEDP++  R  V +V+GLQ  EG           LK ++C KH+A   V +
Sbjct: 119 RWGRGHETYGEDPYLTSRLGVAFVKGLQGEEG-----------LKTAACAKHFA---VHS 164

Query: 236 WKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQT 295
               DR+HFDARV+++D+ ET+L  FE  VKE +  SVM +YNR NG P C  P L+   
Sbjct: 165 GPEADRHHFDARVSQKDLWETYLPAFEALVKEAEVESVMGAYNRTNGEPCCGSPTLMKDI 224

Query: 296 VRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFTGNAV 355
           +R +W   G+ V+DC +I+   ++H   + ++E A A  LK+G DL+CG  Y +    A 
Sbjct: 225 LREKWGFQGHYVSDCWAIKDFHEHHMVTSTAQESA-ALALKSGCDLNCGNTYLHIL-MAY 282

Query: 356 QQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQDICSDENIELAAEAAREGIVL 415
           Q G V E +I  + + L+T    LG FDGS  Y ++  + + S  ++ +A EA  + IVL
Sbjct: 283 QNGLVTEEEITTAAERLFTTRYLLGLFDGS-TYDAIPYEVVESKPHLSVADEATAKSIVL 341

Query: 416 LKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFSG----YANVT 471
           LKN+   LPLN   +KT+ V+GP+AN+  A+IGNY G   +Y++ + G          + 
Sbjct: 342 LKNN-GLLPLNKESIKTIGVIGPNANSRKALIGNYHGTSSQYITILEGLQKEVGDEVRIL 400

Query: 472 YKTGCDDVACK------SNNSIFAASEAAKTADATIILAGLDLSVEAE---------SLD 516
           Y  G    A +        + +  A   AK +D  I+  GLD ++E E         S D
Sbjct: 401 YSEGSHLYADRVEPLAYQRDRLSEAKIVAKHSDVVIVCVGLDETLEGEEGDTGNAYASGD 460

Query: 517 REDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGG 576
           + DL LP  Q +L+  +A++ K PVIL + +   +D+ +A+ + +  A+L A YPG  GG
Sbjct: 461 KRDLALPEPQQELVEAMAKMGK-PVILCLSAGSAIDLQYADAHYD--AVLQAWYPGARGG 517

Query: 577 RAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYP 636
           + IA  + G+  P G+LP+T+Y         L+ +P    +     GRTY++     LYP
Sbjct: 518 QVIAKALLGEIVPSGKLPVTFYR-------DLSGLP--AFEDYSMQGRTYRYMQEEALYP 568

Query: 637 FGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFE 696
           FGYGL+Y +                     CR    + D    R   VLV++        
Sbjct: 569 FGYGLTYGK---------------------CRIEEASYDQGSLR---VLVHN-------- 596

Query: 697 FKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVF--NAC 754
            +VDF+        +VV +Y K      A     + GF+RV + AG  K I+     NA 
Sbjct: 597 -EVDFKL------EEVVQLYIKNLDSEFAVPNHSLCGFKRVSLEAGETKEIQINVSPNAF 649

Query: 755 KSLN 758
           K +N
Sbjct: 650 KVVN 653


>gi|373952439|ref|ZP_09612399.1| glycoside hydrolase family 3 domain protein [Mucilaginibacter
           paludis DSM 18603]
 gi|373889039|gb|EHQ24936.1| glycoside hydrolase family 3 domain protein [Mucilaginibacter
           paludis DSM 18603]
          Length = 721

 Score =  393 bits (1010), Expect = e-106,   Method: Compositional matrix adjust.
 Identities = 245/725 (33%), Positives = 366/725 (50%), Gaps = 91/725 (12%)

Query: 42  FSKLGLQMSSF-----LFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQY 96
            + LGL  ++F     ++ +     S RV+DL+SR+TL EKV  LG  +  VPRL +P Y
Sbjct: 13  LTSLGLIKTAFCQQIPIYRNPDKKLSTRVQDLISRLTLAEKVSLLGYRSQAVPRLNIPAY 72

Query: 97  EWWSEALHGVSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYN 156
            WW+E LHGV+  G           AT FP  I   A+F+++L K++   VSTEARA YN
Sbjct: 73  NWWNEGLHGVARAGE----------ATIFPQAIAMAATFDDNLVKQVANVVSTEARAKYN 122

Query: 157 LGRA--------GLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGH 208
           L  A        GLT+WSPNIN+ RDPRWGR  ET GEDPF+  +    YV GLQ  +  
Sbjct: 123 LSTAMGRHLQYMGLTFWSPNINIFRDPRWGRGQETYGEDPFLTSKMGNAYVHGLQGTDPL 182

Query: 209 ENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEG 268
                     LK S+  KH+ A+        +R +FDA V E+D+ +T+L  F+  V +G
Sbjct: 183 H---------LKTSATAKHFVAHSGPEG---ERDYFDALVDEKDLRDTYLYAFKSLV-DG 229

Query: 269 DASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKE 328
              S+M +YNRVNG+P+  +  L+N  V  EW   G++V DC ++  +   HK L +  E
Sbjct: 230 GVESIMTAYNRVNGVPNSINKTLVNDIVIKEWGFKGHVVTDCGALDDVYKTHKVLPNRME 289

Query: 329 DAVAQTLKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDG--SP 386
            A A  +KAG+DLDC   +     NA+    + E  +D +L  + +   +LGFFD   S 
Sbjct: 290 VAAA-AIKAGVDLDCSSIFQTDIINAINNKLLTEKQVDAALAAVLSTQFKLGFFDAPSSS 348

Query: 387 QYVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAM 446
            + S G   I +D ++ LA + A++ +VLLKND+  LPL      ++ VVGP+A +  A+
Sbjct: 349 PFYSFGADSIHNDSHVMLARQMAQKSMVLLKNDKQILPLKMQNYSSIMVVGPNAASLDAL 408

Query: 447 IGNYAGIPCRYMSPIAGFSGYANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGL 506
           + +Y G+  + ++ + G +   +   +   D  A   + + F     A  AD T+ + GL
Sbjct: 409 VASYHGVSSKAVNFVEGITAAVDKGTRVEYDLGADYRDTTHFGGIWGAGNADVTVAVIGL 468

Query: 507 DLSVEAES---------LDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAE 557
              +E E+          D++DL LP      +  + +  K P+I V+ S   VDIA   
Sbjct: 469 TPVLEGEAGDAFLSQTGGDKKDLSLPAGDIAFMKALRKSVKKPIIAVVTSGSDVDIAAIA 528

Query: 558 TNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVD 617
              +  A++ A YPGE+GG A+AD++FGK +P G LP+T+YN   V  LP         +
Sbjct: 529 PYAD--AVILAWYPGEQGGNALADILFGKISPSGHLPLTFYNS--VNDLP-------AYN 577

Query: 618 SLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDAS 677
           +    GRTY+++ G   YPFG+GLSYT F Y      KT                     
Sbjct: 578 NYSMKGRTYRYFAGAVQYPFGFGLSYTTFNYQWQQQPKT--------------------- 616

Query: 678 KTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRV 737
                          D  +  V  +N G+    +VV  Y   P  +    +K++ GF+R+
Sbjct: 617 ----------SYSAKDTIQLSVVVKNTGNISADEVVQAYIGYPT-LNRMPLKELKGFKRI 665

Query: 738 FVRAG 742
            +  G
Sbjct: 666 TLNKG 670


>gi|402074909|gb|EJT70380.1| hypothetical protein GGTG_11406 [Gaeumannomyces graminis var.
           tritici R3-111a-1]
          Length = 793

 Score =  393 bits (1009), Expect = e-106,   Method: Compositional matrix adjust.
 Identities = 234/626 (37%), Positives = 350/626 (55%), Gaps = 38/626 (6%)

Query: 45  LGLQMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALH 104
           +G  ++S   CD SL  S R   LV+ + + EK+  L   A+G  R+GLP+Y WWSEALH
Sbjct: 34  VGGLLASNKVCDRSLSPSERAAALVAALNVTEKMANLVSNANGSARIGLPKYNWWSEALH 93

Query: 105 GVSNVGPGTHFDDVIPG----ATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRA 160
           GV+   PGT F    PG    +TSFP  +L  ASF++SL +KIG  + TE+RA  N   +
Sbjct: 94  GVA-YAPGTQFRRG-PGDFNSSTSFPMPLLLAASFDDSLIEKIGDVIGTESRAFGNGRWS 151

Query: 161 GLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLK 220
           GL YW+PN+N  +DPRWGR +ETPGED   + RYA + ++GL+     +          +
Sbjct: 152 GLDYWTPNVNPFKDPRWGRGSETPGEDILRIKRYAASMIKGLEGPHPEKER--------R 203

Query: 221 VSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRV 280
           V S CKHYAA D ++W G  R+ FDAR++ QD+ E +L PF+ C ++    S+MC+YN V
Sbjct: 204 VVSTCKHYAANDFEDWNGTSRHDFDARISAQDLAEYYLMPFQQCARDSRVGSIMCAYNAV 263

Query: 281 NGIPSCADPKLLNQTVRGEWDLHG---YIVADCDSIQVMVDNHKFLADSKEDAVAQTLKA 337
           NG+PSCA+  LL+  +R  W   G   Y+ +DC+++  +   HK+ A +  +  A   +A
Sbjct: 264 NGVPSCANSYLLDTVLRKHWGWTGHNNYVTSDCEAVLDVSAGHKY-ARTNAEGTAMCFEA 322

Query: 338 GLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDG-SPQYVSLGKQDI 396
           G D  C    ++    A  QG ++E  +D++L  LY  L+R+G+FDG S  +  +   D+
Sbjct: 323 GTDTSCEYTPSSDIRGAYAQGLLREETMDRALLRLYEGLVRVGYFDGNSSAFSDISWADV 382

Query: 397 CSDENIELAAEAAREGIVLLKNDQN-TLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPC 455
            +    +L+ ++A EGIV+LKND    LPL +                +AMIG +A  P 
Sbjct: 383 NAPAAQDLSLQSAVEGIVMLKNDGTLPLPLGAKCSSKSKKRSSSGGPKLAMIGFWADAPE 442

Query: 456 RYMSPIAGFSGY----ANVTYKTGCDDVAC-----------KSNNSIFAASEAAKTADAT 500
           +     +G + Y    A    + G D V              ++N    A  AA+ AD  
Sbjct: 443 KLRGGYSGTAAYLRTPAYAARQMGLDVVTAGGPVLQGAAAAAADNWTAPALAAAEGADYI 502

Query: 501 IILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNT 560
           +   GLD +   E+ DR D+  PG Q  L+ ++A + K P+++V M    +D      N 
Sbjct: 503 VYFGGLDETAAGENKDRWDVEWPGAQLALVKRLAALGK-PLVVVQM-GDQLDGTPLLANA 560

Query: 561 NIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLG 620
            + A+LWA +PG++GG A+  ++ G  +P GRLP+T Y  +Y +++P+T M LRP  S  
Sbjct: 561 GVGAVLWASWPGQDGGPAVMRLLSGAASPAGRLPVTQYPANYTRLVPMTEMALRPSASGS 620

Query: 621 YPGRTYKFYNGPTLYPFGYGLSYTQF 646
            PGRTY++Y+ P L PFG+GL YT F
Sbjct: 621 RPGRTYRWYSTPVL-PFGFGLHYTNF 645


>gi|330947691|ref|XP_003306937.1| hypothetical protein PTT_20252 [Pyrenophora teres f. teres 0-1]
 gi|311315273|gb|EFQ84970.1| hypothetical protein PTT_20252 [Pyrenophora teres f. teres 0-1]
          Length = 756

 Score =  392 bits (1008), Expect = e-106,   Method: Compositional matrix adjust.
 Identities = 256/736 (34%), Positives = 387/736 (52%), Gaps = 44/736 (5%)

Query: 49  MSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSN 108
           + S   CD +   + R   LV+ M   EK++ L   + GV RLGLP Y WW EALHGV+ 
Sbjct: 29  LKSNAICDVTASPAKRAAALVAAMQTQEKLENLVSKSKGVARLGLPAYNWWGEALHGVAG 88

Query: 109 VGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPN 168
             PG +F      ATSFP  +L +A+F++ L  +I   +  EARA  N G A + +W+P+
Sbjct: 89  A-PGINFTGSYRTATSFPMPLLMSAAFDDDLIHQIAIVIGNEARAFGNGGIAPVDFWTPD 147

Query: 169 INVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHY 228
           IN  RDPRWGR +ETPGED   +  Y  + + GL+  +             K+ + CKHY
Sbjct: 148 INPFRDPRWGRGSETPGEDILRIKGYTKSLLSGLEGDKAQR----------KIIATCKHY 197

Query: 229 AAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCAD 288
             YDV+NW G DR+HFDA++T QD+ E F+ PF+ C ++    S MCSYN VNG+P+CAD
Sbjct: 198 VGYDVENWNGTDRHHFDAKITTQDLAEYFMPPFQQCARDSKVGSFMCSYNAVNGVPTCAD 257

Query: 289 PKLLNQTVRGEW---DLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQ 345
             +L   +R  W   D + YI +DC++++ +   HK++A  +E A A     G+DL C  
Sbjct: 258 TYVLEDILRKHWNWTDSNNYITSDCEAVKDISLRHKYVATLQE-ATAIAFNNGMDLSCEY 316

Query: 346 YYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDG-SPQYVSLGKQDICSDENIEL 404
             T+    A  QG +  + ID++L   Y  L+  G+FDG +  Y  LG QDI + E  +L
Sbjct: 317 SGTSDIPGAFSQGLLNVSVIDRALTRQYEGLVHAGYFDGAAATYAHLGVQDINTPEAQKL 376

Query: 405 AAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYM-SPI-A 462
             + A EG+ LLKND +TLPL+      VA+VG  AN T  + G Y+G P  Y+ +P+ A
Sbjct: 377 VLQVAAEGLTLLKND-DTLPLSLKSGSKVAMVGFWANTTSKLSGIYSG-PAPYLHTPVYA 434

Query: 463 GFSGYANVTYKTG-CDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAESLDREDLW 521
           G     ++   TG     +  ++N    A  AAK +D  +   GLD S  AE  DR D+ 
Sbjct: 435 GNKLGLDMAVATGPILQTSGAADNWTTTALNAAKKSDFILYFGGLDPSAAAEGSDRTDIS 494

Query: 522 LPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIAD 581
            P  Q  LI ++A  A G  ++VI     VD         + +++WA +PG++GG A+  
Sbjct: 495 WPSAQIDLITKLA--ALGKPLVVIALGDMVDHTPILKMKGVNSLIWANWPGQDGGTAVMQ 552

Query: 582 VVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGL 641
           V+ G+    GRLPIT Y  +Y Q L +  M +RP  +   PGRTY++YN  ++ PFG+GL
Sbjct: 553 VITGEHAIAGRLPITQYPAEYTQ-LSMLDMNMRPGGN--NPGRTYRWYN-ESVQPFGFGL 608

Query: 642 SYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDF 701
            YT+F     S +  + VN+  +      ++               DL C D    +V  
Sbjct: 609 HYTKFAAKFGS-SSGLTVNIQDIMKSCTKDHP--------------DL-C-DVPPIEVAV 651

Query: 702 QNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVD 761
            N G+     + + + K         +K ++ + R+   +G   ++  +     +L+ VD
Sbjct: 652 TNEGNRTSDFIALAFIKGEVGPKPYPLKTLVSYARLRDISGSQTKMASLALTLGALSRVD 711

Query: 762 YAANTLLPAGEHTIFV 777
            + N +   GE+T+ +
Sbjct: 712 QSGNLVAYPGEYTLLL 727


>gi|339499234|ref|YP_004697269.1| beta-glucosidase [Spirochaeta caldaria DSM 7334]
 gi|338833583|gb|AEJ18761.1| Beta-glucosidase [Spirochaeta caldaria DSM 7334]
          Length = 699

 Score =  392 bits (1008), Expect = e-106,   Method: Compositional matrix adjust.
 Identities = 263/738 (35%), Positives = 382/738 (51%), Gaps = 97/738 (13%)

Query: 64  RVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGAT 123
           +++ L+S M+L+EK+  +   A G+PRLG+P Y WW+EALHGV+N G           AT
Sbjct: 11  QIETLISNMSLEEKIGLMIHRAKGIPRLGIPDYNWWNEALHGVANNGE----------AT 60

Query: 124 SFPTVILTTASFNESLWKKIGQAVSTEARAMYN-LGRA-------GLTYWSPNINVARDP 175
            FP  I   A+F+E L  ++ +A+S EARA +N +G+        GLT+W+PNIN+ RDP
Sbjct: 61  VFPQAIALGATFDEDLVHRVAEAISIEARAKFNAVGKEKAEQYHRGLTFWAPNINIFRDP 120

Query: 176 RWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRP--LKVSSCCKHYAAYDV 233
           RWGR  ET GEDP +  R    YVRGLQ            S P  L+ ++C KH+A +  
Sbjct: 121 RWGRGQETYGEDPVLTSRLGTAYVRGLQ-----------GSDPYYLRAAACAKHFAVH-- 167

Query: 234 DNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLN 293
              +G+ R+ F+A V+++D+EET+L  F+  VK G   SVM +YNRVNG P+C    LL 
Sbjct: 168 SGPEGL-RHTFNAEVSQKDLEETYLPAFKALVKSG-VESVMGAYNRVNGEPACGSTYLLK 225

Query: 294 QTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFTGN 353
           Q +R EW   G++V+DC +I     NHK   D  E ++A  L++G DL+CG  Y N+   
Sbjct: 226 QKLREEWQFQGHVVSDCWAICDFHKNHKVTNDILE-SIALALRSGCDLNCGDAY-NYLAE 283

Query: 354 AVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQDICSDENIELAAEAAREGI 413
           AV +G V E DI++++  L   L +LG       Y  +    I   ++  LA EAA + I
Sbjct: 284 AVLKGYVTEDDINRAVVRLLITLDKLGLIHDDGPYQGITIHQIDWKKHDSLALEAAEKSI 343

Query: 414 VLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFSGYA----N 469
           VLLKN+   LPL   K+  + V GP+A  + A++GNYAG+  R ++ +      A     
Sbjct: 344 VLLKNN-GVLPLKKDKISYIYVTGPNATNSDALLGNYAGVSSRLLTVLEAIVEEAGPEIT 402

Query: 470 VTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAESLDR---------EDL 520
           VTYK GC  +A +  N    AS   K AD TI + G D SVE E  D          EDL
Sbjct: 403 VTYKKGC-PLAERRVNPNDWASGVTKYADVTIAVMGRDTSVEGEEGDAILSSTYGDFEDL 461

Query: 521 WLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIA 580
            L   Q   ++++ E  K P+I+V+M  GG  I   E +    AIL A YPG+ GG A++
Sbjct: 462 NLNDEQLSYLHKLKESGK-PLIVVLM--GGAPICSPELHEIADAILVAWYPGQAGGTAVS 518

Query: 581 DVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYG 640
           ++VFGK NP G+LP+T          P +   L   ++    GRTY++     LYPFG+G
Sbjct: 519 NIVFGKTNPSGKLPVT---------FPKSVRQLPEFENYSMQGRTYRYMTEEPLYPFGFG 569

Query: 641 LSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVD 700
           LSYT+ ++  ++                         + + P          D      +
Sbjct: 570 LSYTKMEFKHVT------------------------GRWKSPE--------KDELIVSTE 597

Query: 701 FQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIV 760
             N G+ DG +VV +Y        A     +I F+RV V AG +   +F     + L  +
Sbjct: 598 LYNQGTIDGEEVVQLYYHWKDAPFAVPNWSLIDFKRVLVAAGASCICEFKI-PLEKLQCI 656

Query: 761 DYAANTLLPAGEHTIFVG 778
           D +   ++P G    +VG
Sbjct: 657 DPSGKGVIPTGTLQFYVG 674


>gi|121700633|ref|XP_001268581.1| beta-xylosidase XylA [Aspergillus clavatus NRRL 1]
 gi|119396724|gb|EAW07155.1| beta-xylosidase XylA [Aspergillus clavatus NRRL 1]
          Length = 743

 Score =  392 bits (1008), Expect = e-106,   Method: Compositional matrix adjust.
 Identities = 258/805 (32%), Positives = 398/805 (49%), Gaps = 107/805 (13%)

Query: 1   MAKVVSSLLCFSLSIALLVFSTNAVDANGSSSPVFV---------CDPGRFSKLGLQMSS 51
           +A V++++L   L+ A   ++    +AN   +P  V         CD G  SK       
Sbjct: 7   IATVLAAILPSVLAQANTSYADYNTEANPDLTPQSVATIDLSFPDCDNGPLSKT------ 60

Query: 52  FLFCDS-SLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVG 110
            + CD+ + PY  R   L+S  TL+E V   G+ + GVPRLGLP Y+ W+EALHG+    
Sbjct: 61  -IVCDTLTSPYD-RAAALISLFTLEELVNATGNTSPGVPRLGLPPYQVWNEALHGLDRA- 117

Query: 111 PGTHFDD--VIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPN 168
              +F D      +TSFP  ILT ++ N +L  ++   +ST+ RA  N GR GL  +SPN
Sbjct: 118 ---YFTDEGQFSWSTSFPMPILTMSALNRTLINQVASIISTQGRAFSNAGRYGLDVYSPN 174

Query: 169 INVARDPRWGRITETPGEDPFVVGR-YAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKH 227
           IN  R P WGR  ETPGED + +   YA  Y+ G+Q          ++ + LK+ +  KH
Sbjct: 175 INSFRHPVWGRGQETPGEDAYCLSSAYAYEYITGIQG--------GVDPKSLKLVATAKH 226

Query: 228 YAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCA 287
           YA YD++NW G  R   D  +T+QD+ E +   F +  ++    SVMCSYN VNG+PSCA
Sbjct: 227 YAGYDIENWDGHSRLGNDMNITQQDLSEYYTPQFLVAARDAKVRSVMCSYNAVNGVPSCA 286

Query: 288 DPKLLNQTVRGEWDL--HGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQ 345
           +   L   +R  +     GYI +DCDS   + + H++ A+    A A +++AG D+DCG 
Sbjct: 287 NSFFLQTLLRDTFGFVEDGYISSDCDSAYNVFNPHEYAANVSS-AAADSIRAGTDIDCGT 345

Query: 346 YYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQDICSDENIELA 405
            Y  +   AV Q  +   DI++ +  LY+ LMRLG+FD                      
Sbjct: 346 TYQYYFDEAVDQNLLSRADIERGVIRLYSNLMRLGYFD---------------------- 383

Query: 406 AEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGF- 464
                                         VGP  N +  + GNY G     +SP+  F 
Sbjct: 384 ------------------------------VGPWMNVSTQLQGNYFGPAPYLISPLDAFR 413

Query: 465 SGYANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAESLDREDLWLPG 524
             + +V Y  G + ++  S +    A  AAK +DA I   G+D S+EAE+LDR ++  PG
Sbjct: 414 DSHLDVNYAFGTN-ISSNSTDGFSKALSAAKKSDAIIFAGGIDNSLEAETLDRMNITWPG 472

Query: 525 YQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVF 584
            Q +LI+Q++++ K P+I++ M  G VD +  ++N N+ +++W GYPG+ GG+A+ D++ 
Sbjct: 473 KQLELIDQLSQLGK-PLIVLQMGGGQVDSSLLKSNKNVNSLIWGGYPGQSGGQALLDIIT 531

Query: 585 GKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYT 644
           GK  P GRL +T Y  +Y    P T M LRP  +   PG+TY +Y G  +Y FG+GL YT
Sbjct: 532 GKRAPAGRLVVTQYPAEYATQFPATDMSLRPHGN--NPGQTYMWYTGTPVYEFGHGLFYT 589

Query: 645 QFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNV 704
            F+   +S  + +     K++   N+    D      PG +   +    +  F VD  N 
Sbjct: 590 TFR---VSHARAV-----KIKPTYNIQ---DLLAQPHPGYI--HVEQMPFLNFTVDITNT 636

Query: 705 GSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAA 764
           G        ++++   A  A    K ++GF R+        ++  +     S+   D   
Sbjct: 637 GKASSDYTAMLFANTTAGPAPYPKKWLVGFDRLPTLGPSTSKLMTIPVTINSMARTDELG 696

Query: 765 NTLLPAGEHTIFVGNG-GVSFPIHL 788
           N +L  G++ + + N   V  P+ L
Sbjct: 697 NRVLYPGKYELALNNERSVVLPLSL 721


>gi|410723195|ref|ZP_11362440.1| beta-glucosidase-like glycosyl hydrolase [Clostridium sp.
           Maddingley MBC34-26]
 gi|410603399|gb|EKQ57833.1| beta-glucosidase-like glycosyl hydrolase [Clostridium sp.
           Maddingley MBC34-26]
          Length = 709

 Score =  391 bits (1005), Expect = e-106,   Method: Compositional matrix adjust.
 Identities = 253/742 (34%), Positives = 383/742 (51%), Gaps = 102/742 (13%)

Query: 66  KDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGATSF 125
           K+LVS+MTL E+ +QL   +  +  L +P+Y WW+E LHGV+  G           AT F
Sbjct: 17  KELVSKMTLQERAEQLTYQSPAIKHLNVPEYNWWNEGLHGVARAGT----------ATVF 66

Query: 126 PTVILTTASFNESLWKKIGQAVSTEARAMYNLGRA--------GLTYWSPNINVARDPRW 177
           P  I   A F+E    +I   +STE RA YN            GLTYWSPN+N+ RDPRW
Sbjct: 67  PQAIGLAAIFDEEFLGEIADIISTEGRAKYNEYSKKDDRGIYKGLTYWSPNVNIFRDPRW 126

Query: 178 GRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWK 237
           GR  ET GEDP++  R  V +++GLQ  EG         + LK+++C KH+A +     +
Sbjct: 127 GRGHETYGEDPYLTSRLGVAFIKGLQG-EG---------KYLKLAACAKHFAVHS--GPE 174

Query: 238 GVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVR 297
           G+ R+ F+A V ++D+ ET+L  FE CVKE +  SVM +YNR NG P C    LL   +R
Sbjct: 175 GL-RHEFNAVVEKKDLYETYLPAFEACVKEANVESVMGAYNRTNGEPCCGSKTLLKDILR 233

Query: 298 GEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFTGNAVQQ 357
           G+W   G++V+DC ++      H  +  +  ++VA  ++ G DL+CG  Y N    A ++
Sbjct: 234 GKWGFKGHVVSDCWAL-ADFHLHHMITSTATESVALAIENGCDLNCGNMYLNLL-LAYKE 291

Query: 358 GKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQDICSDENIELAAEAAREGIVLLK 417
           G V E  I  + + L T   +LG FD   +Y  +  +     E+ E+A  A+R+ +VLLK
Sbjct: 292 GLVTEEQITTAAERLMTTRFKLGMFDEDCEYNRIPYEVNDCKEHNEIALIASRKSMVLLK 351

Query: 418 NDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFSGYA----NVTYK 473
           ND  TLPL+ + +K++AV+GP+AN+ + + GNY+G   +Y + + G          V Y 
Sbjct: 352 ND-GTLPLDKSSLKSIAVIGPNANSEIMLKGNYSGTASKYTTILEGIHNAVGDNIRVYYS 410

Query: 474 TGC-------DDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAE---------SLDR 517
            GC       +D+A   ++ +  A   A+ +D  I+  GLD ++E E         + D+
Sbjct: 411 EGCHLFKDKVEDLA-GPDDRLSEAISVAERSDVVILCLGLDSTIEGEQGDAGNSYGAGDK 469

Query: 518 EDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGR 577
           E L LPG Q  L+ +V EV K PVI+V+    G  + F        AIL A YPG  GG 
Sbjct: 470 ESLNLPGRQQNLLEKVLEVGK-PVIVVL--GAGSALTFNGAEEKCAAILNAWYPGSHGGT 526

Query: 578 AIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPF 637
           A+AD++FGK +P G+LP+T+Y  D   +   T   ++        GRTY++    +LYPF
Sbjct: 527 AVADILFGKCSPSGKLPVTFYK-DTANLPEFTDYSMK--------GRTYRYLEHESLYPF 577

Query: 638 GYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCD-DYFE 696
           GYGL+Y+             +V L+ LQ                    V  ++ D + F+
Sbjct: 578 GYGLTYS-------------KVELSNLQ--------------------VPFVKADFESFD 604

Query: 697 FKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKS 756
             +D +N G+    +VV  Y K      A     + GF+RV ++ G +K +    +  +S
Sbjct: 605 ISIDIRNTGNYGIEEVVQCYVKDLKSKYAVLNHSLAGFKRVSLKKGESKTVTIELSK-RS 663

Query: 757 LNIVDYAANTLLPAGEHTIFVG 778
              V+     LL +    +FVG
Sbjct: 664 FEAVNNDGERLLDSKSFKLFVG 685


>gi|345519864|ref|ZP_08799275.1| beta-glucosidase [Bacteroides sp. 4_3_47FAA]
 gi|254836262|gb|EET16571.1| beta-glucosidase [Bacteroides sp. 4_3_47FAA]
          Length = 736

 Score =  391 bits (1005), Expect = e-106,   Method: Compositional matrix adjust.
 Identities = 247/766 (32%), Positives = 386/766 (50%), Gaps = 104/766 (13%)

Query: 48  QMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVS 107
           Q+ +  F ++ LP  +RVKDLV+R+TL+EKV  +   +  +PRLG+P Y+WW+EALHGV+
Sbjct: 20  QVENLPFRNADLPLEVRVKDLVARLTLEEKVLLMQHHSPAIPRLGIPAYDWWNEALHGVA 79

Query: 108 NVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYN----LGRA--- 160
                      +   T FP  I   A+F+    +K+G   STE RA++N     G+    
Sbjct: 80  R---------TLEKVTVFPQAIGMAATFDTEALQKMGDITSTEGRALFNEDWKAGKTGTR 130

Query: 161 --GLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRP 218
             GLTYW+PNIN+ RDPRWGR  ET GEDP++  +     VRGL+  + H          
Sbjct: 131 YRGLTYWTPNINIFRDPRWGRGQETYGEDPYLTAKMGAAIVRGLEGEDPHY--------- 181

Query: 219 LKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYN 278
           LK  +C KHYA +    +   +R+ FDAR +  D+ +T++  F   V +     VMC+YN
Sbjct: 182 LKSVACAKHYAVHSGPEY---NRHSFDARPSVFDLWDTYMPAFRELVTKAKVHGVMCAYN 238

Query: 279 RVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAG 338
           R+NG P C +  LL   +R +W   GY+ +DC +++   + HK   +    A++  L AG
Sbjct: 239 RLNGQPCCGNDPLLVDILRNQWHFDGYVTSDCWALKDFAEFHKTHPEHT-IAMSDALLAG 297

Query: 339 LDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ--YVSLGKQDI 396
            DL+CG  Y +     V++G   E DI+ SL  L+T+L ++G FD + +  Y S+G++ +
Sbjct: 298 TDLECGNLY-HLLAEGVKKGLHSERDINVSLSRLFTILFKIGMFDPAERVPYSSIGREVL 356

Query: 397 CSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCR 456
             + + + A   A+E IVLL+N  + LPL+++K+K++A++GP+A+     + NY G P  
Sbjct: 357 ECEAHKQHAERMAKESIVLLENKNHILPLDASKIKSIALIGPNADNGQTQLANYFGTPSE 416

Query: 457 ----YMSPIAGFSGYANVTYKTGCDDV-ACKSNNSIFAASEAAKTADATIILAGLDLSVE 511
               YMS          + Y  G   V   K   S    +  A  +D  + ++G+    E
Sbjct: 417 IVTPYMSLKRRLGDKIKINYLPGVGIVDKLKDAPSFVQVAHKAAQSDVIVFVSGISADYE 476

Query: 512 -------------AESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAET 558
                          S DR  + LP  Q +L+ ++ +  + P+I+V MS  G  ++F   
Sbjct: 477 GEAGDAGAAGYGGFASGDRTTMQLPLVQIELLKKLKKTGR-PLIIVNMS--GSVMSFEWE 533

Query: 559 NTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDS 618
           + N  A+L A Y G+  G AI DV+FG  NP GR+P+T Y  D         +P  P ++
Sbjct: 534 SQNADALLQAWYGGQAAGDAIVDVLFGHCNPAGRMPLTTYKSD-------NDLP--PFEN 584

Query: 619 LGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASK 678
               GRTY+++ G   YPFGYGLSYT F Y+ +               C +  +T D ++
Sbjct: 585 YSMLGRTYRYFKGEPRYPFGYGLSYTTFAYSDV--------------QCVDETHTGDTAR 630

Query: 679 TRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPP----AEIAATYIKQVIGF 734
                               V   N G  DG +VV +Y   P     +I    +K   GF
Sbjct: 631 V------------------TVTVSNTGDCDGDEVVQLYVVHPQDGRKQIPLCALK---GF 669

Query: 735 QRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGNG 780
           +R+ ++ G +  + F     + L + +   N +   G+ T+FVG G
Sbjct: 670 KRIHLKRGESTSVSFTLTP-EELALTETDGNLVEKNGQVTLFVGGG 714


>gi|374372635|ref|ZP_09630297.1| Beta-glucosidase [Niabella soli DSM 19437]
 gi|373235166|gb|EHP54957.1| Beta-glucosidase [Niabella soli DSM 19437]
          Length = 734

 Score =  391 bits (1004), Expect = e-106,   Method: Compositional matrix adjust.
 Identities = 256/774 (33%), Positives = 381/774 (49%), Gaps = 102/774 (13%)

Query: 40  GRFSKLGLQM----SSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQ 95
           G F  L +Q     S   F +  L +  RV DLVSR+TL+EKV+Q+ + A  +PRLG+P 
Sbjct: 10  GLFFSLAVQAQADKSQLPFWNYKLSFEARVNDLVSRLTLEEKVKQMLNHAPAIPRLGIPA 69

Query: 96  YEWWSEALHGVSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMY 155
           Y+WWSE LHGV+     T         T +P  I   A+++      +    + E RA++
Sbjct: 70  YDWWSEVLHGVARTPYHT---------TVYPQAIAMAATWDTVALYTMADQSAREGRAIH 120

Query: 156 NLGR---------AGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVE 206
           N             GLTYW+PNIN+ RDPRWGR  ET GEDPF+       +VRGLQ  +
Sbjct: 121 NKATEEGKNGDRYVGLTYWTPNINIFRDPRWGRGQETYGEDPFLTAMLGRAFVRGLQGED 180

Query: 207 GHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVK 266
                     + LK ++C KHYA   + +     R+ FD  V++ D+  T+L  F+  V 
Sbjct: 181 ---------PKYLKAAACAKHYA---IHSGPEAVRHSFDVDVSDYDLWNTYLPAFKELVT 228

Query: 267 EGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADS 326
               + VMC+YN     P C    L+   +R +W   GY+ +DC +I    + HK   ++
Sbjct: 229 HAKVAGVMCAYNAFRKKPCCGSDLLMTDILRRQWGFTGYVTSDCGAIDDFFNYHKTHPNA 288

Query: 327 KEDAVAQTLKAGLDLDCG-QYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFD-- 383
            E A    +  G D++CG + Y   T +AV+ G++ E +ID+S+K L+ + MRLG FD  
Sbjct: 289 -EAAAIDAVTNGTDVECGNRAYLTLT-DAVKTGRIAEKEIDRSVKRLFMIRMRLGMFDPV 346

Query: 384 GSPQYVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANAT 443
               Y       + S  +   A + A+E IVLLKN+ + LPL S  +K +AVVGP+A+ +
Sbjct: 347 SMVSYAQTSPAVLESAPHKAQALKMAQESIVLLKNENHLLPL-SKSIKKIAVVGPNADNS 405

Query: 444 VAMIGNYAGIPCRYMSPIAGFSGY----ANVTYKTGCD--DVACKSNNSIFAA-SEAAKT 496
           +A++GNY G P + ++ + G         +V Y+   +  +       + FAA +   K 
Sbjct: 406 IAVLGNYNGTPSKIVTALDGIKAKLGTNGSVVYEKAVNFTNAMLPEGKTDFAALTSRVKD 465

Query: 497 ADATIILAGLDLSVEAESL----------DREDLWLPGYQTQLINQVAEVAKGPVILVIM 546
           ADA I + G+   +E E +          DR  + LP  QT+ +  +    K PV+ V+M
Sbjct: 466 ADAIIFVGGISPQLEGEEMKVNEPGFNSGDRTTILLPTVQTEAMKALKATGK-PVVFVMM 524

Query: 547 SAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQML 606
           +   + I + +   NI AI+ A Y G+  G AIADV+FG +NP GRLP+T+Y  D     
Sbjct: 525 TGSALAIPWEQ--ENIPAIVNAWYGGQAAGTAIADVLFGDYNPSGRLPVTFYKSD----- 577

Query: 607 PLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQH 666
                 L   D      RTY++++G  LYPFGYGLSYT F+Y  L    T++        
Sbjct: 578 ----ADLPAFDDYRMENRTYRYFSGQALYPFGYGLSYTTFRYEGLKVPTTVK-------- 625

Query: 667 CRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAAT 726
                     +K R P                +   N G+  G +VV +Y     +    
Sbjct: 626 ----------NKVRIP--------------VSIQLTNTGAKGGEEVVQLYISYQGQPIKK 661

Query: 727 YIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGNG 780
            +K + GFQRV++  G+ K IKF+     +L I       L P G+  I VG G
Sbjct: 662 PLKALKGFQRVWLNRGQTKTIKFLLTP-DALAIAGENGKLLNPKGKLRISVGGG 714


>gi|291548352|emb|CBL21460.1| Beta-glucosidase-related glycosidases [Ruminococcus sp. SR1/5]
          Length = 697

 Score =  389 bits (998), Expect = e-105,   Method: Compositional matrix adjust.
 Identities = 245/743 (32%), Positives = 383/743 (51%), Gaps = 105/743 (14%)

Query: 64  RVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGAT 123
           + + LV+RMTL+EK  QL   A  + RLG+P Y WW+E LHGV+  G           AT
Sbjct: 9   KAEALVARMTLEEKASQLRYDAPAIKRLGIPAYNWWNEGLHGVARAGQ----------AT 58

Query: 124 SFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRA--------GLTYWSPNINVARDP 175
            FP  I   A+F+     ++   V+TE RA YN            GLT+WSPN+N+ RDP
Sbjct: 59  VFPQAIGMAAAFDRKSVAEMAGIVATEGRAKYNAYSVNGDRDIYKGLTFWSPNVNIFRDP 118

Query: 176 RWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDN 235
           RWGR  ET GEDP++     V++V+ LQ           N   +K ++C KH+A   V +
Sbjct: 119 RWGRGHETYGEDPYLTKELGVSFVKALQG----------NGDTMKAAACAKHFA---VHS 165

Query: 236 WKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQT 295
                R+ FDA  + +DMEET+L  FE  VKE    +VM +YNR NG P C  P  L + 
Sbjct: 166 GPEALRHEFDAEASAKDMEETYLPAFEGLVKEAKVEAVMGAYNRTNGEPCCGSP-TLQKK 224

Query: 296 VRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFTGNAV 355
           +RGEW   G+ V+DC +I+   ++H  + D+  ++ A  +  G DL+CG  Y +    A 
Sbjct: 225 LRGEWKFQGHFVSDCWAIRDFHEHH-MVTDTAVESAALAINNGCDLNCGNTYLHIM-KAY 282

Query: 356 QQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQDICSDENIELAAEAAREGIVL 415
           ++G V E  I ++   L+T    LG FDGS +Y +L   ++ S  +++ A +AA +  VL
Sbjct: 283 EKGLVTEETITRAAVRLFTTRYLLGLFDGS-EYDNLSYMEVESPRHLDAAEKAAEKSFVL 341

Query: 416 LKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFSGY----ANVT 471
           LKN+   LPL+  K+KT+ ++GP+A++  A+IGNY G   RY++   G   Y      + 
Sbjct: 342 LKNN-GILPLDKEKLKTIGIIGPNADSRQALIGNYHGTASRYITIQEGIQDYVGDDVRIL 400

Query: 472 YKTGCDDVACKSNNSIFA------ASEAAKTADATIILAGLDLSVEAE---------SLD 516
              GCD    ++ +  F       A   A+ +D  I+  GLD ++E E         S D
Sbjct: 401 TSRGCDLFRDRTEHLAFTRDRIAEAKVVAENSDVVILCMGLDETLEGEEGDTGNSYVSGD 460

Query: 517 REDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGG 576
           +ED+ LPG Q +L+  +A+  K PV+  +++   +D+ +A    +   +LW  YPG +GG
Sbjct: 461 KEDIELPGVQRELMEAIADTGK-PVVFCLLAGSDLDLKYAAEKFDAVMMLW--YPGCQGG 517

Query: 577 RAIADVVFGKFNPGGRLPITWYNGDYVQMLP-LTSMPLRPVDSLGYPGRTYKFYNGPTLY 635
           +A A V+FG+ +P G+LP+T+Y  + ++ LP  T   ++        GRTY++      +
Sbjct: 518 KAAAKVLFGEISPSGKLPVTFY--ESLEELPDFTDYSMK--------GRTYRYMERKAQF 567

Query: 636 PFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYF 695
           PFGYGL+Y++           + V+  +++ C               G  +N        
Sbjct: 568 PFGYGLTYSK-----------VAVDKAEVKTC---------------GQKIN-------- 593

Query: 696 EFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACK 755
             +V+ QN G+ D  DVV +Y K      A     + GFQR+F++AG  ++I+      K
Sbjct: 594 -VEVEVQNNGAYDTEDVVQIYVKNIDSKNAIPNPMLAGFQRIFLKAGECRKIEIPIWE-K 651

Query: 756 SLNIVDYAANTLLPAGEHTIFVG 778
           +  +VD     +    +  I+ G
Sbjct: 652 AFTVVDETGKRMEEGKKFEIYAG 674


>gi|160881137|ref|YP_001560105.1| glycoside hydrolase family 3 [Clostridium phytofermentans ISDg]
 gi|160429803|gb|ABX43366.1| glycoside hydrolase family 3 domain protein [Clostridium
           phytofermentans ISDg]
          Length = 717

 Score =  387 bits (994), Expect = e-104,   Method: Compositional matrix adjust.
 Identities = 252/752 (33%), Positives = 378/752 (50%), Gaps = 107/752 (14%)

Query: 61  YSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTHFDDVIP 120
           +  R  +LV +MTL+EKV Q    A  +PRL +  Y +W+EALHGV+  G          
Sbjct: 10  FQQRATELVKKMTLEEKVFQTLHSAPSIPRLDIKAYNYWNEALHGVARAGV--------- 60

Query: 121 GATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRA--------GLTYWSPNINVA 172
            AT FP  I   A+F+E L ++I   +STE R  +N  +         GLT+WSPN+N+ 
Sbjct: 61  -ATVFPQAIGLAATFDEDLIEEIADTISTEGRGKFNAQQKYGDHDIYKGLTFWSPNVNIF 119

Query: 173 RDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAY- 231
           RDPRWGR  ET GEDPF+ G     +V G+Q   GH+         LK ++C KH+A + 
Sbjct: 120 RDPRWGRGHETFGEDPFLSGTLGGRFVDGIQ---GHDETY------LKAAACAKHFAVHS 170

Query: 232 ---DVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCAD 288
              D+       R+ F+A V+EQD+ ET+L  F+  VKE    +VM +YNR NG P C  
Sbjct: 171 GPEDI-------RHSFNAEVSEQDLRETYLPAFKKLVKEHKVEAVMGAYNRTNGEPCCGS 223

Query: 289 PKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYT 348
             LL   +RGEW+  G++ +DC +I+   ++H   +++ E +VA  +  G DL+CG  Y 
Sbjct: 224 KTLLEDILRGEWEFVGHVTSDCWAIKDFHEHHMVTSNAVE-SVALAMNRGCDLNCGNLYV 282

Query: 349 NFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDG--SPQYVSLGKQDICSDENIELAA 406
           N    AV+ G V+E  ID +L  L+T  M+LG FD   S  + ++    + +  + EL  
Sbjct: 283 NLL-QAVRDGLVEEETIDTALIRLFTTRMKLGLFDKEESIPFNTITYDQVDTKSSKELNI 341

Query: 407 EAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFSG 466
           +A+++ +VLLKN+ N LPLN  K+ +V V+GP+AN   A++GNY G    Y++ + G   
Sbjct: 342 KASKKCVVLLKNEDNILPLNPKKITSVGVIGPNANNRNALVGNYEGTASEYITVLEGIKQ 401

Query: 467 Y----ANVTYKTGCDDVACK------SNNSIFAASEAAKTADATIILAGLDLSVEAE--- 513
                  V +  GC     K       N+ I       + +D  I   GLD  +E E   
Sbjct: 402 VVPEDVRVYFSEGCHLFKNKLSNLSQENDRIAEVRAVCEHSDVVIACLGLDPGLEGEEGD 461

Query: 514 ------SLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILW 567
                 S D++ L LPG Q  ++  + E  K PVIL+++S   + + +A  + +I AIL 
Sbjct: 462 QGNQFASGDKKTLALPGIQEDVLKTIYECGK-PVILILLSGSALAVPWA--DEHIPAILQ 518

Query: 568 AGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYK 627
             YPG +GGRAIA+++FG  NP G+LP+T+Y          T+  L          RTY+
Sbjct: 519 GWYPGAQGGRAIAELIFGDGNPEGKLPVTFYR---------TTEELPEFTDYAMKNRTYR 569

Query: 628 FYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVN 687
           +     LYPFGYGLSYT F++ LL       VN + L    N+                 
Sbjct: 570 YMKNEALYPFGYGLSYTTFEHTLL------YVNTDTLGKGSNV----------------- 606

Query: 688 DLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRI 747
                   E  V  +N G  +GS     Y K   E A     Q+ G ++V +  G  K I
Sbjct: 607 --------ECMVRVKNTGDYEGSVTTQAYVKYVGEDAPNC--QLKGLKKVSLLPGEEKDI 656

Query: 748 KFVFNACKSLNIVDYAANTLLPAGEHTIFVGN 779
               +  ++  + +     +L  GE+ +++ +
Sbjct: 657 MIELDD-RAFGLYNEEGEFILNQGEYELYLSD 687


>gi|333381510|ref|ZP_08473192.1| hypothetical protein HMPREF9455_01358 [Dysgonomonas gadei ATCC
           BAA-286]
 gi|332830480|gb|EGK03108.1| hypothetical protein HMPREF9455_01358 [Dysgonomonas gadei ATCC
           BAA-286]
          Length = 738

 Score =  387 bits (993), Expect = e-104,   Method: Compositional matrix adjust.
 Identities = 249/754 (33%), Positives = 382/754 (50%), Gaps = 95/754 (12%)

Query: 52  FLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGP 111
           + F D+ L    RV DLVSR+TL+EKV Q+ +    + RL +P Y WW+E LHG+     
Sbjct: 24  YPFRDTKLSTDKRVSDLVSRLTLEEKVLQMLNNTPAIERLNIPAYNWWNECLHGIGR--- 80

Query: 112 GTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRA--------GLT 163
            T +       T FP  I   A+++  L K +  A+S E RA+YN   A        GLT
Sbjct: 81  -TEYK-----VTVFPQAIGMAAAWDARLLKDVANAISDEGRAIYNDASAKGNYSIYHGLT 134

Query: 164 YWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSS 223
           YW+PN+N+ RDPRWGR  ET GEDP++ G    ++V GLQ  +         S+ LK ++
Sbjct: 135 YWTPNVNIFRDPRWGRGQETYGEDPYLTGALGKSFVAGLQGDD---------SQYLKAAA 185

Query: 224 CCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGI 283
           C KHYA   V +     R+ F+  VT  D+ +T+L  F   V +   + VMC+YN  +G 
Sbjct: 186 CAKHYA---VHSGPENTRHTFNTFVTTFDLWDTYLPAFRDLVVDAKVAGVMCAYNAFSGE 242

Query: 284 PSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDC 343
           P C +  L+ + +R +W   GY+ +DC +I     +HK   D+K  A A  + +G D+DC
Sbjct: 243 PCCGNNLLMQEILRDKWGFTGYVTSDCGAIDDFYRHHKTHPDAKY-AAADAVYSGTDIDC 301

Query: 344 GQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSP--QYVSLGKQDICSDEN 401
           G        +AV+ G + E  ID SLK L+ +  RLG FD +   ++  +    + S  +
Sbjct: 302 GNEAYKALVDAVKTGLITEEQIDISLKRLFEIRFRLGMFDPAEDVKFSKIPLSVLESQPH 361

Query: 402 IELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPI 461
            +LA +  RE IVLLKN+ N LPL S K+K VAV+GP+A+  V+++GNY G P + ++P 
Sbjct: 362 KDLALKITRESIVLLKNENNFLPL-SKKLKKVAVIGPNADNEVSVLGNYNGFPTQIITPY 420

Query: 462 AGFSGY---ANVTYKTGCDDVACKSNN--SIFAASEAAKTADATIILAGLDLSVEAESL- 515
                      V Y+ G D V    N+   I A ++  K  D  I   G+   +E E + 
Sbjct: 421 KAIKNKLKNTEVIYEKGIDFVKPSENSKEEIAALAKRLKGMDVVIFAGGISPELEGEEMP 480

Query: 516 ---------DREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAIL 566
                    DR  + LP  QT+L+ Q  +  + P + V+M+  G  IA    + N+ AIL
Sbjct: 481 VKIEGFTGGDRTSIKLPKIQTELM-QALKAERIPTVFVMMT--GSAIAAEWESQNVPAIL 537

Query: 567 WAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTY 626
            A Y G++ G AIADV+FG +NP G+LP+T+Y  D       + +P    +S     RTY
Sbjct: 538 NAWYGGQDAGTAIADVLFGDYNPSGKLPVTFYTKD-------SDLP--AFNSYEMKNRTY 588

Query: 627 KFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLV 686
           ++++G  LYPFGYGLSYT+F+Y+ +    +I+   N                        
Sbjct: 589 RYFDGQVLYPFGYGLSYTKFEYSPIQMPASIKAGEN------------------------ 624

Query: 687 NDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIK--QVIGFQRVFVRAGRN 744
                    E  +  +N G TDG +VV +Y           +    +  F+R+ ++AG +
Sbjct: 625 --------MEVSITVKNTGKTDGEEVVQLYISHDNNGTNRQLPLYALKSFERISLKAGES 676

Query: 745 KRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVG 778
           K + F  +  + + + D      +  G+  +++G
Sbjct: 677 KSVTFKLSP-REMALADEDGVLKMTKGKSKLYIG 709


>gi|365120422|ref|ZP_09338009.1| hypothetical protein HMPREF1033_01355 [Tannerella sp.
           6_1_58FAA_CT1]
 gi|363647477|gb|EHL86692.1| hypothetical protein HMPREF1033_01355 [Tannerella sp.
           6_1_58FAA_CT1]
          Length = 735

 Score =  386 bits (991), Expect = e-104,   Method: Compositional matrix adjust.
 Identities = 254/758 (33%), Positives = 392/758 (51%), Gaps = 97/758 (12%)

Query: 46  GLQMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHG 105
           G   ++F F +  L +  RV DLVSR+TL+EK+ Q+ + A  + RLG+P Y+WW+E LHG
Sbjct: 21  GKAQNTFPFQNPDLSFEKRVDDLVSRLTLEEKISQMLNKAPAIERLGIPAYDWWNECLHG 80

Query: 106 VSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRA----- 160
           V      T +       T FP  I   A+++++L++++  +++ E RA+Y+   +     
Sbjct: 81  VGR----TPYK-----VTVFPQAIGMAATWDDALFQQVASSIADEGRAIYHDAISKGVHE 131

Query: 161 ---GLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSR 217
              GLTYW+PNIN+ RDPRWGR  ET GEDP++ G     +V GLQ           + +
Sbjct: 132 IYHGLTYWTPNINIFRDPRWGRGQETYGEDPYLTGTLGKAFVNGLQGD---------DPK 182

Query: 218 PLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSY 277
            LK S+C KHYA   V +   + R+ F+  V+  D+ +T+L  F   V +   SSVMC+Y
Sbjct: 183 YLKASACAKHYA---VHSGPEISRHFFNTEVSMYDLWDTYLPAFRDLVVDAKVSSVMCAY 239

Query: 278 NRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKA 337
           N + G P C +  L+   +R +W   GY+ +DC +I   +  HK  AD+   +    L  
Sbjct: 240 NALAGQPCCGNDLLMQDILRKQWKFTGYVTSDCGAIDDFL-KHKTHADAAHASADAVLH- 297

Query: 338 GLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSP--QYVSLGKQD 395
           G DL+CGQ       +AV+QG + E  ID+S+K L+    RLG FD +   +Y       
Sbjct: 298 GTDLECGQNIYVKLVDAVKQGLITEAQIDESVKRLFMTRFRLGLFDPADRVKYADTPLSV 357

Query: 396 ICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPC 455
           +  DE+  LA + +RE +VLLKND N LPL    +K +AV+GP+A+ +  ++GNY G P 
Sbjct: 358 LECDEHKALALKMSRESVVLLKND-NVLPLRK-NLKKIAVIGPNADDSTVVLGNYNGFPS 415

Query: 456 RYMSPIAGFSG----YANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVE 511
           + ++P+            V Y    D V      ++ A  E  K  D  I + G+   +E
Sbjct: 416 KVITPLEAIRSKVGKRTQVIYDRAIDCVKPSDEKTLNALIERLKGVDQVIFVGGISPRLE 475

Query: 512 AESL----------DREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTN 561
            E L          DR  + LP  QT+L+ ++ E A  PVI V+M+   + I +   + N
Sbjct: 476 GEELPISVDGFRGGDRTTIALPEVQTELMKKMKE-AGLPVIFVMMTGSALGIEW--ESQN 532

Query: 562 IKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGY 621
           I AIL A Y G+  G+AIADV+FG +NP G+LP+T+Y  D       + +P  P  +   
Sbjct: 533 IPAILNAWYGGQFAGQAIADVLFGDYNPSGKLPVTFYRSD-------SDLP--PFGAFSM 583

Query: 622 PGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRC 681
             RTY+++ G  LYPFG+GLSYT F Y++                               
Sbjct: 584 ANRTYRYFKGEALYPFGFGLSYTMFDYSV------------------------------- 612

Query: 682 PGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVY-SKPPAEIAATYIKQVIGFQRVFVR 740
           P V V+  +  +  +  V  +N+G  +G +VV +Y S    E A   I  + GF+RV+++
Sbjct: 613 PQV-VSGGKVGEPIKVSVKVKNIGKKNGDEVVQLYLSHEGVEKAP--ITALKGFKRVYLK 669

Query: 741 AGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVG 778
           AG  K + F  +  + +++ D      +  G+ TI+ G
Sbjct: 670 AGEEKTLSFEISP-RDMSLPDDNGIITVFPGKKTIYAG 706


>gi|313202830|ref|YP_004041487.1| glycoside hydrolase [Paludibacter propionicigenes WB4]
 gi|312442146|gb|ADQ78502.1| glycoside hydrolase family 3 domain protein [Paludibacter
           propionicigenes WB4]
          Length = 742

 Score =  385 bits (989), Expect = e-104,   Method: Compositional matrix adjust.
 Identities = 262/749 (34%), Positives = 388/749 (51%), Gaps = 92/749 (12%)

Query: 52  FLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGP 111
           + F D+S     RVKDLVSR+TLDEK  Q+   A  + RLG+  Y WW+EALHGV+  G 
Sbjct: 38  YPFQDTSKTIDERVKDLVSRLTLDEKAGQMLHNAPAIKRLGILPYSWWNEALHGVARTG- 96

Query: 112 GTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGR--------AGLT 163
                     AT FP  +   A+F+E L  +IGQA+S EA A YN+ +        +G+T
Sbjct: 97  ---------RATVFPENVGLAATFDEDLVYRIGQAISDEAWAKYNIAQRLENYGQYSGIT 147

Query: 164 YWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSS 223
           +++PN+N+ RDPRWGR  ET GEDPF+  R  V YV+G+Q  +          + LK ++
Sbjct: 148 FYAPNVNIFRDPRWGRGQETYGEDPFLTSRMGVAYVKGMQGND---------PKYLKTAA 198

Query: 224 CCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGI 283
           C KHY    V +     R+ +DA    +D  ET++  FE  VKEG   SVMC+YNR  G 
Sbjct: 199 CAKHYV---VHSGPEALRHSYDAEPPMKDFMETYVPAFETLVKEGKVESVMCAYNRTFGK 255

Query: 284 PSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDC 343
           P C    LL+  +R +W   GY+  DC +IQ    +H    DS E A A  +K+G++L+C
Sbjct: 256 PCCGSSFLLHDLLREKWGFTGYVTTDCWAIQNFYLHHGAAKDSLE-ACALAIKSGVNLNC 314

Query: 344 GQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ---YVSLGKQDICSDE 400
           G  + N+   AV++G V E ++D++L  L     RLG FD SP    Y  + ++ I S +
Sbjct: 315 GNEF-NYLPAAVRKGLVTEKEVDEALSQLLRTRFRLGLFD-SPNENPYAKIKEEVIGSQQ 372

Query: 401 NIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRY--- 457
           NI+LA EAA + +VLL+N  NTLPL    +K++ VVGP+A     ++GNY G+  R    
Sbjct: 373 NIDLAYEAAAKSLVLLQNKNNTLPLKK-DMKSLYVVGPYAANQDILLGNYNGVNSRLTTI 431

Query: 458 MSPIAG-FSGYANVTYKTGCDDVACKSNNSIFAASEAAKTADATII--LAGLDLSVEAES 514
           M  I G  S   +V Y+ G +  A   N+  ++  EAA       +  ++G+    E ES
Sbjct: 432 MQAIVGKVSAGTSVNYRIGVEPSAPNKNSMNYSIGEAADADAVVAVFGISGVFEGEEGES 491

Query: 515 L------DREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWA 568
                  DR DL LP  Q   + ++ +  K P+ILV+   GG  I   E    + AIL+ 
Sbjct: 492 TASTSRGDRLDLNLPQNQLDYLRELKKKCKKPIILVL--TGGSPICTPELADMVDAILFV 549

Query: 569 GYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKF 628
            YPG+EGG A+ADV+FG  NP GRL IT+         P +   L   +     GRTY++
Sbjct: 550 WYPGQEGGHAVADVIFGDVNPSGRLCITF---------PKSVSQLPAFEDYSMKGRTYRY 600

Query: 629 YNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVND 688
                LYPFG+GLSYT + Y+       I+ + +K++  ++++ T+  S           
Sbjct: 601 MTEEPLYPFGFGLSYTNYSYS------NIKTDKDKIKKGQSVHVTATVS----------- 643

Query: 689 LRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIK 748
                         N G T G +V  +Y       A T +  + G +RV + AG +K + 
Sbjct: 644 --------------NTGKTAGEEVAQLYITDVKASAPTPLYALKGTKRVKLAAGESKEVS 689

Query: 749 FVFNACKSLNIVDYAANTLLPAGEHTIFV 777
           F     + + +V      ++  G+  +++
Sbjct: 690 FEVTP-QMMELVTVTGEKVIEPGDFKVYI 717


>gi|238578959|ref|XP_002388893.1| hypothetical protein MPER_12044 [Moniliophthora perniciosa FA553]
 gi|215450599|gb|EEB89823.1| hypothetical protein MPER_12044 [Moniliophthora perniciosa FA553]
          Length = 658

 Score =  385 bits (988), Expect = e-104,   Method: Compositional matrix adjust.
 Identities = 258/700 (36%), Positives = 364/700 (52%), Gaps = 81/700 (11%)

Query: 96  YEWWSEALHGVSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMY 155
           Y WWSEAL+  S              ATSFP  I   A+F++ L   I   +STEARA  
Sbjct: 1   YNWWSEALNFSS--------------ATSFPAPITMGATFDDGLIHAIATVISTEARAFN 46

Query: 156 NLGRAGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLN 215
           N+ R GL +++PNIN  +DPRWGR  ETPGEDPF + +Y    V GLQ   G  N     
Sbjct: 47  NVNRGGLDFFTPNINPFKDPRWGRGQETPGEDPFHISQYVYQLVTGLQGGVGPTN----- 101

Query: 216 SRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMC 275
              LK+++ CKH+AAYD++N  GV R+ FDA+VT QD+ E +   F+ C+++   +S+MC
Sbjct: 102 ---LKIAADCKHWAAYDLEN-LGVSRFEFDAKVTMQDLAEFYSPSFQSCIRDAKVASIMC 157

Query: 276 SYNRVNGIPSCADPKLLNQTVRGEWDL--HGYIVADCDSIQVMVDNHKFLADSKEDAVAQ 333
           SYN VNGIPSCA+  LL    R  W L    +I  DC ++  +   H +  D   +  A 
Sbjct: 158 SYNAVNGIPSCANRYLLQTLARDFWGLGEEQWITGDCGAVGNIFARHHY-TDDPANGTAV 216

Query: 334 TLKAGLDLDC---GQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVS 390
            L AG D+DC      Y+   G A+ +  V E  +  ++   Y  L+RL +         
Sbjct: 217 ALNAGTDIDCDSGAAAYSQNLGQALNRSLVSEDQLRTAVTRQYNSLVRLSW--------- 267

Query: 391 LGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNY 450
               D+ ++   +LA +AA EGIVLLKND   LPL S+ VK VAVVGP ANAT  M  NY
Sbjct: 268 ---DDVNTEPAQQLAYQAAVEGIVLLKND-GILPLASS-VKKVAVVGPMANATTQMQSNY 322

Query: 451 AGIPCRYMSPIAGF--SGYANVTYKTGCDDVACKSNNSIFAASEAAKTADATII-LAGLD 507
            GI    +SP   F  +G+ NVT+  G       S+ S F+A+ AA      +  + G+D
Sbjct: 323 NGIAPFLVSPQQAFRNAGF-NVTFANGTG--LNSSDTSGFSAAIAAADDADVVFYVGGID 379

Query: 508 LSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILW 567
            ++E E  DR ++   G Q  L+ Q+A + K P+I++ M  G VD +    NT++ A++W
Sbjct: 380 TTIEREDRDRPEISWTGNQLALVQQLASLGK-PLIVLQMGGGQVDSSSLRDNTSVNALIW 438

Query: 568 AGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYK 627
            GYPG+ GG A+ D++ GK  P GRLPIT Y   YV   P+T M LRP  S   PGRTYK
Sbjct: 439 GGYPGQSGGTALVDLITGKQAPAGRLPITQYPASYVDGFPMTDMTLRPSSS--NPGRTYK 496

Query: 628 FYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVN 687
           +Y G  ++ FG+GL YT F     S   +  V        ++L      S  +  GV   
Sbjct: 497 WYTGAPIFEFGFGLHYTTFDAEWASGGDSFSV--------QDL-----VSSAKNSGVAHV 543

Query: 688 DLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRI 747
           DL   D   F V   N G+     V +++S+  A  +    K+++ + RV       K I
Sbjct: 544 DLGVLD--TFNVTVTNSGTVASDYVALLFSRTTAGPSPAPNKELVSYTRV-------KGI 594

Query: 748 KFVFNACKSLNI-------VDYAANTLLPAGEHTIFVGNG 780
           +   ++  SL +        D   N +L  GE+ + +  G
Sbjct: 595 EPGASSAASLKVTLGAVARTDEQGNRVLYPGEYVLLLDTG 634


>gi|410098444|ref|ZP_11293422.1| hypothetical protein HMPREF1076_02600 [Parabacteroides goldsteinii
           CL02T12C30]
 gi|409222318|gb|EKN15263.1| hypothetical protein HMPREF1076_02600 [Parabacteroides goldsteinii
           CL02T12C30]
          Length = 738

 Score =  384 bits (986), Expect = e-103,   Method: Compositional matrix adjust.
 Identities = 250/766 (32%), Positives = 380/766 (49%), Gaps = 98/766 (12%)

Query: 45  LGLQMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALH 104
           L  Q   + F +  LP  +RV+D++SR+TL+EKVQ +   A  VPRLG+P Y WW+EALH
Sbjct: 19  LTAQTYDYPFRNPDLPLDVRVQDIISRLTLEEKVQLMKHAAPAVPRLGIPAYNWWNEALH 78

Query: 105 GVSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYN----LGRA 160
           GV+               T FP  I   A+F+    +K+G   S+E RA++N     G+ 
Sbjct: 79  GVARTK---------EKVTVFPQAIGMAATFDTEALQKMGDMTSSEGRALFNEDLKAGKT 129

Query: 161 -----GLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLN 215
                GLTYW+PNIN+ RDPRWGR  ET GEDP++  +     V GL   EG+      N
Sbjct: 130 GEIYRGLTYWTPNINIFRDPRWGRGQETYGEDPYLTAKMGSAIVHGL---EGN------N 180

Query: 216 SRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMC 275
              LK  +C KHYA   V +    +R+ +DARV+  D+ +T+L  F   V +     VMC
Sbjct: 181 PEYLKSVACAKHYA---VHSGPEHNRHSYDARVSMYDLWDTYLPAFRELVTKAKVHGVMC 237

Query: 276 SYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTL 335
           +YNR  G P C   +LL   +R +W   GY+ +DC ++      HK  ++  E AVA  +
Sbjct: 238 AYNRFEGTPCCGHNELLQDILRNQWKFDGYVTSDCWAVSDFAKYHKTHSNDTE-AVADAV 296

Query: 336 KAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ--YVSLGK 393
             G DL+CG  Y       V++G + E DI+ SL  L+ +  +LG +D + +  Y S+G+
Sbjct: 297 LNGTDLECGNLYQKLQ-QGVEKGLISEKDINVSLARLFEIQFKLGMYDPADRVPYASIGR 355

Query: 394 QDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGI 453
           + I  D + + A E A++ +VLLKN++N LPLN++K+K +A++GP+ +    ++ NY G 
Sbjct: 356 EVIECDAHKKHAYEMAQKSMVLLKNNKNILPLNASKIKRIALIGPNMDNGSTLLANYFGT 415

Query: 454 PCRYMSPIAG----FSGYANVTYKTGCDDV-ACKSNNSIFAASEAAKTADATIILAGLDL 508
           P   ++P       F     +   TG   V   +   S    +  AK AD  I + G+  
Sbjct: 416 PSEIITPYKSLQKRFGNSIQIDTLTGVGIVQKLEGAPSFAQVAAQAKKADIIIFVGGISA 475

Query: 509 SVE-------------AESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAF 555
             E               S DR  + LP  QT+L+ ++ +  + P+ILV MS  G  ++F
Sbjct: 476 DYEGEAGDAGAAGYGGFASGDRTTMKLPPVQTELMKELKKTGR-PLILVNMS--GSVMSF 532

Query: 556 AETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRP 615
              + N  AIL A Y G+  G AI DV+FG +NP GR+P+T Y  D  + LP        
Sbjct: 533 DWESRNADAILQAWYGGQAAGDAITDVLFGDYNPAGRMPLTTYMND--EDLP-------D 583

Query: 616 VDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSD 675
            +      RTY+++ G   YPFGYGLSYT F Y  L    T+                  
Sbjct: 584 FEDYSMANRTYRYFKGDVRYPFGYGLSYTTFGYAPLQNASTV------------------ 625

Query: 676 ASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVY-SKPPAEIAATYIKQVIGF 734
                         +  +  +      N G   G +VV +Y S P        ++ + GF
Sbjct: 626 --------------KTGESIQVTTTVTNTGKRAGDEVVQLYISHPQNGNTRVPLRALKGF 671

Query: 735 QRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGNG 780
           +R+ +  G ++++ F  +  + L++VD   N +   G   +++G G
Sbjct: 672 KRIHLDTGESRQVTFTLSP-EELSLVDEKGNQVEKEGTVELYIGGG 716


>gi|323447708|gb|EGB03620.1| hypothetical protein AURANDRAFT_72703 [Aureococcus anophagefferens]
          Length = 744

 Score =  384 bits (986), Expect = e-103,   Method: Compositional matrix adjust.
 Identities = 249/748 (33%), Positives = 363/748 (48%), Gaps = 101/748 (13%)

Query: 45  LGLQMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALH 104
           L     +  FCD++L   +R  D VSRMT+ EK+  L      +  LGLP Y WWSEA  
Sbjct: 30  LNATFEALPFCDATLAIDLRAADAVSRMTIPEKIDALDTKTGPIASLGLPAYNWWSEASS 89

Query: 105 GVSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTY 164
           GV    P T F        ++P  + T  SFN +LW+  G A+  EARA+ N G A  TY
Sbjct: 90  GVMGSRPTTKF--------AYP--VTTAMSFNRTLWRATGAAIGREARALMNAGAAYSTY 139

Query: 165 WSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSC 224
           W+P +N+AR+PRWGR  E PGEDP++ G YA  +V G Q       A   +   L+ S+C
Sbjct: 140 WAPVVNLAREPRWGRNIEVPGEDPYLTGEYATEFVGGFQ-------AAPEDPYHLQASAC 192

Query: 225 CKHYAAYDVDNWKGVD-----RYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNR 279
           CKHY A +++N +  D     R H D+ VT++D+ ++++ PF+ CV++G  SS+MCSYN 
Sbjct: 193 CKHYVANELENTRQPDGEQWDRQHVDSNVTQRDLVDSYMVPFQACVEKGKVSSLMCSYNA 252

Query: 280 VNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGL 339
           VNG+PSCA+  LL    R  W   GYI +DCD+   + D H + A + E+AVA  LKAG 
Sbjct: 253 VNGVPSCANDWLLRTVARDAWHFDGYITSDCDADSNVYDAHHYAA-TPEEAVADVLKAGT 311

Query: 340 DLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVS----LGKQD 395
           D+DC  +      +A+ +G + E D+D  L  L+ V +RLG FD S         L + D
Sbjct: 312 DVDCQSFVGQHARSALDKGLITEADMDARLVNLFKVRLRLGHFDLSFDAAKPRGPLDEID 371

Query: 396 ----ICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYA 451
               +CSD +++ + E   +   LLKND   LPL  +   T AVVGP+A  + A  G Y 
Sbjct: 372 ADAVVCSDAHLDASMEGLAQSATLLKND-GALPLKPSG--TAAVVGPNALLSKADAGYYG 428

Query: 452 GIPCRYMSPIAGFSGYANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVE 511
                                                        ADA ++  G DL+  
Sbjct: 429 -----------------------------------------PTDAADAVVLAVGTDLTWA 447

Query: 512 AESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIA--FAETNTNIKAILWAG 569
           AE  D   +     Q +LI+ VA  +  PV++V+ SA  +D+    A ++  + A++  G
Sbjct: 448 AEGKDATSIVFTAAQLELIDAVATASATPVVVVVFSATPLDLTPLLARSDGKVGAVVHVG 507

Query: 570 YPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSL---------- 619
            P     + + D+++G+ +  GR   T Y   Y   + +    +RP  S           
Sbjct: 508 QPSVTV-KGLGDLLYGRRSFAGRAVQTVYPAAYADQISIFDFNMRPGPSAFARPDCATNE 566

Query: 620 ------GYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYT 673
                   PGRTY+FY    + PFG+GLSYT F Y + S   T  V+L  L+        
Sbjct: 567 SACPRGTNPGRTYRFYVDEPVVPFGFGLSYTTFAYAVRSAPTT--VDLAPLRAAYAGVAA 624

Query: 674 SDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPP-AEIAATYIKQVI 732
           +          L +D     Y    VD  N G  D  DVV+ +  PP A +    +K++ 
Sbjct: 625 ARGDGGPAFLSLHDDAAAATY---AVDVTNTGDIDADDVVLGFVTPPGAGVDGVPLKELF 681

Query: 733 GFQRVFVRAGRNKRIKFVFNACKSLNIV 760
           GF+RV V+AG  K + +++ A      V
Sbjct: 682 GFERVHVKAGETKTV-YLYPALSKFKTV 708


>gi|350295750|gb|EGZ76727.1| glycoside hydrolase [Neurospora tetrasperma FGSC 2509]
          Length = 839

 Score =  383 bits (984), Expect = e-103,   Method: Compositional matrix adjust.
 Identities = 257/664 (38%), Positives = 348/664 (52%), Gaps = 90/664 (13%)

Query: 55  CDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTH 114
           CD +     R   LV ++T+DEK+  L D A G  R+GLP+Y WWSE LHGV+   PG  
Sbjct: 37  CDVTGTAPERAASLVDQLTIDEKLVNLVDQALGASRIGLPKYAWWSEGLHGVAG-SPGVT 95

Query: 115 FDDV---IPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNINV 171
           F+        ATSF   I   ASF++ L  ++G A+STEARA  N G  GL YW+PN+N 
Sbjct: 96  FNTTGYPFSYATSFANAINLGASFDDDLVYEVGTAISTEARAFANFGFGGLDYWTPNVNP 155

Query: 172 ARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAY 231
            +DPRWGR  ETPGEDP  +  Y    + GL   EG+E          KV + CKHYAAY
Sbjct: 156 YKDPRWGRGAETPGEDPLHIKGYVKAMLAGL---EGNETVR-------KVIATCKHYAAY 205

Query: 232 DVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRV----------- 280
           D++ W G+ RY F+A VT QD+ E +L PF+ C ++    S+MCSYN +           
Sbjct: 206 DLERWHGLTRYEFEAIVTLQDLSEYYLPPFQQCARDSKVGSIMCSYNALTIRDMAGGSKP 265

Query: 281 -------NGIPSCADPKLLNQTVRGEWDL---HGYIVADCDSI-QVMVDNHKFLADSKED 329
                     P+CA+  L+   +R  W+    + YI +DC++I   + DNH F + +  +
Sbjct: 266 DEIINLTTAQPACANTYLMT-ILRDHWNWTEHNNYITSDCNAILDFLPDNHNF-SQTPAE 323

Query: 330 AVAQTLKAGLDLDC---GQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFD--- 383
           A A   KAG D  C   G   T+  G A  Q  + E  ID +L+ LY  L+R G+ D   
Sbjct: 324 AAAAAYKAGTDTVCEVSGSPLTDVVG-AYNQSLLPEAVIDTALRRLYEGLIRAGYLDHGR 382

Query: 384 -----------GSPQYVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKT 432
                       SP Y +L   D+ +    ELA  +A EGIVLLKN  + LPL+ +  K 
Sbjct: 383 SAVAGGDGGSFSSPAYDALNWNDVNTPSTQELALRSATEGIVLLKNSGSLLPLDFSG-KK 441

Query: 433 VAVVGPHANATVAMIGNYAGIPCRYMSPI-AGFSGYANVTYKTGCDDVACKSNNSIFAAS 491
           VA++G  ANAT  M G Y+GIP  Y +P+ A      +++Y  G    A   +     A 
Sbjct: 442 VALIGHWANATGTMRGPYSGIPPFYHNPLYAAQQLNLSLSYANGPVVNASDPDTWTAPAL 501

Query: 492 EAAKTADATIILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGV 551
            AA+ AD  +   G D +V +E LDRE +  P  Q +L++++A + K PV+ VI     V
Sbjct: 502 AAAEGADVVLYFGGTDTTVASEDLDRESIAWPEAQMKLLSELAGLGK-PVV-VIQLGDQV 559

Query: 552 DIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSM 611
           D +    N N+ +ILW GYPG+ GG A+ DV+ GK  P GRLP+T Y   YV  +PLT M
Sbjct: 560 DDSSLLNNGNVSSILWVGYPGQSGGTAVFDVLTGKKAPAGRLPVTQYPEGYVDEVPLTEM 619

Query: 612 PLRPVD-----------------------------SLGYPGRTYKFYNGPTLYPFGYGLS 642
            LRP +                             +L  PGRTYK+Y+ P L PFGYGL 
Sbjct: 620 ALRPFNHSSSNLEEEVSVQGGASLTIQARSTPGNKTLSSPGRTYKWYSTPVL-PFGYGLH 678

Query: 643 YTQF 646
           YT F
Sbjct: 679 YTTF 682


>gi|413919686|gb|AFW59618.1| putative O-Glycosyl hydrolase superfamily protein [Zea mays]
          Length = 475

 Score =  383 bits (983), Expect = e-103,   Method: Compositional matrix adjust.
 Identities = 199/448 (44%), Positives = 279/448 (62%), Gaps = 17/448 (3%)

Query: 337 AGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ---YVSLGK 393
           AGLDL+CG +    T  AVQ GK+ E+D+D+++      LMRLGFFDG P+   + +LG 
Sbjct: 31  AGLDLNCGTFLAQHTVAAVQAGKLSESDVDRAVTNNLVTLMRLGFFDGDPRELPFGNLGP 90

Query: 394 QDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGI 453
            D+C+  N ELA EAAR+GIVLLKN    LPL++  +K++AV+GP+ANA+  MIGNY G 
Sbjct: 91  SDVCTPSNQELAREAARQGIVLLKN-TGKLPLSATSIKSMAVIGPNANASFTMIGNYEGT 149

Query: 454 PCRYMSPIAGFSGYANVTYKTGCDDVACKSNN-SIFAASEAAKTADATIILAGLDLSVEA 512
           PC+Y +P+ G        Y+ GC +V C  N+  + AA++AA +AD T+++ G D S+E 
Sbjct: 150 PCKYTTPLQGLGANVATVYQPGCTNVGCSGNSLQLDAATKAAASADVTVLVVGADQSIER 209

Query: 513 ESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPG 572
           ESLDR  L LPG Q QL++ VA  + GP ILV+MS G  DI+FA+++  I AILW GYPG
Sbjct: 210 ESLDRTSLLLPGQQPQLVSAVANASSGPCILVVMSGGPFDISFAKSSDKIAAILWVGYPG 269

Query: 573 EEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGP 632
           E GG AIADV+FG  NP GRLP+TWY   + + +P+T M +RP  S GYPGRTY+FY G 
Sbjct: 270 EAGGAAIADVLFGYHNPSGRLPVTWYPESFTK-VPMTDMRMRPDPSTGYPGRTYRFYTGD 328

Query: 633 TLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCD 692
           T+Y FG GLSYT F ++L+S  K + + L +   C            +CP V      C+
Sbjct: 329 TVYAFGDGLSYTSFAHHLVSAPKQLALQLAEGHACLT---------EQCPSVEAEGAHCE 379

Query: 693 DY-FEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVF 751
              F+  +  +N G   G   V ++S PPA +     K ++GF++V +  G+   + F  
Sbjct: 380 GLAFDVHLRVRNAGERSGGHTVFLFSSPPA-VHNAPAKHLLGFEKVSLEPGQAGVVAFKV 438

Query: 752 NACKSLNIVDYAANTLLPAGEHTIFVGN 779
           + CK L++VD   N  +  G HT+ VG+
Sbjct: 439 DVCKDLSVVDELGNRKVALGSHTLHVGD 466


>gi|189201569|ref|XP_001937121.1| beta-xylosidase [Pyrenophora tritici-repentis Pt-1C-BFP]
 gi|187984220|gb|EDU49708.1| beta-xylosidase [Pyrenophora tritici-repentis Pt-1C-BFP]
          Length = 756

 Score =  382 bits (982), Expect = e-103,   Method: Compositional matrix adjust.
 Identities = 252/736 (34%), Positives = 385/736 (52%), Gaps = 44/736 (5%)

Query: 49  MSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSN 108
           + S   CD +   + R   LV+ M   EK+  L   + GV RLGLP Y WW EALHGV+ 
Sbjct: 29  LKSNAICDVTASPAKRAAALVAAMQTQEKLDNLVSKSKGVARLGLPAYNWWGEALHGVAG 88

Query: 109 VGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPN 168
             PG +F      ATSFP  +L +A+F++ L  +I   +  EARA  N G A + +W+P+
Sbjct: 89  A-PGINFTGPYRTATSFPMPLLMSAAFDDDLIHQIAIVIGNEARAFGNGGIAPVDFWTPD 147

Query: 169 INVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHY 228
           IN  RDPRWGR +ETPGED   +  Y  + + GL+  +             K+ + CKHY
Sbjct: 148 INPFRDPRWGRGSETPGEDILRIKGYTKSLLSGLEGDKAQR----------KIIATCKHY 197

Query: 229 AAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCAD 288
             YD+++W G DR+ FDA++T QD+ E F+ PF+ C ++    S MCSYN VNG+P+CAD
Sbjct: 198 VGYDMEDWNGTDRHSFDAKITTQDLAEYFMPPFQQCARDSKVGSFMCSYNAVNGVPTCAD 257

Query: 289 PKLLNQTVRGEW---DLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQ 345
             +L   +R  W   D + YI +DC++++ +   HK++A  +E A A     G+DL C  
Sbjct: 258 TYVLEDILRKHWNWTDSNNYITSDCEAVKDISLRHKYVATLQE-ATAIAFNNGMDLSCEY 316

Query: 346 YYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDG-SPQYVSLGKQDICSDENIEL 404
             ++    A  QG +  + ID++L   Y  L+  G+FDG +  Y +LG QDI + E  +L
Sbjct: 317 SGSSDIPGAFSQGLLNVSVIDRALTRQYEGLVHAGYFDGAAATYANLGVQDINTPEAQKL 376

Query: 405 AAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYM-SPI-A 462
             + A EG+ LLKND +TLPL+      VA+VG  AN +  + G Y+G P  Y+ +P+ A
Sbjct: 377 VLQVAAEGLTLLKND-DTLPLSLKSGSKVAMVGFWANDSSKLSGIYSG-PAPYLHNPVYA 434

Query: 463 GFSGYANVTYKTG-CDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAESLDREDLW 521
           G     ++   TG     +  ++N    A +AAK +D  +   GLD S  AE  DR D+ 
Sbjct: 435 GNKLGLDMAVATGPILQKSGAADNWTTKALDAAKKSDTILYFGGLDPSAAAEGSDRTDIS 494

Query: 522 LPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIAD 581
            P  Q  LI ++A  A G  ++VI     VD         + +++WA +PG++GG A+  
Sbjct: 495 WPSAQIDLITKLA--ALGKPLVVIALGDMVDHMPILNMKGVNSLIWANWPGQDGGTAVMQ 552

Query: 582 VVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGL 641
           V+ G+    GRLPIT Y   Y Q L +  M LRP  +   PGRTY++YN  ++ PFG+GL
Sbjct: 553 VITGEHAIAGRLPITQYPAKYTQ-LSMLDMNLRPGGN--NPGRTYRWYN-ESVQPFGFGL 608

Query: 642 SYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDF 701
            YT+F     S   ++ VN+  +      ++               DL C D    +V  
Sbjct: 609 HYTKFAAKFGS-NSSLTVNIQDIMKSCTKDHP--------------DL-C-DVPPIEVAV 651

Query: 702 QNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVD 761
            N G+     + + + K         +K ++ + R+   +G   +   +     +L+ VD
Sbjct: 652 TNKGNRTSDFIALAFIKGEVGPKPYPLKTLVSYARLRDISGSQTKTASLALTLGTLSRVD 711

Query: 762 YAANTLLPAGEHTIFV 777
            + N +   GE+T+ +
Sbjct: 712 QSGNLVAYPGEYTLLL 727


>gi|373955483|ref|ZP_09615443.1| glycoside hydrolase family 3 domain protein [Mucilaginibacter
           paludis DSM 18603]
 gi|373892083|gb|EHQ27980.1| glycoside hydrolase family 3 domain protein [Mucilaginibacter
           paludis DSM 18603]
          Length = 738

 Score =  382 bits (982), Expect = e-103,   Method: Compositional matrix adjust.
 Identities = 257/756 (33%), Positives = 376/756 (49%), Gaps = 99/756 (13%)

Query: 52  FLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGP 111
           + F + +L    RV DLV RMTL+EKV Q+ + A  + RLG+P Y WW+E LHGV+    
Sbjct: 31  YPFNNPALSMDERVADLVGRMTLEEKVSQMLNSAPAIERLGVPAYNWWNECLHGVAR--- 87

Query: 112 GTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLG--------RAGLT 163
            T F       T +P  I   A+++++    +G   + E RA+YN            GLT
Sbjct: 88  -TPFK-----VTVYPQAIAMAATWDKTSMHVMGDYTAEEGRAVYNESIKNDKHDIYLGLT 141

Query: 164 YWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSS 223
           YW+PNIN+ RDPRWGR  ET GEDPF+ G     +V+GLQ  +          R LK + 
Sbjct: 142 YWTPNINIFRDPRWGRGQETYGEDPFLTGEMGSAFVKGLQGDD---------PRYLKAAG 192

Query: 224 CCKHYAAYDVDNWKGVD--RYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVN 281
           C KHYA +      G +  R+ F+  +++ D+ +T+L  F   V +   + VMC+YN   
Sbjct: 193 CAKHYAVHS-----GPEDLRHKFNTDISDYDLWDTYLPAFRKLVVDAKVTGVMCAYNAFK 247

Query: 282 GIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMV--DNHKFLADSKEDAVAQTLKAGL 339
           G P C    L+N  +  +W   GY+ +DC  I      + H+   D+ E A A  +  G 
Sbjct: 248 GQPCCGSDLLMNSILHDKWKFTGYVTSDCGGIDDFYRENTHQTQPDA-ESAAADAVLHGT 306

Query: 340 DLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSP--QYVSLGKQDIC 397
           D++CG         AV+ GK+ E  ID+SLK L++V  +LG FD +   +Y  +GK  + 
Sbjct: 307 DVECGNVTYKSLVKAVKDGKLSEKQIDQSLKRLFSVRFKLGMFDPADAVKYNQIGKDALE 366

Query: 398 SDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRY 457
           +  +   A + A + IVLLKN+ N LPL S  +K +AV+GP+A+  V+++GNY G P R 
Sbjct: 367 APAHGAQALKMAHQSIVLLKNEGNLLPL-SKNLKKIAVLGPNADNAVSVLGNYNGTPSRI 425

Query: 458 MSPIAGFSGY----ANVTYKTGCDDVACKSNNSIFAASEA-AKTADATIILAGLDLSVEA 512
           ++ + G          V Y    D VA  +    +AA  A  K ADA I + G+   +E 
Sbjct: 426 VTALQGIKNKLPAGTEVIYDKAVDYVADSAARYNYAAMAAKVKDADAIIYIGGISPELEG 485

Query: 513 ESL----------DREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNI 562
           E +          DR  + LPG QT+L+  +    K PV+ V+M+  G  IA      N+
Sbjct: 486 EEMPVSKPGFHGGDRSTILLPGVQTELLKALKATGK-PVVFVMMT--GSAIATPWEAENL 542

Query: 563 KAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYP 622
            AI+ A Y G+  G AIADV+FG +NP GRLP+T+Y  D      L S     +D+    
Sbjct: 543 PAIVNAWYGGQAAGTAIADVLFGDYNPAGRLPVTFYGSDK----DLPSFTDYSMDN---- 594

Query: 623 GRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCP 682
            RTY+++ G  LY FGYGLSY++F+Y  L                       DA  T   
Sbjct: 595 -RTYRYFKGKPLYAFGYGLSYSKFEYAPL-----------------------DAPLT--- 627

Query: 683 GVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAG 742
                 L+  +     V   N    DG +V  +Y         T I+ + GF+R  ++AG
Sbjct: 628 ------LKAGEALTVHVKVTNKSKMDGEEVTELYLSHIGIKQKTAIRALKGFERTLIKAG 681

Query: 743 RNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVG 778
             K I F  ++   L+I D   N +  +G+  I VG
Sbjct: 682 ETKDITFKLSSA-DLSITDLNGNLVKASGKIAISVG 716


>gi|319641744|ref|ZP_07996426.1| beta-glucosidase [Bacteroides sp. 3_1_40A]
 gi|317386631|gb|EFV67528.1| beta-glucosidase [Bacteroides sp. 3_1_40A]
          Length = 702

 Score =  382 bits (982), Expect = e-103,   Method: Compositional matrix adjust.
 Identities = 244/751 (32%), Positives = 376/751 (50%), Gaps = 104/751 (13%)

Query: 63  IRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGA 122
           +RVKDLV+R+TL+EKV  +   +  +PRLG+P Y+WW+EALHGV+           +   
Sbjct: 1   MRVKDLVARLTLEEKVLLMQHHSPAIPRLGIPAYDWWNEALHGVART---------LEKV 51

Query: 123 TSFPTVILTTASFNESLWKKIGQAVSTEARAMYN----LGRAG-----LTYWSPNINVAR 173
           T FP  I   A+F+    +K+G   STE RA++N     G+ G     LTYW+PNIN+ R
Sbjct: 52  TVFPQAIGMAATFDTEALQKMGDITSTEGRALFNEDWKAGKTGTRYRGLTYWTPNINIFR 111

Query: 174 DPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDV 233
           DPRWGR  ET GEDP++  +     VRGL+  + H          LK  +C KHYA +  
Sbjct: 112 DPRWGRGQETYGEDPYLTAKMGAAIVRGLEGEDPHY---------LKSVACAKHYAVHSG 162

Query: 234 DNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLN 293
             +   +R+ FDAR +  D+ +T++  F   V +     VMC+YNR+NG P C +  LL 
Sbjct: 163 PEY---NRHSFDARPSVFDLWDTYMPAFRELVTKAKVHGVMCAYNRLNGQPCCGNDPLLV 219

Query: 294 QTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFTGN 353
             +R +W   GY+ +DC +++   + HK   +    A++  L AG DL+CG  Y +    
Sbjct: 220 DILRNQWHFDGYVTSDCWALKDFAEFHKTHPEHT-IAMSDALLAGTDLECGNLY-HLLAE 277

Query: 354 AVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ--YVSLGKQDICSDENIELAAEAARE 411
            V++G   E DI+ SL  L+T+L ++G FD + +  Y S+G++ +  + + + A   A+E
Sbjct: 278 GVKKGLHSERDINVSLSRLFTILFKIGMFDPAERVPYSSIGREVLECEAHKQHAERMAKE 337

Query: 412 GIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCR----YMSPIAGFSGY 467
            IVLL+N  + LPL+++K+K++A++GP+A+     + NY G P      YMS        
Sbjct: 338 SIVLLENKNHILPLDASKIKSIALIGPNADNGQTQLANYFGTPSEIVTPYMSLKRRLGDK 397

Query: 468 ANVTYKTGCDDV-ACKSNNSIFAASEAAKTADATIILAGLDLSVE-------------AE 513
             + Y  G   V   K   S    +  A  +D  + ++G+    E               
Sbjct: 398 IKINYLPGVGIVDKLKDAPSFVQVAHKAAQSDVIVFVSGISADYEGEAGDAGAAGYGGFA 457

Query: 514 SLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGE 573
           S DR  + LP  Q +L+ ++ +  + P+I+V MS  G  ++F   + N  A+L A Y G+
Sbjct: 458 SGDRTTMQLPLVQIELLKKLKKTGR-PLIIVNMS--GSVMSFEWESQNADALLQAWYGGQ 514

Query: 574 EGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPT 633
             G AI DV+FG  NP GR+P+T Y  D         +P  P ++    GRTY+++ G  
Sbjct: 515 AAGDAIVDVLFGHCNPAGRMPLTTYKSD-------NDLP--PFENYSMLGRTYRYFKGEP 565

Query: 634 LYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDD 693
            YPFGYGLSYT F Y                            S  +C    V++    D
Sbjct: 566 RYPFGYGLSYTTFAY----------------------------SDVQC----VDETHTGD 593

Query: 694 YFEFKVDFQNVGSTDGSDVVIVYSKPP----AEIAATYIKQVIGFQRVFVRAGRNKRIKF 749
                V   N G  DG +VV +Y   P     +I    +K   GF+R+ ++ G +  + F
Sbjct: 594 TARVTVTVSNTGDCDGDEVVQLYVVHPQDGRKQIPLCALK---GFKRIHLKRGESTSVSF 650

Query: 750 VFNACKSLNIVDYAANTLLPAGEHTIFVGNG 780
                + L + +   N +   G+ T+FVG G
Sbjct: 651 TLTP-EELALTETDGNLVEKNGQVTLFVGGG 680


>gi|336408348|ref|ZP_08588841.1| hypothetical protein HMPREF1018_00856 [Bacteroides sp. 2_1_56FAA]
 gi|423248801|ref|ZP_17229817.1| hypothetical protein HMPREF1066_00827 [Bacteroides fragilis
           CL03T00C08]
 gi|423253750|ref|ZP_17234681.1| hypothetical protein HMPREF1067_01325 [Bacteroides fragilis
           CL03T12C07]
 gi|335937826|gb|EGM99722.1| hypothetical protein HMPREF1018_00856 [Bacteroides sp. 2_1_56FAA]
 gi|392655379|gb|EIY49022.1| hypothetical protein HMPREF1067_01325 [Bacteroides fragilis
           CL03T12C07]
 gi|392657742|gb|EIY51373.1| hypothetical protein HMPREF1066_00827 [Bacteroides fragilis
           CL03T00C08]
          Length = 722

 Score =  381 bits (979), Expect = e-103,   Method: Compositional matrix adjust.
 Identities = 255/737 (34%), Positives = 389/737 (52%), Gaps = 90/737 (12%)

Query: 53  LFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPG 112
           +  D S P ++RV+ L+ +MTL EKV QL   +  +PRL LP Y +W+E LHGV+  G  
Sbjct: 50  IIGDLSQPVAVRVRTLIQQMTLAEKVAQLVSESDSIPRLNLPAYNYWNECLHGVARAGEV 109

Query: 113 THFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNINVA 172
           T F   I  A+++ TV++          K++  A+STEAR  Y     GLTYWSP IN+A
Sbjct: 110 TVFPQAINLASTWDTVLV----------KRVASAISTEARLKYLEIGKGLTYWSPTINMA 159

Query: 173 RDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRP--LKVSSCCKHYAA 230
           RDPRWGR  ET GEDP +  R  V +V+GLQ              P  LK  +  KH+ A
Sbjct: 160 RDPRWGRNEETYGEDPHLTSRLGVAFVKGLQ-----------GDHPTYLKTVATIKHFVA 208

Query: 231 YDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPK 290
            + +N    +R+   +++  + + E +   +E CVKE +A SVM +YN  NG+P      
Sbjct: 209 NNEEN----NRFSSSSQIPTKQLYEYYFPAYEACVKEANAQSVMTAYNAFNGVPPSGSHW 264

Query: 291 LLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNF 350
           LL+  +R EW   G++V+DC +I VM   H+ + +S E+A A  + +G DL+CG  Y   
Sbjct: 265 LLDDVLRKEWGFDGFVVSDCGAIGVMNWQHR-VVNSLEEAAALGVNSGCDLECGTTYKEK 323

Query: 351 TGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSP--QYVSLGKQDICSDENIELAAEA 408
              AV+QG + E  ID++L  + T   +LG FD      Y    K+ +   +  ELA EA
Sbjct: 324 LVQAVEQGLISEAAIDRALTRVLTARFKLGEFDPMELVPYNHYDKKLLAGKKFAELAYEA 383

Query: 409 AREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAG----F 464
           A + +VLLKND   LPLN  K+K+VAVVGP A+     +G Y+G P   +S + G     
Sbjct: 384 AVKSVVLLKNDA-LLPLNKEKIKSVAVVGPFADYN--YLGGYSGQPPYSVSLLKGVKELI 440

Query: 465 SGYANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAESLDREDLWLPG 524
                VTY  G       S +SI   ++  K AD  ++  G D  +  E+ D   ++LP 
Sbjct: 441 GKKGKVTYLNGMGT----SADSI---AQVVKGADIVLVALGSDEKMARENHDMPSIYLPE 493

Query: 525 YQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVF 584
            Q +L+ ++ +V   P I+++   G   +     +T+I AI+ A YPG+E GRA+A+++F
Sbjct: 494 EQEKLLKEIYQV--NPRIVLVFHTGN-PLTSEWADTHIPAIMQAWYPGQEAGRALANLLF 550

Query: 585 GKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYT 644
           G  NP G+LP+T Y  +  + LP        +D   + GRTY++  G  LY FG+GLSYT
Sbjct: 551 GNENPSGKLPMTIYKTE--EQLPDI------LDFDMWKGRTYRYMKGEPLYGFGHGLSYT 602

Query: 645 QFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNV 704
            F+++ +    T+Q               SDA            L+C       V+  N 
Sbjct: 603 SFEFDNIQGNDTLQ---------------SDAI-----------LQCS------VELSNS 630

Query: 705 GSTDGSDVVIVYSKPPAEIAATY-IKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYA 763
           G   G +VV VY         TY +K+++ F++V + +G  K++ F   A + L++ +  
Sbjct: 631 GQLAGEEVVQVYVSRENTPVYTYPLKKLVAFKKVKLASGEKKKVDFTI-APRELSVWEDG 689

Query: 764 ANTLLPAGEHTIFVGNG 780
              +L +G++T+F+G+G
Sbjct: 690 KWRML-SGKYTLFIGSG 705


>gi|280977785|gb|ACZ98610.1| glucosidase [Cellulosilyticum ruminicola]
          Length = 711

 Score =  381 bits (979), Expect = e-103,   Method: Compositional matrix adjust.
 Identities = 248/742 (33%), Positives = 378/742 (50%), Gaps = 87/742 (11%)

Query: 64  RVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGAT 123
             K+LV +M L EK  QL   A  + RLG+P Y WW+EALHGV+  G           AT
Sbjct: 7   EAKELVRQMDLLEKASQLRYDAPAIKRLGIPTYNWWNEALHGVARAGV----------AT 56

Query: 124 SFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGR--------AGLTYWSPNINVARDP 175
            FP  I   A F+E    +I   ++ E RA YN            G+T+W+PNIN+ RDP
Sbjct: 57  VFPQAIGLAAMFDEEKLGEIADIIAIEGRAKYNQFSQKEDRDIYKGMTFWAPNINIFRDP 116

Query: 176 RWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDN 235
           RWGR  ET GEDP++  R  V +++GLQ         D N   LK ++C KH+A   V +
Sbjct: 117 RWGRGHETYGEDPYLTARLGVAFIKGLQG--------DENEDYLKAAACAKHFA---VHS 165

Query: 236 WKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQT 295
               DR+HFDA V+++D+ ET+L  FE  VKE +   VM +YNRVNG P+C    LL   
Sbjct: 166 GPEEDRHHFDAIVSKKDLYETYLPAFEAAVKEANVIGVMGAYNRVNGEPACGSKTLLVDI 225

Query: 296 VRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFTGNAV 355
           ++ +W   GYIV+DC +I+     H     + E A A  +  G +L+CG  Y +    A 
Sbjct: 226 LKKDWGFDGYIVSDCWAIRDFHTEHMVTHTAAESA-ALAINNGCELNCGNTYLHML-EAH 283

Query: 356 QQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQDICSDENIELAAEAAREGIVL 415
           Q+G VKE  I ++ + L  + M+LG FD + +Y  +         + E+A EA+R  +V+
Sbjct: 284 QEGLVKEEIITEAAEKLMRIRMQLGLFDKNCKYNEIPYAVNDCKVHREVALEASRRSMVM 343

Query: 416 LKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFSGY----ANVT 471
           LKND   LPLN  K+K++ ++GP AN    + GNY G   RY + + G   Y      V 
Sbjct: 344 LKND-GILPLNKDKLKSIGIIGPTANNRTVLEGNYNGTASRYTTFVEGIQDYVGDDVRVY 402

Query: 472 YKTGCDDVACKSNNSIFA---ASEA---AKTADATIILAGLDLSVEAES---------LD 516
           Y  GC   A   +N  +     +EA   A+ +D  ++  GLD ++E E           D
Sbjct: 403 YSEGCHLFANGMSNLAWENDREAEALIVAEQSDVVVLCLGLDSTIEGEQGDTGNAFAGGD 462

Query: 517 REDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGG 576
           +  L L G Q QL+ +V  V K PVILV+ +   + I +A+ + N  AI    YPG +GG
Sbjct: 463 KLSLNLIGRQQQLLEKVVAVGK-PVILVLSTGSAMAINYADEHCN--AIFQTWYPGAQGG 519

Query: 577 RAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYP 636
           +A+A ++FG+++P G+LP+T+Y          T+  L   +      RTY++     LYP
Sbjct: 520 KALAQLLFGEYSPSGKLPVTFYK---------TTEELPAFEDYSMKDRTYRYMPNEALYP 570

Query: 637 FGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFE 696
           FGYGLSY   K       ++++V L+  +     N+++  +K                ++
Sbjct: 571 FGYGLSYADIK------VQSVKV-LDGAKGEEITNFSAGQTK----------------YK 607

Query: 697 FKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKS 756
            KV+ +N  + D  DVV +Y K      A     +  F+ VF++AG +K +       K+
Sbjct: 608 VKVELENKSNVDSYDVVQIYIKDMESQYAVPNFSLCSFKSVFLKAGESKEVTLNVGE-KA 666

Query: 757 LNIVDYAANTLLPAGEHTIFVG 778
             +++     ++ + +  +F+G
Sbjct: 667 FTVINEEGKRIVDSKKFKLFIG 688


>gi|424661938|ref|ZP_18098975.1| hypothetical protein HMPREF1205_02324 [Bacteroides fragilis HMW
           616]
 gi|404578249|gb|EKA82984.1| hypothetical protein HMPREF1205_02324 [Bacteroides fragilis HMW
           616]
          Length = 722

 Score =  380 bits (977), Expect = e-102,   Method: Compositional matrix adjust.
 Identities = 250/731 (34%), Positives = 384/731 (52%), Gaps = 78/731 (10%)

Query: 53  LFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPG 112
           +  D S P ++RVK L+ +MTL EK  QL   +  +PRL LP Y +W+E LHGV+  G  
Sbjct: 50  IIGDLSQPIAVRVKTLIQQMTLAEKASQLVSESDSIPRLNLPAYNYWNECLHGVARAGEV 109

Query: 113 THFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNINVA 172
           T F   I  A+++ TV++          K++  A+STEAR  Y     GLTYWSP IN+A
Sbjct: 110 TVFPQAINLASTWDTVLV----------KRVASAISTEARLKYLEIGKGLTYWSPTINMA 159

Query: 173 RDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYD 232
           RDPRWGR  ET GEDP++  R  V +V+GLQ   G   A       LK  +  KH+ A +
Sbjct: 160 RDPRWGRNEETYGEDPYLTSRLGVAFVKGLQ---GDHPAY------LKTVATIKHFVANN 210

Query: 233 VDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLL 292
            +N    +R+   +++  + + E +   +E CVKE D  SVM +YN  NG+P      LL
Sbjct: 211 EEN----NRFSSSSQIPTKQLYEYYFPAYEACVKEADVQSVMTAYNAFNGVPPSGSRWLL 266

Query: 293 NQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFTG 352
            + +R EW   G++V+DC +I VM   H+ + +S E+A A  + +G DL+CG  Y     
Sbjct: 267 GEVLRKEWGFDGFVVSDCGAIGVMNWQHR-VVNSLEEAAALGVNSGCDLECGTTYKEKLV 325

Query: 353 NAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSP--QYVSLGKQDICSDENIELAAEAAR 410
            AV+QG + E  ID++L  + T   +LG FD      Y    K+ +   +  ELA EAA 
Sbjct: 326 QAVKQGLISEATIDQALTRVLTARFKLGEFDPMELVPYNHYDKKLLAGKKFAELAYEAAV 385

Query: 411 EGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFSGYANV 470
           + +VLLKN+ N LPL+  K K+VAVVGP A+     +G Y+G P   ++ + G       
Sbjct: 386 KSVVLLKNE-NLLPLSKEKTKSVAVVGPFADHN--YLGGYSGQPPYSITLLKGVKDLMGK 442

Query: 471 TYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAESLDREDLWLPGYQTQLI 530
             K    +    S +SI A   A K  D  ++  G D  +  E+ D   ++LP  Q +L+
Sbjct: 443 RGKVNYLNGIGASRDSIVA---AVKGVDVVLVALGSDEKMARENHDMTSIYLPEEQEKLL 499

Query: 531 NQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPG 590
             + +V   P I+++  +G   +     +T+I AI+ A YPG+E GRA+A+++FG  NP 
Sbjct: 500 KAIYQV--NPRIVLVFHSGN-PLTSEWADTHIPAIMQAWYPGQEAGRALANLLFGNENPS 556

Query: 591 GRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNL 650
           G+LP+T Y  +  + LP        +D   + GRTY++  G  LY FG+GLSYT F+++ 
Sbjct: 557 GKLPMTIYKTE--EQLPDI------LDFDMWKGRTYRYMKGEPLYSFGHGLSYTSFEFD- 607

Query: 651 LSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGS 710
                 IQ N                          + L+ D   +  V+  N G   G 
Sbjct: 608 -----NIQGN--------------------------DTLQPDAILQCSVELSNSGQLAGE 636

Query: 711 DVVIVYSKPPAEIAATY-IKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLP 769
           +VV VY         TY +K+++ F++V + +G  K++ F   A + L++ +     +L 
Sbjct: 637 EVVQVYVSRENTPVYTYPLKKLVAFKKVKLASGEKKKVDFTI-APRELSVWEDGKWRML- 694

Query: 770 AGEHTIFVGNG 780
           +G++T+F+G+G
Sbjct: 695 SGKYTLFIGSG 705


>gi|375357164|ref|YP_005109936.1| putative glycosyl hydrolase [Bacteroides fragilis 638R]
 gi|301161845|emb|CBW21389.1| putative glycosyl hydrolase [Bacteroides fragilis 638R]
          Length = 722

 Score =  380 bits (976), Expect = e-102,   Method: Compositional matrix adjust.
 Identities = 253/737 (34%), Positives = 387/737 (52%), Gaps = 90/737 (12%)

Query: 53  LFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPG 112
           +  D S P ++RV+ L+ +MTL EKV QL   +  +PRL LP Y +W+E LHGV+  G  
Sbjct: 50  IIGDLSQPVAVRVRTLIQQMTLAEKVAQLVSESDSIPRLNLPAYNYWNECLHGVARAGEV 109

Query: 113 THFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNINVA 172
           T F   I  A+++ TV++          K++  A+STEAR  Y     GLTYWSP IN+A
Sbjct: 110 TVFPQAINLASTWDTVLV----------KRVASAISTEARLKYLEIGKGLTYWSPTINMA 159

Query: 173 RDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRP--LKVSSCCKHYAA 230
           RDPRWGR  ET GEDP +  R  V +V+GLQ              P  LK  +  KH+ A
Sbjct: 160 RDPRWGRNEETYGEDPHLTSRLGVAFVKGLQ-----------GDHPTYLKTVATIKHFVA 208

Query: 231 YDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPK 290
            + +N    +R+   +++  + + E +   +E CVKE +A SVM +YN  NG+P      
Sbjct: 209 NNEEN----NRFSSSSQIPTKQLYEYYFPAYEACVKEANAQSVMTAYNAFNGVPPSGSHW 264

Query: 291 LLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNF 350
           LL+  +R EW   G++V+DC +I VM   H+ + +S E+A A  + +G DL+CG  Y   
Sbjct: 265 LLDDVLRKEWGFDGFVVSDCGAIGVMNWQHR-VVNSLEEAAALGVNSGCDLECGTTYKEK 323

Query: 351 TGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSP--QYVSLGKQDICSDENIELAAEA 408
              AV+QG + E  ID++L  + T   +LG FD      Y    K+ +   +  ELA EA
Sbjct: 324 LVQAVEQGLISEAAIDRALTRVLTARFKLGEFDPMELVPYNHYDKKLLAGKKFAELAYEA 383

Query: 409 AREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAG----F 464
           A + +VLLKND   LPLN  K+K+VAVVGP A+     +G Y+G P   +S + G     
Sbjct: 384 AVKSVVLLKNDA-LLPLNKEKIKSVAVVGPFADYN--YLGGYSGQPPYSVSLLKGVKELI 440

Query: 465 SGYANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAESLDREDLWLPG 524
                VTY  G       S +SI   ++  K AD  ++  G D  +  E+ D   ++LP 
Sbjct: 441 GKKGKVTYLNGMGT----SADSI---AQVVKGADIVLVALGSDEKMARENHDMPSIYLPE 493

Query: 525 YQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVF 584
            Q +L+ ++ +V   P I+++   G   +     +T+I AI+ A YPG+E GRA+A+++F
Sbjct: 494 EQEKLLKKIYQV--NPRIVLVFHTGN-PLTSEWADTHIPAIMQAWYPGQEAGRALANLLF 550

Query: 585 GKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYT 644
           G  NP G+LP+T Y  +  + LP        +D   + GRTY++  G  LY FG+GLSYT
Sbjct: 551 GNENPSGKLPMTIYKTE--EQLPDI------LDFDMWKGRTYRYMKGEPLYGFGHGLSYT 602

Query: 645 QFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNV 704
            F+++       IQ N                          + L+ D   +  V+  N 
Sbjct: 603 SFEFD------NIQGN--------------------------DTLQPDAILQCSVELSNS 630

Query: 705 GSTDGSDVVIVYSKPPAEIAATY-IKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYA 763
           G   G +VV VY         TY +K+++ F++V + +G  K++ F   A + L++ +  
Sbjct: 631 GQLAGEEVVQVYVSRENTPVYTYPLKKLVAFKKVKLASGEKKKVDFTI-APRELSVWEDG 689

Query: 764 ANTLLPAGEHTIFVGNG 780
              +L +G++T+F+G+G
Sbjct: 690 KWRML-SGKYTLFIGSG 705


>gi|383117083|ref|ZP_09937830.1| hypothetical protein BSHG_0813 [Bacteroides sp. 3_2_5]
 gi|251947612|gb|EES87894.1| hypothetical protein BSHG_0813 [Bacteroides sp. 3_2_5]
          Length = 722

 Score =  380 bits (975), Expect = e-102,   Method: Compositional matrix adjust.
 Identities = 253/737 (34%), Positives = 387/737 (52%), Gaps = 90/737 (12%)

Query: 53  LFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPG 112
           +  D S P ++RV+ L+ +MTL EKV QL   +  +PRL LP Y +W+E LHGV+  G  
Sbjct: 50  IIGDLSQPVAVRVRTLIQQMTLAEKVAQLVSESDSIPRLNLPAYNYWNECLHGVARAGEV 109

Query: 113 THFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNINVA 172
           T F   I  A+++ TV++          K++  A+STEAR  Y     GLTYWSP IN+A
Sbjct: 110 TVFPQAINLASTWDTVLV----------KRVASAISTEARLKYLEIGKGLTYWSPTINMA 159

Query: 173 RDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRP--LKVSSCCKHYAA 230
           RDPRWGR  ET GEDP +  R  V +V+GLQ              P  LK  +  KH+ A
Sbjct: 160 RDPRWGRNEETYGEDPHLTSRLGVAFVKGLQ-----------GDHPTYLKTVATIKHFVA 208

Query: 231 YDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPK 290
            + +N    +R+   +++  + + E +   +E CVKE +A SVM +YN  NG+P      
Sbjct: 209 NNEEN----NRFSSSSQIPTKQLYEYYFPAYEACVKEANAQSVMTAYNAFNGVPPSGSHW 264

Query: 291 LLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNF 350
           LL+  +R EW   G++V+DC +I VM   H+ + +S E+A A  + +G DL+CG  Y   
Sbjct: 265 LLDDVLRKEWGFDGFVVSDCGAIGVMNWQHR-VVNSLEEAAALGVNSGCDLECGTTYKEK 323

Query: 351 TGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSP--QYVSLGKQDICSDENIELAAEA 408
              AV+QG + E  ID++L  + T   +LG FD      Y    K+ +   +  ELA EA
Sbjct: 324 LVQAVEQGLISEVAIDRALTRVLTARFKLGEFDPMELVPYNHYDKKLLAGKKFAELAYEA 383

Query: 409 AREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAG----F 464
           A + +VLLKND   LPLN  K+K+VAVVGP A+     +G Y+G P   +S + G     
Sbjct: 384 AVKSVVLLKNDA-LLPLNKEKIKSVAVVGPFADYN--YLGGYSGQPPYSVSLLKGVKELI 440

Query: 465 SGYANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAESLDREDLWLPG 524
                VTY  G       S +SI   ++  K AD  ++  G D  +  E+ D   ++LP 
Sbjct: 441 GKKGKVTYLNGMGT----SADSI---AQVVKGADIVLVALGSDEKMARENHDMPSIYLPE 493

Query: 525 YQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVF 584
            Q +L+ ++ +V   P I+++   G   +     +T+I AI+ A YPG+E GRA+A+++F
Sbjct: 494 EQEKLLKKIYQV--NPRIVLVFHTGN-PLTSEWADTHIPAIMQAWYPGQEAGRALANLLF 550

Query: 585 GKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYT 644
           G  NP G+LP+T Y  +  + LP        +D   + GRTY++  G  LY FG+GLSYT
Sbjct: 551 GNENPSGKLPMTIYKTE--EQLPDI------LDFDMWKGRTYRYMKGEPLYGFGHGLSYT 602

Query: 645 QFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNV 704
            F+++       IQ N                          + L+ D   +  V+  N 
Sbjct: 603 SFEFD------NIQGN--------------------------DTLQPDAILQCSVELSNS 630

Query: 705 GSTDGSDVVIVYSKPPAEIAATY-IKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYA 763
           G   G +VV VY         TY +K+++ F++V + +G  K++ F   A + L++ +  
Sbjct: 631 GQLAGEEVVQVYVSRENTPVYTYPLKKLVAFKKVKLASGEKKKVDFTI-APRELSVWEDG 689

Query: 764 ANTLLPAGEHTIFVGNG 780
              +L +G++T+F+G+G
Sbjct: 690 KWRML-SGKYTLFIGSG 705


>gi|423258868|ref|ZP_17239791.1| hypothetical protein HMPREF1055_02068 [Bacteroides fragilis
           CL07T00C01]
 gi|423264161|ref|ZP_17243164.1| hypothetical protein HMPREF1056_00851 [Bacteroides fragilis
           CL07T12C05]
 gi|387776448|gb|EIK38548.1| hypothetical protein HMPREF1055_02068 [Bacteroides fragilis
           CL07T00C01]
 gi|392706427|gb|EIY99550.1| hypothetical protein HMPREF1056_00851 [Bacteroides fragilis
           CL07T12C05]
          Length = 722

 Score =  380 bits (975), Expect = e-102,   Method: Compositional matrix adjust.
 Identities = 253/737 (34%), Positives = 387/737 (52%), Gaps = 90/737 (12%)

Query: 53  LFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPG 112
           +  D S P ++RV+ L+ +MTL EKV QL   +  +PRL LP Y +W+E LHGV+  G  
Sbjct: 50  IIGDLSQPVAVRVRTLIQQMTLAEKVAQLVSESDSIPRLNLPAYNYWNECLHGVARAGEV 109

Query: 113 THFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNINVA 172
           T F   I  A+++ TV++          K++  A+STEAR  Y     GLTYWSP IN+A
Sbjct: 110 TVFPQAINLASTWDTVLV----------KRVASAISTEARLKYLEIGKGLTYWSPTINMA 159

Query: 173 RDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRP--LKVSSCCKHYAA 230
           RDPRWGR  ET GEDP +  R  V +V+GLQ              P  LK  +  KH+ A
Sbjct: 160 RDPRWGRNEETYGEDPHLTSRLGVAFVKGLQ-----------GDHPTYLKTVATIKHFVA 208

Query: 231 YDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPK 290
            + +N    +R+   +++  + + E +   +E CVKE +A SVM +YN  NG+P      
Sbjct: 209 NNEEN----NRFSSSSQIPTKQLYEYYFPAYEACVKEANAQSVMTAYNAFNGVPPSGSHW 264

Query: 291 LLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNF 350
           LL+  +R EW   G++V+DC +I VM   H+ + +S E+A A  + +G DL+CG  Y   
Sbjct: 265 LLDDVLRKEWGFDGFVVSDCGAIGVMNWQHR-VVNSLEEAAALGVNSGCDLECGTTYKEK 323

Query: 351 TGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSP--QYVSLGKQDICSDENIELAAEA 408
              AV+QG + E  ID++L  + T   +LG FD      Y    K+ +   +  ELA EA
Sbjct: 324 LVQAVEQGLISEAAIDRALTRVLTARFKLGEFDPMELVPYNHYDKKLLAGKKFAELAYEA 383

Query: 409 AREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAG----F 464
           A + +VLLKND   LPLN  K+K+VAVVGP A+     +G Y+G P   +S + G     
Sbjct: 384 AVKSVVLLKNDA-LLPLNKEKIKSVAVVGPFADYN--YLGGYSGQPPYSVSLLKGVKELI 440

Query: 465 SGYANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAESLDREDLWLPG 524
                VTY  G       S +SI   ++  K AD  ++  G D  +  E+ D   ++LP 
Sbjct: 441 GKKGKVTYLNGMGT----SADSI---AQVVKGADIVLVALGSDEKMARENHDMPSIYLPE 493

Query: 525 YQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVF 584
            Q +L+ ++ +V   P I+++   G   +     +T+I AI+ A YPG+E GRA+A+++F
Sbjct: 494 EQEKLLKEIYQV--NPRIVLVFHTGN-PLTSEWADTHIPAIMQAWYPGQEAGRALANLLF 550

Query: 585 GKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYT 644
           G  NP G+LP+T Y  +  + LP        +D   + GRTY++  G  LY FG+GLSYT
Sbjct: 551 GNENPSGKLPMTIYKTE--EQLPDI------LDFDMWKGRTYRYMKGEPLYGFGHGLSYT 602

Query: 645 QFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNV 704
            F+++       IQ N                          + L+ D   +  V+  N 
Sbjct: 603 SFEFD------NIQGN--------------------------DTLQPDAILQCSVELSNS 630

Query: 705 GSTDGSDVVIVYSKPPAEIAATY-IKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYA 763
           G   G +VV VY         TY +K+++ F++V + +G  K++ F   A + L++ +  
Sbjct: 631 GQLAGEEVVQVYVSRENTPVYTYPLKKLVAFKKVKLASGEKKKVDFTI-APRELSVWEDG 689

Query: 764 ANTLLPAGEHTIFVGNG 780
              +L +G++T+F+G+G
Sbjct: 690 KWRML-SGKYTLFIGSG 705


>gi|451851086|gb|EMD64387.1| glycoside hydrolase family 3 protein [Cochliobolus sativus ND90Pr]
          Length = 763

 Score =  380 bits (975), Expect = e-102,   Method: Compositional matrix adjust.
 Identities = 247/736 (33%), Positives = 384/736 (52%), Gaps = 44/736 (5%)

Query: 49  MSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSN 108
           +S+   CD + P   R   LV+ M   EK+  L   + GV RLGLP Y WW EALHGV+ 
Sbjct: 31  LSTNAICDVNAPPHERAAALVAAMEPQEKLDNLVSKSKGVSRLGLPAYNWWGEALHGVAG 90

Query: 109 VGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPN 168
             PG  F +    ATSFP  IL +A+F++ L  KI   +  EARA  N G A + YW+P+
Sbjct: 91  A-PGIKFVEPYKNATSFPMPILMSAAFDDDLIFKIANIIGNEARAFGNGGVAPVDYWTPD 149

Query: 169 INVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHY 228
           IN  RD RWGR +E+PGED   +  Y    + GL+  +             K+ + CKHY
Sbjct: 150 INPVRDIRWGRASESPGEDIRRIKGYTKALLAGLEGDQAQR----------KIIATCKHY 199

Query: 229 AAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCAD 288
             YD++ W G DR++F A++T QD+ E ++ PF+ C ++    S MCSYN VNGIP+CAD
Sbjct: 200 VGYDMEAWGGYDRHNFSAKITMQDLAEYYMPPFQQCARDSKVGSFMCSYNAVNGIPTCAD 259

Query: 289 PKLLNQTVRGEW---DLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQ 345
             +L   +R  W   D + YI +DC+++  + +NHK++ ++     A     G+DL C  
Sbjct: 260 TYVLQTILRDHWNWTDSNNYITSDCEAVADISENHKYV-ETLAQGTALAFAKGMDLSCEY 318

Query: 346 YYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGS-PQYVSLGKQDICSDENIEL 404
             ++    A  QG +  + IDK+L   Y  L+  G+FDG+   Y +L  +DI + E  +L
Sbjct: 319 TGSSDIPGAWAQGLLNISVIDKALTRQYEGLVHAGYFDGAKATYANLSYKDINTPEARQL 378

Query: 405 AAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPI-AG 463
           + +   EG+V+LKND +TLPL   K   VA++G  AN +  + G Y+G P    SP+ AG
Sbjct: 379 SLQVTSEGLVMLKND-HTLPLPLTKGSKVAMIGFWANDSSKLQGIYSGPPPYRHSPVFAG 437

Query: 464 FSGYANVTYKTG-CDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAESLDREDLWL 522
                ++    G     +   +N    A +AA+ +D  +   G D +V  E  DR  +  
Sbjct: 438 EQMGLDMAIAWGPMIQNSSVPDNWTTNALDAAEKSDYILYFGGQDWTVAQEGYDRTTISF 497

Query: 523 PGYQTQLINQVAEVAKGPVILVIMSAGGV-DIAFAETNTNIKAILWAGYPGEEGGRAIAD 581
           P  Q  L+ ++A++ K    LV+++ G + D +   +   + +I+WA +PG++GG AI +
Sbjct: 498 PQVQIDLLTKLAKLGKP---LVVITLGDMTDHSPLLSMEGVNSIIWANWPGQDGGPAILN 554

Query: 582 VVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGL 641
           VV G   P GRLPIT Y  DYV+ L +  M LRP      PGRTY+++N  ++ PFG+GL
Sbjct: 555 VVSGAHAPAGRLPITEYPADYVK-LSMLDMNLRPHTE--SPGRTYRWFN-ESVQPFGFGL 610

Query: 642 SYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDF 701
            YT F+ +  S  + +  ++ ++     L+  +   K  C           +    +V  
Sbjct: 611 HYTTFEASFAS-EEGLTYDIEEI-----LDGCTQQYKDLC-----------EVAPLEVTV 653

Query: 702 QNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVD 761
            N G+     V + + K         +K +I + R+    G  K+   +      L  VD
Sbjct: 654 ANKGNRTSDFVALAFIKGEVGPKPYPLKTLITYGRLRDIHGGAKKSASLPLTLGELARVD 713

Query: 762 YAANTLLPAGEHTIFV 777
            + NT++  GE+T+ +
Sbjct: 714 QSGNTVIYPGEYTLLL 729


>gi|265765457|ref|ZP_06093732.1| beta-xylosidase [Bacteroides sp. 2_1_16]
 gi|263254841|gb|EEZ26275.1| beta-xylosidase [Bacteroides sp. 2_1_16]
          Length = 722

 Score =  380 bits (975), Expect = e-102,   Method: Compositional matrix adjust.
 Identities = 253/737 (34%), Positives = 387/737 (52%), Gaps = 90/737 (12%)

Query: 53  LFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPG 112
           +  D S P ++RV+ L+ +MTL EKV QL   +  +PRL LP Y +W+E LHGV+  G  
Sbjct: 50  IIGDLSQPVAVRVRTLIQQMTLAEKVAQLVSESDSIPRLNLPAYNYWNECLHGVARAGEV 109

Query: 113 THFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNINVA 172
           T F   I  A+++ TV++          K++  A+STEAR  Y     GLTYWSP IN+A
Sbjct: 110 TVFPQAINLASTWDTVLV----------KRVASAISTEARLKYLEIGKGLTYWSPTINMA 159

Query: 173 RDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRP--LKVSSCCKHYAA 230
           RDPRWGR  ET GEDP +  R  V +V+GLQ              P  LK  +  KH+ A
Sbjct: 160 RDPRWGRNEETYGEDPHLTSRLGVAFVKGLQ-----------GDHPTYLKTVATIKHFVA 208

Query: 231 YDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPK 290
            + +N    +R+   +++  + + E +   +E CVKE +A SVM +YN  NG+P      
Sbjct: 209 NNEEN----NRFSSSSQIPTKQLYEYYFPAYEACVKEANAQSVMTAYNAFNGVPPSGSHW 264

Query: 291 LLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNF 350
           LL+  +R EW   G++V+DC +I VM   H+ + +S E+A A  + +G DL+CG  Y   
Sbjct: 265 LLDDVLRKEWGFDGFVVSDCGAIGVMNWQHR-VVNSLEEAAALGVNSGCDLECGTTYKEK 323

Query: 351 TGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSP--QYVSLGKQDICSDENIELAAEA 408
              AV+QG + E  ID++L  + T   +LG FD      Y    K+ +   +  ELA EA
Sbjct: 324 LVQAVEQGLISEAAIDRALTRVLTARFKLGEFDPMELVPYNHYDKKLLAGKKFAELAYEA 383

Query: 409 AREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAG----F 464
           A + +VLLKND   LPLN  K+K+VAVVGP A+     +G Y+G P   +S + G     
Sbjct: 384 AVKSVVLLKNDA-LLPLNKEKIKSVAVVGPFADYN--YLGGYSGQPPYSVSLLKGVKELI 440

Query: 465 SGYANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAESLDREDLWLPG 524
                VTY  G       S +SI   ++  K AD  ++  G D  +  E+ D   ++LP 
Sbjct: 441 GKKGKVTYLNGMGT----SADSI---AQVVKGADIVLVALGSDEKMARENHDMPSIYLPE 493

Query: 525 YQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVF 584
            Q +L+ ++ +V   P I+++   G   +     +T+I AI+ A YPG+E GRA+A+++F
Sbjct: 494 EQEKLLKKIYQV--NPRIVLVFHTGN-PLTSEWADTHIPAIMQAWYPGQEAGRALANLLF 550

Query: 585 GKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYT 644
           G  NP G+LP+T Y  +  + LP        +D   + GRTY++  G  LY FG+GLSYT
Sbjct: 551 GNENPSGKLPMTIYKTE--EQLPDI------LDFDMWKGRTYRYMKGEPLYGFGHGLSYT 602

Query: 645 QFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNV 704
            F+++       IQ N                          + L+ D   +  V+  N 
Sbjct: 603 SFEFD------NIQGN--------------------------DTLQPDAILQCSVELSNS 630

Query: 705 GSTDGSDVVIVYSKPPAEIAATY-IKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYA 763
           G   G +VV VY         TY +K+++ F++V + +G  K++ F   A + L++ +  
Sbjct: 631 GQLAGEEVVQVYVSRENTPVYTYPLKKLVAFKKVKLASGEKKKVDFTI-APRELSVWEDG 689

Query: 764 ANTLLPAGEHTIFVGNG 780
              +L +G++T+F+G+G
Sbjct: 690 KWRML-SGKYTLFIGSG 705


>gi|423281966|ref|ZP_17260851.1| hypothetical protein HMPREF1204_00389 [Bacteroides fragilis HMW
           615]
 gi|404582453|gb|EKA87147.1| hypothetical protein HMPREF1204_00389 [Bacteroides fragilis HMW
           615]
          Length = 722

 Score =  379 bits (974), Expect = e-102,   Method: Compositional matrix adjust.
 Identities = 253/737 (34%), Positives = 386/737 (52%), Gaps = 90/737 (12%)

Query: 53  LFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPG 112
           +  D S P ++RV+ L+ +MTL EKV QL   +  +PRL LP Y +W+E LHGV+  G  
Sbjct: 50  IIGDLSQPVAVRVRTLIQQMTLAEKVAQLVSESDSIPRLNLPAYNYWNECLHGVARAGEV 109

Query: 113 THFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNINVA 172
           T F   I  A+++ TV++          K++  A+STEAR  Y     GLTYWSP IN+A
Sbjct: 110 TVFPQAINLASTWDTVLV----------KRVASAISTEARLKYLEIGKGLTYWSPTINMA 159

Query: 173 RDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRP--LKVSSCCKHYAA 230
           RDPRWGR  ET GEDP +  R  V +V+GLQ              P  LK  +  KH+ A
Sbjct: 160 RDPRWGRNEETYGEDPHLTSRLGVAFVKGLQ-----------GDHPTYLKTVATIKHFVA 208

Query: 231 YDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPK 290
            + +N    +R+   +++  + + E +   +E CVKE +A SVM +YN  NG+P      
Sbjct: 209 NNEEN----NRFSSSSQIPTKQLYEYYFPAYEACVKEANAQSVMTAYNAFNGVPPSGSHW 264

Query: 291 LLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNF 350
           LL+  +R EW   G++V+DC +I VM   H+ + +S E+A A  + +G DL+CG  Y   
Sbjct: 265 LLDDVLRKEWGFDGFVVSDCGAIGVMNWQHR-VVNSLEEAAALGVNSGCDLECGTTYKEK 323

Query: 351 TGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSP--QYVSLGKQDICSDENIELAAEA 408
              AV+QG + E  ID++L  + T   +LG FD      Y    K+ +   +  ELA EA
Sbjct: 324 LVQAVEQGLISEAAIDRALTRVLTARFKLGEFDPMELVPYNHYDKKLLAGKKFAELAYEA 383

Query: 409 AREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAG----F 464
           A + +VLLKND   LPLN  K+K+VAVVGP A+     +G Y+G P   +S + G     
Sbjct: 384 AVKSVVLLKNDA-LLPLNKEKIKSVAVVGPFADYN--YLGGYSGQPPYSVSLLKGVKELI 440

Query: 465 SGYANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAESLDREDLWLPG 524
                VTY  G       S +SI   ++  K AD  ++  G D  +  E+ D   ++LP 
Sbjct: 441 GKKGKVTYLNGMGT----SADSI---AQVVKGADIVLVALGSDEKMARENHDMPSIYLPE 493

Query: 525 YQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVF 584
            Q +L+ ++ +V   P I ++   G   +     +T+I AI+ A YPG+E GRA+A+++F
Sbjct: 494 EQEKLLKEIYQV--NPRIALVFHTGN-PLTSEWADTHIPAIMQAWYPGQEAGRALANLLF 550

Query: 585 GKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYT 644
           G  NP G+LP+T Y  +  + LP        +D   + GRTY++  G  LY FG+GLSYT
Sbjct: 551 GNENPSGKLPMTIYKTE--EQLPDI------LDFDMWKGRTYRYMKGEPLYGFGHGLSYT 602

Query: 645 QFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNV 704
            F+++       IQ N                          + L+ D   +  V+  N 
Sbjct: 603 SFEFD------NIQGN--------------------------DTLQPDAILQCSVELSNS 630

Query: 705 GSTDGSDVVIVYSKPPAEIAATY-IKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYA 763
           G   G +VV VY         TY +K+++ F++V + +G  K++ F   A + L++ +  
Sbjct: 631 GQLAGEEVVQVYVSRENTPVYTYPLKKLVAFKKVKLASGEKKKVDFTI-APRELSVWEDG 689

Query: 764 ANTLLPAGEHTIFVGNG 780
              +L +G++T+F+G+G
Sbjct: 690 KWRML-SGKYTLFIGSG 705


>gi|451996250|gb|EMD88717.1| glycoside hydrolase family 3 protein [Cochliobolus heterostrophus
           C5]
          Length = 763

 Score =  379 bits (974), Expect = e-102,   Method: Compositional matrix adjust.
 Identities = 249/739 (33%), Positives = 379/739 (51%), Gaps = 50/739 (6%)

Query: 49  MSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSN 108
           +S+   CD + P   R   LV+ M   EK+  L   + GV RLGLP Y WW EALHGV+ 
Sbjct: 31  LSTNAICDVNAPPHERAAALVAAMEPQEKLDNLVSKSKGVSRLGLPAYNWWGEALHGVAG 90

Query: 109 VGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPN 168
             PG  F +    ATSFP  IL +A+F++ L  KI   +  EARA  N G A + YW+P+
Sbjct: 91  A-PGIKFVEPYKNATSFPMPILMSAAFDDDLIFKIANIIGNEARAFGNGGVAPMDYWTPD 149

Query: 169 INVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHY 228
           IN  RD RWGR +E+PGED   +  Y    + GL+  +             K+ + CKHY
Sbjct: 150 INPVRDIRWGRASESPGEDIRRIKGYTKALLAGLEGDQAQR----------KIIATCKHY 199

Query: 229 AAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCAD 288
             YD++ W G DR++F A++T QD+ E ++ PF+ C ++    S MCSYN VNG+P+CAD
Sbjct: 200 VGYDMEAWGGYDRHNFSAKITMQDLAEYYMPPFQQCARDSKVGSFMCSYNAVNGVPTCAD 259

Query: 289 PKLLNQTVRGEW---DLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQ 345
             +L   +R  W   D + YI +DC+++  + +NHK++ ++     A     G+DL C  
Sbjct: 260 TYVLQTILRDHWNWTDSNNYITSDCEAVADISENHKYV-ETLAQGTALAFAKGMDLSCEY 318

Query: 346 YYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGS-PQYVSLGKQDICSDENIEL 404
             ++    A  QG +  + IDK+L   Y  L+  G+FDG+   Y +L   DI + E  +L
Sbjct: 319 SGSSDIPGAWSQGLLNLSVIDKALTRQYEGLVHAGYFDGAKATYANLSYNDINTPEARQL 378

Query: 405 AAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPI-AG 463
           + +   EG+V+LKND +TLPL   K   VA++G  AN +  + G Y+G P    SP+ AG
Sbjct: 379 SLQVTSEGLVMLKND-HTLPLPLTKGSKVAMIGFWANDSSKLQGIYSGPPPYRHSPVFAG 437

Query: 464 FSGYANVTYKTG-CDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAESLDREDLWL 522
                ++    G     +   +N    A +AA+ +D  +   G D +V  E  DR  +  
Sbjct: 438 EQMGLDMAIAWGPMIQNSSVPDNWTTNALDAAEKSDYILYFGGQDWTVAQEGYDRTTISF 497

Query: 523 PGYQTQLINQVAEVAKGPVILVIMSAGGV-DIAFAETNTNIKAILWAGYPGEEGGRAIAD 581
           P  Q  L+ ++A++ K    LV+++ G + D +   +   I +I+WA +PG++GG AI +
Sbjct: 498 PQVQIDLLAKLAKLGKP---LVVITLGDMTDHSPLLSMEGINSIIWANWPGQDGGPAILN 554

Query: 582 VVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGL 641
           V+ G   P GRLPIT Y  DYV+ L +  M LRP      PGRTY+++N  ++ PFG+GL
Sbjct: 555 VISGVHAPAGRLPITEYPADYVK-LSMLDMNLRP--HAESPGRTYRWFN-ESVQPFGFGL 610

Query: 642 SYTQFKYNLLS---FTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFK 698
            YT F+    S    T  IQ           L+  +   K  C           +    +
Sbjct: 611 HYTTFEAGFASEEGLTYDIQ---------ETLDSCTQQYKDLC-----------EVAPLE 650

Query: 699 VDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLN 758
           V   N G+     V + + K         +K +I + R+    G  K+   +      L 
Sbjct: 651 VTVANKGNRTSDFVALAFIKGEVGPKPYPLKTLITYGRLRDIHGGAKKSASLPLTLGELA 710

Query: 759 IVDYAANTLLPAGEHTIFV 777
            VD + NT++  GE+T+ +
Sbjct: 711 RVDQSGNTVIYPGEYTLLL 729


>gi|365135698|ref|ZP_09343911.1| hypothetical protein HMPREF1032_03710 [Subdoligranulum sp.
           4_3_54A2FAA]
 gi|363612160|gb|EHL63713.1| hypothetical protein HMPREF1032_03710 [Subdoligranulum sp.
           4_3_54A2FAA]
          Length = 643

 Score =  379 bits (974), Expect = e-102,   Method: Compositional matrix adjust.
 Identities = 231/610 (37%), Positives = 336/610 (55%), Gaps = 61/610 (10%)

Query: 61  YSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTHFDDVIP 120
           ++ R + LV++MTL+EKV Q+   A  + RLG+P Y WW+E LHGV   G          
Sbjct: 4   FAQRARALVAQMTLEEKVSQMRYDAPAIERLGIPAYNWWNECLHGVGRSGT--------- 54

Query: 121 GATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRA--------GLTYWSPNINVA 172
            AT FP  I   ASF+ESL + + QA+S EARA YN  +         GLT+WSPNIN+ 
Sbjct: 55  -ATVFPQPIGMAASFDESLLEHVAQAISDEARAKYNQYKTFGETGIYQGLTFWSPNINLF 113

Query: 173 RDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYD 232
           RDPRWGR  ET GEDP + GR    ++RGLQ+ E        +S+  K+ +  KH+AA+ 
Sbjct: 114 RDPRWGRGHETYGEDPLLTGRMGTAFIRGLQEGE--------DSQYRKLDATVKHFAAHS 165

Query: 233 VDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLL 292
                   R+ F+A V+ +DM +++L  F  C++    ++VM +YNR+NG P+CA    L
Sbjct: 166 GPE---AGRHSFNAEVSAEDMADSYLWAFRYCIEHAKPAAVMGAYNRINGEPACASSTYL 222

Query: 293 NQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFTG 352
              +  EW   GY+V+DC +IQ + +NH    + KE A A  +  G  L+CG+ Y ++  
Sbjct: 223 KGVLYEEWKFDGYVVSDCGAIQDINENHHVTKNEKESA-ALAVNNGCQLNCGKAY-HWVK 280

Query: 353 NAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQDICSDENIELAAEAAREG 412
            AV+ G + E  +  +++ L+    RLG FD    Y S+    I   ++ EL  + A+E 
Sbjct: 281 AAVEDGLISEDTVTCAVERLFEARFRLGMFDSDCVYDSIPMNVIECRKHRELNRKMAQES 340

Query: 413 IVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFSGYA--NV 470
           IVLLKN+   LPLN    KT+AV+GP+A+    ++GNY G P  + + + G    A   V
Sbjct: 341 IVLLKNN-GILPLNPE--KTIAVIGPNADDKTVLLGNYNGTPSHWTTLLRGIQDQARGEV 397

Query: 471 TYKTGCDDVACK----SNNSIFAASEAAKTADATIILAGLDLSVE---------AESLDR 517
            Y  G   V  +    +   +  A   AK AD  ++  GL   +E         A+S DR
Sbjct: 398 YYARGSVLVEKEALPWAEKPLHEAIYTAKAADVVVLCLGLSPLLEGEEGDAYNGADSGDR 457

Query: 518 EDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGR 577
           +D+ LP  Q QL+  + +  K PV+LV +S G VD+   + +    AIL   YPG EGG 
Sbjct: 458 KDISLPDIQQQLLCAILDTEK-PVVLVNVSGGCVDL--RQADERCAAILQCFYPGAEGGN 514

Query: 578 AIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPF 637
           A+AD++FG+ +P GRLP+T+Y    V+ LP       P       GRTY+F++G  LYPF
Sbjct: 515 ALADILFGRVSPSGRLPVTFYRT--VEDLP-------PFTDYSMKGRTYRFFDGKPLYPF 565

Query: 638 GYGLSYTQFK 647
           G+GL+Y   K
Sbjct: 566 GHGLTYADIK 575


>gi|373954937|ref|ZP_09614897.1| glycoside hydrolase family 3 domain protein [Mucilaginibacter
           paludis DSM 18603]
 gi|373891537|gb|EHQ27434.1| glycoside hydrolase family 3 domain protein [Mucilaginibacter
           paludis DSM 18603]
          Length = 723

 Score =  379 bits (973), Expect = e-102,   Method: Compositional matrix adjust.
 Identities = 256/756 (33%), Positives = 389/756 (51%), Gaps = 101/756 (13%)

Query: 54  FCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGT 113
           + D   P  +RV+DL+S++TL+EKV Q+ D +  VPRL LP+Y WW+EALHGV+  G   
Sbjct: 24  YLDPFNPTDVRVRDLISKLTLEEKVHQMMDVSPSVPRLNLPKYNWWNEALHGVARSGV-- 81

Query: 114 HFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGR--------AGLTYW 165
                   AT FP  I   A+F++ L K+   A+S EARAMYN            GLT+W
Sbjct: 82  --------ATIFPQAIALGATFDQDLAKRESTAISDEARAMYNAAMVNGYNEKYGGLTFW 133

Query: 166 SPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQ-DVEGHENATDLNSRPLKVSSC 224
           +PNIN+ RDPRWGR  ET GEDPF+  +  V +++GLQ D   H          LKV++C
Sbjct: 134 TPNINIFRDPRWGRGQETYGEDPFLTSQIGVAFIQGLQGDDPEH----------LKVAAC 183

Query: 225 CKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIP 284
            KH+A   V +     R+ F+A  + +D+ ET+L  F+  V      +VMC+YNR N   
Sbjct: 184 AKHFA---VHSGPERLRHSFNAIASPKDLRETYLPAFKALVN-ARVEAVMCAYNRTNSEV 239

Query: 285 SCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCG 344
            C    LL+Q +R EW   G++V+DC +I      HK +    E AVA  +K G+DL+CG
Sbjct: 240 CCGSNLLLDQILRDEWHFTGHVVSDCGAIVDFYMGHKVVPGQPE-AVALAVKHGVDLNCG 298

Query: 345 QYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFD---GSPQYVSLGKQDICSDEN 401
             Y      AV++G + E +IDK+L  L     +LG FD    SP Y ++    I S ++
Sbjct: 299 DEYPALI-EAVKRGLITEKEIDKALATLLKTRFKLGLFDPKQNSP-YNNIPVSVINSTDH 356

Query: 402 IELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPI 461
             LA E A + IVLLKN++  LPL +  +    + GP+A +  A++GNY G+     + +
Sbjct: 357 RALAKEVALKSIVLLKNEK-CLPLKN-NLSKYYITGPNAASVDALMGNYYGVNPHMSTIL 414

Query: 462 AGFSGY----ANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAESL-- 515
            G +G     + + YK G   +   +NN I   +  AK +D T ++ G+   +E E    
Sbjct: 415 EGIAGAIQPGSQMQYKPGIL-LDRDNNNPIDWTTGDAKASDVTFVVMGITGLLEGEEGEA 473

Query: 516 -------DREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWA 568
                  DR D  LP  Q   + ++ +  K  V+ +I   GG  +  +E +    A+L A
Sbjct: 474 IASPNYGDRLDYNLPKNQIDFLRKIRKGNKNKVVAII--TGGSPMNLSEVHELADAVLLA 531

Query: 569 GYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKF 628
            YPGEEGG A+AD++FGK +P GRLP+T+         P +   L P +     GRTY++
Sbjct: 532 WYPGEEGGNAVADILFGKVSPSGRLPVTF---------PKSFAQLPPYEDYSMKGRTYRY 582

Query: 629 YNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVND 688
                +Y FGYGLSY+ + Y+ L+        L++ Q  +N+   ++   T         
Sbjct: 583 MTAEPMYTFGYGLSYSTYTYSSLT--------LSEKQIKKNMTIIAETMVT--------- 625

Query: 689 LRCDDYFEFKVDFQNVGSTDGSDVVIVY-SKPPAEIAATYIKQVIGFQRVFVRAGRNKRI 747
                         N G  +G +VV +Y + P  E    Y   + GF+RV ++AG ++++
Sbjct: 626 --------------NTGKMEGEEVVQLYITVPQTEKNPQY--SLKGFKRVNLKAGESRKV 669

Query: 748 KFVFNACKSLNIVDYAANTLLPAGEHTIFVGNGGVS 783
           +F       +  VD   + +L +G + + +G    S
Sbjct: 670 QFQITP-DLMKSVDANGSEVLLSGSYVVRIGGASPS 704


>gi|53712125|ref|YP_098117.1| beta-xylosidase [Bacteroides fragilis YCH46]
 gi|52214990|dbj|BAD47583.1| beta-xylosidase [Bacteroides fragilis YCH46]
          Length = 722

 Score =  379 bits (973), Expect = e-102,   Method: Compositional matrix adjust.
 Identities = 253/737 (34%), Positives = 387/737 (52%), Gaps = 90/737 (12%)

Query: 53  LFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPG 112
           +  D S P ++RV+ L+ +MTL EKV QL   +  +PRL LP Y +W+E LHGV+  G  
Sbjct: 50  IIGDLSQPVAVRVRTLIQQMTLAEKVAQLVSESDSIPRLNLPAYNYWNECLHGVARAGEV 109

Query: 113 THFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNINVA 172
           T F   I  A+++ TV++          K++  A+STEAR  Y     GLTYWSP IN+A
Sbjct: 110 TVFPQAINLASTWDTVLV----------KRVASAISTEARLKYLEIGKGLTYWSPTINMA 159

Query: 173 RDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRP--LKVSSCCKHYAA 230
           RDPRWGR  ET GEDP +  R  V +V+GLQ              P  LK  +  KH+ A
Sbjct: 160 RDPRWGRNEETYGEDPHLTSRLGVAFVKGLQ-----------GDHPTYLKTVATIKHFVA 208

Query: 231 YDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPK 290
            + +N    +R+   +++  + + E +   +E CVKE +A SVM +YN  NG+P      
Sbjct: 209 NNEEN----NRFSSSSQIPTKQLYEYYFPAYEACVKEANAQSVMTAYNAFNGVPPSGSHW 264

Query: 291 LLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNF 350
           LL+  +R EW   G++V+DC +I VM   H+ + +S E+A A  + +G DL+CG  Y   
Sbjct: 265 LLDDVLRKEWGFDGFVVSDCGAIGVMNWQHR-VVNSLEEAAALGVNSGCDLECGTTYKEK 323

Query: 351 TGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSP--QYVSLGKQDICSDENIELAAEA 408
              AV+QG + E  ID++L  + T   +LG FD      Y    K+ +   +  ELA EA
Sbjct: 324 LVQAVEQGLISEAAIDRALTRVLTARFKLGEFDPMELVPYNHYDKKLLAGKKFAELAYEA 383

Query: 409 AREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAG----F 464
           A + +VLLKND   LPLN  K+K+VAVVGP A+     +G Y+G P   +S + G     
Sbjct: 384 AVKSVVLLKNDA-LLPLNKEKIKSVAVVGPFADYN--YLGGYSGQPPYSVSLLKGVKELI 440

Query: 465 SGYANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAESLDREDLWLPG 524
                VTY  G       S +SI   ++  K AD  ++  G D  +  E+ D   ++LP 
Sbjct: 441 GKKGKVTYLNGMGT----SADSI---AQVVKGADIVLVALGSDEKMARENHDMPSIYLPE 493

Query: 525 YQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVF 584
            Q +L+ ++ +V   P I+++   G   +     +T+I AI+ A YPG+E GRA+A+++F
Sbjct: 494 GQEKLLKEIYQV--NPRIVLVFHTGN-PLTSEWADTHIPAIMQAWYPGQEAGRALANLLF 550

Query: 585 GKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYT 644
           G  NP G+LP+T Y  +  + LP        +D   + GRTY++  G  LY FG+GLSYT
Sbjct: 551 GNENPSGKLPMTIYKTE--EQLPDI------LDFDMWKGRTYRYMKGEPLYGFGHGLSYT 602

Query: 645 QFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNV 704
            F+++       IQ N                          + L+ D   +  V+  N 
Sbjct: 603 SFEFD------NIQGN--------------------------DTLQPDAILQCSVELSNS 630

Query: 705 GSTDGSDVVIVYSKPPAEIAATY-IKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYA 763
           G   G +VV VY         TY +K+++ F++V + +G  K++ F   A + L++ +  
Sbjct: 631 GQLAGEEVVQVYVSRENTPVYTYPLKKLVAFKKVKLASGEKKKVDFTI-APRELSVWEDG 689

Query: 764 ANTLLPAGEHTIFVGNG 780
              +L +G++T+F+G+G
Sbjct: 690 KWRML-SGKYTLFIGSG 705


>gi|451821678|ref|YP_007457879.1| periplasmic beta-glucosidase BglX [Clostridium
           saccharoperbutylacetonicum N1-4(HMT)]
 gi|451787657|gb|AGF58625.1| periplasmic beta-glucosidase BglX [Clostridium
           saccharoperbutylacetonicum N1-4(HMT)]
          Length = 710

 Score =  379 bits (972), Expect = e-102,   Method: Compositional matrix adjust.
 Identities = 245/744 (32%), Positives = 383/744 (51%), Gaps = 101/744 (13%)

Query: 64  RVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGAT 123
           + K+LVS+MTL E+ +QL   A  +  L + +Y WW+E LHGV+  G           AT
Sbjct: 15  KAKELVSKMTLQERAEQLTYKAPAIKHLNISRYNWWNEGLHGVARAGT----------AT 64

Query: 124 SFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRA--------GLTYWSPNINVARDP 175
            FP  I   A F++ L +KI   ++TE RA YN            GLT+WSPN+N+ RDP
Sbjct: 65  VFPQAIGLAAIFDDELLEKIAGIIATEGRAKYNENSKKEDKDIYKGLTFWSPNVNIFRDP 124

Query: 176 RWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDN 235
           RWGR  ET GEDP++  R  V +V+GLQ  E +          LK+++C KH+A +    
Sbjct: 125 RWGRGHETYGEDPYLTSRLGVAFVKGLQGDEKY----------LKIAACAKHFAVHS--G 172

Query: 236 WKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQT 295
            +G+ R+ F+A V+++D+ ET+L  FE CVKE D  +VM +YNR N  P C    LL   
Sbjct: 173 PEGL-RHEFNAVVSKKDLYETYLPAFEACVKEADVEAVMGAYNRTNDEPCCGSSLLLKDI 231

Query: 296 VRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFTGNAV 355
           +RG+W   G++V+DC +I      H   + + E A A  +K G DL+CG  Y      A 
Sbjct: 232 LRGKWQFKGHVVSDCWAIADFHLYHGVTSTATESA-ALAIKNGCDLNCGNVYLQML-LAY 289

Query: 356 QQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQDICSDENIELAAEAAREGIVL 415
           ++G V E DI ++ + L    +RLG FD   ++  +        E+ E++  A+R+ IV+
Sbjct: 290 KEGLVTEEDITRAAERLMATRIRLGMFDEECEFNKIPYTMNDCKEHHEVSLMASRKSIVM 349

Query: 416 LKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGF-----SGYANV 470
           L+N+   LPL+ +K+K++ ++GP+A++ + + GNY G   +Y++ + G      S    +
Sbjct: 350 LRNN-GLLPLDKSKLKSIGIIGPNADSELMLKGNYFGTASKYITVLEGIHEAVDSENIRI 408

Query: 471 TYKTGC-------DDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAE---------S 514
            Y  GC        D+A + ++ +  A   A+ +D  I+  GLD S+E E         +
Sbjct: 409 FYSEGCHLYKDRVQDLA-EPDDRMAEAVTVAEHSDVVILCLGLDSSIEGEQGDAGNSDGA 467

Query: 515 LDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEE 574
            D+ +L LPG Q +L+ +V  +A G  ++V++ AG   +       N  AIL A YPG  
Sbjct: 468 GDKLNLNLPGKQQELLEKV--IATGKPVIVVLGAGSA-LTLQGQEENCAAILNAWYPGSF 524

Query: 575 GGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTL 634
           GGRAIAD++FGK +P G+LP+T+Y          T+  L          RTY++    +L
Sbjct: 525 GGRAIADLIFGKCSPSGKLPVTFYK---------TTEELPEFTDYSMKNRTYRYMKNESL 575

Query: 635 YPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDY 694
           YPFG+GL+Y++ + + LS                     SD SK    GV          
Sbjct: 576 YPFGFGLTYSKVQLSDLS--------------------VSDISKD-FEGV---------- 604

Query: 695 FEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNAC 754
            E  +   NVG+ D  +V+  Y K      A     +  F+RV +  G +K +K   N  
Sbjct: 605 -EVSIKISNVGNFDIEEVLQCYIKDLESKYAVDNHSLSAFKRVALNKGESKVVKMTINK- 662

Query: 755 KSLNIVDYAANTLLPAGEHTIFVG 778
           ++  +V+   + +L + +  +FVG
Sbjct: 663 RAFEVVNDEGDRILDSKKFKLFVG 686


>gi|169611757|ref|XP_001799296.1| hypothetical protein SNOG_08993 [Phaeosphaeria nodorum SN15]
 gi|160702362|gb|EAT83185.2| hypothetical protein SNOG_08993 [Phaeosphaeria nodorum SN15]
          Length = 755

 Score =  379 bits (972), Expect = e-102,   Method: Compositional matrix adjust.
 Identities = 252/737 (34%), Positives = 384/737 (52%), Gaps = 57/737 (7%)

Query: 54  FCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGT 113
            CD +   + R   LV  M  +EK+  L     GV RLGLP+Y WW EALHGV+   PG 
Sbjct: 33  ICDVTAAPAERAAALVEAMQTNEKLDNL---MRGVTRLGLPKYNWWGEALHGVAGA-PGI 88

Query: 114 HFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNINVAR 173
           +F      ATSFP  +L +A+F++ L  KI   +  EARA  N G A + +W+P+IN  R
Sbjct: 89  NFTGAYKTATSFPMPLLMSAAFDDDLIFKIANIIGNEARAFGNGGVAPVDFWTPDINPFR 148

Query: 174 DPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDV 233
           DPRWGR +ETPGED   +  Y  + + GL+  +             K+ + CKHY  YD+
Sbjct: 149 DPRWGRGSETPGEDIVRIKGYTKHLLAGLEGDKPQR----------KIIATCKHYVGYDM 198

Query: 234 DNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLN 293
           + W G+DR+ F+A++  QD+ E ++ PF+ C ++    S MCSYN VNG+P+CAD  +L 
Sbjct: 199 EAWGGIDRHSFNAKINMQDLAEYYMPPFQQCARDSKVGSFMCSYNAVNGVPTCADTYVLQ 258

Query: 294 QTVRGEWDL---HGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNF 350
             +R  W+    + YI +DC++++ +   HK+ A +  +       AG+D  C    ++ 
Sbjct: 259 TILRDHWNWTESNNYITSDCEAVKDISLKHKY-AKTNAEGTGLAFTAGMDNSCEYTGSSD 317

Query: 351 TGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDG-SPQYVSLGKQDICSDENIELAAEAA 409
              A  Q  +    ID++LK  Y  L+R G+FDG +  Y +LG +DI + E  +L+ + A
Sbjct: 318 IPGAFNQSYLSIPTIDRALKRQYEGLVRAGYFDGAAATYANLGVKDINTPEAQQLSLQVA 377

Query: 410 REGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYM-SPI-AGFSGY 467
            EG+VLLKND +TLPL+      VA++G  AN T  + G Y+G P  Y+ SP+ AG    
Sbjct: 378 SEGLVLLKND-DTLPLSLTNGSKVAMLGFWANDTSKLSGIYSG-PAPYLRSPVWAGQKLG 435

Query: 468 ANVTYKTGCDDVACKSNNS-----IFAASEAAKTADATIILAGLDLSVEAESLDREDLWL 522
            ++   +G   +  +SN+S        A  AA+ +D  +   GLD S  AE  DR  +  
Sbjct: 436 LDMAIASG--PILQQSNSSTRDNWTTNALAAAEKSDYILYFGGLDPSAAAEGFDRNSIAW 493

Query: 523 PGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADV 582
           P  Q  LI ++A + K  V+LV+     +D +       + +++WA +PG++GG A+  V
Sbjct: 494 PTAQVDLIKKLAAIGKPLVVLVLGDL--MDNSPLLELDGVNSVIWANWPGQDGGSAVMQV 551

Query: 583 VFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLS 642
           V G     GRLPIT Y  +Y + L +  M +RP  S   PGRTY+++NG  + PFG GL 
Sbjct: 552 VTGAVAVAGRLPITQYPANYTE-LSMLDMNMRPSSS--SPGRTYRWFNG-AVQPFGTGLH 607

Query: 643 YTQFKYNLLSFTKTIQVNLNKL-QHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDF 701
           YT F     +   TI+ +++ + + C N  Y    S    P                V  
Sbjct: 608 YTTFDAKFAA-NSTIEYDISNITKECTN-QYPDTCSVPSIP----------------VAV 649

Query: 702 QNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVF-VRAGRNKRIKFVFNACKSLNIV 760
            N G+     + + + K     A   +K +I + RV  V+ G+ K  +       +L  V
Sbjct: 650 TNSGNRTSDFIALAFIKGENGPAPYPLKTLISYTRVRDVKGGQTKSAEMQL-TLGNLARV 708

Query: 761 DYAANTLLPAGEHTIFV 777
           D   NT+L  GE+T+ +
Sbjct: 709 DQMGNTVLYPGEYTVLL 725


>gi|320161274|ref|YP_004174498.1| beta-D-xylosidase [Anaerolinea thermophila UNI-1]
 gi|319995127|dbj|BAJ63898.1| beta-D-xylosidase [Anaerolinea thermophila UNI-1]
          Length = 712

 Score =  379 bits (972), Expect = e-102,   Method: Compositional matrix adjust.
 Identities = 245/753 (32%), Positives = 377/753 (50%), Gaps = 99/753 (13%)

Query: 53  LFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPG 112
           L+ +   P   RV DL+SRMTL+EK+ Q+ +    +PRLG+P Y++WSEALHGV+  G  
Sbjct: 7   LYLNPDAPLEERVNDLISRMTLEEKISQMCNSCAAIPRLGIPAYDYWSEALHGVARNGK- 65

Query: 113 THFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYN-----LGRA----GLT 163
                    AT FP  I   A+++  L +++  A+++EARA ++      G+     GLT
Sbjct: 66  ---------ATVFPQAIGMAATWDTELIERVADAIASEARAKFHETLRKFGKTDIYQGLT 116

Query: 164 YWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSS 223
            WSPNIN+ RDPRWGR  ET GEDP++ G     +VRGLQ  + H          LK ++
Sbjct: 117 MWSPNINIFRDPRWGRGQETWGEDPYLTGEMGAAFVRGLQGKDPHY---------LKTAA 167

Query: 224 CCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGI 283
           C KHY    V +    +R+ F+A VT +++ +T+L  F+  V E    +VM +YNR  G 
Sbjct: 168 CAKHYT---VHSGPEKERHTFNAIVTRRELFDTYLPAFKKLVTEAKVEAVMGAYNRTLGE 224

Query: 284 PSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLD- 342
           P C  P LL + +R +W   G++V+DC +I     +H+   D  E A A  +K G D+  
Sbjct: 225 PCCGSPYLLKEILRNQWGFKGHVVSDCGAINDFHLHHQVTKDGAESA-ALGIKNGCDMAC 283

Query: 343 -CGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ----YVSLGKQDIC 397
            C   Y N T  A+ +G + E DID +L+       +LG FD  PQ    Y  +    + 
Sbjct: 284 ICTYSYENLT-EALNRGLITEEDIDHALRNTLRTRFKLGLFD--PQEKVPYAHISMSVVG 340

Query: 398 SDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRY 457
            + + +LA E A +  VLLKN  + LP+    VK++ +VGP+A     ++GNY G+    
Sbjct: 341 CEAHRKLAYETAVKSAVLLKNHNHILPVKP-DVKSILIVGPNAGNVHVLLGNYYGLSDSM 399

Query: 458 MSPIAGFSGY----ANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAE 513
            + + G  G       + +  G      K   + ++ + AA + D  I   GL   +E E
Sbjct: 400 TTFMEGLVGRLPEGVRMEFMPGSLLTDSKKIKNDWSVASAA-SFDLVIAFMGLSPLLEGE 458

Query: 514 --------SLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAI 565
                   + DRED+ LP  Q + I  +A  A G  I+++++ GG  IA       ++AI
Sbjct: 459 EGEAILSDNGDREDIALPKAQQEYIRDLA--ATGAKIVLVLT-GGSAIALNGIEDLVEAI 515

Query: 566 LWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRT 625
           LW GYPG+EGGRAIAD++FG  +P G+LPIT+         P+++  L P        RT
Sbjct: 516 LWVGYPGQEGGRAIADLIFGDHSPSGKLPITF---------PVSTDQLPPFREYSMKERT 566

Query: 626 YKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVL 685
           Y++     L+PFG+GLSYTQF+Y  L     +                            
Sbjct: 567 YRYMTSSPLFPFGFGLSYTQFEYKNLQLEHPV---------------------------- 598

Query: 686 VNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNK 745
              L   +      +  NVG  +G +VV VY           ++++I FQRV ++ G   
Sbjct: 599 ---LSAGEALRGTFELANVGEYEGEEVVQVYLSDLEASTIVPLQKLISFQRVRLKPGETV 655

Query: 746 RIKFVFNACKSLNIVDYAANTLLPAGEHTIFVG 778
           ++ F     +++ ++D   N +L  G+  + +G
Sbjct: 656 QLSFAIQP-EAMMMIDDEGNQVLEPGKFKLTIG 687


>gi|295134875|ref|YP_003585551.1| beta-glucosidase [Zunongwangia profunda SM-A87]
 gi|294982890|gb|ADF53355.1| beta-glucosidase [Zunongwangia profunda SM-A87]
          Length = 735

 Score =  378 bits (971), Expect = e-102,   Method: Compositional matrix adjust.
 Identities = 251/758 (33%), Positives = 387/758 (51%), Gaps = 99/758 (13%)

Query: 47  LQMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGV 106
           +  S F F D+ L    R+ DL+SR+TL+EK QQ+ + +  + RLG+P Y+WW+EALHG+
Sbjct: 27  IDKSEFDFYDTDLSMDERIDDLISRLTLEEKAQQMLNASPAIERLGIPAYDWWNEALHGL 86

Query: 107 SNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYN--------LG 158
              G           AT FP  I   A+F++ L  K+  A+S EARA +N          
Sbjct: 87  GRSGV----------ATVFPQAIGMGATFDDDLILKVSTAISDEARANFNNAVKHGYHRK 136

Query: 159 RAGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRP 218
             GLT+W+PN+N+ RDPRWGR  ET GEDP++  +    +V+GLQ           N + 
Sbjct: 137 YGGLTFWTPNVNIFRDPRWGRGQETYGEDPYLTSKLGEAFVKGLQGD---------NDKY 187

Query: 219 LKVSSCCKHYAAYDVDNWKGVD--RYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCS 276
           LK ++  KHYA +      G +  R+ F+A V+E+D+ ET+L  F+  V + +  ++MC+
Sbjct: 188 LKTAAAAKHYAVH-----SGPEKLRHEFNADVSEKDLWETYLPAFKTLV-DANVETIMCA 241

Query: 277 YNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLK 336
           YN  NG P CA+ +L+N  +R +W  +G++V+DC ++Q  V  H  + +S E A A  ++
Sbjct: 242 YNSTNGEPCCANNRLINDILRDKWGFNGHVVSDCWALQDFVSGHD-IVESPEAAAALAVE 300

Query: 337 AGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFD--GSPQYVSLGKQ 394
            G++L+CG  Y NF   AV+ G V E  +DK L  L     +LG FD   S  Y  +G +
Sbjct: 301 VGIELNCGDTY-NFLAKAVEDGLVSEELVDKRLHKLLETRFKLGLFDPEESNPYNKIGVE 359

Query: 395 DICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIP 454
            + SDE+  LA E AR+ IVLLKND   LPL +   K   + GP+A     ++GNY G+ 
Sbjct: 360 VMNSDEHRALARETARKSIVLLKND-GVLPLKNNLSKYF-ITGPNATNIEVLLGNYHGVN 417

Query: 455 CRYMSPIAGFSG----YANVTYKTGCDDVACKSNNSIFAASEAAKTADATII---LAGLD 507
              ++ + G +      + + Y+ G   +   + N    AS  A  +DAT +   ++GL 
Sbjct: 418 PDMVTVLEGIAKAIKPESQLQYRMGT-RLNLPNENPQDWASPNAGNSDATFVVMGISGLL 476

Query: 508 LSVEAESL------DREDLWLPGYQTQLINQVAEVAKG-PVILVIMSAGGVDIAFAETNT 560
              E ES+      DR D  LP  Q   + +V+E A+  PV+ ++   GG  +   E + 
Sbjct: 477 EGEEGESIASPTFGDRMDYNLPQNQIDYLQKVSEAAEDRPVVAIV--TGGSPMNLTEVHK 534

Query: 561 NIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLG 620
              A+L   YPGEEGG A+AD++FGK +P GRLPIT+         P+T   L   +   
Sbjct: 535 LADAVLLVWYPGEEGGNAVADIIFGKNSPSGRLPITF---------PMTIEDLPAYEDYT 585

Query: 621 YPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTR 680
             GRTYK+ +   +YPFGYGLSYT F+Y+ +  +K        +                
Sbjct: 586 MEGRTYKYMDVVPMYPFGYGLSYTDFEYSEIKLSKDKIKKKESV---------------- 629

Query: 681 CPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVR 740
                          E ++   N G  +  +VV VY K     +     +++ F+ + ++
Sbjct: 630 ---------------EARISVTNTGDFEADEVVQVYLKDVKASSRVPNFELVAFKNIHLK 674

Query: 741 AGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVG 778
            G +K + F     + L+ +D      L  G   I++G
Sbjct: 675 RGESKELTFEITP-EMLSFIDDNGKEKLEKGAFEIYIG 711


>gi|60680313|ref|YP_210457.1| glycosyl hydrolase [Bacteroides fragilis NCTC 9343]
 gi|60491747|emb|CAH06504.1| putative glycosyl hydrolase [Bacteroides fragilis NCTC 9343]
          Length = 722

 Score =  378 bits (970), Expect = e-102,   Method: Compositional matrix adjust.
 Identities = 252/737 (34%), Positives = 386/737 (52%), Gaps = 90/737 (12%)

Query: 53  LFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPG 112
           +  D S P ++RV+ L+ +MTL EKV QL   +  +PRL LP Y +W+E LHGV+  G  
Sbjct: 50  IIGDLSQPVAVRVRTLIQQMTLAEKVAQLVSESDSIPRLNLPAYNYWNECLHGVARAGEV 109

Query: 113 THFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNINVA 172
           T F   I  A+++ TV++          K++  A+STEAR  Y     GLTYWSP IN+A
Sbjct: 110 TVFPQAINLASTWDTVLV----------KRVASAISTEARLKYLEIGKGLTYWSPTINMA 159

Query: 173 RDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRP--LKVSSCCKHYAA 230
           RDPRWGR  ET GEDP +  R  V +V+GLQ              P  LK  +  KH+ A
Sbjct: 160 RDPRWGRNEETYGEDPHLTSRLGVAFVKGLQ-----------GDHPTYLKTVATIKHFVA 208

Query: 231 YDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPK 290
            + +N    +R+   +++  + + E +   +E CVKE +A SVM +YN  NG+P      
Sbjct: 209 NNEEN----NRFSSSSQIPTKQLYEYYFPAYEACVKEANAQSVMTAYNAFNGVPPSGSHW 264

Query: 291 LLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNF 350
           LL+  +R EW   G++V+DC +I VM   H+ + +S E+A A  + +G DL+CG  Y   
Sbjct: 265 LLDDVLRKEWGFDGFVVSDCGAIGVMNWQHR-VVNSLEEAAALGVNSGCDLECGTTYKEK 323

Query: 351 TGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSP--QYVSLGKQDICSDENIELAAEA 408
              AV+QG + E  ID++L  + T   +LG FD      Y    K+ +   +  ELA EA
Sbjct: 324 LVQAVEQGLISEAAIDRALTRVLTARFKLGEFDPMELVPYNHYDKKLLAGKKFAELAYEA 383

Query: 409 AREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAG----F 464
           A + +VLLKND   LPLN  K+K+VAVVGP A+     +G Y+G P   +S + G     
Sbjct: 384 AVKSVVLLKNDA-LLPLNKEKIKSVAVVGPFADYN--YLGGYSGQPPYSVSLLKGVKELI 440

Query: 465 SGYANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAESLDREDLWLPG 524
                VTY  G       S +SI   ++  K AD  ++  G D  +  E+ D   ++LP 
Sbjct: 441 GKKGKVTYLNGMGT----SADSI---AQVVKGADIVLVALGSDEKMARENHDMPSIYLPE 493

Query: 525 YQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVF 584
            Q + + ++ +V   P I+++   G   +     +T+I AI+ A YPG+E GRA+A+++F
Sbjct: 494 EQEKFLKKIYQV--NPRIVLVFHTGN-PLTSEWADTHILAIMQAWYPGQEAGRALANLLF 550

Query: 585 GKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYT 644
           G  NP G+LP+T Y  +  + LP        +D   + GRTY++  G  LY FG+GLSYT
Sbjct: 551 GNENPSGKLPMTIYKTE--EQLPDI------LDFDMWKGRTYRYMKGEPLYGFGHGLSYT 602

Query: 645 QFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNV 704
            F+++       IQ N                          + L+ D   +  V+  N 
Sbjct: 603 SFEFD------NIQGN--------------------------DTLQPDAILQCSVELSNS 630

Query: 705 GSTDGSDVVIVYSKPPAEIAATY-IKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYA 763
           G   G +VV VY         TY +K+++ F++V + +G  K++ F   A + L++ +  
Sbjct: 631 GQLAGEEVVQVYVSRENTPVYTYPLKKLVAFKKVKLASGEKKKVDFTI-APRELSVWEDG 689

Query: 764 ANTLLPAGEHTIFVGNG 780
              +L +G++T+F+G+G
Sbjct: 690 KWRML-SGKYTLFIGSG 705


>gi|423269271|ref|ZP_17248243.1| hypothetical protein HMPREF1079_01325 [Bacteroides fragilis
           CL05T00C42]
 gi|423273165|ref|ZP_17252112.1| hypothetical protein HMPREF1080_00765 [Bacteroides fragilis
           CL05T12C13]
 gi|392701693|gb|EIY94850.1| hypothetical protein HMPREF1079_01325 [Bacteroides fragilis
           CL05T00C42]
 gi|392708197|gb|EIZ01305.1| hypothetical protein HMPREF1080_00765 [Bacteroides fragilis
           CL05T12C13]
          Length = 722

 Score =  377 bits (969), Expect = e-102,   Method: Compositional matrix adjust.
 Identities = 252/737 (34%), Positives = 387/737 (52%), Gaps = 90/737 (12%)

Query: 53  LFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPG 112
           +  D S P ++RV+ L+ +MTL EKV QL   +  +PRL LP Y +W+E LHGV+  G  
Sbjct: 50  IIGDLSQPVAVRVRTLIQQMTLAEKVAQLVSESDSIPRLNLPAYNYWNECLHGVARAGEV 109

Query: 113 THFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNINVA 172
           T F   I  A+++ TV++          K++  A+STEAR  Y     GLTYWSP IN+A
Sbjct: 110 TVFPQAINLASTWDTVLV----------KRVASAISTEARLKYLEIGKGLTYWSPTINMA 159

Query: 173 RDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRP--LKVSSCCKHYAA 230
           RDPRWGR  ET GE+P +  R  V +V+GLQ              P  LK  +  KH+ A
Sbjct: 160 RDPRWGRNEETYGEEPHLTSRLGVAFVKGLQ-----------GDHPTYLKTVATIKHFVA 208

Query: 231 YDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPK 290
            + +N    +R+   +++  + + E +   +E CVKE +A SVM +YN  NG+P      
Sbjct: 209 NNEEN----NRFSSSSQIPTKQLYEYYFPAYEACVKEANAQSVMTAYNAFNGVPPSGSHW 264

Query: 291 LLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNF 350
           LL+  +R EW   G++V+DC +I VM   H+ + +S E+A A  + +G DL+CG  Y   
Sbjct: 265 LLDDVLRKEWGFDGFVVSDCGAIGVMNWQHR-VVNSLEEAAALGVNSGCDLECGTTYKEK 323

Query: 351 TGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSP--QYVSLGKQDICSDENIELAAEA 408
              AV+QG + E  ID++L  + T   +LG FD      Y    K+ +   +  ELA EA
Sbjct: 324 LVQAVEQGLISEAAIDRALTRVLTARFKLGEFDPMELVPYNHYDKKLLAGKKFAELAYEA 383

Query: 409 AREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAG----F 464
           A + +VLLKND   LPLN  K+K+VAVVGP A+     +G Y+G P   +S + G     
Sbjct: 384 AVKSVVLLKNDA-LLPLNKEKIKSVAVVGPFADYN--YLGGYSGQPPYSVSLLKGVKELI 440

Query: 465 SGYANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAESLDREDLWLPG 524
                VTY  G       S +SI   ++  K AD  ++  G D  +  E+ D   ++LP 
Sbjct: 441 GKKGKVTYLNGMGT----SADSI---AQVVKGADIVLVALGSDEKMARENHDMPSIYLPE 493

Query: 525 YQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVF 584
            Q +L+ ++ +V   P I+++   G   +     +T+I AI+ A YPG+E GRA+A+++F
Sbjct: 494 GQEKLLKEIYQV--NPRIVLVFHTGN-PLTSEWADTHIPAIMQAWYPGQEAGRALANLLF 550

Query: 585 GKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYT 644
           G  NP G+LP+T Y  +  + LP        +D   + GRTY++  G  LY FG+GLSYT
Sbjct: 551 GNENPSGKLPMTIYKTE--EQLPDI------LDFDMWKGRTYRYMKGEPLYGFGHGLSYT 602

Query: 645 QFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNV 704
            F+++       IQ N                          + L+ D   +  V+  N 
Sbjct: 603 SFEFD------NIQGN--------------------------DTLQPDAILQCSVELSNS 630

Query: 705 GSTDGSDVVIVYSKPPAEIAATY-IKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYA 763
           G   G +VV VY         TY +K+++ F++V + +G  K++ F   A + L++ +  
Sbjct: 631 GQLAGEEVVQVYVSRENTPVYTYPLKKLVAFKKVKLASGEKKKVDFTI-APRELSVWEDG 689

Query: 764 ANTLLPAGEHTIFVGNG 780
              +L +G++T+F+G+G
Sbjct: 690 KWRML-SGKYTLFIGSG 705


>gi|372209074|ref|ZP_09496876.1| glycoside hydrolase family protein [Flavobacteriaceae bacterium
           S85]
          Length = 727

 Score =  375 bits (963), Expect = e-101,   Method: Compositional matrix adjust.
 Identities = 260/758 (34%), Positives = 385/758 (50%), Gaps = 102/758 (13%)

Query: 51  SFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVG 110
           SFL  D S+    R + LVS+MTL EK+ QL + A  + RL +P Y+WW+EALHGV+  G
Sbjct: 20  SFLDTDKSI--EERAEILVSQMTLKEKIAQLKNTAPAISRLKVPDYDWWNEALHGVARNG 77

Query: 111 PGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGR--------AGL 162
                      AT FP  I   A+F+  L  ++  A+STEARA Y + +        AGL
Sbjct: 78  K----------ATIFPQGIGIGATFDPDLALRVASAISTEARAKYTISQQMGNHSRYAGL 127

Query: 163 TYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVS 222
           T+W+PN+N+ RDPRWGR  ET GEDP+++ +  V +V+GLQ         D N   LK +
Sbjct: 128 TFWTPNVNIFRDPRWGRGQETFGEDPYLMTQMGVAFVKGLQ-------GDDPNY--LKSA 178

Query: 223 SCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNG 282
           +C KHYA   V +     R  F+A  T+QD+ ET+L  FE  VK+ +   VM ++N V G
Sbjct: 179 ACAKHYA---VHSGPESLRLEFNAVPTQQDLYETYLPAFEALVKDANVEGVMPAHNAVFG 235

Query: 283 IPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLD 342
            P  A+  LL   +R  W   GY+V DC +I+ +   HK++ DS+  A A  LKAG +L+
Sbjct: 236 APMAANKFLLTDVLRDRWGFDGYVVTDCGAIKQIKVGHKYV-DSEVAAAAVALKAGTNLN 294

Query: 343 CGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFD---GSPQYVSLGKQDICSD 399
           CG  Y      A+ QG V E  + +  K L+    RLG FD       Y  +G + I S 
Sbjct: 295 CGATYKELK-KAIDQGLVTEELVHERTKQLFKTRFRLGMFDKDLSKNPYSKIGPELIHSK 353

Query: 400 ENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMS 459
           E+IELA EAA++ IV+LKN  N LPL +  +K   V GP AN++  ++G+Y G+    ++
Sbjct: 354 EHIELAREAAQKSIVMLKNKNNLLPLPT-DIKVPYVTGPFANSSDMLMGSYYGVSPGVVT 412

Query: 460 PIAGFSGY----ANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAESL 515
            +AG +       ++ Y++G      K+ N    A   A  +D TI + GL    E E +
Sbjct: 413 ILAGITDAVSLGTSLNYRSGALPFQ-KNINPKNWAPNVAGMSDVTICVVGLTADREGEGV 471

Query: 516 ---------DREDLWLPGYQTQLINQVAEVAK-GPVILVIMSAGGVDIAFAETNTNIKAI 565
                    DR DL LP  Q   + Q+A   K  P++LVI S   V +   E + +  AI
Sbjct: 472 DAIASNHKGDRLDLKLPENQINYVKQLAAKKKDKPLVLVIASGSPVSLEGIEEHCD--AI 529

Query: 566 LWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRT 625
           L   YPGE+GG A+ADV+FGK +P G LP+T+         P +   L         GRT
Sbjct: 530 LQIWYPGEQGGNAVADVLFGKVSPTGHLPMTF---------PKSVAQLPDYKDYSMKGRT 580

Query: 626 YKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVL 685
           YK+     ++PFG+GL+Y                                 SKT    ++
Sbjct: 581 YKYMTEEPMFPFGFGLTY---------------------------------SKTEFKNLV 607

Query: 686 VND--LRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYI--KQVIGFQRVFVRA 741
           V D  LR  +  +  V+  NVG  D  ++V +Y  P ++     +    +  F+RV ++ 
Sbjct: 608 VEDAKLRKKESLKVSVEVTNVGDFDIDEIVQLYISPKSQKEGEGLPFTTLKAFKRVALKK 667

Query: 742 GRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGN 779
           G  ++++F  +  +SL +++     +   G + + VGN
Sbjct: 668 GETQKVEFTIHP-ESLKVINVKGQKVWRKGAYKVTVGN 704


>gi|171695518|ref|XP_001912683.1| hypothetical protein [Podospora anserina S mat+]
 gi|170948001|emb|CAP60165.1| unnamed protein product [Podospora anserina S mat+]
          Length = 805

 Score =  375 bits (963), Expect = e-101,   Method: Compositional matrix adjust.
 Identities = 253/743 (34%), Positives = 371/743 (49%), Gaps = 91/743 (12%)

Query: 49  MSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAH--------------------GV 88
           ++  L CD++     R   LV  + + EK+  L ++                      G 
Sbjct: 30  LAKTLACDTTASPPARAAALVQALNITEKLVNLVEYVKSREAPLGISIQLITPHSMSLGA 89

Query: 89  PRLGLPQYEWWSEALHGVSNVGPGTHFDDV---IPGATSFPTVILTTASFNESLWKKIGQ 145
            R+GLP Y WW+EALHGV+   PG  F+        ATSF   I   A+F+  L  ++  
Sbjct: 90  ERIGLPAYAWWNEALHGVA-ASPGVSFNQAGQEFSHATSFANTITLAAAFDNDLVYEVAD 148

Query: 146 AVSTEARAMYNLGRAGLTYWSPNINVARDPRWGRITE------------------TPGED 187
            +STEARA  N   AGL YW+PNIN  +DPRWGR  E                  TPGED
Sbjct: 149 TISTEARAFSNAELAGLDYWTPNINPYKDPRWGRGHEVCYLSLLFRAVQLLRTQKTPGED 208

Query: 188 PFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDAR 247
           P  +  Y    + GL   EG +          KV + CKH+AAYD++ W+G  RY F+A 
Sbjct: 209 PVHIKGYVQALLEGL---EGRDKIR-------KVIATCKHFAAYDLERWQGALRYRFNAV 258

Query: 248 VTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDL---HG 304
           VT QD+ E +L+PF+ C ++    S MCSYN +NG P+CA   L++  +R  W+    + 
Sbjct: 259 VTSQDLSEYYLQPFQQCARDSKVGSFMCSYNALNGTPACASTYLMDDILRKHWNWTEHNN 318

Query: 305 YIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCG----QYYTNFTGNAVQQGKV 360
           YI +DC++IQ  + N    + +   A A    AG D  C        T+  G A  Q  +
Sbjct: 319 YITSDCNAIQDFLPNFHNFSQTPAQAAADAYNAGTDTVCEVPGYPPLTDVIG-AYNQSLL 377

Query: 361 KETDIDKSLKYLYTVLMRLGFFD-GSPQ-YVSLGKQDICSDENIELAAEAAREGIVLLKN 418
            E  ID++L+ LY  L+R G+ D  SP  Y  +    + + +   LA ++A +GIVLLKN
Sbjct: 378 SEEIIDRALRRLYEGLIRAGYLDSASPHPYTKISWSQVNTPKAQALALQSATDGIVLLKN 437

Query: 419 DQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFSGYANVTYK--TGC 476
           +   LPL+    KT+A++G  ANAT  M+G Y+GIP  Y +PI   +   NVT+    G 
Sbjct: 438 N-GLLPLDLTN-KTIALIGHWANATRQMLGGYSGIPPYYANPIYAATQL-NVTFHHAPGP 494

Query: 477 DDVACKSNNSIFA--ASEAAKTADATIILAGLDLSVEAESLDREDLWLPGYQTQLINQVA 534
            + +  S N  +   A  AA  +D  + L G DLS+ AE  DR+ +  P  Q  L+  +A
Sbjct: 495 VNQSSPSTNDTWTSPALSAASKSDIILYLGGTDLSIAAEDRDRDSIAWPSAQLSLLTSLA 554

Query: 535 EVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLP 594
           ++ K P I+  +    VD     +N NI +ILW GYPG+ GG A+ +++ G  +P  RLP
Sbjct: 555 QMGK-PTIVARL-GDQVDDTPLLSNPNISSILWVGYPGQSGGTALLNIITGVSSPAARLP 612

Query: 595 ITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFT 654
           +T Y   Y  ++PLT+M LRP  +   PGRTY++Y  P L PFG+GL YT F      F 
Sbjct: 613 VTVYPETYTSLIPLTAMSLRPTSA--RPGRTYRWYPSPVL-PFGHGLHYTTFTAKFGVF- 668

Query: 655 KTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVI 714
           +++ +N+ +L    N  Y                L    + +  V   N G      V +
Sbjct: 669 ESLTINIAELVSNCNERY----------------LDLCRFPQVSVWVSNTGELKSDYVAL 712

Query: 715 VYSKPPAEIAATYIKQVIGFQRV 737
           V+ +         IK ++G++R+
Sbjct: 713 VFVRGEYGPEPYPIKTLVGYKRI 735


>gi|325192664|emb|CCA27085.1| unnamed protein product [Albugo laibachii Nc14]
          Length = 2278

 Score =  375 bits (963), Expect = e-101,   Method: Compositional matrix adjust.
 Identities = 256/769 (33%), Positives = 397/769 (51%), Gaps = 97/769 (12%)

Query: 52  FLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFA--HG-VPRLGLPQYEWWSEALHGVSN 108
           F FC+SSL   +RV+DL+ R+ LDEKV+ L   A  HG +PRLG+P+Y W +  +HGV +
Sbjct: 34  FPFCNSSLSLDLRVEDLLQRLQLDEKVRMLTARASTHGSIPRLGVPEYNWGANCVHGVQS 93

Query: 109 VGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMY------NLGRA-- 160
              GTH       ATSFP  +   A F+ +   K+ Q +  E RA+       N  R   
Sbjct: 94  TC-GTH------CATSFPNPVNLGAIFDPNEIYKMAQVIGKELRALRLEGARENYARGPH 146

Query: 161 -GLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPL 219
            GL  WSPNIN+ RDPRWGR  ETP EDP+V  +Y V Y +GLQ  EG       +SR L
Sbjct: 147 IGLDCWSPNININRDPRWGRAMETPSEDPYVNAKYGVAYTKGLQ--EGQ------DSRFL 198

Query: 220 KVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNR 279
           +     KHY AY  +N+ G DR  FDA V+  D  +T+   FE  V +G A  +MCSYN 
Sbjct: 199 QAVVTLKHYLAYSYENYGGTDRTQFDAIVSAYDFADTYFPAFEASVVDGKAKGIMCSYNS 258

Query: 280 VNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGL 339
           +NGIP+CA+ K LNQ +R + +  GYI +D  +IQ + D HK+     E A    +++G+
Sbjct: 259 LNGIPTCAN-KWLNQLLRDDLEFDGYITSDTGAIQGIFDGHKYTKTLCE-ATKIAMESGV 316

Query: 340 DLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQDICSD 399
           D+  G  Y N             + ID++++    +  +LG FD        G +D+ + 
Sbjct: 317 DICSGNAYWNCLKQLANSTNFSAS-IDEAIRRTLKLRFQLGLFDAIGDQPHFGPEDVRTA 375

Query: 400 ENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCR--- 456
           ++++L+ + AR+ IVLL+N  NTLPL       +AV+GPH+     ++GNY G  C    
Sbjct: 376 KSLQLSLDLARKSIVLLQNHGNTLPLRLG--LRIAVIGPHSMTRRGIMGNYYGQLCHGDY 433

Query: 457 -----YMSP---IAGFSGYANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDL 508
                  SP   I   +G  N  +  GC  +   S      A +A +TAD  ++  G+D+
Sbjct: 434 DEVRCIQSPLEAIQSVNGRNNTHHVNGC-GINDTSTAEFDDALQAVRTADVAVLFLGIDI 492

Query: 509 SVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAG--GVD--IAFAETNTNIKA 564
           S+E ES DR+++ +P  Q +L+  +  VA  P ++V+ + G  G++  I +A++      
Sbjct: 493 SIERESKDRDNIDVPHIQLELLKAI-RVAGKPTVVVLFNGGILGIEKLILYADS------ 545

Query: 565 ILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGR 624
           +L A YPG  G +AIA+++FG  NP G+LP+T Y  +++  + + SM +       YPGR
Sbjct: 546 VLEAFYPGFFGAQAIAEILFGSINPSGKLPVTMYRSNFINDVDMKSMSM-----TLYPGR 600

Query: 625 TYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGV 684
           +Y++Y    +Y FG+GLSYT F         +IQ              + D+  TR    
Sbjct: 601 SYRYYTEVPVYSFGWGLSYTTF---------SIQ--------------SIDSHDTRA--- 634

Query: 685 LVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAAT-----YIKQVIGFQRVFV 739
            +N +       +++   N G   G +V+  + + P +I AT       +Q+  + RV +
Sbjct: 635 -MNHVLTAQPKMYRILITNNGKYYGEEVLFAFFR-PLDIHATGPVESLQQQLFNYTRVRL 692

Query: 740 RAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGNG---GVSFP 785
             G  + +       ++L + D   N  +  G + + + NG    ++FP
Sbjct: 693 DPGDMREVPLHVKD-ENLALHDRNGNLCVFEGFYELIISNGVEEQLTFP 740


>gi|255284060|ref|ZP_05348615.1| beta-glucosidase [Bryantella formatexigens DSM 14469]
 gi|255265405|gb|EET58610.1| glycosyl hydrolase family 3 C-terminal domain protein
           [Marvinbryantia formatexigens DSM 14469]
          Length = 700

 Score =  374 bits (961), Expect = e-101,   Method: Compositional matrix adjust.
 Identities = 251/749 (33%), Positives = 376/749 (50%), Gaps = 104/749 (13%)

Query: 64  RVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGAT 123
           R + LV++MT++EK  QL   A  + RLG+P Y WW+EALHGV+  G           AT
Sbjct: 9   RAEALVAQMTVEEKASQLKYDAPAIKRLGIPAYNWWNEALHGVARAGQ----------AT 58

Query: 124 SFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRA--------GLTYWSPNINVARDP 175
            FP  I   A+F+E+L  +I   ++TE RA YN   A        GLT+WSPN+N+ RDP
Sbjct: 59  VFPQAIGLGATFDEALLGEIADVIATEGRAKYNAYAAKEDRDIYKGLTFWSPNVNIFRDP 118

Query: 176 RWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDN 235
           RWGR  ET GEDP +  R  V +V+GLQ           +   +K ++C KH+A   V +
Sbjct: 119 RWGRGHETYGEDPCLTSRLGVAFVKGLQG----------DGETMKAAACAKHFA---VHS 165

Query: 236 WKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQT 295
                R+ F+A  + +DMEET+L  FE  VKE D  +VM +YNR NG   CA P +L + 
Sbjct: 166 GPEAVRHEFNAEASAKDMEETYLPAFEALVKEADVEAVMGAYNRTNGEACCASP-VLQKI 224

Query: 296 VRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFTGNAV 355
           +R +W   G+ V+DC +I+   ++H   A +KE A A  + +G DL+CG  Y +   +A 
Sbjct: 225 LREDWGFEGHFVSDCWAIRDFHEHHMLTATAKESA-AMAINSGCDLNCGNTYLHIL-HAY 282

Query: 356 QQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQDICSDENIELAAEAAREGIVL 415
           + G V E  I ++   L+T    LG FDGS +Y  +    + S E++ LA +AA E  VL
Sbjct: 283 RDGLVSEETITEAAVRLFTTRFLLGLFDGS-EYDDIPYTVVESKEHLALAEKAALESAVL 341

Query: 416 LKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFSGY----ANVT 471
           LKN+   LPL   +++TV V+GP+A++  A+ GNY G   RY +   G   Y      V 
Sbjct: 342 LKNN-GILPLKKERLRTVGVIGPNADSRAALAGNYHGTASRYETIQQGLQDYLGEDVRVL 400

Query: 472 YKTGC---DDVACK---SNNSIFAASEAAKTADATIILAGLDLSVEAE---------SLD 516
              GC   +D   K   + + +  A   A+ +D  I+  GLD ++E E         S D
Sbjct: 401 TSVGCALSEDRTEKLALAGDRLAEAQIVAENSDVVILCLGLDETLEGEEGDTGNSYASGD 460

Query: 517 REDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGG 576
           +E L LP  Q  L+  VA   K PV+L +MS   +D+++A  + +    LW  YPG +GG
Sbjct: 461 KETLLLPEAQRDLMEAVAATGK-PVVLCMMSGSDLDMSYAAEHFDAILQLW--YPGSQGG 517

Query: 577 RAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYP 636
            A A ++FG+ +P G+LP+T+Y  + ++ LP         +     GRTY++   P  YP
Sbjct: 518 SAAAKLLFGEVSPSGKLPVTFY--ETLEELP-------AFEDYSMKGRTYRYMGHPAQYP 568

Query: 637 FGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFE 696
           FG+GL+Y   +                          +DA+        +     +    
Sbjct: 569 FGFGLTYGDVR-------------------------VTDAN--------IRGASAEGDLT 595

Query: 697 FKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKS 756
             V  +N G+    +V+ +Y K      A     +  F R+ + AG  K I+    A ++
Sbjct: 596 LAVTAENAGNAVTDEVLQIYVKCTDSANAVPNPALAAFGRIHLEAGEKKTIEMTVPA-RA 654

Query: 757 LNIVDYAANTLLPAGEHTIFVGNGGVSFP 785
             +VD A    + + +   FV   GVS P
Sbjct: 655 FTVVDEAG---VRSRDGKQFVIYAGVSQP 680


>gi|109897152|ref|YP_660407.1| beta-glucosidase [Pseudoalteromonas atlantica T6c]
 gi|109699433|gb|ABG39353.1| Beta-glucosidase [Pseudoalteromonas atlantica T6c]
          Length = 733

 Score =  372 bits (956), Expect = e-100,   Method: Compositional matrix adjust.
 Identities = 244/749 (32%), Positives = 375/749 (50%), Gaps = 86/749 (11%)

Query: 54  FCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGT 113
           + D+ LP + R++ L+  MTL EK  QL +    + RLGLP+Y++W+EALHGV+  G   
Sbjct: 26  WFDTQLPTNERIESLIDAMTLKEKASQLVNGNVAIERLGLPEYDFWNEALHGVARNG--- 82

Query: 114 HFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGR--------AGLTYW 165
                   AT FP  I   A+F++ L  +    +S EARA +N+          +GLT+W
Sbjct: 83  -------RATVFPQAIGMAATFDQDLLLQAATVISDEARAKFNVSSEIGNRSKYSGLTFW 135

Query: 166 SPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCC 225
           +PNIN+ RDPRWGR  ET GEDP++  +     V GLQ           + + LK ++  
Sbjct: 136 TPNINIFRDPRWGRGQETYGEDPYLTAQMGKAMVNGLQGD---------HPKYLKTAAAA 186

Query: 226 KHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPS 285
           KH+A   V +     R+ FDA  +E+DM ET+   FE  V E D  +VM +YNRVNG P+
Sbjct: 187 KHFA---VHSGPEALRHEFDAIASEKDMYETYFPAFEALVTEADVETVMAAYNRVNGHPA 243

Query: 286 CADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQ 345
                LLN  +R +W   G+IV+DC  +    + HK  A++ E A A  +  G DL+CG 
Sbjct: 244 GGSDFLLNTVLRDKWGFSGHIVSDCWGLADFHEYHKVTANAVESA-ALAINTGTDLNCGS 302

Query: 346 YYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ--YVSLGKQDICSDENIE 403
            YT    +AV+ G V E  ID  L  +     +LGFFD      Y S+    + SD + +
Sbjct: 303 VYTALP-DAVEAGLVDEKTIDTRLHKVLATKFKLGFFDPKDDNPYNSISADVVNSDAHAD 361

Query: 404 LAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAG 463
           +A E A + IVLL+N+   LPL+   ++ V V GP A+++  ++GNY G+  +  + + G
Sbjct: 362 VAYEMAVKSIVLLQNENQVLPLDK-NIRNVYVTGPFASSSEVLLGNYYGLSGKTTNILDG 420

Query: 464 FSGYANV----TYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAES----- 514
            +   +V     YK G        N   +   EA +  D  I + GL  + E E      
Sbjct: 421 ITANVSVGTTINYKQGILPYQANVNPIDWTTGEAKQMGDVIIAVMGLSGAYEGEEGEAIA 480

Query: 515 ----LDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGY 570
                DR  L LP +Q + + ++ +    PVI+V+ +  G  +   E      AI++A Y
Sbjct: 481 SPHKGDRLSLDLPEHQIEFLRKLRKDNDKPVIVVLTA--GTPVNVTEIAQLADAIVFAWY 538

Query: 571 PGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYN 630
           PG+EGG+A+AD++FG+ +P GRLPIT+         P +   L P D     GRTY++  
Sbjct: 539 PGQEGGKAVADILFGERSPSGRLPITF---------PKSEAQLPPYDDYSMQGRTYRYMT 589

Query: 631 GPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLR 690
              +YPFG+GLSY   K++ ++           L +   L+ T     T           
Sbjct: 590 EEPMYPFGFGLSYATVKFDNIT-----------LGNAEALSSTDGQKGT----------- 627

Query: 691 CDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFV 750
                +  V+  N G+ +  +VV +Y K P       I+ + GFQR+ +  G+  ++ F 
Sbjct: 628 ----LDVSVNVTNTGTRELEEVVQLYLKTPNAGIDQPIQSLKGFQRIKLAPGQTGQVSFT 683

Query: 751 FNACKSLNIVDYAANTLLPAGEHTIFVGN 779
            +  K L  ++     +L  G++ + VGN
Sbjct: 684 VSK-KQLYSINAKGKPVLLEGDYHVIVGN 711


>gi|371776901|ref|ZP_09483223.1| beta-glucosidase [Anaerophaga sp. HS1]
          Length = 720

 Score =  371 bits (952), Expect = 1e-99,   Method: Compositional matrix adjust.
 Identities = 245/722 (33%), Positives = 364/722 (50%), Gaps = 94/722 (13%)

Query: 47  LQMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGV 106
           LQ  S  F D +L    R K L+S +TL EK+  LG     V RL +P Y WW+EALHGV
Sbjct: 22  LQGQSTNFRDEALDIETRAKALLSELTLKEKISLLGYNNPPVERLQIPAYNWWNEALHGV 81

Query: 107 SNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRA------ 160
           +  G           AT FP  I   A+F+ +L  +I  A+STEAR+ YN+ R+      
Sbjct: 82  ARAGE----------ATVFPQAIALAATFDTTLVYRIADAISTEARSKYNINRSKGFQNQ 131

Query: 161 --GLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRP 218
             G+T+W+PNIN+ RDPRWGR  ET GEDPF+       +V+GLQ  E          R 
Sbjct: 132 YLGITFWTPNINIFRDPRWGRGQETYGEDPFLTASMGKAFVKGLQGSE--------PERR 183

Query: 219 LKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYN 278
           LK ++  KH+A   V +    DR+HF+A V E+D+ ET+L  F+  V+ G  +++MC+YN
Sbjct: 184 LKTAAGAKHFA---VHSGPEADRHHFNAVVDEKDLRETYLPAFKALVENG-VTTIMCAYN 239

Query: 279 RVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAG 338
           RVNG P C    LL   +R EW   G +V DC ++  +   HK +  ++ +  A  +KAG
Sbjct: 240 RVNGEPCCTGKTLLQDILRDEWGFKGQVVTDCWALDDIWLRHKTIP-TRVEVAAAAVKAG 298

Query: 339 LDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ--YVSLGKQDI 396
           ++LDC         +A+++  +    +D +L       ++LGF+D      Y   G   +
Sbjct: 299 VNLDCANILQEDVQDAIEKRLLTLEQVDSALLPTLQTQLKLGFYDDPSHSPYRHYGIDSV 358

Query: 397 CSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCR 456
            +  +I LA EAA + +VLLKND   LPL    + ++ VVG +A +  A+ GNY G+   
Sbjct: 359 NNSYHISLAKEAAEKSMVLLKND-GILPLKKDTISSIMVVGENAASISALTGNYHGLSGN 417

Query: 457 YMSPIAGFSGYA----NVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEA 512
            ++ + G         +V Y  GC      ++ S F    AA   D TI + GL   +E 
Sbjct: 418 MVTFVEGLVKAGGPGMSVQYDYGC----SFADTSHFGGIWAAGFTDVTIAVIGLSPLLEG 473

Query: 513 E---------SLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIK 563
           E           D++DL +P      + ++ E    PVI V+     +DI+  E   +  
Sbjct: 474 EHGDAFLSNWGGDKKDLRMPRSHEIYLKKLRESHNHPVIAVVTGGSALDISAIEPYAD-- 531

Query: 564 AILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPG 623
           AI++A YPGE+GG A+AD++FG+ +P GRLPIT+Y    ++ LP       P        
Sbjct: 532 AIIYAWYPGEQGGTALADLIFGEVSPSGRLPITFYKD--IKDLP-------PYHDYNMTN 582

Query: 624 RTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPG 683
           RTY+++ G  LYPFGYGLSYT F Y  LS                             P 
Sbjct: 583 RTYRYFQGDVLYPFGYGLSYTSFHYEWLS----------------------------KPS 614

Query: 684 VLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGR 743
             V++   DD     +   N G+ D  +V+ VY   P +I    ++++ GF R+ ++AG+
Sbjct: 615 TKVSE---DDIISVNIAVTNTGTMDADEVIQVYIVYP-DIERMPLRELKGFSRIHIKAGQ 670

Query: 744 NK 745
            +
Sbjct: 671 TQ 672


>gi|348684866|gb|EGZ24681.1| hypothetical protein PHYSODRAFT_325770 [Phytophthora sojae]
          Length = 805

 Score =  370 bits (949), Expect = 2e-99,   Method: Compositional matrix adjust.
 Identities = 257/765 (33%), Positives = 386/765 (50%), Gaps = 92/765 (12%)

Query: 54  FCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPR-----LGLPQYEWWSEALHGVSN 108
           FC++SL  + RV+DL+SR+ L EK   L   A   PR     +GLP+Y W +  +HGV +
Sbjct: 37  FCNTSLSTADRVEDLLSRLPLQEKATLL--TARASPRGNMSSIGLPEYNWGANCVHGVQS 94

Query: 109 VGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLG---------R 159
              GT+        TSFP  +   A F+  +   + Q +  E RA++  G          
Sbjct: 95  TC-GTNC------PTSFPNPVNLGAIFDPQVVFDMAQVIGWELRALWLEGATENYKGGPH 147

Query: 160 AGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPL 219
            GL  WSPNIN+ RDPRWGR TETP EDP V  +Y V Y RGLQ  EG       + R L
Sbjct: 148 LGLDCWSPNININRDPRWGRNTETPSEDPLVNSKYGVAYTRGLQ--EGKRQ----DPRFL 201

Query: 220 KVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNR 279
           +     KHYAAY  +N+ GV+R  FDA V+  D  +T+   F   V +G+A  VMCSYN 
Sbjct: 202 QAVVTLKHYAAYSYENYGGVNRMEFDAIVSPYDFADTYFPAFRSSVVDGNAKGVMCSYNS 261

Query: 280 VNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGL 339
           VNGIP CA+ +L+   +RG     GY+ +D  +++ + D H + ADS+ +A    + AG 
Sbjct: 262 VNGIPMCANKELVETLLRGTLGFDGYVTSDSGAVEAISDMHHY-ADSQCEAARLAILAGT 320

Query: 340 DLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFD--GSPQYVSLGKQDIC 397
           D++ G+ Y       V   +++E  +D +L++   +   LG FD      Y ++   ++ 
Sbjct: 321 DINSGKSYEACLKTLVDDNQLEEKALDDALRHTLKLRFELGLFDPIDDQPYWNVTPSEVN 380

Query: 398 SDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCR- 456
           +     L+  A R+ +V+L+N+ + LPL   K   +AV+GPHA +   ++GNY G  C  
Sbjct: 381 TAAAKALSLNATRKSLVMLQNNASVLPLQ--KGVKLAVLGPHAKSKRGLLGNYLGQMCHG 438

Query: 457 ----------YMSPIAGFSGYANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGL 506
                      +  I   +G +N T+  GC  ++  S      A  AAK ADA ++  G+
Sbjct: 439 DYDEVGCVQTPLDAIRAANGASNTTFAEGC-GISGNSTAGFEKAVAAAKEADAVVLFLGI 497

Query: 507 DLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAIL 566
           D S+E E  DR ++ LP  Q QL+ +V  V + P ++V+++ GGV I   E      A++
Sbjct: 498 DKSIEGEVGDRNNIDLPNIQMQLLQRVHAVGR-PTVVVLIN-GGV-IGAEEIIERTDALV 554

Query: 567 WAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTY 626
            A YPG  G RA+ADV+FG  NP G+LP+T Y  DYV  + + SM     D   +PGRTY
Sbjct: 555 EAFYPGFFGARAMADVLFGDTNPSGKLPVTMYRSDYVDQVEMKSM-----DMTAHPGRTY 609

Query: 627 KFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYT----SDASKTRCP 682
           +++ G  ++PFG+GLSYT F  ++ S T       N   H  N  ++    SD +     
Sbjct: 610 RYFKGEPVFPFGWGLSYTTFSLSVDSGT-------NSSSHSNNAAFSGGEVSDTANVTIS 662

Query: 683 GVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKP-------PAEIAATYIKQVIGFQ 735
            V+ ND                G   G +VV+ + +P       PA +     +Q+  +Q
Sbjct: 663 VVVKND----------------GEVAGDEVVLAFFRPVNSNVTGPATLLN---EQLFDYQ 703

Query: 736 RVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGNG 780
           RV +    +  + F      +L + D   N     G + + V NG
Sbjct: 704 RVSLGPLDSTEVSFTIER-STLALPDEEGNLASFPGSYEVIVSNG 747


>gi|423279990|ref|ZP_17258903.1| hypothetical protein HMPREF1203_03120 [Bacteroides fragilis HMW
           610]
 gi|404584326|gb|EKA88991.1| hypothetical protein HMPREF1203_03120 [Bacteroides fragilis HMW
           610]
          Length = 722

 Score =  370 bits (949), Expect = 2e-99,   Method: Compositional matrix adjust.
 Identities = 245/731 (33%), Positives = 378/731 (51%), Gaps = 78/731 (10%)

Query: 53  LFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPG 112
           +  D S P ++RVK L+ +MTL EK  QL   +  +PRL LP Y +W+E LHGV+  G  
Sbjct: 50  IIGDLSQPIAVRVKTLIQQMTLAEKASQLVSESDSIPRLNLPAYNYWNECLHGVARAGEV 109

Query: 113 THFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNINVA 172
           T F   I  A+++ TV++          K++  A+STEAR  Y     GLTYWSP IN+A
Sbjct: 110 TVFPQAINLASTWDTVLV----------KRVASAISTEARLKYLEIGKGLTYWSPTINMA 159

Query: 173 RDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYD 232
           RDPRWGR  ET GEDP++  R  V +V+GLQ   G   A       LK  +  KH+ A +
Sbjct: 160 RDPRWGRNEETYGEDPYLTSRLGVAFVKGLQ---GDHPAY------LKTVATIKHFVANN 210

Query: 233 VDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLL 292
            +N    +R+   +++  + + E +   +E CVKE    SVM +YN  NG+P      LL
Sbjct: 211 EEN----NRFSSSSQIPTKQLYEYYFPAYEACVKEAGVQSVMTAYNAFNGVPPSGSRWLL 266

Query: 293 NQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFTG 352
            + +R EW   G++V+DC +I VM   H+ + +S E+A A  + +G DL+CG  Y     
Sbjct: 267 GEVLRKEWGFDGFVVSDCGAIGVMNWQHR-VVNSLEEAAALGVNSGCDLECGTTYKEKLV 325

Query: 353 NAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSP--QYVSLGKQDICSDENIELAAEAAR 410
            AV+QG + E  ID++L  + T   +LG FD      Y    K+ +   +  ELA EAA 
Sbjct: 326 QAVKQGLISEATIDQALTRVLTARFKLGEFDPMELVPYNHYDKKLLAGKKFAELAYEAAV 385

Query: 411 EGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFSGYANV 470
           + +VLLKN+ N LPL+  K K+VAVVGP A+     +G Y+G P   ++ + G       
Sbjct: 386 KSVVLLKNE-NLLPLSKEKTKSVAVVGPFADHN--YLGGYSGQPPYSVTLLKGVKDLMGK 442

Query: 471 TYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAESLDREDLWLPGYQTQLI 530
             K    +    S +SI A   A K  D  ++  G D  +  E+ D   ++LP  Q +L+
Sbjct: 443 RGKVNYLNGIGASRDSIVA---AVKGVDVVLVALGSDEKMARENHDMTSIYLPEEQEKLL 499

Query: 531 NQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPG 590
             + +V   P I+++  +G   +     + +I AI+ A YPG+E GRA+AD++FG  NP 
Sbjct: 500 KAIYQV--NPRIVLVFHSGN-PLTSEWADVHIPAIMQAWYPGQEAGRALADLLFGNENPS 556

Query: 591 GRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNL 650
           G+LP+T Y  +    LP        +D   + GRTY++     LY FG+GLSYT F ++ 
Sbjct: 557 GKLPMTIYRAE--DQLPDI------LDFDMWKGRTYRYMKEDPLYGFGHGLSYTSFGFDG 608

Query: 651 LSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGS 710
           +  + T++                  ++ +C                 V+  N G   G 
Sbjct: 609 IQGSDTLK----------------SGARLQC----------------SVELSNTGKWTGE 636

Query: 711 DVVIVYSKPPAEIAATY-IKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLP 769
           +VV VY         TY +K+++ F++V +  G  KR++F     + L++ +   N  + 
Sbjct: 637 EVVQVYVSRENTPVYTYPLKKLVAFKKVKLAPGEKKRVEFNI-PPRELSVWE-NGNWRML 694

Query: 770 AGEHTIFVGNG 780
            G++T+F+G+G
Sbjct: 695 TGKYTLFIGSG 705


>gi|164428543|ref|XP_964543.2| hypothetical protein NCU00709 [Neurospora crassa OR74A]
 gi|157072187|gb|EAA35307.2| hypothetical protein NCU00709 [Neurospora crassa OR74A]
          Length = 786

 Score =  370 bits (949), Expect = 2e-99,   Method: Compositional matrix adjust.
 Identities = 247/635 (38%), Positives = 331/635 (52%), Gaps = 91/635 (14%)

Query: 85  AHGVPRLGLPQYEWWSEALHGVSNVGPGTHFDDV---IPGATSFPTVILTTASFNESLWK 141
           A G  RLGLP+Y WWSE LHGV+   PG  F+        ATSF   I   ASF++ L  
Sbjct: 8   ALGASRLGLPKYAWWSEGLHGVAG-SPGVKFNTTGYPFSYATSFANAINLGASFDDDLVY 66

Query: 142 KIGQAVSTEARAMYNLGRAGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRG 201
           ++G A+STEARA  N G  GL YW+PN+N  +DPRWGR  ETPGEDP  +  Y    + G
Sbjct: 67  EVGTAISTEARAFANFGFGGLDYWTPNVNPYKDPRWGRGAETPGEDPLHIKGYVKAILAG 126

Query: 202 LQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPF 261
           L   EG+E          KV + CKHYAAYD++ W G+ RY F+A VT QD+ E +L PF
Sbjct: 127 L---EGNETVR-------KVIATCKHYAAYDLERWHGLTRYEFEAIVTLQDLSEYYLPPF 176

Query: 262 EMCVKEGDASSVMCSYNRV-----------------NGIPSCADPKLLNQTVRGEWDL-- 302
           + C ++    S+MCSYN +                    P+CA P L+   +R  W+   
Sbjct: 177 QQCARDSKVGSIMCSYNALTIRDMASGKPDEEINLTTAQPACAKPYLMT-ILRDHWNWTE 235

Query: 303 -HGYIVADCDSI-QVMVDNHKFLADSKEDAVAQTLKAGLDLDC---GQYYTNFTGNAVQQ 357
            + YI +DC++I   + DNH F + +  +A A   KAG D  C   G   T+  G A  Q
Sbjct: 236 HNNYITSDCNAILDFLPDNHNF-SQTPAEAAAAAYKAGTDTVCEVSGSPLTDVVG-AYNQ 293

Query: 358 GKVKETDIDKSLKYLYTVLMRLGFFD---------------GSPQYVSLGKQDICSDENI 402
             + E  ID +L+ LY  L+R G+ D                SP Y +L  +D+ +    
Sbjct: 294 SLLPEAVIDTALRRLYEGLIRAGYLDHGRSSAVAGGDGGSFSSPAYDALNWEDVNTPSTQ 353

Query: 403 ELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPI- 461
           ELA  +A EGIVLLKN  + LPL+ +  K VA++G  ANAT  M G Y+GIP  Y +P+ 
Sbjct: 354 ELALRSATEGIVLLKNAGSLLPLDFSG-KKVALIGHWANATGTMRGPYSGIPPFYHNPLY 412

Query: 462 AGFSGYANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAESLDREDLW 521
           A      + +Y  G    A   +     A  AA+ AD  +   G D +V +E LDRE + 
Sbjct: 413 AAQQLNLSFSYANGPVVNASDPDTWTAPALAAAEGADVVLYFGGTDTTVASEDLDRESIA 472

Query: 522 LPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIAD 581
            P  Q QL++++A + K   ++VI     VD +    N N+ +ILW GYPG+ GG A+ D
Sbjct: 473 WPETQMQLLSELAGLGK--PLVVIQLGDQVDDSSLLNNGNVSSILWVGYPGQSGGTAVFD 530

Query: 582 VVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVD------------------------ 617
           V+ GK  P GRLP+T Y   YV  +PLT M LRP +                        
Sbjct: 531 VLTGKKAPAGRLPVTQYPEGYVDEVPLTEMALRPFNYSSSSNLEQEVSVQGRGSLTIQPR 590

Query: 618 ------SLGYPGRTYKFYNGPTLYPFGYGLSYTQF 646
                 +L  PGRTYK+Y+ P L PFGYGL YT F
Sbjct: 591 STPGNKTLSSPGRTYKWYSSPVL-PFGYGLHYTTF 624


>gi|336463686|gb|EGO51926.1| hypothetical protein NEUTE1DRAFT_125528 [Neurospora tetrasperma
           FGSC 2508]
          Length = 788

 Score =  369 bits (948), Expect = 3e-99,   Method: Compositional matrix adjust.
 Identities = 245/639 (38%), Positives = 332/639 (51%), Gaps = 94/639 (14%)

Query: 85  AHGVPRLGLPQYEWWSEALHGVSNVGPGTHFDDV---IPGATSFPTVILTTASFNESLWK 141
           A G  R+GLP+Y WWSE LHGV+   PG  F+        ATSF   I   ASF++ L  
Sbjct: 8   ALGASRIGLPKYAWWSEGLHGVAG-SPGVTFNTTGYPFSYATSFANAINLGASFDDDLVY 66

Query: 142 KIGQAVSTEARAMYNLGRAGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRG 201
           ++G A+STEARA  N G  GL YW+PN+N  +DPRWGR  ETPGEDP  +  Y    + G
Sbjct: 67  EVGTAISTEARAFANFGFGGLDYWTPNVNPYKDPRWGRGAETPGEDPLHIKGYVKAMLAG 126

Query: 202 LQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPF 261
           L   EG+E          KV + CKHYAAYD++ W G+ RY F+A VT QD+ E +L PF
Sbjct: 127 L---EGNETVR-------KVIATCKHYAAYDLERWHGLTRYEFEAIVTLQDLSEYYLPPF 176

Query: 262 EMCVKEGDASSVMCSYNRV-----------------NGIPSCADPKLLNQTVRGEWDL-- 302
           + C ++    S+MCSYN +                    P+CA+  L+   +R  W+   
Sbjct: 177 QQCARDSKVGSIMCSYNALTIRDMAGGNPDEIINLTTAQPACANTYLMT-ILRDHWNWTE 235

Query: 303 -HGYIVADCDSI-QVMVDNHKFLADSKEDAVAQTLKAGLDLDC---GQYYTNFTGNAVQQ 357
            + YI +DC++I   + DNH F + +  +A A   KAG D  C   G   T+  G A  Q
Sbjct: 236 HNNYITSDCNAILDFLPDNHNF-SQTPAEAAAAAYKAGTDTVCEVSGSPLTDVVG-AYNQ 293

Query: 358 GKVKETDIDKSLKYLYTVLMRLGFFD---------------GSPQYVSLGKQDICSDENI 402
             + E  ID +L+ LY  L+R G+ D                SP Y +L  +D+ +    
Sbjct: 294 SLLPEAVIDTALRRLYEGLIRAGYLDHGRSSAVAGGDGGSFSSPAYDALNWEDVNTPSTQ 353

Query: 403 ELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPI- 461
           ELA  +A EGIVLLKN  + LPL+ +  K VA++G  ANAT  M G Y+GIP  Y +P+ 
Sbjct: 354 ELALRSATEGIVLLKNSGSLLPLDFSSGKKVALIGHWANATGTMRGPYSGIPPFYHNPLY 413

Query: 462 AGFSGYANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAESLDREDLW 521
           A      + +Y  G    A   +     A  AA+ AD  +   G D +V +E LDRE + 
Sbjct: 414 AAQQLNLSFSYANGPVVNASDPDTWTAPALAAAEGADVVLYFGGTDTTVASEDLDRESIA 473

Query: 522 LPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIAD 581
            P  Q +L++++A + K   ++VI     VD +F   N N+ +ILW GYPG+ GG A+ D
Sbjct: 474 WPKAQMKLLSELAGLGK--PLVVIQLGDQVDDSFLLENGNVSSILWVGYPGQSGGTAVFD 531

Query: 582 VVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVD------------------------ 617
           V+ GK  P GRLP+T Y   YV  +PLT M LRP +                        
Sbjct: 532 VLTGKKAPAGRLPVTQYPEGYVDEVPLTEMALRPFNHSSSTSSSSNPEEEVSVQGSGSLT 591

Query: 618 ----------SLGYPGRTYKFYNGPTLYPFGYGLSYTQF 646
                     +L  PGRTYK+Y+ P L PFGYGL YT F
Sbjct: 592 IQPRSTPGNKTLSSPGRTYKWYSNPVL-PFGYGLHYTTF 629


>gi|291530120|emb|CBK95705.1| Beta-glucosidase-related glycosidases [Eubacterium siraeum 70/3]
          Length = 689

 Score =  369 bits (946), Expect = 5e-99,   Method: Compositional matrix adjust.
 Identities = 231/637 (36%), Positives = 350/637 (54%), Gaps = 76/637 (11%)

Query: 54  FCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGT 113
           + D  L    R   L   ++ +E+ QQL   A  + + GLP Y WW+E LHGV+  G   
Sbjct: 4   YKDKQLSAYERAAALADTLSTEEQAQQLKYDAPAIEKAGLPSYNWWNEGLHGVARAGT-- 61

Query: 114 HFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRA--------GLTYW 165
                   AT FP  I   A+F++ +  ++G+ VSTEARAMYN            GLT W
Sbjct: 62  --------ATVFPQAIALAAAFDKDMMCRVGEVVSTEARAMYNSAAKHGDTDIYKGLTLW 113

Query: 166 SPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCC 225
           +PNIN+ RDPRWGR  ET GEDP++  R  VN+V+G+Q  E +          L+ ++C 
Sbjct: 114 APNINIFRDPRWGRGHETYGEDPYLTSRLGVNFVKGIQGEEKY----------LRAAACA 163

Query: 226 KHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPS 285
           KH+A   V +     R+ FDARV+E+D+EET+L  F+  VKEG    VM +YNRVNG PS
Sbjct: 164 KHFA---VHSGPESLRHEFDARVSEKDLEETYLPAFKALVKEGRVEGVMGAYNRVNGEPS 220

Query: 286 CADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQ 345
           CA  KL+ +    EW   GY V+DC +I+    NHK + D+   + A  LKAG D++CG 
Sbjct: 221 CASEKLMGKLR--EWGFDGYFVSDCGAIRDFHTNHK-ITDTAPQSAAMALKAGCDVNCGN 277

Query: 346 YYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQDICSDENIELA 405
            Y +    A+++G + + DI  +  +     +RLG  D + ++  L    I  D N  L+
Sbjct: 278 TYLHILA-ALEEGLITKQDIRTACIHALRTRIRLGQLDDN-EFDDLPFDIIACDGNKALS 335

Query: 406 AEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAG-- 463
            EAA + +VLL ND   LPL+ +++ ++AV+GP+A++  A++GNY G P R ++ + G  
Sbjct: 336 LEAAEKSMVLLHND-GILPLDKSRISSIAVIGPNADSRAALLGNYEGTPDRSVTFLEGIQ 394

Query: 464 --FSGYANVTYKTGCDDVACKSN------NSIFAASEAAKTADATIILAGLDLSVEAE-- 513
             F G   V Y  GC     ++       +    A  A + AD T++  GLD ++E E  
Sbjct: 395 DAFDG--RVYYAEGCQLFRDRTQGLALPGDRYAEAVAACEAADVTVVCVGLDSTLEGEEG 452

Query: 514 -----SLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWA 568
                S D+ DL LP  Q  L+ ++ +  K P+I+V+ +   V+     T     A++ A
Sbjct: 453 DTENKSGDKPDLRLPEVQRVLLQKLKDTGK-PLIIVLAAGSSVN-----TECEGNALINA 506

Query: 569 GYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLP-LTSMPLRPVDSLGYPGRTYK 627
            YPG+ GG+A+A+++FG+ +P G+LP+T+Y      MLP  T   ++         RTY+
Sbjct: 507 WYPGQYGGKALAEILFGEVSPSGKLPVTFYKS--ADMLPDFTDYSMK--------NRTYR 556

Query: 628 FYNGPT--LYPFGYGLSYTQFKYNLLSFT-KTIQVNL 661
           F +  +  LYPFGYGL+Y+ F+   +S+   T+ VN+
Sbjct: 557 FCDDESNVLYPFGYGLTYSHFECGDISYKDNTLAVNV 593


>gi|374316077|ref|YP_005062505.1| beta-glucosidase-like glycosyl hydrolase [Sphaerochaeta pleomorpha
           str. Grapes]
 gi|359351721|gb|AEV29495.1| beta-glucosidase-like glycosyl hydrolase [Sphaerochaeta pleomorpha
           str. Grapes]
          Length = 701

 Score =  368 bits (945), Expect = 6e-99,   Method: Compositional matrix adjust.
 Identities = 251/757 (33%), Positives = 370/757 (48%), Gaps = 116/757 (15%)

Query: 64  RVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGAT 123
           + K LV+ M+L E   QL   A  +PRLGLP+Y WW+EALHG +  G           AT
Sbjct: 9   QAKQLVAHMSLKEMFSQLLHEAPAIPRLGLPRYNWWNEALHGAARSGT----------AT 58

Query: 124 SFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRA--------GLTYWSPNINVARDP 175
            FP  I   A F++   K+I   +STE RA YN   A        GLT WSPN+N+ RDP
Sbjct: 59  VFPQAIGLAAMFDDVFLKEIATVISTEQRAKYNTFSALGDRGIYKGLTLWSPNVNIFRDP 118

Query: 176 RWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDN 235
           RWGR  ET GEDP++  +  V++++GLQ           +   LK ++C KH+A   V +
Sbjct: 119 RWGRGQETYGEDPYLASQLGVSFIQGLQG----------DGPYLKTAACVKHFA---VHS 165

Query: 236 WKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQT 295
                R+ F+A V+ +D+ ET+L  FE CVKEG+ ++VM +Y+ VNG P C  P L+   
Sbjct: 166 GPEPLRHDFNAIVSRKDLYETYLPAFEACVKEGEVNAVMGAYSAVNGEPCCGSPFLITDI 225

Query: 296 VRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFTGNAV 355
           +R +W   G  ++DC +I+    NH  +  ++ D+VA  L AG DL+CG  Y +    A 
Sbjct: 226 LRNDWGFEGMYISDCWAIRDFHLNHA-VTKNQVDSVALALNAGCDLNCGCEYLSLE-KAY 283

Query: 356 QQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQDICSDENIELAAEAAREGIVL 415
           QQG +    I ++   + T    LG F     Y ++G +   ++E+ ++A +A+   +VL
Sbjct: 284 QQGLIDRKTITQACIRVMTTRFALGLFSEDCTYSNIGYEQNDTEEHRKVAFKASCNSLVL 343

Query: 416 LKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFSGY----ANVT 471
           LKND   LPL+S  +  +A++GP+A++  A+ GNY G    Y + + GF         V 
Sbjct: 344 LKND-GMLPLDSRSLHAIAIIGPNADSREALWGNYHGTSSTYTTVLEGFRKTLGESVKVK 402

Query: 472 YKTGCD------DVACKSNNSIFAASEAAKTADATIILAGLDLSVEAE---------SLD 516
           Y  G        +   + N+ I  A   A  +D  I+  G D +VE E         + D
Sbjct: 403 YSQGSAIQKEKLERLAEPNDRIAEAIAVATVSDTIILCLGYDETVEGEMHDDGNGGWAGD 462

Query: 517 REDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGG 576
           ++DL LP  Q  L+  VA   K P++LV++S G +D    E   N+KA+L   YPG+EGG
Sbjct: 463 KQDLRLPPCQRALLKAVASTGK-PIVLVLLSGGAIDPEI-ERFPNVKALLQGWYPGQEGG 520

Query: 577 RAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGY--PGRTYKFYNGPTL 634
            AIA  + G  NP G LP+T+Y  + V  LP         D   Y   GRTY++     L
Sbjct: 521 LAIAHTILGLNNPSGHLPVTFYRSETV--LP---------DFCDYRMEGRTYRYVQEKVL 569

Query: 635 YPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDY 694
           YPFG+GLSYT F Y  LS  K    NL                                 
Sbjct: 570 YPFGFGLSYTTFSYGNLSTGKQADGNL--------------------------------- 596

Query: 695 FEFKVDFQNVGSTDGSDVVIVYSK------PPAEIAATYIKQVIGFQRVFVRAGRNKRIK 748
            E      N G+ +G +VV +Y        PP  +       + GF  + ++ G +K + 
Sbjct: 597 -ELSFIVSNSGNREGREVVQIYCHSDHPFFPPNPV-------LCGFTSLVLQPGEHKTVT 648

Query: 749 FVFNACKSLNIVDYAANTLLPAGEHTIFVGNGGVSFP 785
               A ++ + +D     +   G   ++VGN   + P
Sbjct: 649 QTILA-EAFSAIDPEGKRIALKGWFDLYVGNHQKALP 684


>gi|7671419|emb|CAB89360.1| beta-glucosidase-like protein [Arabidopsis thaliana]
 gi|9758998|dbj|BAB09525.1| unnamed protein product [Arabidopsis thaliana]
          Length = 411

 Score =  368 bits (944), Expect = 8e-99,   Method: Compositional matrix adjust.
 Identities = 188/417 (45%), Positives = 268/417 (64%), Gaps = 11/417 (2%)

Query: 377 MRLGFFDGSPQ---YVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTV 433
           MRLGFFDG+P+   Y  LG +D+C+ EN ELA E AR+GIVLLKN   +LPL+ + +KT+
Sbjct: 1   MRLGFFDGNPKNQPYGGLGPKDVCTVENRELAVETARQGIVLLKNSAGSLPLSPSAIKTL 60

Query: 434 AVVGPHANATVAMIGNYAGIPCRYMSPIAGFSGYANVT-YKTGCDDVACKSNNSIFAASE 492
           AV+GP+AN T  MIGNY G+ C+Y +P+ G       T Y  GC +V C +   + +A  
Sbjct: 61  AVIGPNANVTKTMIGNYEGVACKYTTPLQGLERTVLTTKYHRGCFNVTC-TEADLDSAKT 119

Query: 493 AAKTADATIILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVD 552
            A +ADAT+++ G D ++E E+LDR DL LPG Q +L+ QVA+ A+GPV+LVIMS GG D
Sbjct: 120 LAASADATVLVMGADQTIEKETLDRIDLNLPGKQQELVTQVAKAARGPVVLVIMSGGGFD 179

Query: 553 IAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMP 612
           I FA+ +  I +I+W GYPGE GG AIADV+FG+ NP G+LP+TWY   YV+ +P+T+M 
Sbjct: 180 ITFAKNDEKITSIMWVGYPGEAGGIAIADVIFGRHNPSGKLPMTWYPQSYVEKVPMTNMN 239

Query: 613 LRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNY 672
           +RP  S GY GRTY+FY G T+Y FG GLSYT F + L+   K + +NL++ Q CR+   
Sbjct: 240 MRPDKSNGYLGRTYRFYIGETVYAFGDGLSYTNFSHQLIKAPKFVSLNLDESQSCRSPEC 299

Query: 673 TS-DASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQV 731
            S DA    C   +    R D  FE ++  +NVG  +G++ V +++ PP E+  +  KQ+
Sbjct: 300 QSLDAIGPHCEKAVGE--RSD--FEVQLKVRNVGDREGTETVFLFTTPP-EVHGSPRKQL 354

Query: 732 IGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGNGGVSFPIHL 788
           +GF+++ +       ++F  + CK L +VD      L  G H + VG+   SF I +
Sbjct: 355 LGFEKIRLGKKEETVVRFKVDVCKDLGVVDEIGKRKLALGHHLLHVGSLKHSFNISV 411


>gi|325970053|ref|YP_004246244.1| beta-glucosidase [Sphaerochaeta globus str. Buddy]
 gi|324025291|gb|ADY12050.1| Beta-glucosidase [Sphaerochaeta globus str. Buddy]
          Length = 698

 Score =  367 bits (943), Expect = 1e-98,   Method: Compositional matrix adjust.
 Identities = 253/752 (33%), Positives = 372/752 (49%), Gaps = 117/752 (15%)

Query: 64  RVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGAT 123
           R ++LV RM L + + QL   A  +  LG+P Y WW+E LHG +  G           AT
Sbjct: 6   RAQELVERMNLPQMMSQLRHDAPAIESLGIPAYNWWNEGLHGSARSGT----------AT 55

Query: 124 SFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGR--------AGLTYWSPNINVARDP 175
            FP  I   + F+      +   VSTE RA YNL           GLT WSPN+N+ RDP
Sbjct: 56  VFPQAIGLASLFDPDFLYAVASVVSTEQRAKYNLFTHENDRDIYKGLTVWSPNVNIFRDP 115

Query: 176 RWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDN 235
           RWGR  ET GEDP++  R AV ++RGLQ  EG           LK +SC KH+AA+    
Sbjct: 116 RWGRGQETFGEDPYLTARLAVAFIRGLQG-EGP---------VLKTASCVKHFAAHS--- 162

Query: 236 WKGVD--RYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLN 293
             G +  R+ F+A V ++D+EET+L  F   VKE  A +VM +Y+ +N  P CA   L+ 
Sbjct: 163 --GPEPLRHGFNAVVGKKDLEETYLPAFASAVKEAKADAVMGAYSALNDEPCCASSFLME 220

Query: 294 QTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFTGN 353
           +T+R  W   G  ++DC +I+    NHK +  ++E++ A  LK G DL CG  Y +    
Sbjct: 221 ETLRLRWGFEGMYISDCWAIRDFHLNHK-VTKNEEESAALALKRGCDLACGCEYQSLE-K 278

Query: 354 AVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQDICSDENIELAAEAAREGI 413
           A Q+G +    I K+   + T   +LG FD    Y +LG + + SDE+  LA EA+   +
Sbjct: 279 AFQKGLITREQIKKAAIRVMTTRFKLGQFDQGTAYDTLGLESLDSDEHAALAFEASCRSL 338

Query: 414 VLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFSGY----AN 469
           VLLKND   LPL    V  +AV+GP+A++  A+ GNY G   RY++ + G   Y      
Sbjct: 339 VLLKNDA-LLPLKKEAVSCLAVIGPNADSRQALWGNYHGTSSRYVTILEGLRDYVGSSTR 397

Query: 470 VTYKTGCD------DVACKSNNSIFAASEAAKTADATIILAGLDLSVEAE---------S 514
           + Y  G +      +   K ++ +  A   AK +D  ++  GL+ +VE E         +
Sbjct: 398 ILYSEGSNLTKNKVERLAKDDDRLSEAVFMAKASDVVVLCLGLNETVEGEMHDDGNGGWA 457

Query: 515 LDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEE 574
            D++DL LP  Q +L+  VAE  K P+I+V++S G +D    E   N+KA++ A YPG+E
Sbjct: 458 GDKDDLRLPLCQRKLLKAVAETGK-PIIVVLLSGGSLDPEI-EQYANVKALIQAWYPGQE 515

Query: 575 GGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGP-T 633
           GG+AIA +++G   P G+LP+T+Y  +  ++ P T   L          RTY++ + P  
Sbjct: 516 GGKAIAHLLYGALCPSGKLPVTFYKAE-AKLPPFTDYSL--------IRRTYRYCDDPDV 566

Query: 634 LYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDD 693
           LYPFG+GLSY  F + L                       S A +T   GV    L    
Sbjct: 567 LYPFGFGLSYASFSFCL-----------------------SAAQETEQNGVAATVL---- 599

Query: 694 YFEFKVDFQNVGSTDGSDVVIVY------SKPPAEIAATYIKQVIGFQRVFVRAGRNKRI 747
                   +N  + D   VV +Y        PP  +       + G + V ++AG   +I
Sbjct: 600 -------VRNTSALDARTVVQLYLAMEGKDLPPHPV-------LCGMKSVHLKAGEETQI 645

Query: 748 KFVFNACKSLNIVDYAANTLLPAGEHTIFVGN 779
            F+    K    V    N     G +T++ G+
Sbjct: 646 TFILEE-KQFTAVQEDGNRYAVRGGYTLYAGS 676


>gi|266619450|ref|ZP_06112385.1| beta-glucosidase [Clostridium hathewayi DSM 13479]
 gi|288869013|gb|EFD01312.1| beta-glucosidase [Clostridium hathewayi DSM 13479]
          Length = 714

 Score =  367 bits (943), Expect = 1e-98,   Method: Compositional matrix adjust.
 Identities = 243/701 (34%), Positives = 356/701 (50%), Gaps = 77/701 (10%)

Query: 53  LFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPG 112
           ++ D S     RV+DLVS+MTL+EKV QL   A  V RLG+P Y WW+EALHGV+  G  
Sbjct: 4   VYLDESRTDEERVRDLVSQMTLEEKVSQLRYDAPAVERLGIPSYNWWNEALHGVARAG-- 61

Query: 113 THFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMY-----NLGRA---GLTY 164
                    AT FP  I   A F+E+L +KIG   + E RA Y     N  R    G+T+
Sbjct: 62  --------AATVFPQAIGLAAMFDEALLEKIGDVTALEGRAKYHEAVRNGDRGLYKGITF 113

Query: 165 WSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSC 224
           WSPNIN+ RDPRWGR  ET GEDP + GR    Y++G+Q           N + LK ++C
Sbjct: 114 WSPNINIFRDPRWGRGHETYGEDPCLTGRMGTAYIKGMQG----------NGKRLKAAAC 163

Query: 225 CKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIP 284
            KH+AA+     KG  R+ F++ V+++D+ ET+   FE CVKE     VM  YNR+NG  
Sbjct: 164 VKHFAAHSGPE-KG--RHSFNSVVSKKDLTETYFPAFERCVKEAGVEGVMGGYNRLNGEA 220

Query: 285 SCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCG 344
           +C    L+ + +R +W   GY V+DC +I+     H  L D+ +++ A  LK+G DL+CG
Sbjct: 221 ACGSHHLITEILREKWGFDGYYVSDCGAIKDF-HMHHGLTDTPQESAALALKSGCDLNCG 279

Query: 345 QYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQDICSDENIEL 404
             Y +   +A  QG V   DID+++ +L    MRLG FD   ++  +  +     E+  L
Sbjct: 280 AVYLHVM-SAYNQGLVSAEDIDRAVTHLMMTRMRLGMFDQHTEFDEIPYEINDCAEHHGL 338

Query: 405 AAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAG- 463
           A +AA E +VLLKND   LPL+   +KTVAV+GP+ ++   + GNY G      + + G 
Sbjct: 339 ALKAAEESMVLLKND-GILPLDKTALKTVAVIGPNGDSEEILKGNYNGTATEKYTILEGI 397

Query: 464 ----------FSGYANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAE 513
                     F    +  Y+   +++A ++++ +  A   A  +D   +  GL+ ++E E
Sbjct: 398 RAVLGKETRIFCSEGSHLYRDNVENLA-EADDRLKEAVSMAVRSDVVFLCLGLNGTLEGE 456

Query: 514 S---------LDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKA 564
                      D+ DL LP  Q +L+  V      PVIL++ +   + I +A  + +  A
Sbjct: 457 EGDANNSYAGADKADLNLPESQMRLLKAVCGTGT-PVILLLAAGSAMAINYAAEHCS--A 513

Query: 565 ILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGR 624
           IL   YPG+ GG A A ++ G+  P GRLP+T+Y          T+  L         GR
Sbjct: 514 ILHIWYPGQMGGLAAARLLTGEAVPSGRLPVTFYQ---------TTEELPEFTDYSMKGR 564

Query: 625 TYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGV 684
           TY++     LYPFGYGLSY  F+Y   S  K  Q         R     ++ SK  C  +
Sbjct: 565 TYRYMEREALYPFGYGLSYGDFEY---SNFKAEQTEAGPDGQVRFSVKITNRSKAECDEI 621

Query: 685 LVNDLRCDDYFEFK------VDFQNVGSTDGSDVVIVYSKP 719
               +R  D  E         DF+ +    G  V + ++ P
Sbjct: 622 AEVYVRIADS-ELAAPGGSLADFRRIHMKAGESVTVPFTLP 661


>gi|313145345|ref|ZP_07807538.1| beta-glucosidase [Bacteroides fragilis 3_1_12]
 gi|313134112|gb|EFR51472.1| beta-glucosidase [Bacteroides fragilis 3_1_12]
          Length = 722

 Score =  367 bits (941), Expect = 2e-98,   Method: Compositional matrix adjust.
 Identities = 244/724 (33%), Positives = 374/724 (51%), Gaps = 78/724 (10%)

Query: 60  PYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTHFDDVI 119
           P ++RVK L+ +MTL EK  QL   +  +PRL LP Y +W+E LHGV+  G  T F   I
Sbjct: 57  PIAVRVKTLIQQMTLAEKASQLVSESDSIPRLNLPAYNYWNECLHGVARAGEVTVFPQAI 116

Query: 120 PGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNINVARDPRWGR 179
             A+++ TV++          K++  A+STEAR  Y     GLTYWSP IN+ARDPRWGR
Sbjct: 117 NLASTWDTVLV----------KRVASAISTEARLKYLEIGKGLTYWSPTINMARDPRWGR 166

Query: 180 ITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGV 239
             ET GEDP++  R  V +V+GLQ   G   A       LK  +  KH+ A + +N    
Sbjct: 167 NEETYGEDPYLTSRLGVAFVKGLQ---GDHPAY------LKTVATIKHFVANNEEN---- 213

Query: 240 DRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGE 299
           +R+   +++  + + E +   +E CVKE    SVM +YN  NG+P      LL + +R E
Sbjct: 214 NRFSSSSQIPTKQLYEYYFPAYEACVKEAGVQSVMTAYNAFNGVPPSGSRWLLGEVLRKE 273

Query: 300 WDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFTGNAVQQGK 359
           W   G++V+DC +I VM   H+ + +S E+A A  + +G DL+CG  Y      AV+QG 
Sbjct: 274 WGFDGFVVSDCGAIGVMNWQHR-VVNSLEEAAALGVNSGCDLECGTTYKEKLVQAVKQGL 332

Query: 360 VKETDIDKSLKYLYTVLMRLGFFDGSP--QYVSLGKQDICSDENIELAAEAAREGIVLLK 417
           + E  ID++L  + T   +LG FD      Y    K+ +   +  ELA EAA + +VLLK
Sbjct: 333 ISEATIDQALTRVLTARFKLGEFDPMELVPYNHYDKKLLAGKKFAELAYEAAVKSVVLLK 392

Query: 418 NDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFSGYANVTYKTGCD 477
           N+ N LPL+  K K+VAVVGP A+     +G Y+G P   ++ + G         K    
Sbjct: 393 NE-NLLPLSKEKTKSVAVVGPFADHN--YLGGYSGQPPYSVTLLKGVKDLMGKRGKVNYL 449

Query: 478 DVACKSNNSIFAASEAAKTADATIILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVA 537
           +    S +SI A   A K  D  ++  G D  +  E+ D   ++LP  Q +L+  + +V 
Sbjct: 450 NGIGASRDSIVA---AVKGVDVVLVALGSDEKMARENHDMTSIYLPEEQEKLLKAIYQV- 505

Query: 538 KGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITW 597
             P I+++  +G   +     + +I AI+ A YPG+E GRA+AD++FG  NP G+LP+T 
Sbjct: 506 -NPRIVLVFHSGN-PLTSEWADVHIPAIMQAWYPGQEAGRALADLLFGNENPSGKLPMTI 563

Query: 598 YNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTI 657
           Y  +    LP        +D   + GRTY++     LY FG+GLSYT F ++ +  + T+
Sbjct: 564 YRAE--DQLPDI------LDFDMWKGRTYRYMKEDPLYGFGHGLSYTSFGFDGIQGSDTL 615

Query: 658 QVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYS 717
           +                              L+C       V+  N G   G +VV VY 
Sbjct: 616 KSG--------------------------TTLQCS------VELSNTGKWTGEEVVQVYV 643

Query: 718 KPPAEIAATY-IKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIF 776
                   TY +K+++ F++V +  G  KR++F     + L++ +   N  +  G++T+F
Sbjct: 644 SRENTPVYTYPLKKLVAFKKVKLAPGEKKRVEFNI-PPRELSVWE-NGNWRMLTGKYTLF 701

Query: 777 VGNG 780
           +G+G
Sbjct: 702 IGSG 705


>gi|268610157|ref|ZP_06143884.1| glycoside hydrolase family 3 protein [Ruminococcus flavefaciens
           FD-1]
          Length = 690

 Score =  367 bits (941), Expect = 2e-98,   Method: Compositional matrix adjust.
 Identities = 243/724 (33%), Positives = 359/724 (49%), Gaps = 110/724 (15%)

Query: 54  FCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGT 113
           + D SL    R +DL +R+TL+E+  QL   A  V RL +P Y WWSE LHGV+  G   
Sbjct: 4   YKDKSLSAQERAEDLTNRLTLEEQASQLKYDAPAVDRLDIPAYNWWSEGLHGVARAGT-- 61

Query: 114 HFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRA--------GLTYW 165
                   AT FP  I   A F+E    K+G  +  EARA YN   A        GL  W
Sbjct: 62  --------ATMFPQAIGLAAMFDEEAMNKVGSIIGDEARAKYNEYSAHGDHDIYKGLCLW 113

Query: 166 SPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCC 225
           SPN+N+ RDPRWGR  ET GEDP++  R  V + +GLQ  EG           LK ++C 
Sbjct: 114 SPNVNIFRDPRWGRGQETYGEDPYLTTRLGVAFAKGLQG-EGE---------VLKTAACA 163

Query: 226 KHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPS 285
           KH A   V +     R+ FDA  + +DMEET+L  FE  VKE     VM +YNRVNG P+
Sbjct: 164 KHLA---VHSGPEAIRHEFDAVASPKDMEETYLPAFEALVKEAKVEGVMGAYNRVNGEPA 220

Query: 286 CADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQ 345
           CA   L+ +    EW   GY V+DC +I+    NH     + E A A  LK G DL+CG 
Sbjct: 221 CASKFLMGKL--DEWGFDGYFVSDCWAIRDFHTNHMVTKTAPESA-AMALKLGCDLNCGN 277

Query: 346 YYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQDICSDENIELA 405
            Y +   +A  +G + + DI K+  +L    +RLG FD   +Y  L    + ++EN   A
Sbjct: 278 TYLHLL-HAYNEGLINDEDIKKACTHLMRTRVRLGMFDDETEYDKLDYSIVANEENKAYA 336

Query: 406 AEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAG-- 463
            + +   +V+LKN+   LPL+ +K+KT+ V+GP+A++  A+ GNY G   RY++ + G  
Sbjct: 337 RKCSERSMVMLKNN-GILPLDPSKIKTIGVIGPNADSRPALEGNYNGRADRYITFLEGIQ 395

Query: 464 --FSGY-----ANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAE--- 513
             F G       +  YK  C  +A  +++ +  A    + +D  ++  GLD ++E E   
Sbjct: 396 DAFGGRVLYSEGSHLYKDRCMGLAV-ADDRLSEAEIVTEHSDVVVLCVGLDATIEGEEGD 454

Query: 514 ------SLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILW 567
                 S D+ DL LP  Q +L+  V    K PVI+V  +   +++       +  A++ 
Sbjct: 455 TGNEFSSGDKNDLRLPEAQRKLVETVMRKGK-PVIIVTAAGSAINV-----EADCDALIH 508

Query: 568 AGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYK 627
           A YPG+ GG A+AD++FGK +P G+LP+T+Y  D  ++   T   ++        GRTY+
Sbjct: 509 AWYPGQFGGTALADILFGKISPSGKLPVTFYT-DTTKLPEFTDYSMK--------GRTYR 559

Query: 628 FYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVN 687
           +     LYPFGYGL+Y++ + + L F                     +  K         
Sbjct: 560 YTQDNILYPFGYGLTYSKTEVSDLKF---------------------ENGKASVKVTNTG 598

Query: 688 DLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRI 747
           D   +D  +F +        +GSD V  YS             + GF+RVF++ G +  +
Sbjct: 599 DFDTEDVVQFYI------KGEGSDYVPFYS-------------LCGFRRVFLKKGESTVV 639

Query: 748 KFVF 751
           +   
Sbjct: 640 EVTL 643


>gi|410648100|ref|ZP_11358515.1| beta-glucosidase [Glaciecola agarilytica NO2]
 gi|410132388|dbj|GAC06914.1| beta-glucosidase [Glaciecola agarilytica NO2]
          Length = 733

 Score =  366 bits (940), Expect = 2e-98,   Method: Compositional matrix adjust.
 Identities = 242/748 (32%), Positives = 372/748 (49%), Gaps = 86/748 (11%)

Query: 56  DSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTHF 115
           D+ LP   R+  L+  MTL EK  QL +    + RLGLP+Y++W+EALHGV+  G     
Sbjct: 28  DTQLPTQKRIDLLIDAMTLKEKTSQLVNGNVAIERLGLPEYDFWNEALHGVARNG----- 82

Query: 116 DDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGR--------AGLTYWSP 167
                 AT FP  I   A+F++ L  K    +S EARA +N+          +GLT+W+P
Sbjct: 83  -----RATVFPQAIGMAATFDQHLLLKAASVISDEARAKFNVSSEIGNRSKYSGLTFWTP 137

Query: 168 NINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKH 227
           NIN+ RDPRWGR  ET GEDP++  +     V GLQ           + + LK ++  KH
Sbjct: 138 NINIFRDPRWGRGQETYGEDPYLTAQMGKAMVNGLQGD---------HPKYLKTAAAAKH 188

Query: 228 YAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCA 287
           +A   V +     R+ FDA  + +DM ET+   FE  V E +  +VM +YNRVNG P+  
Sbjct: 189 FA---VHSGPEALRHEFDAIASPKDMYETYFPAFEALVTEANVETVMAAYNRVNGHPAGG 245

Query: 288 DPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYY 347
              LLN  +R +W   G++V+DC  +      HK  A++ E A A  +  G DL+CG  Y
Sbjct: 246 SDFLLNTVLRDKWGFSGHVVSDCWGLADFHQYHKVTANAVESA-ALAINTGTDLNCGAVY 304

Query: 348 TNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ--YVSLGKQDICSDENIELA 405
            N   +AV+ G V E  IDK L  +     +LGFFD      Y ++    + S+ + ++A
Sbjct: 305 -NALPDAVEAGLVDEKTIDKRLSKVLATKFKLGFFDPKDDNPYNNISADVVNSEAHAQVA 363

Query: 406 AEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFS 465
            E A + IVLL+N  N LPL+   ++ + V GP A+++  ++GNY G+  +  + + G +
Sbjct: 364 YEMAVKSIVLLQNKNNILPLDR-NIRNLYVTGPFASSSEVLLGNYYGLSGKTTNILDGIT 422

Query: 466 GYANV----TYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAES------- 514
              +V     YK G        N   +   EA +  D  I + GL  + E E        
Sbjct: 423 ANVSVGTTINYKQGILPYQANVNPIDWTTGEAKQMGDVIIAVMGLSGAYEGEEGEAIASP 482

Query: 515 --LDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPG 572
              DR  L LP +Q   + ++ +    PVI+V+ +  G  +   E      AI++A YPG
Sbjct: 483 HKGDRLSLDLPEHQIAFLRKLRKDNDKPVIVVLTA--GTPVNLTEIAELADAIVFAWYPG 540

Query: 573 EEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGP 632
           +EGG+A+AD++FG+ +P GRLPIT+         P +   L P D     GRTY++    
Sbjct: 541 QEGGKAVADILFGERSPSGRLPITF---------PKSEAQLPPYDDYSMQGRTYRYMTQE 591

Query: 633 TLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCD 692
            +YPFG+GLSY Q K++ ++   T Q   +K +   N+  T                   
Sbjct: 592 PMYPFGFGLSYAQVKFDNITLGNT-QALASKNELQENMTVT------------------- 631

Query: 693 DYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFN 752
                 V+  N G  +  +VV +Y K P    +  +  + GF R+ + AG+ +++ F   
Sbjct: 632 ------VNVTNTGEREFEEVVQLYLKTPDAGVSQPLHSLKGFTRIKLAAGQTEQVLFNI- 684

Query: 753 ACKSLNIVDYAANTLLPAGEHTIFVGNG 780
             K L  ++     +L  G++++ VGN 
Sbjct: 685 PKKHLYSINEQGKPVLLKGQYSVIVGNA 712


>gi|363742357|ref|XP_003642627.1| PREDICTED: probable beta-D-xylosidase 5-like [Gallus gallus]
          Length = 748

 Score =  366 bits (939), Expect = 3e-98,   Method: Compositional matrix adjust.
 Identities = 254/762 (33%), Positives = 386/762 (50%), Gaps = 98/762 (12%)

Query: 48  QMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQL---GDFAHG----VPRLGLPQYEWWS 100
           +   F F D +LP+  R++DL+ R+T  E V Q+   G   +G    +PRLG+  Y W +
Sbjct: 23  EAQPFPFRDPTLPWHRRLEDLLGRLTPAEMVLQMARGGALGNGPAPPIPRLGIAPYNWNT 82

Query: 101 EALHGVSNVGPGTHFDDVIPG-ATSFPTVILTTASFNESLWKKIGQAVSTEARAMYN--- 156
           E L G          D   PG AT+FP  +   A+F+  L  ++  A +TE RA +N   
Sbjct: 83  ECLRG----------DAEAPGWATAFPQALGLAAAFSPELVYRVANATATEVRAKHNSFV 132

Query: 157 -LGR----AGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENA 211
             GR     GL+ +SP +N+ R P WGR  ET GEDP++    A ++V+GLQ        
Sbjct: 133 AAGRYDDHTGLSCFSPVLNIMRHPLWGRNQETYGEDPYLTAELATSFVQGLQGQ------ 186

Query: 212 TDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDAS 271
              + R +K S+ CKH++ +       V R  FDA+V E+D   TFL  F+ CV+ G + 
Sbjct: 187 ---HPRYIKASAGCKHFSVHGGPENIPVSRLSFDAKVLERDWHTTFLPQFQACVRAG-SY 242

Query: 272 SVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAV 331
           S MCSYNR+NG+P+CA+ KLL   +RGEW   GY+V+D  ++++++  H++     E A+
Sbjct: 243 SFMCSYNRINGVPACANKKLLTDILRGEWGFEGYVVSDEGAVELILLGHRYTHTFLETAI 302

Query: 332 AQTLKAGLDLDCGQYYTN----FTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ 387
           A ++ AGL+L+      N        A+  G +    +   ++ L+   +RLG FD    
Sbjct: 303 A-SVNAGLNLELSYGMRNNVFMHIPKALAMGNITLEMLRDRVRPLFYTRLRLGEFDPPAM 361

Query: 388 --YVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVA 445
             Y +L    + S E+  L+ EAA +  VLLKN ++TLPL     K +AVVGP A+    
Sbjct: 362 NPYNALELSVVQSSEHRNLSLEAAIKSFVLLKNQRDTLPLRELHGKRLAVVGPFADNPRV 421

Query: 446 MIGNYAGIP-CRYM-SPIAGFSGY-ANVTYKTGCDDVAC--KSNNSIFAASEAAKTADAT 500
           + G+YA +P  +Y+ +P  G     ANV++  GC +  C   S + +     A + AD  
Sbjct: 422 LFGDYAPVPEPQYIYTPRRGLQTLPANVSFAAGCREPRCWVYSRDEV---ENAVRGADVV 478

Query: 501 IILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKG-PVILVIMSAGGVDIAFAETN 559
           ++  G  + VE E+ DR+DL LPG+Q QL+      A G PVIL++ +AG +D+++A+ +
Sbjct: 479 LVCLGTGIDVEMEARDRKDLSLPGHQLQLLQDAVRAAAGHPVILLLFNAGPLDVSWAQLH 538

Query: 560 TNIKAILWAGYPGEEGGRAIADVVFGK--FNPGGRLPITWYNGDYVQMLPLTSMPLRPVD 617
             + AIL   +P +  G AIA V+ GK   +P GRLP TW  G  +  +P       P++
Sbjct: 539 DGVGAILACFFPAQATGLAIASVLLGKQGASPAGRLPATWPAG--MHQVP-------PME 589

Query: 618 SLGYPGRTYKFYNGPT-LYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDA 676
           +    GRTY++Y     LYPFGYGLSYT F Y  L  +  +      L  C NL+ +   
Sbjct: 590 NYTMEGRTYRYYGQEAPLYPFGYGLSYTTFHYRDLVLSPPV------LPICANLSVS--- 640

Query: 677 SKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQR 736
                                 V  +N G  D  +VV +Y +           Q++ F+R
Sbjct: 641 ----------------------VVLENTGPRDSEEVVQLYLRWEQPSVPVPRWQLVAFRR 678

Query: 737 VFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVG 778
           V V AG   ++ F   A +      +     L  G  T+F G
Sbjct: 679 VAVPAGGATKLSFGVTAAQR---AVWMQQWHLEPGAFTLFAG 717


>gi|332307852|ref|YP_004435703.1| glycoside hydrolase family protein [Glaciecola sp. 4H-3-7+YE-5]
 gi|332175181|gb|AEE24435.1| glycoside hydrolase family 3 domain protein [Glaciecola sp.
           4H-3-7+YE-5]
          Length = 733

 Score =  365 bits (938), Expect = 4e-98,   Method: Compositional matrix adjust.
 Identities = 241/750 (32%), Positives = 373/750 (49%), Gaps = 86/750 (11%)

Query: 54  FCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGT 113
           + D+ LP   R+  L+  MTL EK  QL +    + RLGLP+Y++W+EALHGV+  G   
Sbjct: 26  WFDTQLPTQERIDLLIDAMTLKEKTSQLVNGNVAIERLGLPEYDFWNEALHGVARNG--- 82

Query: 114 HFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGR--------AGLTYW 165
                   AT FP  I   A+F++ L  K    +S EARA +N+          +GLT+W
Sbjct: 83  -------RATVFPQAIGMAATFDQHLLLKAASVISDEARAKFNVSSEIGNRSKYSGLTFW 135

Query: 166 SPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCC 225
           +PNIN+ RDPRWGR  ET GEDP++  +     V GLQ           + + LK ++  
Sbjct: 136 TPNINIFRDPRWGRGQETYGEDPYLTAQMGKAMVNGLQGD---------HPKYLKTAAAA 186

Query: 226 KHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPS 285
           KH+A   V +     R+ FDA  + +DM ET+   FE  + E +  +VM +YNRVNG P+
Sbjct: 187 KHFA---VHSGPEALRHEFDAIASPKDMYETYFPAFEALITEANVETVMAAYNRVNGHPA 243

Query: 286 CADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQ 345
                LLN  +R +W   G++V+DC  +      HK  A++ E A A  +  G DL+CG 
Sbjct: 244 GGSDFLLNTVLRDKWGFSGHVVSDCWGLADFHQYHKVTANAVESA-ALAINTGTDLNCGA 302

Query: 346 YYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ--YVSLGKQDICSDENIE 403
            Y N   +AV+ G V E  IDK L  +     +LGFFD      Y ++    + S+ + +
Sbjct: 303 VY-NALPDAVEAGLVDEKTIDKRLSKVLATKFKLGFFDPKDDNPYNNISADVVNSEAHAQ 361

Query: 404 LAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAG 463
           +A E A + IVLL+N  N LPL+   ++ + V GP A+++  ++GNY G+  +  + + G
Sbjct: 362 VAYEMAVKSIVLLQNKNNILPLDR-NIRNLYVTGPFASSSEVLLGNYYGLSGKTTNILDG 420

Query: 464 FSGYANV----TYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAES----- 514
            +   +V     YK G        N   +   EA +  D  I + GL  + E E      
Sbjct: 421 ITANVSVGTTINYKQGILPYQANVNPIDWTTGEAKQMGDVIIAVMGLSGAYEGEEGEAIA 480

Query: 515 ----LDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGY 570
                DR  L LP +Q   + ++ +    PVI+V+ +  G  +   E      AI++A Y
Sbjct: 481 SPHKGDRLSLDLPEHQIAFLRKLRKDNDKPVIVVLTA--GTPVNLTEIAELADAIVFAWY 538

Query: 571 PGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYN 630
           PG+EGG+A+AD++FG+ +P GRLPIT+         P +   L P D     GRTY++  
Sbjct: 539 PGQEGGKAVADILFGERSPSGRLPITF---------PKSEAQLPPYDDYSMQGRTYRYMT 589

Query: 631 GPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLR 690
              +YPFG+GLSY Q K++ ++   T Q   +K +   N+  T                 
Sbjct: 590 QEPMYPFGFGLSYAQVKFDNITLGNT-QALASKNEPQENMTVT----------------- 631

Query: 691 CDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFV 750
                   V+  N G  +  +VV +Y K P    +  +  + GF R+ + AG+ +++ F 
Sbjct: 632 --------VNVTNTGEREFEEVVQLYLKTPDAGVSQPLHSLKGFTRIKLAAGQTEQVLFS 683

Query: 751 FNACKSLNIVDYAANTLLPAGEHTIFVGNG 780
               K L  ++     +L  G++++ VGN 
Sbjct: 684 I-PKKHLYSINEQGKPVLLKGQYSVIVGNA 712


>gi|317474362|ref|ZP_07933636.1| glycosyl hydrolase family 3 C terminal domain-containing protein
           [Bacteroides eggerthii 1_2_48FAA]
 gi|316909043|gb|EFV30723.1| glycosyl hydrolase family 3 C terminal domain-containing protein
           [Bacteroides eggerthii 1_2_48FAA]
          Length = 723

 Score =  365 bits (938), Expect = 5e-98,   Method: Compositional matrix adjust.
 Identities = 254/759 (33%), Positives = 372/759 (49%), Gaps = 115/759 (15%)

Query: 54  FCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGT 113
           + + SL  + R  DLVSR+TL+EK+  + + +  V RLG+  YEWW+EALHGV+  G   
Sbjct: 25  YQNKSLSPTERAADLVSRLTLEEKITLMQNNSSAVKRLGIKPYEWWNEALHGVARNGL-- 82

Query: 114 HFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGR--------AGLTYW 165
                   AT +P  I   ASFN++L  ++  ++S EAR  Y   R         GLT+W
Sbjct: 83  --------ATVYPQAIGMGASFNDTLLYQVFTSISDEARVKYRQAREAGNYKRYTGLTFW 134

Query: 166 SPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCC 225
           +PNIN+ RDPRWGR  ET GEDP++  R  ++ V GLQ  +        N++  K  +C 
Sbjct: 135 TPNINIFRDPRWGRGQETYGEDPYLTSRMGLSVVNGLQGPQ--------NTKYNKTHACA 186

Query: 226 KHYAAYDVDNWKGVDRYHFDAR-VTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIP 284
           KHYA +    W   +R+ F+A  +  +D+ ET+L  F+  V +G+   VMC+YNR  G P
Sbjct: 187 KHYAVHSGPEW---NRHSFNAENINPRDLWETYLPAFQDLVIQGNVKEVMCAYNRFEGDP 243

Query: 285 SCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLA-----DSKEDAVAQTLKAGL 339
            C   +LL   +R EW+  G +V+DC +I    DN  F        +K DA A  + +G 
Sbjct: 244 CCGSDRLLINILRNEWNYKGLVVSDCGAI----DNFYFKGRHETHKNKADASAAAVLSGT 299

Query: 340 DLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQDICSD 399
           DL+CG+ YT    +AV++G + E+ ID+SL  L      LG  D +  +  L    +   
Sbjct: 300 DLECGRSYTGLI-SAVKEGLINESAIDQSLCRLMKARFELGEMDDTTPWDQLPDSLLSCH 358

Query: 400 ENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMS 459
            + +LA + ARE + LL+N +N LPL+  K  TVA++GP+AN +V    NY G P   ++
Sbjct: 359 AHQQLALQMARESMTLLQNHKNILPLD--KEMTVALIGPNANDSVMQWANYNGFPVHTIT 416

Query: 460 PIAGFSGY---ANVTYKTGCDDVACK------SNNSIFAASEAAKTADATIILAGLDLSV 510
            + G + Y     + Y    +    K        N I A    A  AD  I   G+  S+
Sbjct: 417 LLEGLTQYLPQERLIYIPQKNIEVQKYPWVNYYPNDIQAVINQAAKADVIIYAGGISASL 476

Query: 511 EAESL----------DREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNT 560
           E E +          DR  + LP  Q +L+  +    K P++ V  S  G  +     + 
Sbjct: 477 EGEEMDVDAEGFRGGDRTTIELPNVQRKLVKALKATGK-PIVFVNFS--GCAMGLQPESQ 533

Query: 561 NIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLG 620
              AIL A YPG+ GG AIA+V+FG +NP GRLPIT+Y  D    LP         +   
Sbjct: 534 ICDAILQAWYPGQAGGTAIAEVLFGDYNPAGRLPITFYKKD--NQLP-------DFEDYN 584

Query: 621 YPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTR 680
             GRTY++ N   LYPFG+GLSYT F Y+      T  +   KL                
Sbjct: 585 MQGRTYRYLNYEPLYPFGHGLSYTTFSYS------TPFIENGKL---------------- 622

Query: 681 CPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVR 740
                            KV   N G+ +G +V+ +Y K   +     +K + GFQR+ + 
Sbjct: 623 -----------------KVKVTNSGNYNGDEVIQLYIKRYDDPDGP-LKTLRGFQRIHIP 664

Query: 741 AGRNKRIKFVFNACKSLNIVDYAANTLLP-AGEHTIFVG 778
           AG+   + F   +  +    D  +NT+ P  G + I VG
Sbjct: 665 AGQTSEVSFPLTS-DTFTWWDKDSNTVHPLQGRYKILVG 702


>gi|402493386|ref|ZP_10840139.1| beta-glucosidase [Aquimarina agarilytica ZC1]
          Length = 734

 Score =  365 bits (936), Expect = 6e-98,   Method: Compositional matrix adjust.
 Identities = 258/754 (34%), Positives = 379/754 (50%), Gaps = 105/754 (13%)

Query: 51  SFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVG 110
           +F + D++  +  R K LV+ +TL+EK+  + D +  + RL +P+Y WW+E LHGV+  G
Sbjct: 38  NFEWFDTNKSFEKRAKALVASLTLEEKISLMVDQSAPIDRLNIPEYNWWNECLHGVARNG 97

Query: 111 PGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYN----LGR----AGL 162
                      AT FP  I   A+F++ L  K+  A+STEARA +N    +G     AGL
Sbjct: 98  R----------ATVFPQAIGLAATFDQDLIFKVADAISTEARAKFNASIAIGNRGKYAGL 147

Query: 163 TYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVS 222
           T+W+PNIN+ RDPRWGR  ET GEDP++  +  VN+V+GLQ   G+      + + LK +
Sbjct: 148 TFWTPNINIFRDPRWGRGQETYGEDPYLTSQIGVNFVKGLQ---GN------HPKYLKSA 198

Query: 223 SCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNG 282
           +C KHYA   V +     R+ FDA  +++DM ET+L  FE  VKE     VM +YNRVNG
Sbjct: 199 ACAKHYA---VHSGPEELRHEFDAIASKKDMAETYLPAFEALVKEAKVEGVMGAYNRVNG 255

Query: 283 IPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKF--LADSKEDAVAQTLKAGLD 340
             +CA P LL + ++  W   GYIV+DC     + D HKF  +  + E++ A  L  GL+
Sbjct: 256 EGACASPYLLEKLLKDTWGFKGYIVSDC---WALSDLHKFHKVTQTAEESAAAALNVGLN 312

Query: 341 LDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ--YVSLGKQDICS 398
           ++CG  Y    G A++QG   E  +D  L++      +LGFFD S    Y  +    + S
Sbjct: 313 VNCGNVYPALDG-AIKQGLTSEKQLDNVLQHQLLTRFKLGFFDPSNNNPYNKITTDVVDS 371

Query: 399 DENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYM 458
           + +  +A EAA++ IVLLKN+ N L      +K+V V GP+A     ++GNY G+  +  
Sbjct: 372 EAHRAIALEAAQKSIVLLKNNNNLL-PLKKDLKSVYVAGPNAAREDVLLGNYYGVTSKTQ 430

Query: 459 SPIAGF----SGYANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAES 514
           + + G     S   ++ YK G        N   ++  E ++ AD  II+ GL  + E E 
Sbjct: 431 TILDGIVSKVSAGTSINYKQGLLPFQKNVNPIDWSTGEISR-ADVGIIVMGLSGNYEGEE 489

Query: 515 ---------LDREDLWLPGYQTQLINQVAEVAKG-PVILVIMSAGGVDIAFAETNTNIKA 564
                     DR D+ LP  Q   I ++     G P++LV+   GG  IA  E    + A
Sbjct: 490 GEAIASESKGDRVDIRLPQNQIDYIKKIKAKNTGNPLVLVL--TGGSPIAMPEVYDLVDA 547

Query: 565 ILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGR 624
           I++A YPGEEGG+A+AD++FG   P G+LPIT+         P +   L P +     GR
Sbjct: 548 IVFAWYPGEEGGQAVADILFGDVVPSGKLPITF---------PKSVDDLPPYNDYAMKGR 598

Query: 625 TYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGV 684
           TYK+      +PFG+GLSYT FKY+ L                    Y   AS       
Sbjct: 599 TYKYMTKTPQFPFGFGLSYTSFKYDNLKV------------------YKEKAS------- 633

Query: 685 LVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRN 744
                             N G+ D  +V  VY   P       +  ++GF RV ++AG  
Sbjct: 634 --------------FSITNNGNVDAEEVAQVYVSSPNAGKGDPLNTLVGFTRVSLKAGAT 679

Query: 745 KRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVG 778
           K++   F+  K+    D     +   G +TI VG
Sbjct: 680 KQVSIPFSK-KAFVQFDSDGKEITRKGTYTIHVG 712


>gi|333995841|ref|YP_004528454.1| beta-glucosidase [Treponema azotonutricium ZAS-9]
 gi|333737309|gb|AEF83258.1| periplasmic beta-glucosidase (Gentiobiase)(Cellobiase)
           (Beta-D-glucoside glucohydrolase) [Treponema
           azotonutricium ZAS-9]
          Length = 706

 Score =  365 bits (936), Expect = 7e-98,   Method: Compositional matrix adjust.
 Identities = 240/750 (32%), Positives = 386/750 (51%), Gaps = 106/750 (14%)

Query: 64  RVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGAT 123
           R+K+++S+MTL+EKV QL   A  V   G+P+Y WW+E LHGV+  G           AT
Sbjct: 6   RIKEMISKMTLEEKVSQLSYDAPAVESAGIPKYNWWNECLHGVARAGL----------AT 55

Query: 124 SFPTVILTTASFNESLWKKIGQAVSTEARAMYN--LGRA------GLTYWSPNINVARDP 175
            FP  I   A+F+E+  + +  A+S E RA YN  + R       GLT+W+PN+N+ RDP
Sbjct: 56  VFPQAIALAATFDEAFIRSVADAISDEGRAKYNEAVKRGNRSQYYGLTFWTPNVNIFRDP 115

Query: 176 RWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDN 235
           RWGR  ET GEDP++ GR  + +++GLQ           ++  LKV++C KHYA   V +
Sbjct: 116 RWGRGQETYGEDPYLTGRIGLAFMKGLQGD---------DTEHLKVAACAKHYA---VHS 163

Query: 236 WKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQT 295
                R+ FDA V+++D+ ET+L  F++ V+ G   +VM +YNR  G P      LL + 
Sbjct: 164 GPEKLRHTFDAVVSKKDLFETYLPAFKLLVENG-VEAVMGAYNRTLGEPCGGSTYLLKEI 222

Query: 296 VRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFTGNAV 355
           +RG W   G++ +DC +I+   +NHK +  S E++ A  L AG DL+CG  Y   T +  
Sbjct: 223 LRGRWGFKGHVTSDCWAIRDFHENHK-VTKSPEESAAMALNAGCDLNCGCTYPYLTVSH- 280

Query: 356 QQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ--YVSLGKQDICSDENIELAAEAAREGI 413
           ++G V +  ID +L  L     +LG FD   Q  Y +LG   +  +++  LA EAA++ I
Sbjct: 281 KKGLVTDETIDTALTRLLRTRFKLGLFDPPEQDPYRNLGNDIVGCEKHRNLALEAAQKSI 340

Query: 414 VLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFSGYAN---- 469
           VLLKND N LPL+ +  + + ++GP A   + ++ NY G+  R ++ + G +        
Sbjct: 341 VLLKNDSNILPLDDS-ARKILLMGPGAANILTLLANYYGMSSRLVTILEGLAEKIKTKTA 399

Query: 470 VTYKTGCDDVACKSN---NSIFAASEAAKTA--------DATIILAGLDLSVEAE----- 513
           ++++     +  + N   N  F ++     A        D  I + GLD S+E E     
Sbjct: 400 ISFEYRQGSLMYEPNHLSNVPFGSTGVDAEAPIYGLDEIDLVIAVYGLDGSMEGEEGDSI 459

Query: 514 ----SLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAG 569
               + DR+ + LP +Q   + ++ +  K  V+++    GG  IAF E   +  A+L+A 
Sbjct: 460 ASDANGDRDTIELPSWQLNFLRRIRKAGKKVVLIL---TGGSPIAFPEDLAD--AVLFAW 514

Query: 570 YPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFY 629
           YPGE+GG A+AD++FG  +P G+LPIT+         P ++  L P D     GRTY++ 
Sbjct: 515 YPGEQGGNAVADILFGDVSPSGKLPITF---------PQSTAQLPPYDDYALKGRTYRYM 565

Query: 630 NGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDL 689
               LYPFG+GLSYT F+++      +++++ +K+    ++                   
Sbjct: 566 KETPLYPFGFGLSYTSFRFD------SVELSSSKISAGNSV------------------- 600

Query: 690 RCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKF 749
                 + KV   N G  D  +VV +Y              + GF+R+ + AG++  ++ 
Sbjct: 601 ------KAKVQVSNTGKRDAEEVVQLYIAKDNRSEDEPASSLRGFRRLKILAGKSASVEI 654

Query: 750 VFNACKSLNIVDYAANTLLPAGEHTIFVGN 779
              A     I    A+ L+P G +T+   +
Sbjct: 655 ELPASAFETINAEGASVLIP-GSYTVIAAD 683


>gi|358061481|ref|ZP_09148135.1| hypothetical protein HMPREF9473_00197 [Clostridium hathewayi
           WAL-18680]
 gi|356700240|gb|EHI61746.1| hypothetical protein HMPREF9473_00197 [Clostridium hathewayi
           WAL-18680]
          Length = 695

 Score =  364 bits (934), Expect = 1e-97,   Method: Compositional matrix adjust.
 Identities = 218/603 (36%), Positives = 326/603 (54%), Gaps = 66/603 (10%)

Query: 68  LVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGATSFPT 127
           LV +MTL+E+  Q+   A  VPRLG+P Y WW E LHGV+  G           AT FP 
Sbjct: 13  LVEQMTLEERASQMRYDAPAVPRLGIPAYNWWGEGLHGVARAGT----------ATMFPQ 62

Query: 128 VILTTASFNESLWKKIGQAVSTEARAMYNLG--------RAGLTYWSPNINVARDPRWGR 179
            I   A F+  L ++I   VSTE RA YN            GLT+WSPN+N+ RDPRWGR
Sbjct: 63  AIAMAAMFDVELTEEIANVVSTEGRAKYNQFCEEGDRDIYKGLTFWSPNVNIFRDPRWGR 122

Query: 180 ITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGV 239
             ET GEDP++  R    +VRGLQ    H          LK+++C KH+A   V +    
Sbjct: 123 GHETYGEDPYLTSRLGTAFVRGLQGDGEH----------LKIAACAKHFA---VHSGPEA 169

Query: 240 DRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGE 299
            R+ F A  +++D+ ET+L  FE CVKE    SVM +YN  +G P CA+  L+ + +RG+
Sbjct: 170 LRHEFWADTSKKDLWETYLPAFEACVKEAHVESVMGAYNSYHGEPCCANTLLMEEILRGQ 229

Query: 300 WDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFTGNAVQQGK 359
           W   G+ V+DC +I+    N+  + D+  ++ A  +K G DL+CG  Y      A ++G 
Sbjct: 230 WGFEGHFVSDCWAIRDFHMNY-MVTDTAMESAALAVKKGCDLNCGNTYLQVL-KACEEGL 287

Query: 360 VKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQDICSDENIELAAEAAREGIVLLKND 419
           + +  + +++  L+T    LG  + + +Y  +  + +   E+ ELA EAAR  +VLLKND
Sbjct: 288 LDDACVTEAVVRLFTTRYLLGMGEET-EYDDIPYEVVECKEHRELAVEAARRSMVLLKND 346

Query: 420 QNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFSGY----ANVTYKTG 475
              LPL++ K+ T+AV+GP+A+   A+IGNY G    Y + + G          V Y  G
Sbjct: 347 -GLLPLHAEKLNTIAVIGPNADNRTALIGNYHGTSSCYTTILEGIQDAVGEDVRVLYAEG 405

Query: 476 CD------DVACKSNNSIFAASEAAKTADATIILAGLDLSVEAE---------SLDREDL 520
           C       +    + + +  A   AK +D  ++  GLD ++E E         S D++DL
Sbjct: 406 CHLFKDRVEHLAVAGDRLSEARIVAKHSDVVVLCVGLDETLEGEEGDTGNSHASGDKKDL 465

Query: 521 WLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIA 580
            LP  Q +L+ ++  + K PV++  MS   +D++ A+        +W  YPG EGGRA+A
Sbjct: 466 LLPESQRRLMEEILNLGK-PVVVCNMSGSAIDLSLAQEKAGAVIQVW--YPGAEGGRALA 522

Query: 581 DVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYG 640
           D++FGK +P G+LP+T+Y         L ++P  P +     GRTY++     LYPFG+G
Sbjct: 523 DLLFGKASPSGKLPVTFYKD-------LENLP--PFEDYSMDGRTYRYLTAEPLYPFGFG 573

Query: 641 LSY 643
           L+Y
Sbjct: 574 LTY 576


>gi|410639677|ref|ZP_11350222.1| beta-glucosidase [Glaciecola chathamensis S18K6]
 gi|410140558|dbj|GAC08409.1| beta-glucosidase [Glaciecola chathamensis S18K6]
          Length = 733

 Score =  363 bits (933), Expect = 1e-97,   Method: Compositional matrix adjust.
 Identities = 241/750 (32%), Positives = 372/750 (49%), Gaps = 86/750 (11%)

Query: 54  FCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGT 113
           + D+ LP   R+  L+  MTL EK  QL +    + RLGLP+Y++W+EALHGV+  G   
Sbjct: 26  WFDTQLPTQKRIDLLIDAMTLKEKTSQLVNGNVAIERLGLPEYDFWNEALHGVARNG--- 82

Query: 114 HFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGR--------AGLTYW 165
                   AT FP  I   A+F++ L  K    +S EARA +N+          +GLT+W
Sbjct: 83  -------RATVFPQAIGMAATFDQHLLLKAASVISDEARAKFNVSSEIGNRSKYSGLTFW 135

Query: 166 SPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCC 225
           +PNIN+ RDPRWGR  ET GEDP++  +     V GLQ           + + LK ++  
Sbjct: 136 TPNINIFRDPRWGRGQETYGEDPYLTAQMGKAMVNGLQGD---------HPKYLKTAAAA 186

Query: 226 KHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPS 285
           KH+A   V +     R+ FDA  + +DM ET+   FE  V E +  +VM +YNRVNG P+
Sbjct: 187 KHFA---VHSGPEALRHEFDAIASPKDMYETYFPAFEALVTEANVETVMAAYNRVNGHPA 243

Query: 286 CADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQ 345
                LLN  +R +W   G++V+DC  +      HK  A++ E A A  +  G DL+CG 
Sbjct: 244 GGSDFLLNTVLRDKWGFSGHVVSDCWGLADFHQYHKVTANAVESA-ALAINTGTDLNCGA 302

Query: 346 YYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ--YVSLGKQDICSDENIE 403
            Y N   +AV+ G V E  IDK L  +     +LGFFD      Y ++    + S+ + +
Sbjct: 303 VY-NALPDAVEAGLVDEKTIDKRLSKVLATKFKLGFFDPKDDNPYNNISADVVNSEAHAQ 361

Query: 404 LAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAG 463
           +A E A + IVLL+N  N LPL+   ++ + V GP A+++  ++GNY G+  +  + + G
Sbjct: 362 VAYEMAVKSIVLLQNKNNILPLDR-NIRNLYVTGPFASSSEVLLGNYYGLSGKTTNILDG 420

Query: 464 FSGYANV----TYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAES----- 514
            +   +V     YK G        N   +   EA +  D  I + GL  + E E      
Sbjct: 421 ITANVSVGTTINYKQGILPYQANVNPIDWTTGEAKQMGDVIIAVMGLSGAYEGEEGEAIA 480

Query: 515 ----LDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGY 570
                DR  L LP +Q   + ++ +    PVI+V+ +  G  +   E      AI++A Y
Sbjct: 481 SPHKGDRLSLDLPEHQIAFLRKLRKDNDKPVIVVLTA--GTPVNLTEIAELADAIVFAWY 538

Query: 571 PGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYN 630
           PG+EGG+A+AD++FG+ +P GRLPIT+         P +   L P D      RTY++  
Sbjct: 539 PGQEGGKAVADILFGERSPSGRLPITF---------PKSEAQLPPYDDYSMQERTYRYMT 589

Query: 631 GPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLR 690
              +YPFG+GLSY Q K++ ++   T Q   +K +   N+  T                 
Sbjct: 590 QEPMYPFGFGLSYAQVKFDNITLGNT-QALASKNEPQENMTVT----------------- 631

Query: 691 CDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFV 750
                   V+  N G  +  +VV +Y K P    +  +  + GF R+ + AG+ +++ F 
Sbjct: 632 --------VNVTNTGEREFEEVVQLYLKTPDAGVSQPLHSLKGFTRIKLAAGQTEQVLFN 683

Query: 751 FNACKSLNIVDYAANTLLPAGEHTIFVGNG 780
               K L  ++     +L  G++++ VGN 
Sbjct: 684 I-PKKHLYSINAQGKPVLLKGQYSVIVGNA 712


>gi|409385818|ref|ZP_11238358.1| Beta-glucosidase [Lactococcus raffinolactis 4877]
 gi|399206850|emb|CCK19273.1| Beta-glucosidase [Lactococcus raffinolactis 4877]
          Length = 695

 Score =  363 bits (933), Expect = 2e-97,   Method: Compositional matrix adjust.
 Identities = 243/698 (34%), Positives = 362/698 (51%), Gaps = 104/698 (14%)

Query: 68  LVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGATSFPT 127
           +VS+MTL EK+ Q+   A  + RL +P Y +W+E LHGV+  G           AT FP 
Sbjct: 15  IVSQMTLAEKISQIDFDASAIERLNIPHYNYWNEGLHGVARAGV----------ATVFPQ 64

Query: 128 VILTTASFNESLWKKIGQAVSTEARAMYNLGRA--------GLTYWSPNINVARDPRWGR 179
            I   A+F+  L K I + +S E RA YN            GLT+WSPNIN+ RDPRWGR
Sbjct: 65  AIGLAATFDTELVKHIAEVISIEGRAKYNAYTKHGDRDIYKGLTFWSPNINLFRDPRWGR 124

Query: 180 ITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGV 239
             ET GEDPF+  +  V +++GLQ  EG         + L++++C KH+A   V +    
Sbjct: 125 GQETYGEDPFLTAQIGVAFIKGLQG-EG---------KYLRLAACTKHFA---VHSGPEA 171

Query: 240 DRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGE 299
           DR++FDA V  +D+ E +L  F+  ++E D  S M +YN +NG P+C + +L+ +T+ G+
Sbjct: 172 DRHYFDAVVNPKDLNEFYLPQFKAAIEEADVESFMGAYNAINGQPACVNEELIAKTLLGK 231

Query: 300 WDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFTGNAVQQGK 359
           W   G++V+D  +++ + +NH +   + E  +A  +K G +L C    ++    AV +G 
Sbjct: 232 WGFEGHVVSDYAALEDVHENHHYTQTAAE-TMALAMKIGTNL-CAGKISDALFEAVGKGL 289

Query: 360 VKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQDICSDENIELAAEAAREGIVLLKND 419
           V ET+I  S+  LYT  +RLG F     Y ++  +   S E+  L+ +AA + +VLLKND
Sbjct: 290 VTETEITASVVKLYTTHVRLGMFAEDNDYDTIPYEVNASAEHEMLSLKAAEKSMVLLKND 349

Query: 420 QNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAG----FSGYANVTYKTG 475
            N LPL+ +++K+VAV+GP A    A+ GNYAG    Y + ++G     S  A VTY  G
Sbjct: 350 -NFLPLSQSEIKSVAVIGPTARNIGALEGNYAGTANHYETFVSGIQQALSNQARVTYALG 408

Query: 476 CDDVACKSNNSIFAASE-------AAKTADATIILAGLDLSVEAE---------SLDRED 519
           C   A  + +S+  A+E       AA+ AD  ++  GLD ++E E         S D+  
Sbjct: 409 CHLYADHAESSLSRANERESEAIIAAEHADIAVLCVGLDPTIEGEQGDAGNVYGSGDKPS 468

Query: 520 LWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAI 579
           L LPG Q +LI +V E  K  VILV+ S   + +   E +T +KAI+ A YPG  GG A+
Sbjct: 469 LSLPGQQKRLIEKVLETGK-TVILVLTSGSALSLEGLEKHTGVKAIIQAWYPGAHGGTAL 527

Query: 580 ADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGY 639
           A+++ GK +P G+LP+T+      Q LP  S             RTY+      LYPFGY
Sbjct: 528 ANILLGKVSPSGKLPVTFCKD--TQGLPDFS-------DYSMAERTYQNTQLEVLYPFGY 578

Query: 640 GLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKV 699
           GL+Y   +       KT+Q+                                 D     V
Sbjct: 579 GLTYGHAE------IKTLQL---------------------------------DDLTLSV 599

Query: 700 DFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRV 737
             +N G  D  +V+ VY K  +E A    K +I F+R+
Sbjct: 600 TAENKGDYDIEEVIQVYVKINSEFAPKNHK-LIAFKRI 636


>gi|167751044|ref|ZP_02423171.1| hypothetical protein EUBSIR_02029 [Eubacterium siraeum DSM 15702]
 gi|167655962|gb|EDS00092.1| glycosyl hydrolase family 3 C-terminal domain protein [Eubacterium
           siraeum DSM 15702]
          Length = 691

 Score =  363 bits (933), Expect = 2e-97,   Method: Compositional matrix adjust.
 Identities = 229/639 (35%), Positives = 349/639 (54%), Gaps = 78/639 (12%)

Query: 54  FCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGT 113
           + D  L    R   L   ++ +E+ QQL   A  + + GLP Y WW+E LHGV+  G   
Sbjct: 4   YKDKQLSAYERAAALADTLSTEEQAQQLKYDAPAIEKAGLPSYNWWNEGLHGVARAGT-- 61

Query: 114 HFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRA--------GLTYW 165
                   AT FP  I   A+F++ +  ++G+ +STEARAMYN            GLT W
Sbjct: 62  --------ATVFPQAIALAAAFDKDMMYRVGEVISTEARAMYNSAAKHGDTDIYKGLTLW 113

Query: 166 SPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCC 225
           +PNIN+ RDPRWGR  ET GEDP++  R  VN+V+G+Q  E +          L+ ++C 
Sbjct: 114 APNINIFRDPRWGRGHETYGEDPYLTSRLGVNFVKGIQGEEEY----------LRAAACA 163

Query: 226 KHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPS 285
           KH+A   V +     R+ FDARV+E+DMEET+L  F+  VKEG    VM +YNRVNG PS
Sbjct: 164 KHFA---VHSGPESLRHEFDARVSEKDMEETYLPAFKALVKEGRVEGVMGAYNRVNGEPS 220

Query: 286 CADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQ 345
           CA  KL+ +    EW   GY V+DC +I+     HK + D+   + A  LKAG D++CG 
Sbjct: 221 CASEKLMGKLR--EWGFDGYFVSDCWAIRDFHTTHK-ITDTAPQSAAMALKAGCDVNCGN 277

Query: 346 YYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQDICSDENIELA 405
            Y +    A+++G + + +I  +  +     +RLG  D + ++  L    I  D N  L+
Sbjct: 278 TYLHILA-ALEEGLITKQNIRTACIHALRTRIRLGQLDDN-EFDDLPFDIIACDGNKALS 335

Query: 406 AEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAG-- 463
            EAA + +VLL ND   LPL+ +++ ++AV+GP+A++  A++GNY G P R ++ + G  
Sbjct: 336 LEAAEKSMVLLHND-GILPLDKSRISSIAVIGPNADSRAALLGNYNGTPDRSVTFLEGIQ 394

Query: 464 --FSGYANVTYKTGCDDVACKSN------NSIFAASEAAKTADATIILAGLDLSVEAE-- 513
             F G   V Y  GC     ++       +    A  A + AD T++  GLD ++E E  
Sbjct: 395 DAFDG--RVYYAEGCQLFRDRTQGLALPGDRYAEAVAACEAADVTVVCVGLDATLEGEEG 452

Query: 514 -------SLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAIL 566
                  S D+ DL LP  Q  L+ ++ +  K P+I+V+ +   V+     T     A++
Sbjct: 453 DTGNEFASGDKPDLRLPEVQRVLLQKLKDTGK-PLIIVLAAGSSVN-----TECEGNALI 506

Query: 567 WAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLP-LTSMPLRPVDSLGYPGRT 625
            A YPG+ GG+A+A+++FG+ +P G+LP+T+Y      MLP  T   ++         RT
Sbjct: 507 NAWYPGQYGGKALAEILFGEVSPSGKLPVTFYKS--ADMLPDFTDYSMK--------NRT 556

Query: 626 YKFYNGPT--LYPFGYGLSYTQFKYNLLSFT-KTIQVNL 661
           Y+F +  +  LYPFGYGL+Y+ F+   +S+   T+ VN+
Sbjct: 557 YRFCDDESNVLYPFGYGLTYSHFECGDISYKDNTLAVNV 595


>gi|291556907|emb|CBL34024.1| Beta-glucosidase-related glycosidases [Eubacterium siraeum V10Sc8a]
          Length = 691

 Score =  363 bits (932), Expect = 2e-97,   Method: Compositional matrix adjust.
 Identities = 230/639 (35%), Positives = 348/639 (54%), Gaps = 78/639 (12%)

Query: 54  FCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGT 113
           + D  L    R   L   ++ +E+ QQL   A  + + GLP Y WW+E LHGV+  G   
Sbjct: 4   YKDKQLSAYERAAALADTLSTEEQAQQLKYDAPAIEKAGLPSYNWWNEGLHGVARAGT-- 61

Query: 114 HFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRA--------GLTYW 165
                   AT FP  I   A+F++ +  ++G+ +STEARAMYN            GLT W
Sbjct: 62  --------ATVFPQAIALAAAFDKDMMYRVGEVISTEARAMYNSAAKHGDTDIYKGLTLW 113

Query: 166 SPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCC 225
           +PNIN+ RDPRWGR  ET GEDP++  R  V++V+G+Q  E +          L+ ++C 
Sbjct: 114 APNINIFRDPRWGRGHETYGEDPYLTSRLGVSFVKGIQGEEEY----------LRAAACA 163

Query: 226 KHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPS 285
           KH+A   V +     R+ FDARV+E+DMEET+L  F+  VKEG    VM +YNRVNG PS
Sbjct: 164 KHFA---VHSGPESLRHEFDARVSEKDMEETYLPAFKALVKEGRVEGVMGAYNRVNGEPS 220

Query: 286 CADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQ 345
           CA  KL+ +    EW   GY V+DC +I+     HK + D+   + A  LKAG D++CG 
Sbjct: 221 CASEKLMGKLR--EWGFDGYFVSDCWAIRDFHTTHK-ITDTAPQSAAMALKAGCDVNCGN 277

Query: 346 YYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQDICSDENIELA 405
            Y +    A+++G + + DI  +  +     +RLG  D + ++  L    I  D N  L+
Sbjct: 278 TYLHILA-ALEEGLITKQDIRTACIHALRTRIRLGQLDDN-EFDDLPFDIIACDGNKALS 335

Query: 406 AEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAG-- 463
            EAA + +VLL ND   LPL+ +++ ++AV+GP+A++  A++GNY G P R ++ + G  
Sbjct: 336 LEAAEKSMVLLHND-GILPLDKSRISSIAVIGPNADSRAALLGNYNGTPDRSVTFLEGIQ 394

Query: 464 --FSGYANVTYKTGCDDVACKSN------NSIFAASEAAKTADATIILAGLDLSVEAE-- 513
             F G   V Y  GC     ++       +    A  A + AD T+I  GLD ++E E  
Sbjct: 395 DAFDG--RVYYAEGCQLFRDRTQGLALPGDRYAEAVAACEAADVTVICVGLDATLEGEEG 452

Query: 514 -------SLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAIL 566
                  S D+ DL LP  Q  L+  + +  K P+I+V+ +   V+     T     A++
Sbjct: 453 DTGNEFASGDKPDLRLPEVQRVLLQNLKDTGK-PLIIVLAAGSSVN-----TECEGNALI 506

Query: 567 WAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLP-LTSMPLRPVDSLGYPGRT 625
            A YPG+ GG+A+A+++FG+ +P G+LP+T+Y      MLP  T   ++         RT
Sbjct: 507 NAWYPGQYGGKALAEILFGEVSPSGKLPVTFYKS--ADMLPDFTDYSMK--------NRT 556

Query: 626 YKFYNGPT--LYPFGYGLSYTQFKYNLLSFT-KTIQVNL 661
           Y+F +  +  LYPFGYGL+Y+ F+   +S+   T+ VN+
Sbjct: 557 YRFCDDESNVLYPFGYGLTYSHFECGDVSYKDNTLAVNV 595


>gi|348684865|gb|EGZ24680.1| family 3 glycoside hydrolase [Phytophthora sojae]
          Length = 769

 Score =  362 bits (930), Expect = 4e-97,   Method: Compositional matrix adjust.
 Identities = 238/666 (35%), Positives = 352/666 (52%), Gaps = 65/666 (9%)

Query: 54  FCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPR-----LGLPQYEWWSEALHGVSN 108
           FC++SL  + RV+DL+SR+ L EK   L   A   PR     +GLP+Y W +  +HGV +
Sbjct: 36  FCNTSLSTADRVEDLLSRLPLQEKATLL--TARASPRGNMSSIGLPEYNWGANCVHGVQS 93

Query: 109 VGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLG---------R 159
              GT+        TSFP  +   A F+  +   + Q +  E RA++  G          
Sbjct: 94  TC-GTNC------PTSFPNPVNLGAIFDPQVVFDMAQVIGWELRALWLEGATENYKGGPH 146

Query: 160 AGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPL 219
            GL  WSPNIN+ RDPRWGR TETP EDP V  +Y V Y RGLQ  EG       + R L
Sbjct: 147 LGLDCWSPNININRDPRWGRNTETPSEDPLVNSKYGVAYTRGLQ--EGKRQ----DPRFL 200

Query: 220 KVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNR 279
           +     KHYAAY  +N+ GV+R  FDA V+  D  +T+   F   V +G+A  VMCSYN 
Sbjct: 201 QAVVTLKHYAAYSYENYGGVNRMEFDAIVSPYDFADTYFPAFRSSVVDGNAKGVMCSYNS 260

Query: 280 VNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGL 339
           VNGIP CA+ +L+   +RG     GY+ +D  +++ + D H + ADS+ +A    + AG 
Sbjct: 261 VNGIPMCANKELVETLLRGTLGFDGYVTSDSGAVEAISDMHHY-ADSQCEAARLAILAGT 319

Query: 340 DLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFD--GSPQYVSLGKQDIC 397
           D++ G+ Y       V   +++E  +D +L++   +   LG FD      Y ++   ++ 
Sbjct: 320 DINSGKSYEACLKTLVDDNQLEEKALDDALRHTLKLRFELGLFDPIDDQPYWNVTPSEVN 379

Query: 398 SDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCR- 456
           +     L+  A R+ +V+L+N+ + LPL   K   +AV+GPHA +   ++GNY G  C  
Sbjct: 380 TAAAKALSLNATRKSLVMLQNNASVLPLQ--KGVKLAVLGPHAKSKRGLLGNYLGQMCHG 437

Query: 457 ----------YMSPIAGFSGYANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGL 506
                      +  I   +G +N T+  GC  ++  S      A  AAK ADA ++  G+
Sbjct: 438 DYDEVGCVQTPLDAIRAANGASNTTFAEGC-GISGNSTAGFEKAVAAAKEADAVVLFLGI 496

Query: 507 DLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAIL 566
           D S+E E  DR ++ LP  Q QL+ +V  V + P ++V+++ GGV I   E      A++
Sbjct: 497 DKSIEGEVGDRNNIDLPNIQMQLLQRVHAVGR-PTVVVLIN-GGV-IGAEEIIERTDALV 553

Query: 567 WAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTY 626
            A YPG  G RA+ADV+FG  NP G+LP+T Y  DYV  + + SM     D   +PGRTY
Sbjct: 554 EAFYPGFFGARAMADVLFGDTNPSGKLPVTMYRSDYVDQVEMKSM-----DMTAHPGRTY 608

Query: 627 KFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYT----SDASKTRCP 682
           +++ G  ++PFG+GLSYT F  ++ S T       N   H  N  ++    SD +     
Sbjct: 609 RYFKGEPVFPFGWGLSYTTFSLSVDSGT-------NSSSHSNNAAFSGGEVSDTANVTIS 661

Query: 683 GVLVND 688
            V+ ND
Sbjct: 662 VVVKND 667


>gi|443695317|gb|ELT96258.1| hypothetical protein CAPTEDRAFT_179825 [Capitella teleta]
          Length = 750

 Score =  360 bits (925), Expect = 1e-96,   Method: Compositional matrix adjust.
 Identities = 246/768 (32%), Positives = 387/768 (50%), Gaps = 96/768 (12%)

Query: 41  RFSKLGLQMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQL----GDFAHGVPRLGLPQY 96
           RF+     + SF F + SLP   R+ DL+SR+T+++ + Q     G F  G+ RLG+   
Sbjct: 26  RFAPSSHALDSFPFRNVSLPIETRLNDLISRLTIEDAINQTVARYGKFTPGIERLGIKPI 85

Query: 97  EWWSEALHGVSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYN 156
           E+ +E L GV               AT FP  +   ASF+  L +++  AVS E RA YN
Sbjct: 86  EYITECLRGVRR-----------ENATGFPQALGLAASFSRDLMQRVATAVSVEVRAFYN 134

Query: 157 -------LGRAGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHE 209
                   G  G+T +SP IN+ R P WGR  ET GEDP++ G  A  YV GLQ  +   
Sbjct: 135 HDIQRETYGAHGITCFSPVINILRHPLWGRNQETYGEDPYLSGELASQYVSGLQGDD--- 191

Query: 210 NATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGD 269
                  R L+VS+ CKH+ A+   +   V ++ FDA++ E+D++ TFL  F+ C+    
Sbjct: 192 ------PRYLRVSAGCKHFDAHGGPDTIPVRKFGFDAKIEERDLQMTFLPAFKKCIA-AK 244

Query: 270 ASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKED 329
             +VMCS+N +NG+PSCA+ +LL   +R +W   G++V+D  +++ +   H +   S E 
Sbjct: 245 PYNVMCSFNSINGVPSCANKRLLTDVLRAQWGYEGFVVSDDAAVEYIFTEHHY-NSSFET 303

Query: 330 AVAQTLKAGLDLD-CGQY---YTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGS 385
           A  + +K+G +++  G++   Y   T  A+ +  + + ++ ++++ ++     LG FD  
Sbjct: 304 AAVEAIKSGCNMELVGKFDPSYWQLT-KALNEHLITKDELMENVRPVFLTRFLLGEFDPP 362

Query: 386 P--QYVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANAT 443
               +  + K  + S E+  LA EAA +  VLLKND+N LPL    +KTVAVVGP +N T
Sbjct: 363 ALNPFNQITKDVVLSAEHQRLALEAAVKSFVLLKNDRNFLPLLKNSLKTVAVVGPMSNYT 422

Query: 444 VAMIGNYA--GIPCRYMSPIAGFSGYA-NVTYKTGCDDVACKSNNSIFAASEAAKTADAT 500
             +IG+Y+    P   ++P+ G    A NV + +GC +  C    +   A+ A   A   
Sbjct: 423 DGLIGDYSTDTDPSLILTPLHGIKKLAPNVQFASGCSNSTCTDYRATDVAA-AVDGAQVV 481

Query: 501 IILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKG-PVILVIMSAGGVDIAFAETN 559
            +  G    VEAE+ DR D+ LPG Q QL+      A G PV+L++ + G +D+ FA+  
Sbjct: 482 FVALGTGFIVEAENNDRSDIVLPGAQLQLLKDAVYHANGRPVVLLLFNGGPLDVTFAQLT 541

Query: 560 TNIKAILWAGYPGEEGGRAIADVVF---GKFNPGGRLPITWYNGDYVQMLP-LTSMPLRP 615
           + I +I+   +P    G AI  ++    G  +P GRLP+TW    Y+  +P +T   ++ 
Sbjct: 542 SGIVSIVECFFPAMMTGEAIYRMLINNEGISSPAGRLPLTW--PAYLNQVPNITDYTMK- 598

Query: 616 VDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSD 675
                  GRTY++Y    LYPFGYGLSYTQFKY+ L  T    + + K Q  R       
Sbjct: 599 -------GRTYRYYTEDPLYPFGYGLSYTQFKYSDLKVTP---LEVTKGQEIR------- 641

Query: 676 ASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIV-----YSKPPAEIAATYIKQ 730
                                 KV   N+G  D  +V I+      S P  EI      Q
Sbjct: 642 ---------------------VKVKVTNIGLYDADEVRIIVVQAYVSWPKTEIPVPRW-Q 679

Query: 731 VIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVG 778
           ++ F R+ + +G+++ ++    A       +      +  GE T+++G
Sbjct: 680 LVAFDRIHIASGKSETVELTIEASLLEVWQNPETGFDILEGEMTLYIG 727


>gi|333494646|gb|AEF56854.1| putative glycosyl hydrolase [synthetic construct]
          Length = 743

 Score =  360 bits (924), Expect = 2e-96,   Method: Compositional matrix adjust.
 Identities = 245/768 (31%), Positives = 379/768 (49%), Gaps = 109/768 (14%)

Query: 46  GLQMSSF----LFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSE 101
           G  M+S     ++ D +L +  R +DLVSRMTL+EK+ Q+   A  + RLG+P Y WW+E
Sbjct: 18  GSHMASMTQIPVYRDENLSFEERARDLVSRMTLEEKIAQMQHEAPSIERLGVPAYNWWNE 77

Query: 102 ALHGVSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGR-- 159
           ALHGV+  G           +T FP  I   A+F+  L +K    +STE RA Y+  +  
Sbjct: 78  ALHGVARAGV----------STMFPQAIGMAATFDAELIEKTADVISTEGRARYHEFQRK 127

Query: 160 ------AGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATD 213
                  GLT+WSP IN+ RDPRWGR  ET GEDP++  R AV+++RG+Q          
Sbjct: 128 GDRDIYKGLTFWSPTINIDRDPRWGRGQETYGEDPYLTSRLAVSFIRGIQG--------- 178

Query: 214 LNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSV 273
              R LK ++C KH+A   V +    +R+ F+A V+++D+ ET+L  FE  VKE   + V
Sbjct: 179 -RGRYLKAAACAKHFA---VHSGPESERHQFNAEVSQKDLWETYLPAFEASVKEAKVAGV 234

Query: 274 MCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQ 333
           M +YNRVNG P C    LL   +RGEW+  GY+ +DC +I+ + + H  +  + E++ A 
Sbjct: 235 MGAYNRVNGEPCCGSGTLLGDVLRGEWEFGGYVTSDCWAIKDINEGHG-VTKTIEESSAL 293

Query: 334 TLKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ--YVSL 391
            +K+G DL+CG  Y +    A + G + E +ID ++  L    MRLG FD   +  Y S+
Sbjct: 294 AVKSGCDLNCGCAYASLV-KAYRAGLIGEKEIDTAVHRLMLTRMRLGMFDAPEKVPYSSI 352

Query: 392 GKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYA 451
             +     E+   A E A + +VLL+N    LPL+ +++++VAV+GP+A++ VA+ GNY 
Sbjct: 353 PYEKNDCAEHRAFALEVAEKSLVLLRNRSGFLPLDRSRIRSVAVIGPNADSRVALEGNYN 412

Query: 452 GIPCRYMSPIAGF----SGYANVTYKTGCD------DVACKSNNSIFAASEAAKTADATI 501
           G    Y++ + G        A V Y  G            + N+ +  A+ AA+ AD  +
Sbjct: 413 GTASEYVTVLDGIREAVGDRARVYYAEGSHLFRNSMGGLSQKNDRLAEAAAAAERADVAV 472

Query: 502 ILAGLDLSVEAE---------SLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVD 552
           +  GL+  +E E         + D+ DL LPG Q +L+  V      PV+LV++S   + 
Sbjct: 473 VCLGLNRDIEGEEGDPSNEYPAGDKRDLRLPGLQEELLETVKATGT-PVVLVLLSGSALA 531

Query: 553 IAFAETNTNIKAILWAGYPGE--EGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTS 610
           + +A+ N +  A++ A YPG   EG R     +FG   P G  P          +   TS
Sbjct: 532 VNWADENAD--AVVQAWYPGAQAEGRRG---ALFGIIRPAGGFP------SRSTVRTRTS 580

Query: 611 MPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNL 670
                +     P        G  LYPFGYGLSYT+F+Y  L                   
Sbjct: 581 RIFGTIHENRLP-----LLQGDPLYPFGYGLSYTKFQYGDLKLA---------------- 619

Query: 671 NYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQ 730
                           +++   +  E  V  +N G  D  +VV +Y +           Q
Sbjct: 620 ---------------ASEIPAGEDAEVSVTVRNAGERDSDEVVQLYLQDLESSVPVPKWQ 664

Query: 731 VIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVG 778
           + GF+RV ++ G +  ++F   A + + ++D     +L  G   ++ G
Sbjct: 665 LAGFRRVHLKPGESAGVRFTV-AARQMALIDEDGRCVLEPGGFRVYAG 711


>gi|291240563|ref|XP_002740191.1| PREDICTED: hypothetical protein [Saccoglossus kowalevskii]
          Length = 747

 Score =  360 bits (923), Expect = 2e-96,   Method: Compositional matrix adjust.
 Identities = 244/723 (33%), Positives = 366/723 (50%), Gaps = 91/723 (12%)

Query: 42  FSKLGLQMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHG-------VPRLGLP 94
           FS +   +  F F ++SLP+S RV DLV R+TL+E V Q+     G       + RLG+ 
Sbjct: 15  FSLISTILGDFPFRNTSLPWSERVDDLVGRLTLEEIVLQMSRGGTGSNGPAPPIDRLGIG 74

Query: 95  QYEWWSEALHGVSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAM 154
            Y W +E LHG    GP          ATSFP      A+F+  L ++I  A + E RA 
Sbjct: 75  PYSWNTECLHGDVAAGP----------ATSFPQAFGLAATFDAVLIEQIANATAYEVRAK 124

Query: 155 YNL--------GRAGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVE 206
           YN            GL+ +SP IN+AR P WGRI ET GEDP++ G  A +YV GLQ   
Sbjct: 125 YNNYAKHKEYGDHKGLSCFSPVINIARHPLWGRIQETYGEDPYLSGTLAASYVNGLQ--- 181

Query: 207 GHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVK 266
           G+      + R +  ++ CKH+ AY         R  FDA+V+++D+  TFL  F  C++
Sbjct: 182 GN------HPRYVTANAGCKHFDAYAGPEDIPSSRSTFDAKVSDRDLRMTFLPAFHECIQ 235

Query: 267 EGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADS 326
            G   S+MCSYN +NG+P+CA+ KLL   +R EW+  GY+++D  +++ + D H +  D 
Sbjct: 236 AG-THSLMCSYNSINGVPACANKKLLTDILRTEWNFTGYVISDQSAVEKVYDAHHYTKDM 294

Query: 327 KEDAVAQTLKAGLDLDCGQYYTN----FTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFF 382
            + A+A  + +GL+L+      +     T  AV+QG V    +   +  L+   MRLG F
Sbjct: 295 LDTAIA-CVNSGLNLELSSNLEDNVMMQTTKAVKQGNVTMKTVKARVSPLFYTRMRLGEF 353

Query: 383 DGSPQYVSLGKQD---ICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPH 439
           D  P+     K D   I S E+ EL+ +AA +  VLLKN+   LPL   K+  +AVVGP 
Sbjct: 354 D-PPEMNPYSKLDLSIIQSQEHQELSLKAAAKSFVLLKNENRFLPLKE-KIDKLAVVGPL 411

Query: 440 ANATVAMIGNYAGIPCRY-MSPIAGFSGYA-NVTYKTGCDDVACKSNNSIFAASEAAKTA 497
           A+   A+ G+Y+  P  Y ++P  G +  A N +Y +GCD+  C+  +S    S A   A
Sbjct: 412 ADNVDALYGDYSATPNNYTVTPRNGLARLAGNTSYASGCDNPKCRKYDSGQVKS-AVSGA 470

Query: 498 DATIILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAE 557
           D  ++  G    +E+E  DR +L LPG Q  L+    +    PVIL++ +AG +D+++A 
Sbjct: 471 DMVVVCVGTGTDIESEGNDRHELALPGKQLSLLQDAVKFGTKPVILLLFNAGPLDVSWAV 530

Query: 558 TNTNIKAILWAGYPGEEGGRAIADVVFG---KFNPGGRLPITWYNGDYVQMLPLTSMPLR 614
            N  ++ I+   +P +  G A+  +      + NP GRLP+TW      Q+ P+T   ++
Sbjct: 531 ENPAVQTIVACFFPAQATGDALYRMFMNTSPESNPAGRLPMTWPR-SMEQVPPMTDYTMK 589

Query: 615 PVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTS 674
                   GRTY++ +   L+PFG+GLSYT FKY                       Y +
Sbjct: 590 --------GRTYRYSDADPLFPFGFGLSYTLFKY-----------------------YNT 618

Query: 675 DASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGF 734
            AS T         ++  D     +   NVG   G +V+ VY             Q++GF
Sbjct: 619 SASPTV--------IKSCDTVTIPLTVTNVGDFPGDEVMQVYISWSNASVTVPKLQLVGF 670

Query: 735 QRV 737
           +RV
Sbjct: 671 RRV 673


>gi|373852136|ref|ZP_09594936.1| Beta-glucosidase [Opitutaceae bacterium TAV5]
 gi|372474365|gb|EHP34375.1| Beta-glucosidase [Opitutaceae bacterium TAV5]
          Length = 740

 Score =  359 bits (921), Expect = 4e-96,   Method: Compositional matrix adjust.
 Identities = 243/705 (34%), Positives = 355/705 (50%), Gaps = 72/705 (10%)

Query: 54  FCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGT 113
           F D  L    RV+DLVSR+TL EKV Q+   A  +PRLG+P Y +W+E LHGV+  G   
Sbjct: 23  FRDPDLALDHRVRDLVSRLTLAEKVSQMEHAAAAIPRLGIPAYNYWNECLHGVARNG--- 79

Query: 114 HFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRA-----------GL 162
                   AT FP +I   A+++  L  ++  A+S EARA ++   A           GL
Sbjct: 80  -------RATVFPQIIGLAATWDTDLVYRVATAISDEARAKHHAALARQGFAQTQQYQGL 132

Query: 163 TYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVS 222
           T+W+PNIN+ RDPRWGR  ET GEDP +  R A  +VRGLQ         D     LK++
Sbjct: 133 TFWTPNINLFRDPRWGRGQETWGEDPHLTARLAAAFVRGLQG--------DTPDTHLKLA 184

Query: 223 SCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNG 282
           +C KHYA   V +    +R+ F+ARVT  D+ +++L  FE  V+     SVM +YNR   
Sbjct: 185 ACAKHYA---VHSGPENERHTFNARVTPHDLWDSYLPAFEHLVRHARVESVMGAYNRTLD 241

Query: 283 IPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLD 342
            P CA   LL   +R  W   G++V+DC +++ + + H+   D  E A A  L  G DL 
Sbjct: 242 EPCCASQFLLLDILRERWGFEGHVVSDCWALRDIHETHRITTDPVESA-ALALTKGCDLA 300

Query: 343 CGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGS-----PQYVSLGKQDIC 397
           CG  +    G AVQ+G + E DID++L        +LG FD +     P       + I 
Sbjct: 301 CGTTF-ELLGEAVQRGLITEADIDRALSRHLRARFKLGMFDPADDNRNPWSNPPAPEAIV 359

Query: 398 S-DENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCR 456
           +   +  LA EAA    VLL+N  + LPL    V+++ + GP A    A++GNY G+P R
Sbjct: 360 TCAAHTALACEAAVASCVLLQNHNHILPLRP-DVRSIYITGPLAATQDALLGNYYGLPPR 418

Query: 457 YMSPIAGFSGY----ANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEA 512
            ++ + G +          Y+ G      K N   +A  + A + D TI   GL   +E 
Sbjct: 419 AITLLDGLAAALPEGIRADYRPGALLSTPKQNALEWAEFDCA-SCDVTIACLGLTALLEG 477

Query: 513 E-------SL--DREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIK 563
           E       SL  DR+D+ LP  Q   +  +  + +G  ++VI+  GG  ++       ++
Sbjct: 478 EEGEAIASSLHGDRDDISLPPPQRLFLESL--IQRGARVIVILF-GGSALSLGPLADKVE 534

Query: 564 AILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPG 623
           AILWAGYPG+EGGRA+AD++ G+ +P GRLPIT+Y  + +  LP       P  +    G
Sbjct: 535 AILWAGYPGQEGGRALADILLGRASPSGRLPITFY--ENINDLP-------PYANYSMRG 585

Query: 624 RTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQV-NLNKLQHCRNLNYTSDASKTRCP 682
           RT+++++G   +PFG+GL+YT+F Y+ L  +      N + L     L  T D       
Sbjct: 586 RTHRWFDGTPAWPFGFGLTYTRFTYSDLRVSDVYSPGNDSPLCGSVLLTNTGDHEAAEIV 645

Query: 683 GVLVNDLRCDDY----FEFKVDFQNVGSTDGSDVVIVYSKPPAEI 723
            + + D           E   DF  V    G    + +S PP  I
Sbjct: 646 QIYLTDFDAPGNGPVPRENLADFHRVTLAPGQSRRVEFSIPPEHI 690


>gi|317057539|ref|YP_004106006.1| glycoside hydrolase family protein [Ruminococcus albus 7]
 gi|315449808|gb|ADU23372.1| glycoside hydrolase family 3 domain protein [Ruminococcus albus 7]
          Length = 691

 Score =  358 bits (920), Expect = 5e-96,   Method: Compositional matrix adjust.
 Identities = 223/625 (35%), Positives = 337/625 (53%), Gaps = 68/625 (10%)

Query: 54  FCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGT 113
           + D +L    R + L   MT +E+  QL   A  V RLG+P Y WW+E +HG++  G   
Sbjct: 4   YLDETLSAQERAEALTDEMTTEEQASQLRYDAPAVERLGIPAYNWWNEGIHGLARSGV-- 61

Query: 114 HFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRA--------GLTYW 165
                   AT FP  I   A F++ L KK  +  S EARA YN            GLT W
Sbjct: 62  --------ATMFPQAIGLAAMFDDELTKKTAEVTSEEARAKYNAYSGEEDRDIYKGLTLW 113

Query: 166 SPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCC 225
           +PNIN+ RDPRWGR  ET GEDP++  +  +  VRGLQ           + + +K ++C 
Sbjct: 114 APNINIFRDPRWGRGHETFGEDPYLTTKNGMAVVRGLQG----------DGKVIKAAACA 163

Query: 226 KHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPS 285
           KH+A   V +     R+ FDA+   +DMEET+L  FE  VKE    SVM +YNRVNG P+
Sbjct: 164 KHFA---VHSGPEAIRHSFDAKANAKDMEETYLPAFEALVKEAKVESVMGAYNRVNGEPA 220

Query: 286 CADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQ 345
           CA   L+++    EW+  GY V+DC +I+   +NH   A++ E + A  LKAG D++CG 
Sbjct: 221 CASNFLMDKL--KEWEFDGYFVSDCWAIRDFHENHMVTANAIE-STAMALKAGCDVNCGC 277

Query: 346 YYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQDICSDENIELA 405
            Y N    A+++G V + DI  +  +L    +RLG FD   +Y  +    +   E+  ++
Sbjct: 278 TYQNLL-VALEKGAVTKEDIRTACVHLMRTRIRLGMFDKKTEYDDIPYDKVACKEHKAIS 336

Query: 406 AEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFS 465
            E A + +V+L+N+   LP++++K KT+AV+GP+A++  A+ GNY G+  RY + + G  
Sbjct: 337 LECAEKSLVMLENN-GILPVDTSKYKTIAVIGPNADSRTALEGNYNGLSDRYTTFLNGIQ 395

Query: 466 GY--ANVTYKTGC----DDVA--CKSNNSIFAASEAAKTADATIILAGLDLSVEAE---- 513
                 V +  GC    D V+   ++ +    A  AAK AD TI+  GLD ++E E    
Sbjct: 396 DRFDGRVIFAEGCHLYKDRVSNLAQAGDRYAEAVAAAKFADMTILCLGLDATIEGEEGDT 455

Query: 514 -----SLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWA 568
                S D+  L LP  Q +L+ ++  V K PV+ V+ +   ++     T +   A++ A
Sbjct: 456 GNEFSSGDKNGLTLPPPQRELVKKIMAVGK-PVVTVVCAGSAIN-----TESKPDALIHA 509

Query: 569 GYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKF 628
            YPG EGG+A+A+V+FG  +P G+LP+T+Y  D  ++   T   ++        GRTY++
Sbjct: 510 FYPGAEGGKALAEVLFGDVSPSGKLPVTFYE-DTDKLPEFTDYSMK--------GRTYRY 560

Query: 629 YNGPTLYPFGYGLSYTQFKYNLLSF 653
                LYPFGYGL+Y   K   + +
Sbjct: 561 TTENVLYPFGYGLTYGSVKVTKVEY 585


>gi|291544853|emb|CBL17962.1| Beta-glucosidase-related glycosidases [Ruminococcus champanellensis
           18P13]
          Length = 697

 Score =  358 bits (919), Expect = 7e-96,   Method: Compositional matrix adjust.
 Identities = 226/625 (36%), Positives = 335/625 (53%), Gaps = 68/625 (10%)

Query: 54  FCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGT 113
           + + SL    R +DL  R+T++E+  QL   A  +PRLG+P Y WW+E LHGV+  G   
Sbjct: 9   YLNPSLTPDERAEDLADRLTVEEQASQLRYDALPIPRLGIPAYNWWNEGLHGVARAGT-- 66

Query: 114 HFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGR--------AGLTYW 165
                   AT FP  I   A+F+ +L  +IG+  +TEARA +   R         GLT W
Sbjct: 67  --------ATMFPQAIGMAATFDTALLHQIGEITATEARAKHMAAREHGDFDIYKGLTLW 118

Query: 166 SPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCC 225
           +PNIN+ RDPRWGR  ET GEDPF+  R  V +V+G+Q  EG         + LK ++C 
Sbjct: 119 APNINLFRDPRWGRGHETYGEDPFLTARLGVAFVKGMQG-EG---------KVLKAAACA 168

Query: 226 KHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPS 285
           KH+A   V +     R+ FDA+V+ +D+EE++L  F   V E     VM +YNRVNG PS
Sbjct: 169 KHFA---VHSGPEALRHSFDAQVSPKDLEESYLPAFHALVAEAKVEGVMGAYNRVNGEPS 225

Query: 286 CADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQ 345
           CA P L+++    +W   GY V+DC +IQ    +H    +  E A A  L+ G DL+CG 
Sbjct: 226 CASPMLMDKL--HQWGFAGYFVSDCWAIQDFHKHHGVTKNVTESA-ALALRTGCDLNCGN 282

Query: 346 YYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQDICSDENIELA 405
            Y  +   A+++G +   DI ++   +    +RLG FD  P + +     I S  +  ++
Sbjct: 283 TYL-YVLAALEEGLIDAADIRRACIRVLRTRIRLGLFDPEPHFAACTYDTIASPAHKAVS 341

Query: 406 AEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFS 465
              A + +VLLKND   LPL+ +K+  +AV+GP+A++  A+ GNY G   RY++ + G  
Sbjct: 342 LSCAEKSMVLLKND-GILPLDLSKLHAIAVIGPNADSRAALEGNYCGTADRYVTFLEGIQ 400

Query: 466 GY--ANVTYKTGCDDVACKSNNSIFA------ASEAAKTADATIILAGLDLSVEAE---- 513
                 V Y  GC     +++N   A      A  AA+ +D  I+  GLD ++E E    
Sbjct: 401 DAFPGRVHYAQGCHLYKDRTSNLAMADDRYAEALAAAEASDVVILCLGLDATLEGEEGDT 460

Query: 514 -----SLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWA 568
                S D+ DL LP  Q +L+ ++  V K PVILV+ +   ++    E + N  A+L A
Sbjct: 461 GNEFSSGDKADLRLPPPQCKLLEKLHAVGK-PVILVLAAGSALN---PEISCN--AVLQA 514

Query: 569 GYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKF 628
            YPG+ GG+A+A ++FGK +P G+LP+T+Y          T+  L          RTY++
Sbjct: 515 WYPGQCGGQALAHILFGKVSPSGKLPVTFYE---------TAEQLPDFTDYSMQNRTYRY 565

Query: 629 YNGPTLYPFGYGLSYTQFKYNLLSF 653
                LYPFGYGL+Y +     LS+
Sbjct: 566 ARNNVLYPFGYGLTYGKIVCTELSY 590


>gi|348684872|gb|EGZ24687.1| family 3 glycoside hydrolase [Phytophthora sojae]
          Length = 805

 Score =  358 bits (918), Expect = 8e-96,   Method: Compositional matrix adjust.
 Identities = 256/779 (32%), Positives = 383/779 (49%), Gaps = 92/779 (11%)

Query: 48  QMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPR-----LGLPQYEWWSEA 102
           +   F FCD+SL  S RV+DL+ R+ LDEKV  L   A   P+     +GLP+Y W +  
Sbjct: 30  EHQKFPFCDASLSTSERVEDLLRRLPLDEKVTLLT--ARASPKGNMSSIGLPEYNWGANC 87

Query: 103 LHGVSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLG---- 158
           +HGV +   GT+       ATSFP  +   A F+      + Q +  E RA++  G    
Sbjct: 88  VHGVQSTC-GTNC------ATSFPNPVNLGAIFDPQAVFDMAQVIGWELRALWLEGAREN 140

Query: 159 -----RAGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATD 213
                  GL  WSPNIN+ RDPRWGR  ETP EDP V  +Y V Y RGLQ  EG +    
Sbjct: 141 YAAGPHLGLDCWSPNININRDPRWGRNMETPSEDPLVNSKYGVAYTRGLQ--EGKDK--- 195

Query: 214 LNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSV 273
              R L+     KHYAAY  +++ G+DR  F+A+V+  D  +T+L  F   V EG A  V
Sbjct: 196 ---RFLQAVVTLKHYAAYSYEHYDGIDRMAFNAQVSRYDFADTYLPAFHASVVEGKAKGV 252

Query: 274 MCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQ 333
           MCSYN VNG+P CA+ +L  + +R      GYI +D  +I+ +     +   S  +A   
Sbjct: 253 MCSYNSVNGMPMCANEQLNTKLLREALGFDGYITSDSGAIEGIYRQRHY-TKSLCEAGRL 311

Query: 334 TLKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFD--GSPQYVSL 391
            + +G D++ G  Y     + V  G++ E  +D +++    +   LG FD      Y  +
Sbjct: 312 AIMSGTDVNSGSVYKKCLADLVTSGQLPEKAVDDAMRRTLKLRFELGLFDPIDDQPYWHV 371

Query: 392 GKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYA 451
              ++   E+ +L+ E  R+ IVLL+N  N LPL   K K +AV+GPHA A  A++GNY 
Sbjct: 372 APSEVGKTESKQLSLELTRKSIVLLQNHGNVLPLR--KGKKLAVIGPHAKAKRALLGNYL 429

Query: 452 GIPCR-----------YMSPIAGFSGYANVTYKTGCDDVACKSNNSIFAASEAAKTADAT 500
           G  C             +  I   +G +N  Y  G   +   S     AA  AA+ ADA 
Sbjct: 430 GQMCHGDYLEVGCVQTPLEAITAANGASNTVYAKGS-GINDTSTADFDAAEAAARGADAV 488

Query: 501 IILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNT 560
           ++  G+D S+E E+ DRE++ +P  Q QL+ +V    K P ++V+ + GGV +   E   
Sbjct: 489 VLFLGIDTSIEREAWDRENIDMPNIQMQLLKRVRRAGK-PTVVVLFN-GGV-VGAEELIL 545

Query: 561 NIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLG 620
           +   +  A YPG  G +A++D++FG   P G+LP+T Y  +Y+  + + SM +       
Sbjct: 546 HTDGVAEAFYPGFFGAQAVSDILFGDAIPSGKLPVTMYPSNYINSVDMKSMSM-----TK 600

Query: 621 YPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTR 680
           YPGR+Y++Y    ++PFG+GLSYT+F   L                        D     
Sbjct: 601 YPGRSYRYYKEVPVFPFGWGLSYTKFTLAL------------------------DGEMPD 636

Query: 681 CPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKP----PAEIAATYIKQVIGFQR 736
            P V+  DL         V   N G   G +VV  + +P        AA   +Q+  ++R
Sbjct: 637 DPIVITRDLDQ----TVTVIVSNDGDLVGDEVVFAFFRPLNVNATGDAALLNEQLFDYRR 692

Query: 737 VFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGNG---GVSFPIHLNFNY 792
           V +R  + +++ F      +L +VD + N     G + + + NG    V+F IHL   Y
Sbjct: 693 VSLRPTQYRKLTFRIQQ-STLAMVDDSGNKASFPGFYEVIITNGVHERVTFAIHLVGKY 750


>gi|332638085|ref|ZP_08416948.1| glycoside hydrolase family 3 protein [Weissella cibaria KACC 11862]
          Length = 713

 Score =  358 bits (918), Expect = 8e-96,   Method: Compositional matrix adjust.
 Identities = 236/743 (31%), Positives = 375/743 (50%), Gaps = 100/743 (13%)

Query: 64  RVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGAT 123
           + K +V +MT+DEK+ Q+   A  + RL +P+Y +W+EALHGV+  G           AT
Sbjct: 13  QAKVIVDQMTIDEKIGQIKYEAPAIERLNIPEYNYWNEALHGVARAGV----------AT 62

Query: 124 SFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRA--------GLTYWSPNINVARDP 175
            FP  I   A+F++ L   I   + TE RA YN            GLT+WSPN+N+ RDP
Sbjct: 63  VFPQAIGLAATFDDQLINDIADVIGTEGRAKYNEFTKHEDRDIYKGLTFWSPNVNIFRDP 122

Query: 176 RWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDN 235
           RWGR  ET GEDPF+  ++ V +++GLQ            ++ LK+++  KH+A +    
Sbjct: 123 RWGRGHETYGEDPFLTSKFGVAFIKGLQG----------QAKYLKLAATAKHFAVHS--G 170

Query: 236 WKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQT 295
            +G+ R+ FDA V+++D+ ET+L  F+  V+E D  S+M +YN V+G+P+     LL   
Sbjct: 171 PEGL-RHGFDAVVSDKDLYETYLPAFKAAVEEADVESIMTAYNAVDGVPASVSEMLLRDI 229

Query: 296 VRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFTGNAV 355
           +  +W   G++V+D  + + + +NHK+  D+ E  +   +KAGL+L  G    +    A+
Sbjct: 230 LHDKWSFEGHVVSDYMAPEDVHENHKYTKDAAE-TMGLAIKAGLNLVAGHIEQSLH-EAL 287

Query: 356 QQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQDICSDENIELAAEAAREGIVL 415
            +G V E +I  ++  LY   +RLG F    +Y ++  +   +  +  L+  AA +  VL
Sbjct: 288 NRGLVTEEEITNAVISLYATRVRLGMFATDNEYDAIPYEANDTKAHNNLSEIAAEKSFVL 347

Query: 416 LKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFSGY----ANVT 471
           LKND   LPL    ++ +AVVGP+A++ +A++GNY G P R  + + G          V 
Sbjct: 348 LKND-GVLPLRKETMEAIAVVGPNAHSEIALLGNYFGTPSRSYTILEGIQERLGDDVRVH 406

Query: 472 YKTGC----DDVA---CKSNNSIFAASEAAKTADATIILAGLDLSVEAE---------SL 515
           Y  G     D  A    K++     A  AA+ +D  + + GLD ++E E         + 
Sbjct: 407 YSIGSGVFQDHAAEPLAKADERESEAIIAAEHSDVIVAVLGLDSTIEGEEGDAGNSQGAG 466

Query: 516 DREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEG 575
           D+ +L LPG Q QL+ ++  V K PV++++ S   + +   E + N++AI+   YPG  G
Sbjct: 467 DKPNLSLPGRQRQLLERLLAVGK-PVVVLLASGSSLQLDGLENHPNLRAIMQIWYPGARG 525

Query: 576 GRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLY 635
           G A+ADV+FG  +P G+LP+T+Y           +  L   +     GRTY++     LY
Sbjct: 526 GLAVADVLFGTVSPSGKLPVTFYK---------NTDNLPAFEDYNMAGRTYRYMTEEALY 576

Query: 636 PFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYF 695
           PFGYGL+Y+              V L+ LQ  ++   T+ A+                  
Sbjct: 577 PFGYGLTYS-------------SVELSDLQ-VKSYEETATAT------------------ 604

Query: 696 EFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACK 755
              V  QN G+ D  +VV VY K      A    Q+ GF+RVF+  G  + I F     +
Sbjct: 605 ---VTIQNTGNFDTDEVVQVYVKDLESEFAVPNAQLKGFKRVFLGKGSKQTITFDLR-PQ 660

Query: 756 SLNIVDYAANTLLPAGEHTIFVG 778
              + D   +  + +    I VG
Sbjct: 661 DFEVFDEQGHNFIDSNRFEISVG 683


>gi|282877070|ref|ZP_06285912.1| glycosyl hydrolase family 3 C-terminal domain protein [Prevotella
           buccalis ATCC 35310]
 gi|281300752|gb|EFA93079.1| glycosyl hydrolase family 3 C-terminal domain protein [Prevotella
           buccalis ATCC 35310]
          Length = 721

 Score =  358 bits (918), Expect = 8e-96,   Method: Compositional matrix adjust.
 Identities = 250/761 (32%), Positives = 372/761 (48%), Gaps = 111/761 (14%)

Query: 51  SFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVG 110
           ++ F D+ L +  R  DL  R+TL+EK   + + +  VPRLG+ Q++WW EALHG +  G
Sbjct: 23  TYPFQDARLSFEQRADDLCKRLTLEEKAGLMQNNSKPVPRLGIKQFQWWGEALHGSARTG 82

Query: 111 PGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRA--------GL 162
                      AT FP  I   ASF++ L  ++    STEARA YN+            +
Sbjct: 83  L----------ATVFPQTIGMAASFDDELLLQVFNIASTEARAKYNVAAKKGYFDTSWSV 132

Query: 163 TYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVS 222
           + W+PN+N+ RDPRWGR  ET GEDP++  R     V GLQ  +G         +  K  
Sbjct: 133 SLWTPNVNIFRDPRWGRGQETYGEDPYLTSRMGCAVVEGLQGGKGPH-------KYYKAF 185

Query: 223 SCCKHYAAYDVDNWKGVDRYHFDAR-VTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVN 281
           +C KH+A +    W   +R+      V+ +D  ET+L  F+  V+ G    VMC+YN ++
Sbjct: 186 ACAKHFAVHSGPEW---NRHSISIDDVSPRDFHETYLPAFKHLVQVGGVKEVMCAYNSID 242

Query: 282 GIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDN--HKFLADSKEDAVAQTLKAGL 339
           G P C+D +LL Q +R EW   G +V+DC +I  +     H+   D+   A A+ +K G 
Sbjct: 243 GEPCCSDQRLLEQLLRDEWGFKGIVVSDCGAIDDIWRKGFHEVEPDAAH-ASARAVKGGT 301

Query: 340 DLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSP--QYVSLGKQDIC 397
           D+ CGQ Y +    AV+ GKV E  IDKSLK L    M+LG FD     ++ ++  +D+ 
Sbjct: 302 DMSCGQTYGSLP-EAVRLGKVTEERIDKSLKRLIVGRMQLGEFDPDSITRWNAISMKDVS 360

Query: 398 SDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRY 457
           +  + E+A + ARE + LL N  + LPL S ++K V V+GP+AN +V M GNY G P   
Sbjct: 361 TPASREVALKMARETMTLLHNPMHALPL-SKQLKQVVVMGPNANDSVMMWGNYNGTPHHT 419

Query: 458 MSPIAGFS---GYANVTYKTGCDDVAC--KSNNSIFAASEAAKTAD-ATII--------L 503
           ++ + G     G   V +  GC  V    + N ++       +  D  T+I        L
Sbjct: 420 VTILDGIRRKIGAQRVKFIEGCGLVEPHRRGNQALTTQQLVEEVGDNKTVIFVGGISPQL 479

Query: 504 AGLDLSVEAESL---DREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNT 560
            G  L VEA+     DR  + LP  Q ++I  +    K    +++++  G  I      T
Sbjct: 480 EGEQLEVEAKGFKGGDRVTIELPQVQREMIAALHAAGKQ---VIMVNCSGSAIGLVPEVT 536

Query: 561 NIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLG 620
           +  AIL A YPGE GG A+ADV+FG +NP G+LP+T+Y  D       + +P    D L 
Sbjct: 537 HTDAILQAWYPGERGGEAVADVLFGDYNPAGKLPVTFYRDD-------SQLP----DYLD 585

Query: 621 Y--PGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASK 678
           Y    RTY+++ G  L+PFG+GLSYT FK                    RN   T     
Sbjct: 586 YNMRNRTYRYFKGKPLFPFGHGLSYTSFKIGKAKM--------------RNGKLT----- 626

Query: 679 TRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVF 738
                               V  +N G  DG +VV +Y     +     IK + GF+R+ 
Sbjct: 627 --------------------VSVKNTGKRDGEEVVQLYISCLDDPNGP-IKSLRGFKRMA 665

Query: 739 VRAGRNKRIKFVFNACKSLNIVDYAANTL-LPAGEHTIFVG 778
           ++AG  + +       KS    D   NT+ +  G++ ++ G
Sbjct: 666 LQAGEQRTVTLNLPR-KSFERFDEQTNTIRVVPGKYRVYYG 705


>gi|373460527|ref|ZP_09552278.1| hypothetical protein HMPREF9944_00542 [Prevotella maculosa OT 289]
 gi|371955145|gb|EHO72949.1| hypothetical protein HMPREF9944_00542 [Prevotella maculosa OT 289]
          Length = 699

 Score =  357 bits (916), Expect = 1e-95,   Method: Compositional matrix adjust.
 Identities = 240/741 (32%), Positives = 377/741 (50%), Gaps = 101/741 (13%)

Query: 64  RVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGAT 123
           + + L++ MTLDEK+ Q+ +   G+PRLG+  Y+WW+E LHGV   G           AT
Sbjct: 12  KARRLINMMTLDEKISQMMNETPGIPRLGIKPYDWWNEGLHGVGRDGR----------AT 61

Query: 124 SFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGR--------AGLTYWSPNINVARDP 175
            FP  I   A+FN +L ++IG A++TE RA YN+ +         GLT+WSPNIN+ RDP
Sbjct: 62  VFPQPIGMGATFNPALIRQIGDAIATEGRAKYNVAQRNNNYARYTGLTFWSPNINIFRDP 121

Query: 176 RWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRP--LKVSSCCKHYAAYDV 233
           RWGR  ET GEDPF+ G   + YV+G+Q            + P  LKV++C KHYA   V
Sbjct: 122 RWGRGMETYGEDPFLTGTLGIAYVQGMQ-----------GNDPFYLKVAACGKHYA---V 167

Query: 234 DNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLN 293
            +     R+  +   T++D+ ET+L  F+M V++G   ++M +YNRV G        LL 
Sbjct: 168 HSGPEATRHEANVSPTKRDLFETYLPAFKMLVQQGHVEAIMGAYNRVYGEACSGSKYLLT 227

Query: 294 QTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFTGN 353
             +R +W   G+IV+DCD++  +   HK +  ++ +A A  +KAGL+++CG  +      
Sbjct: 228 DVLRKQWGFRGHIVSDCDAVADIHAGHK-IVKTEAEACAIAIKAGLNIECGHTFEAMK-Q 285

Query: 354 AVQQGKVKETDIDKSLKYLYTVLMRLGF--FDGSPQYVSLGKQDICSDENIELAAEAARE 411
           AV Q  + E +ID++L  L    ++LG   +D    Y  + + +ICS E+I LA +AA E
Sbjct: 286 AVAQKLLTEQEIDRALLPLMMTRLKLGILEYDAECPYNEVKETEICSPEHIALARKAATE 345

Query: 412 GIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGF-----SG 466
            +VLLKN+   LPL+   + T+ + GP A+ +  ++GNY GI  RY + + G      SG
Sbjct: 346 SMVLLKNN-GILPLDK-NLHTLFIAGPGASDSFWLMGNYFGISNRYCTYLQGIADKVSSG 403

Query: 467 YANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAES---------LDR 517
            A V ++    + +  + N+I  A + A  A+ TI++ G + ++E E           DR
Sbjct: 404 TA-VNFRPAFGE-STPTKNTINWALDEAIAAEKTIVVMGNNGNLEGEEGESIASETRGDR 461

Query: 518 EDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGR 577
             + LP  Q + +  +     G   +V++  GG  I   E +    A++ A YPG+EGG 
Sbjct: 462 VSMRLPASQMKFLRDLKARKNG---IVVVLTGGSPIDVREISRLADAVVMAWYPGQEGGY 518

Query: 578 AIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPF 637
           A+AD++FG  N  GRLP+T+         P ++  L P +     GRTYK+      YPF
Sbjct: 519 ALADLLFGDENFSGRLPVTF---------PESTDALPPFEDYAMKGRTYKYQTAHIQYPF 569

Query: 638 GYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEF 697
           GYGLSYT   Y                 H +      +    +  G+ V+ +        
Sbjct: 570 GYGLSYTTVTY----------------AHAK-----VETMPQKGRGMTVSAV-------- 600

Query: 698 KVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSL 757
               +N G+    +V  VY   P       +  ++ F+R+ ++ G  + ++F     + L
Sbjct: 601 ---LKNTGNKAVDEVAQVYLSAPGAGTTAALASLVAFKRIGLQPGEQQLVRFDIPFDRLL 657

Query: 758 NIVDYAANTLLPAGEHTIFVG 778
            + +     LL  G +TI VG
Sbjct: 658 TVQEDGTAQLL-KGNYTITVG 677


>gi|427385932|ref|ZP_18882239.1| hypothetical protein HMPREF9447_03272 [Bacteroides oleiciplenus YIT
           12058]
 gi|425726971|gb|EKU89834.1| hypothetical protein HMPREF9447_03272 [Bacteroides oleiciplenus YIT
           12058]
          Length = 732

 Score =  357 bits (915), Expect = 2e-95,   Method: Compositional matrix adjust.
 Identities = 249/776 (32%), Positives = 384/776 (49%), Gaps = 99/776 (12%)

Query: 34  VFVCDPGRFSKLGLQMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGL 93
           +F+        +G+Q ++  F +  +    RV DL+SR+TL++K Q L      V   G 
Sbjct: 14  IFLSTGAAAQSIGIQ-NNPAFLNQEMSMEARVADLMSRLTLEQKAQLLNHRGKTVVVDGF 72

Query: 94  P-QYEWWSEALHGVSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEAR 152
             + + W++ LHGV    P           T+FPT I   A+++  L  ++   +S EAR
Sbjct: 73  SIRADQWNQCLHGVKWTEP----------TTNFPTSIALGATWDTELIHRVATVISDEAR 122

Query: 153 AMYNLGR---------AGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQ 203
           A+YN  +          GL Y SP IN++R+P WGRI E  GEDP+  GR  V YV+GLQ
Sbjct: 123 AIYNGWKQDPEFRGEHKGLIYRSPVINISRNPYWGRINEIFGEDPYHTGRMGVAYVKGLQ 182

Query: 204 DVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEM 263
             + H          LK++S  KHYA  +V+    VDR    A+V E+ + E +L  F+ 
Sbjct: 183 GDDSHY---------LKLASTLKHYAVNNVE----VDRMKLSAQVPERMLYEYWLPHFKD 229

Query: 264 CVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFL 323
           C+ EG A SVM SYN +NG+P+  +  LL   ++ +W   G++V+D   ++ MV+ H   
Sbjct: 230 CIVEGKAQSVMASYNAINGVPNNINKLLLTDILKNQWGHEGFVVSDLGGVKTMVEGHHQR 289

Query: 324 ADSKEDAVAQTLKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFD 383
             S E+AV +++ AG D    +Y   +  +A+++G + E  ++ +L+ +  V  RLG FD
Sbjct: 290 QISCEEAVGRSIMAGCDFSDAEY-EKYIPDALRKGYLTEERLNDALRRVLLVRFRLGEFD 348

Query: 384 --GSPQYVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHAN 441
              S  Y  +    I   E+  L+ EAAR+ IVLLKN++  LP++ + +K VAV+GP+A+
Sbjct: 349 DFKSVPYSRISPDVIGCKEHRNLSLEAARKSIVLLKNEKKLLPIDRSIIKRVAVIGPYAD 408

Query: 442 ATVAMIGNYAGIPCRYMSPIAGFSGYA----NVTYKTGCDDVACKSNN------------ 485
             +   GNY G+P   ++P+ G          V Y  G      K               
Sbjct: 409 --LFNQGNYGGVPKDPVTPLQGIKNAVGNNVEVLYCKGAQITPVKVRKGQPIPPRFDKEA 466

Query: 486 SIFAASEAAKTADATIILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVI 545
            +  A E A+ +D   +  G    +E E  DR+ L LPG Q +L+  V EV K  V++V+
Sbjct: 467 EMKKAVEMARNSDVVFLFVGTTADIEVEGRDRKTLVLPGNQNELVKAVYEVNK-KVVVVL 525

Query: 546 MSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQM 605
           MSAG V  A  E   NI A+L A +PG+EGG AIADV+FG +NPGG+LP T Y  D  + 
Sbjct: 526 MSAGPV--AVPEVKKNIPAVLQAWWPGDEGGNAIADVLFGDYNPGGKLPYTMYASD--EQ 581

Query: 606 LPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQ 665
           +P T       +     G TY +     L+ FG+GLSY++F Y+ L  +           
Sbjct: 582 VPSTD------EYDISKGFTYMYLKKKPLFAFGHGLSYSKFHYSDLQIS----------- 624

Query: 666 HCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAA 725
                           P V VND          +  +N+G   G +VV +Y +       
Sbjct: 625 ---------------SPVVSVNDT-----VSVVLKVKNMGKRTGEEVVQLYVRDVKAKVV 664

Query: 726 TYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYA-ANTLLPAGEHTIFVGNG 780
              K++ GF+R+ ++    + I+ +    KSL   D +  + L+  G   I +G+ 
Sbjct: 665 RPTKELRGFKRIALQPNEEQEIRLML-PVKSLAFYDESIGDFLVEPGSFEILLGSA 719


>gi|291240561|ref|XP_002740190.1| PREDICTED: hypothetical protein [Saccoglossus kowalevskii]
          Length = 763

 Score =  355 bits (912), Expect = 4e-95,   Method: Compositional matrix adjust.
 Identities = 241/640 (37%), Positives = 345/640 (53%), Gaps = 69/640 (10%)

Query: 47  LQMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVP--RLGLPQYEWWSEALH 104
           L  +++ F ++SL +  RV DLVSR+TLDE V Q+   +   P  RLG+  Y W SE LH
Sbjct: 21  LISAAYPFQNTSLSWEERVDDLVSRLTLDEMVLQMARTSPAPPIDRLGIKPYVWNSECLH 80

Query: 105 GVSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYN-------- 156
           GV  V P    D +   AT+FP  I   ASF+  L   + +A+  E RA +N        
Sbjct: 81  GV--VPP----DGL---ATAFPQSIGLAASFSPDLLSDVAKAIGLEVRAKHNDYVQRGVY 131

Query: 157 LGRAGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNS 216
               GL+ +SP IN+AR P WGR  ET GEDPF++G     YVRGLQ           + 
Sbjct: 132 QEHTGLSCFSPVINIARHPLWGRNQETYGEDPFLIGELGSAYVRGLQGD---------HP 182

Query: 217 RPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCS 276
           R +  ++ CKH+  +       V R+ FDA+V E+D + TFL  F  CVK G   SVMCS
Sbjct: 183 RYVLANAGCKHFDVHGGPEDIPVSRFSFDAKVFERDWQMTFLPAFHECVKAG-VYSVMCS 241

Query: 277 YNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLK 336
           YNR+N +P+CA+ +LL   +R EW   GY+V+D  +++ ++ +H +  DS  D VA  + 
Sbjct: 242 YNRINEVPACANTRLLTDILRKEWGFDGYVVSDEGAVEFIMTSHHY-TDSIVDTVASAVN 300

Query: 337 AGLDLDC------GQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ--- 387
           AG +LD       G Y     G+AV  GK+KE  + + +K L+   MRLG FD  P+   
Sbjct: 301 AGCNLDLAFPVGDGMYIK--IGDAVTAGKIKEKTVVERVKPLFYTRMRLGEFD-PPELNP 357

Query: 388 YVSLGKQDICSDENIELAAEAAREG-----IVLLKNDQNTLPLNSAKVKTVAVVGPHANA 442
           Y +L    + S+E+ ELA +AA +       VLLK +   LPL++  V  +AV+GP A+ 
Sbjct: 358 YANLNLSVVQSEEHRELAVKAALQSFVLLNFVLLKREGRVLPLDTL-VNKLAVIGPFADN 416

Query: 443 TVAMIGNYAGIPCR--YMSPIAGFSGYANVTYKT-GCDDVACKSNNSIFAASEAAKTADA 499
              + G+Y+  P +   ++P  G S  A  T  T GC    C +  S    + A   AD 
Sbjct: 417 PSYLFGDYSPNPDKEFVVTPCKGLSNAARDTRCTPGCLTAPCTTYFSEMVKA-AVTGADL 475

Query: 500 TIILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKG-PVILVIMSAGGVDIAFAET 558
            ++  G  + +EAE +DR DL LPG Q QL+  V + A G P+IL++ +AG +DI +A  
Sbjct: 476 IVVCLGTGVKIEAEFVDRSDLSLPGKQFQLLQDVVKYANGKPIILLLFNAGPLDIVWAVE 535

Query: 559 NTNIKAILWAGYPGEEGGRAIADVVF-------GKFNPGGRLPITWYNGDYVQMLPLTSM 611
           N  I+ I+   +P +  G A+  +         G  NPGGRLPITW         P +  
Sbjct: 536 NPAIQVIVACFFPSQATGDALYRMFMNTHGVDTGNGNPGGRLPITW---------PRSMN 586

Query: 612 PLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLL 651
            + P+ +    GRTY+++NG  L+PFGYGLSY  F Y+ L
Sbjct: 587 QVPPMTNYTMEGRTYRYFNGDPLFPFGYGLSYGSFSYSSL 626


>gi|167519969|ref|XP_001744324.1| hypothetical protein [Monosiga brevicollis MX1]
 gi|163777410|gb|EDQ91027.1| predicted protein [Monosiga brevicollis MX1]
          Length = 721

 Score =  354 bits (909), Expect = 8e-95,   Method: Compositional matrix adjust.
 Identities = 219/623 (35%), Positives = 328/623 (52%), Gaps = 50/623 (8%)

Query: 54  FCDSSLPYSIRVKDLVSRMTLDEKVQQLGDF-------AHGVPRLGLPQYEWWSEALHGV 106
           FCD SL +  R  DL  R+TLDE  QQL  +       A GVPRLGL  Y + +E LHG+
Sbjct: 44  FCDLSLDFRDRAWDLAQRLTLDELAQQLNTYSFTPQAYAPGVPRLGLRNYSYHAEGLHGI 103

Query: 107 SNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYN--------LG 158
            +       + V   AT +P V    A+ N SL  ++   + TE RA+ N         G
Sbjct: 104 RDA------NVVNYPATLYPQVTAMAATANASLIHEMSTIMGTELRAVNNRAQELGEIFG 157

Query: 159 RAG-LTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSR 217
           R G L+ + P +N+ RD RWGR  E+  EDP++ G YAVN+V GL+           +S+
Sbjct: 158 RGGALSIYGPTMNIIRDGRWGRSQESVSEDPWLNGLYAVNFVLGLEQRN--------SSK 209

Query: 218 PLKVSSCCKHYAAYDVDNWKG-VDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCS 276
            L+ ++ CKH  AY  + +   + R+ F+A + E D+ +T+L  F  CV+ G    +MCS
Sbjct: 210 YLQAATSCKHLFAYSFEGYNNTLTRHSFNAVIDELDIHDTYLPAFRACVELGHVQQIMCS 269

Query: 277 YNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLK 336
           YN VNGIP+CA   + N  VR  W   G IV+DCD++  + + H +   + EDAV   L+
Sbjct: 270 YNSVNGIPACARGDVQNDRVRKAWGFEGLIVSDCDAVADIYNTHNY-TRTPEDAVTVALQ 328

Query: 337 AGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFD--GSPQYVSLGKQ 394
            G DLDCG +Y+    +AVQQ       + +S+  +  +   LG FD   S  Y  LG++
Sbjct: 329 GGCDLDCGDFYSQHLASAVQQNLTTLAALQQSMTRVLEMRFLLGEFDPDTSVPYRQLGRE 388

Query: 395 DICSDENIELAAEAAREGIVLLKNDQNTLPLN-SAKVKTVAVVGPHANATVAMIGNYAG- 452
            I +    + +  A+RE +VLL+N    LP+  SA +K VA++GP+ N T  M+G     
Sbjct: 389 AIDTPFARDSSLRASRESVVLLENRIKLLPVTLSADIK-VALIGPYVNLTTIMMGGKLDY 447

Query: 453 IPCRYMSPIAGFS--GYANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSV 510
            P    +   GF   G  ++T   GC+ +      ++  A + A  AD  ++  GL   +
Sbjct: 448 TPSFITTYFQGFQAIGITHLTSSPGCN-ITAPLPGALDKAVQIATQADLVVLTLGLSSDI 506

Query: 511 EAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGG------VDIAFAETNTNIKA 564
           E E  DRE L LP  Q  L + ++       ++V++  GG      +    A T T I+A
Sbjct: 507 EHEGGDRETLGLPTPQQDLYDAISAAIPSSKLVVVLVNGGPVSVDRIKYGIARTPTIIEA 566

Query: 565 ILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGR 624
                Y G+  G A+A+ +FG+ NP G LP T +  +    +P T M LRP  + G+PGR
Sbjct: 567 F----YGGQSAGTALAETIFGQNNPSGTLPYTVFFSNITAHVPFTDMHLRPDAATGFPGR 622

Query: 625 TYKFYNGPTLYPFGYGLSYTQFK 647
           T++F++ P ++PFG+GLSY+ F 
Sbjct: 623 THRFFDAPVMWPFGHGLSYSTFS 645


>gi|326789672|ref|YP_004307493.1| beta-glucosidase [Clostridium lentocellum DSM 5427]
 gi|326540436|gb|ADZ82295.1| Beta-glucosidase [Clostridium lentocellum DSM 5427]
          Length = 704

 Score =  353 bits (907), Expect = 1e-94,   Method: Compositional matrix adjust.
 Identities = 219/611 (35%), Positives = 321/611 (52%), Gaps = 63/611 (10%)

Query: 64  RVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGAT 123
           +   LV++M L EK   L   +  + RLG+P Y WWSEALHGV+  G           AT
Sbjct: 8   KAGQLVAQMDLLEKASMLRYDSPAIKRLGVPTYNWWSEALHGVARAGV----------AT 57

Query: 124 SFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRA--------GLTYWSPNINVARDP 175
            FP  I   A F+E    +I   ++TEARA YN            G+T W+PNIN+ RDP
Sbjct: 58  VFPQAIGMAAMFDEEYLYEIADIIATEARAKYNEFAKKEDRDIYKGMTLWAPNINIFRDP 117

Query: 176 RWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDN 235
           RWGR  ET GEDP++  R  V ++ GLQ  E H           K ++C KH+A   V +
Sbjct: 118 RWGRGHETYGEDPYLTSRLGVAFIHGLQGDENHHY--------WKAAACAKHFA---VHS 166

Query: 236 WKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQT 295
               +R+HFDA V+++D+ ET+L  FE  V +G  + +M +YNRVNG P+C    LL   
Sbjct: 167 GPEEERHHFDAVVSKKDLYETYLPAFEAAVTKGKVAGMMGAYNRVNGEPACGSKVLLQDI 226

Query: 296 VRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFTGNAV 355
           ++ EW   GY+V+DC +I+     H     + E A A  +  G  L+CG  Y +    A 
Sbjct: 227 LKEEWGFDGYVVSDCWAIRDFHTEHMVTHTATESA-ALAINNGCQLNCGNTYLHML-QAY 284

Query: 356 QQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQDICSDENIELAAEAAREGIVL 415
           ++G V E  I KS + L  + M+LG FD + +Y  +  +      + ++A + AR  +VL
Sbjct: 285 KEGLVTEETITKSAQKLMAIRMKLGLFDKNCEYNKIPYEVNDCKVHRDIALDVARRSMVL 344

Query: 416 LKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFSGY----ANVT 471
           LKN+   LPLN  + K + V+GP AN+   + GNY G   RY + + G   Y    A V 
Sbjct: 345 LKNN-GILPLNLKQTKAIGVIGPTANSRTVLQGNYFGTASRYTTFLEGIQDYVGDAARVY 403

Query: 472 YKTGC----DDVACKS--NNSIFAASEAAKTADATIILAGLDLSVEAE---------SLD 516
           Y  GC    + ++  S  N+ +  A   A+ +D  I+  GLD S+E E         + D
Sbjct: 404 YAEGCHLFKNSISGLSWENDRLSEALIVAEQSDVVILCLGLDASIEGEQGDTGNAFAAGD 463

Query: 517 REDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGG 576
           + DL L G Q  L+ +V ++ K P IL++ S   + I  A+     +AIL   YPG+ GG
Sbjct: 464 KSDLNLIGRQQLLLEEVLKIGK-PTILILSSGSAMAIHTAQEYC--EAILETWYPGQSGG 520

Query: 577 RAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYP 636
           +A+A ++FG+++P G+LPIT+Y          T+  L         GRTY++     LYP
Sbjct: 521 KALAQLLFGEYSPSGKLPITFYK---------TTEELPDFRDYSMAGRTYRYMKNEALYP 571

Query: 637 FGYGLSYTQFK 647
           FGYGL+Y + +
Sbjct: 572 FGYGLNYAKVE 582


>gi|288924872|ref|ZP_06418809.1| beta-glucosidase [Prevotella buccae D17]
 gi|288338659|gb|EFC77008.1| beta-glucosidase [Prevotella buccae D17]
          Length = 721

 Score =  353 bits (907), Expect = 1e-94,   Method: Compositional matrix adjust.
 Identities = 242/741 (32%), Positives = 371/741 (50%), Gaps = 96/741 (12%)

Query: 62  SIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTHFDDVIPG 121
           S   K++++RMT+ EK+ QL + +  +  LG+  Y+WWSE LHGV   G           
Sbjct: 31  SRHAKEIIARMTVSEKISQLMNESPAIEHLGIKPYDWWSEGLHGVGRDGR---------- 80

Query: 122 ATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGR--------AGLTYWSPNINVAR 173
           AT FP  I   A+F+E+L ++IG AV+TE RA +N+ R        AGLT+WSPN+N+ R
Sbjct: 81  ATVFPQPIALGATFDEALVREIGDAVATEGRAKFNVARKLKNYSRNAGLTFWSPNVNIFR 140

Query: 174 DPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDV 233
           DPRWGR  ET GEDP + G     YVRGLQ  +            LK  +C KHYA   V
Sbjct: 141 DPRWGRGMETYGEDPLLSGMLGTAYVRGLQGDDAFY---------LKTGACAKHYA---V 188

Query: 234 DNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLN 293
            +     R+  D   + +D+ ET+L  F+M V++G   +VM +YNRV G P      LL 
Sbjct: 189 HSGPEGTRHEADIHPSRRDLFETYLPQFKMLVQQGRVEAVMSAYNRVYGEPCGGSKYLLT 248

Query: 294 QTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFTGN 353
             +R  W  +G+IV+DCD+I      H+++  + E+A A  +KAGL+++CG  +    G 
Sbjct: 249 DILRKSWGFNGHIVSDCDAINDFYGGHRYV-KTPEEACAAAIKAGLNVECGHTFKAMQG- 306

Query: 354 AVQQGKVKETDIDKSLKYLYTVLMRLGFF--DGSPQYVSLGKQDICSDENIELAAEAARE 411
           A+ QG + E D+D++L  L    ++LG    D +  Y S  + +ICS  +  LA  AA E
Sbjct: 307 ALDQGLLAEADLDRALFPLVMTRLKLGILEPDSACPYNSYDESEICSPAHTALALRAADE 366

Query: 412 GIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGF----SGY 467
            +VLLKN+   LPL+   ++T+ V GP A+    ++GNY G+  RY + + G     S  
Sbjct: 367 AMVLLKNN-GILPLDK-NIRTLFVAGPGASDAFYLMGNYFGLSNRYSTYLQGIVSRVSSG 424

Query: 468 ANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAES---------LDRE 518
            +V ++     +  + N+  +A +EA   A+  I++ G + ++E E           DR 
Sbjct: 425 TSVNFRPAFMQITEELNDMNWAVNEAC-AAEVAIVVMGNNGNMEGEEGEAIASASRGDRV 483

Query: 519 DLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRA 578
            + LP  Q   + +V +  KG  I+V+++ GG  I   E +    A++ A YPG+EGG A
Sbjct: 484 GIGLPASQMNYLRRV-KARKGGRIVVVLT-GGSPIDLREISKLADAVVMAWYPGQEGGEA 541

Query: 579 IADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFG 638
           + D++FG  N  GRLPIT+         P     L   D     GRTYK+ +G  +YPFG
Sbjct: 542 LGDLLFGDKNFSGRLPITF---------PADVDSLPAFDDYSMNGRTYKYMSGNVMYPFG 592

Query: 639 YGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFK 698
           YGLSY +  Y                         +DA        +V  ++  +    +
Sbjct: 593 YGLSYGRVTY-------------------------TDAR-------VVGRIKKGEPLAVE 620

Query: 699 VDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNAC-KSL 757
           V   N G     +V   Y   P     + +  ++GF+RV +       +K VF    + L
Sbjct: 621 VVLTNNGDRTIDEVAQAYIATPTAGKGSPMASLVGFRRVSIPP--KSSVKAVFKIVPERL 678

Query: 758 NIVDYAANTLLPAGEHTIFVG 778
             +    ++ L  G +T+ +G
Sbjct: 679 MTIQSDGSSKLLKGNYTLTIG 699


>gi|301118693|ref|XP_002907074.1| glycoside hydrolase, putative [Phytophthora infestans T30-4]
 gi|262105586|gb|EEY63638.1| glycoside hydrolase, putative [Phytophthora infestans T30-4]
          Length = 809

 Score =  353 bits (906), Expect = 2e-94,   Method: Compositional matrix adjust.
 Identities = 252/775 (32%), Positives = 386/775 (49%), Gaps = 88/775 (11%)

Query: 48  QMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPR-----LGLPQYEWWSEA 102
           +   F FC++SL  + RV+DL+ R+ LDEKV  L   A   P+     +GLP+Y W +  
Sbjct: 31  EHQQFAFCNASLSTAERVEDLLRRLPLDEKVTLLT--ARASPKGNMSSIGLPEYNWGANC 88

Query: 103 LHGVSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLG---- 158
           +HGV +   GT+       ATSFP  +   A F+      + Q V  E RA++  G    
Sbjct: 89  VHGVQSTC-GTNC------ATSFPNPVNLGAIFDPRAVFDMAQVVGWELRALWLEGAREN 141

Query: 159 -----RAGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATD 213
                  GL  WSPNIN+ RDPRWGR  ETP EDP V  +Y V Y +GLQ  EG +    
Sbjct: 142 YATGPHLGLDCWSPNININRDPRWGRNMETPSEDPLVNSKYGVAYTKGLQ--EGKDK--- 196

Query: 214 LNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSV 273
              R L+     KHYAAY  +++ G+DR  F+A V+  D  +T+L  FE  V  G A  V
Sbjct: 197 ---RFLQAVVTLKHYAAYSYEHYDGIDRMAFNAVVSRYDFADTYLPAFEASVVHGKAKGV 253

Query: 274 MCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQ 333
           MCSYN VNG+P CA+ +L ++ +R      GYI +D  +I  +     +     E     
Sbjct: 254 MCSYNSVNGMPMCANEQLNSKLLRDALGFDGYITSDSGAIAGIYHQRHYTKTLCEAGRLA 313

Query: 334 TLKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFD--GSPQYVSL 391
            L +G D++ G  Y       V  G++ E  +D +++    +   LG FD      Y  +
Sbjct: 314 IL-SGTDVNSGSVYKQCLAELVTSGQLPEKAVDDAMRRTLKLRFELGLFDPIDDQPYWHV 372

Query: 392 GKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYA 451
              ++ + E+ +L+ + +R+ IVLL+N  N LPL  AK K +AV+GPHA A  A++GNY 
Sbjct: 373 APNEVNTAESKQLSLDLSRKSIVLLQNHGNILPL--AKGKKLAVIGPHAAAKRALLGNYL 430

Query: 452 GIPCR--------YMSPIAGFS---GYANVTYKTGCDDVACKSNNSIFAASEAAKTADAT 500
           G  C           +P+   +   G +N  Y  G   +   S      A  AA+ A+  
Sbjct: 431 GQMCHGDYLEVGCVQTPLEAITIANGASNTLYAKG-SGINDTSTGGFDEAEAAARKAETV 489

Query: 501 IILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNT 560
           ++  G+D S+E E+ DRE++ +P  Q QL+ +V    K P ++V+ + GGV +   E   
Sbjct: 490 VLFLGIDTSIEREAWDRENIDMPNIQMQLLKRVRRAGK-PTVVVLFN-GGV-VGAEELIL 546

Query: 561 NIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLG 620
           +   ++ A YPG  G +A++D++FG   P G+LP+T Y  +YV     TS+ ++ +    
Sbjct: 547 HTDGVVEAFYPGFFGAQAVSDILFGDAIPSGKLPVTMYPSNYV-----TSVDMKSMSMTK 601

Query: 621 YPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTR 680
           YPGR+Y++Y    ++PFG+GLSYT+F   L S                    +S  +   
Sbjct: 602 YPGRSYRYYKEVPVFPFGWGLSYTRFTMALDS--------------------SSGVTDPS 641

Query: 681 CPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKP----PAEIAATYIKQVIGFQR 736
            P V+   L         V   N G+  G +VV  + +P        AA   +Q+  ++R
Sbjct: 642 EPIVVTRQLDQ----TVTVILSNDGNLVGDEVVFAFFRPLKVNATGNAALLNEQLFDYRR 697

Query: 737 VFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGNG---GVSFPIHL 788
           V +R  + +++KF      +L +VD + N     G + + + NG    V+F IHL
Sbjct: 698 VSLRPTQYRKLKFRIQQ-STLAMVDDSGNQASFPGFYEVIITNGVHERVTFAIHL 751


>gi|301090543|ref|XP_002895482.1| beta-glucosidase, putative [Phytophthora infestans T30-4]
 gi|262098232|gb|EEY56284.1| beta-glucosidase, putative [Phytophthora infestans T30-4]
          Length = 809

 Score =  353 bits (905), Expect = 2e-94,   Method: Compositional matrix adjust.
 Identities = 251/777 (32%), Positives = 392/777 (50%), Gaps = 92/777 (11%)

Query: 48  QMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPR-----LGLPQYEWWSEA 102
           +   F FC++SL  + RV+DL+ R+ LDEKV  L   A   P+     +GLP+Y W +  
Sbjct: 31  EHQQFAFCNASLSTAERVEDLLRRLPLDEKVTLLT--ARASPKGNMSSIGLPEYNWGANC 88

Query: 103 LHGVSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLG---- 158
           +HGV +   GT+       ATSFP  +   A F+      + Q V  E RA++  G    
Sbjct: 89  VHGVQSTC-GTNC------ATSFPNPVNLGAIFDPRAVFDMAQVVGWELRALWLEGAREN 141

Query: 159 -----RAGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATD 213
                  GL  WSPNIN+ RDPRWGR  ETP EDP V  +Y V Y +GLQ  EG +    
Sbjct: 142 YATGPHLGLDCWSPNININRDPRWGRNMETPSEDPLVNSKYGVAYTKGLQ--EGKDK--- 196

Query: 214 LNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSV 273
              R L+     KHYAAY  +++ G+DR  F+A V+  D  +T+L  FE  V  G A  V
Sbjct: 197 ---RFLQAVVTLKHYAAYSYEHYDGIDRMAFNAVVSRYDFADTYLPAFEASVVHGKAKGV 253

Query: 274 MCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQ 333
           MCSYN VNG+P CA+ +L ++ +R      GYI +D  +I   + + +    +  +A   
Sbjct: 254 MCSYNSVNGMPMCANEQLNSKLLRDALGFDGYITSDSGAI-AGIYHQRHYTKTLCEAGRL 312

Query: 334 TLKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFD--GSPQYVSL 391
            + +G D++ G  Y       V  G++ E  +D +++    +   LG FD      Y  +
Sbjct: 313 AILSGTDVNSGSVYKQCLAELVTSGQLPEKAVDDAMRRTLKLRFELGLFDPIDDQPYWHV 372

Query: 392 GKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYA 451
              ++ + E+ +L+ + +R+ IVLL+N  N LPL  AK K +AV+GPHA A  A++GNY 
Sbjct: 373 APNEVNTAESKQLSLDLSRKSIVLLQNHGNILPL--AKGKKLAVIGPHAAAKRALLGNYL 430

Query: 452 GIPCR--------YMSPIAGFS---GYANVTYK--TGCDDVACKSNNSIFAASEAAKTAD 498
           G  C           +P+   +   G +N  Y   +G +D +    +   AA+  A+T  
Sbjct: 431 GQMCHGDYLEVGCVQTPLEAITIANGASNTLYAKGSGINDTSTAGFDEAEAAARKAET-- 488

Query: 499 ATIILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAET 558
             ++  G+D S+E E+ DRE++ +P  Q QL+ +V    K P ++V+ + GGV +   E 
Sbjct: 489 -VVLFLGIDTSIEREAWDRENIDMPNIQMQLLKRVRRAGK-PTVVVLFN-GGV-VGAEEL 544

Query: 559 NTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDS 618
             +   ++ A YPG  G +A++D++FG   P G+LP+T Y  +YV     TS+ ++ +  
Sbjct: 545 ILHTDGVVEAFYPGFFGAQAVSDILFGDAIPSGKLPVTMYPSNYV-----TSVDMKSMSM 599

Query: 619 LGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASK 678
             YPGR+Y++Y    ++PFG+GLSYT+F   L S                    +S  + 
Sbjct: 600 TKYPGRSYRYYKEVPVFPFGWGLSYTRFTMALDS--------------------SSGVTD 639

Query: 679 TRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKP----PAEIAATYIKQVIGF 734
              P V+   L         V   N G+  G +VV  + +P        AA   +Q+  +
Sbjct: 640 PSEPIVVTRQLDQ----TVTVILSNDGNLVGDEVVFAFFRPLKVNATGNAALLNEQLFDY 695

Query: 735 QRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGNG---GVSFPIHL 788
           +RV +R  + +++KF      +L +VD + N     G + + + NG    V+F IHL
Sbjct: 696 RRVSLRPTQYRKLKFRIQQ-STLAMVDDSGNQASFPGFYEVIITNGVHERVTFAIHL 751


>gi|336275603|ref|XP_003352555.1| hypothetical protein SMAC_01389 [Sordaria macrospora k-hell]
 gi|380094444|emb|CCC07823.1| unnamed protein product [Sordaria macrospora k-hell]
          Length = 833

 Score =  352 bits (904), Expect = 3e-94,   Method: Compositional matrix adjust.
 Identities = 259/760 (34%), Positives = 361/760 (47%), Gaps = 121/760 (15%)

Query: 55  CDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTH 114
           CDS+     R   LV ++T+DEK+  L D + G PRLGLP Y WWSE LHGV+   PG  
Sbjct: 37  CDSTASAPDRAASLVEQLTIDEKLVNLVDQSKGAPRLGLPPYAWWSEGLHGVAG-SPGVV 95

Query: 115 FDDV---IPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNINV 171
           F+        ATSF  VI   A+ ++ L  ++G A+STEARA    G  GL YW+PNIN 
Sbjct: 96  FNTSGYPFSYATSFANVITLGAALDDDLVYEVGTAISTEARAFAKFGFGGLDYWTPNINP 155

Query: 172 ARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAY 231
            +DPRWGR  ETPGEDP  +  Y    V GL+           N    KV + CKH+AAY
Sbjct: 156 YKDPRWGRGAETPGEDPLRIKGYVKAMVAGLEG----------NGTVRKVIATCKHFAAY 205

Query: 232 DVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMC---------------- 275
           D++ W+G+ RY FDA V+ QD+ E +L PF+ C ++    S+MC                
Sbjct: 206 DLERWRGLTRYDFDAVVSLQDLSEYYLPPFQQCARDSRVGSIMCRYVSFFLPPFPSFPRL 265

Query: 276 ----------------SYNRVNGIPSCADPKLLNQTVRGEWDL---HGYIVADCDSIQ-V 315
                           SYN +NG P+CA   L+   +R  W+    + YI +DC++IQ  
Sbjct: 266 VTRQSGNQVDIVDNFRSYNALNGTPACASTYLMTNILRDHWNWTNHNNYITSDCNAIQDF 325

Query: 316 MVDNHKFLADSKEDAVAQTLKAGLDLDCG----QYYTNFTGNAVQQGKVKETDIDKSLKY 371
           + DNH F + +  +A A    AG D  C       YT+  G A  Q  + E+ ID +L+ 
Sbjct: 326 LPDNHNF-SQTPAEAAAAAYIAGTDTVCEVSGWPPYTDVVG-AYNQSLLSESVIDTALRR 383

Query: 372 LYTVLMRLGFFD-GSPQYVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKV 430
           LY  L+R G+ D G P   S  K    S                      + LPL+    
Sbjct: 384 LYEGLIRAGYLDHGRPASSSPDKAPFSS---------------------PDFLPLDLTG- 421

Query: 431 KTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFSGYA-NVTYKTGCDDVACKSNNSIFA 489
           KTVA++G  ANAT  + G Y+G+P  Y +P+        +  Y  G    +  ++    A
Sbjct: 422 KTVALIGHWANATRTIRGPYSGLPPFYHNPMYAVRQLKLSFYYANGPVVNSTDADTWTAA 481

Query: 490 ASEAAKTADATIILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAG 549
           A  AA++AD  +   G D +V +E LDRE +  P  Q  LI ++A+V K   ++VI    
Sbjct: 482 AMLAAESADVVLYFGGTDTTVASEDLDRESIAWPKTQLTLIEKLAQVGK--PMVVIQLGD 539

Query: 550 GVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLT 609
            VD      N NI +ILW GYPG+ GG A+ DV+ GK    GRLP+T Y   YV  +PLT
Sbjct: 540 QVDDTPLLNNKNISSILWVGYPGQSGGTAVFDVLTGKKASAGRLPVTQYPAGYVDEVPLT 599

Query: 610 SMPLRPVD--------------------------------SLGYPGRTYKFYNGPTLYPF 637
            M LRP +                                +L  PGRTYK+Y  P L PF
Sbjct: 600 EMGLRPFNHSSSTTSSDVSQSGVEEGNGLTIQTRSTRGNKTLSSPGRTYKWYPRPVL-PF 658

Query: 638 GYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEF 697
           GYGL YT F    +S + +   N +      +++  S  +   C  +    L    +  F
Sbjct: 659 GYGLHYTPFN---ISLSLSTSSNASSTTDNTSISIRSLLTSQTCTAI---HLDLCPFSPF 712

Query: 698 KVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRV 737
            V   N GS     V +++           +K ++G++RV
Sbjct: 713 SVSITNTGSHTSDYVALLFLSGKFGPKPDPLKTLVGYKRV 752


>gi|255590044|ref|XP_002535159.1| Thermostable beta-glucosidase B, putative [Ricinus communis]
 gi|223523880|gb|EEF27223.1| Thermostable beta-glucosidase B, putative [Ricinus communis]
          Length = 449

 Score =  352 bits (902), Expect = 5e-94,   Method: Compositional matrix adjust.
 Identities = 183/457 (40%), Positives = 279/457 (61%), Gaps = 13/457 (2%)

Query: 339 LDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ---YVSLGKQD 395
           +D++CG Y  N+T +AV++ KV E++ID++L  L+++ MRLG F+G+P    Y  +    
Sbjct: 1   MDVNCGNYLKNYTKSAVEKKKVSESEIDRALHNLFSIRMRLGLFNGNPTKLPYGDISADQ 60

Query: 396 ICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPC 455
           +CS E+  +A EAAR+GIVLLKN    LPL+ +K  ++A++GP+A+ +  ++GNYAG PC
Sbjct: 61  VCSQEHQAVALEAARDGIVLLKNSNQLLPLSKSKTTSLAIIGPNADNSTILVGNYAGPPC 120

Query: 456 RYMSPIAGFSGYANVT-YKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAES 514
           + ++P  G   Y   T Y  GC  VAC S+ +I  A + AK AD  +++ GLD + E E 
Sbjct: 121 KTVTPFQGLQNYIKTTKYHPGCSTVAC-SSAAIDQAIKIAKEADQVVLVMGLDQTQEREE 179

Query: 515 LDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEE 574
            DR DL LPG Q +LI  VA  AK PV+LV++  G VDI+FA+ + NI  ILWAGYPGE 
Sbjct: 180 HDRVDLVLPGKQQELIISVARAAKKPVVLVLLCGGPVDISFAKYDRNIGGILWAGYPGEA 239

Query: 575 GGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTL 634
           GG A+A+++FG  NPGGRLP+TWY  D+ + +P+T M +RP  S GYPGRTY+FY G  +
Sbjct: 240 GGIALAEIIFGNHNPGGRLPVTWYPQDFTK-VPMTDMRMRPQPSSGYPGRTYRFYKGKKV 298

Query: 635 YPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCD-D 693
           + FGYGLSY+ + Y L+S T+  +++L      +  N +    KT      + +  C+  
Sbjct: 299 FEFGYGLSYSNYSYELVSVTQN-KISLRSSIDQKAENSSPIGYKTISE---IEEELCERS 354

Query: 694 YFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNA 753
            F   V  +N G   G   V+++++     +   IK++I FQ V + AG N  I++  N 
Sbjct: 355 KFSVTVRVKNQGEMTGKHPVLLFARQDKPGSGGPIKKLIAFQSVKLNAGENAEIEYKVNP 414

Query: 754 CKSLNIVDYAANTLLPAGEHTIFVGNGGVSFPIHLNF 790
           C+ L+  +     ++  G   + VG+    +PI++  
Sbjct: 415 CEHLSRANEDGLMVMEEGSQYLLVGDK--EYPINITI 449


>gi|402308386|ref|ZP_10827395.1| glycosyl hydrolase family 3, N-terminal domain protein [Prevotella
           sp. MSX73]
 gi|400375830|gb|EJP28725.1| glycosyl hydrolase family 3, N-terminal domain protein [Prevotella
           sp. MSX73]
          Length = 721

 Score =  350 bits (897), Expect = 2e-93,   Method: Compositional matrix adjust.
 Identities = 240/741 (32%), Positives = 371/741 (50%), Gaps = 96/741 (12%)

Query: 62  SIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTHFDDVIPG 121
           S   K++++RMT+ EK+ QL + +  +  LG+  Y+WWSE LHGV   G           
Sbjct: 31  SRHAKEIIARMTVSEKISQLMNESPAIEHLGIKPYDWWSEGLHGVGRDGR---------- 80

Query: 122 ATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGR--------AGLTYWSPNINVAR 173
           AT FP  I   A+F+E+L ++IG AV+TE RA +N+ +        AGLT+WSPN+N+ R
Sbjct: 81  ATVFPQPIALGATFDEALVREIGDAVATEGRAKFNVAQKLKNYSRNAGLTFWSPNVNIFR 140

Query: 174 DPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDV 233
           DPRWGR  ET GEDP + G     YVRGLQ  +            LK  +C KHYA   V
Sbjct: 141 DPRWGRGMETYGEDPLLSGMLGTAYVRGLQGDDAFY---------LKTGACAKHYA---V 188

Query: 234 DNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLN 293
            +     R+  D   + +D+ ET+L  F+M V++G   +VM +YNRV G P      LL 
Sbjct: 189 HSGPEGTRHEADIHPSRRDLFETYLPQFKMLVQQGRVEAVMSAYNRVYGEPCGGSKYLLT 248

Query: 294 QTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFTGN 353
             +R  W  +G+IV+DCD+I      H+++  + E+A A  +KAGL+++CG  +    G 
Sbjct: 249 DILRKSWGFNGHIVSDCDAINDFYGGHRYV-KTPEEACAAAIKAGLNVECGHTFKAMQG- 306

Query: 354 AVQQGKVKETDIDKSLKYLYTVLMRLGFF--DGSPQYVSLGKQDICSDENIELAAEAARE 411
           A+ QG + E D+D++L  L    ++LG    D +  Y S  + +ICS  +  LA  AA E
Sbjct: 307 ALDQGLLAEADLDRALFPLVMTRLKLGILEPDSACPYNSYDESEICSPAHTALALRAADE 366

Query: 412 GIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGF----SGY 467
            +VLLKN+   LPL+   ++T+ V GP A+    ++GNY G+  RY + + G     S  
Sbjct: 367 AMVLLKNN-GILPLDK-NIRTLFVAGPGASDAFYLMGNYFGLSNRYSTYLQGIVSRVSSG 424

Query: 468 ANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAES---------LDRE 518
            +V ++     +  + N+  +A +EA   A+  I++ G + ++E E           DR 
Sbjct: 425 TSVNFRPAFMQITEELNDMNWAVNEAC-AAEVAIVVMGNNGNMEGEEGEAIASASRGDRV 483

Query: 519 DLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRA 578
            + LP  Q   + +V +  KG  I+V+++ GG  I   + +    A++ A YPG+EGG A
Sbjct: 484 GIGLPASQLNYLRRV-KARKGGRIVVVLT-GGSPIDLRKISKLADAVVMAWYPGQEGGEA 541

Query: 579 IADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFG 638
           + D++FG  N  GRLPIT+         P     L   D     GRTYK+ +G  +YPFG
Sbjct: 542 LGDLLFGDKNFSGRLPITF---------PADVDSLPAFDDYSMNGRTYKYMSGNVMYPFG 592

Query: 639 YGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFK 698
           YGLSY +  Y                         +DA        +V  ++  +    +
Sbjct: 593 YGLSYGRVTY-------------------------TDAR-------VVGRIKKGEPLAVE 620

Query: 699 VDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNAC-KSL 757
           V   N G     +V   Y   P     + +  ++GF+RV +       +K VF    + L
Sbjct: 621 VVLTNNGDRTIDEVAQAYIATPTAGKGSPMASLVGFRRVSIPP--KSSVKAVFKIVPERL 678

Query: 758 NIVDYAANTLLPAGEHTIFVG 778
             +    ++ L  G +T+ +G
Sbjct: 679 MTIQSDGSSKLLKGNYTLTIG 699


>gi|315607899|ref|ZP_07882892.1| beta-glucosidase [Prevotella buccae ATCC 33574]
 gi|315250368|gb|EFU30364.1| beta-glucosidase [Prevotella buccae ATCC 33574]
          Length = 721

 Score =  350 bits (897), Expect = 2e-93,   Method: Compositional matrix adjust.
 Identities = 241/741 (32%), Positives = 369/741 (49%), Gaps = 96/741 (12%)

Query: 62  SIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTHFDDVIPG 121
           S   K++++RMT+ EK+ QL + +  +  LG+  Y+WWSE LHGV   G           
Sbjct: 31  SRHAKEIIARMTVSEKISQLMNESPAIEHLGIKPYDWWSEGLHGVGRDGR---------- 80

Query: 122 ATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGR--------AGLTYWSPNINVAR 173
           AT FP  I   A+F+E+L ++IG AV+TE RA +N+ R        AGLT+WSPN+N+ R
Sbjct: 81  ATVFPQPIALGATFDEALVREIGDAVATEGRAKFNVARKLKNYSRNAGLTFWSPNVNIFR 140

Query: 174 DPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDV 233
           D RWGR  ET GEDP + G     YVRGLQ  +            LK  +C KHYA +  
Sbjct: 141 DLRWGRGMETYGEDPLLSGMLGTAYVRGLQGDDAFY---------LKTGACAKHYAVHSG 191

Query: 234 DNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLN 293
                  R+  D   + +D+ ET+L  F+M V++G   +VM +YNRV G P      LL 
Sbjct: 192 PEGT---RHEADIHPSRRDLFETYLPQFKMLVQQGRVEAVMSAYNRVYGEPCGGSKYLLT 248

Query: 294 QTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFTGN 353
             +R  W  +G+IV+DCD+I      H+++  + E+A A  +KAGL+++CG  +    G 
Sbjct: 249 DILRKSWGFNGHIVSDCDAINDFYGGHRYV-KTPEEACAAAIKAGLNVECGHTFKAMQG- 306

Query: 354 AVQQGKVKETDIDKSLKYLYTVLMRLGFF--DGSPQYVSLGKQDICSDENIELAAEAARE 411
           A+ QG + E D+D++L  L    ++LG    D +  Y S  + +ICS  +  LA  AA E
Sbjct: 307 ALDQGLLAEADLDRALFPLVMTRLKLGILEPDSACPYNSYDESEICSPAHTALALRAADE 366

Query: 412 GIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGF----SGY 467
            +VLLKN+   LPL+   ++T+ V GP A+    ++GNY G+  RY + + G     S  
Sbjct: 367 AMVLLKNN-GILPLDK-NIRTLFVAGPGASDAFYLMGNYFGLSNRYSTYLQGIVSRVSSG 424

Query: 468 ANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAES---------LDRE 518
            +V ++     +  + N+  +A +EA   A+  I++ G + ++E E           DR 
Sbjct: 425 TSVNFRPAFMQITEELNDMNWAVNEAC-AAEVAIVVMGNNGNMEGEEGEAIASASRGDRV 483

Query: 519 DLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRA 578
            + LP  Q   + +V +  KG  I+V+++ GG  I   E +    A++ A YPG+EGG A
Sbjct: 484 GIGLPASQLNYLRRV-KARKGGRIVVVLT-GGSPIDLREISKLADAVVMAWYPGQEGGEA 541

Query: 579 IADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFG 638
           + D++FG  N  GRLPIT+         P     L   D     GRTYK+ +G  +YPFG
Sbjct: 542 LGDLLFGDKNFSGRLPITF---------PADVDSLPAFDDYSMNGRTYKYMSGNVMYPFG 592

Query: 639 YGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFK 698
           YGLSY +  Y                         +DA        +V  ++  +    +
Sbjct: 593 YGLSYGRVTY-------------------------TDAR-------VVGRIKKGEPLAVE 620

Query: 699 VDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNAC-KSL 757
           V   N G     +V   Y   P     + +  ++GF+RV +       +K VF    + L
Sbjct: 621 VVLTNNGDRTIDEVAQAYIATPTAGKGSPMASLVGFRRVSIPP--KSSVKAVFKIVPERL 678

Query: 758 NIVDYAANTLLPAGEHTIFVG 778
             V    ++ L  G +T+ +G
Sbjct: 679 MTVQSDGSSKLLKGNYTLTIG 699


>gi|167537541|ref|XP_001750439.1| hypothetical protein [Monosiga brevicollis MX1]
 gi|163771117|gb|EDQ84789.1| predicted protein [Monosiga brevicollis MX1]
          Length = 834

 Score =  349 bits (895), Expect = 4e-93,   Method: Compositional matrix adjust.
 Identities = 258/763 (33%), Positives = 370/763 (48%), Gaps = 91/763 (11%)

Query: 50  SSFLFCDSSLPYSIRVKDLVSRM-TLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSN 108
           SS+ FCD+ L    R+KDLVSR+ T D   Q     +  +  +GLP Y W + A+HG+ N
Sbjct: 105 SSYPFCDTKLSVDDRLKDLVSRVSTADAATQLRARESAQIDNIGLPAYYWGTNAIHGMQN 164

Query: 109 VGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLG-RAGLTYWSP 167
                  D   P  TSFP     +A+FN SL K +G+ +  E RA YN     GL  WSP
Sbjct: 165 TA--CLADGQCP--TSFPAPNGLSATFNYSLVKDMGRIIGRELRAYYNTKFHNGLDTWSP 220

Query: 168 NINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKH 227
            IN +RDPRWGR  E+PGE PFV G+Y   Y  GLQ      N  D +     V+   KH
Sbjct: 221 TINPSRDPRWGRNVESPGESPFVCGQYGAAYTEGLQ------NGDDKDYTQAVVT--LKH 272

Query: 228 YAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCA 287
           + AY V+++  V RY ++A V+E D+ +T+   +E  VK      VMCSYN +NG+P+C 
Sbjct: 273 WVAYSVEDYDNVTRYEYNAIVSEYDLMDTYFPGWEYVVKNAKPLGVMCSYNSLNGVPTCG 332

Query: 288 DPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYY 347
           +P  L   +R +W   GYI +D DSI  +  +H + +++   A    L  G D+D G  Y
Sbjct: 333 NPA-LTAYLREDWGFEGYITSDSDSIHCIWADHHYESNAVL-ATRDGLLGGCDIDSGDTY 390

Query: 348 TNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDG--SPQYVSLGKQDICSDENIELA 405
            +    AV Q  V  + +D +L   Y +   LG FD   +  Y  +   ++    + E +
Sbjct: 391 ADNLEAAVNQSLVNRSAVDAALTNSYRMRFNLGLFDPNVTNAYDRISADEVGMSSSQETS 450

Query: 406 AEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPC---------- 455
             AAR+ + LLKND  TLP   A  K VAV+G  +N+   ++GNY G  C          
Sbjct: 451 LLAARKSMTLLKNDGQTLPF--ATGKKVAVIGKSSNSAEDILGNYVGPICPSGAFDCVQT 508

Query: 456 RYMSPIAGFSGYANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAESL 515
            Y    A   G A     T  DDVA      I  A + A  AD  ++L   +     E  
Sbjct: 509 LYQGVAAANQGGAT----TLSDDVA-----DINTAIQLAMDAD-QVVLTISNYGQAGEGK 558

Query: 516 DREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEG 575
           DR  + L   Q +L+  V +V K P  +V+++ G + + + +     +AIL A  PG  G
Sbjct: 559 DRTYIGLDTDQQELVAAVLKVGK-PTAIVMLNGGLISLDWIKDEA--QAILVAFAPGVHG 615

Query: 576 GRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGY-----------PGR 624
           G+A+A+ +FG  NPGG+LP+T Y  DYV  +   +M ++ V  L             PGR
Sbjct: 616 GQAVAETIFGANNPGGKLPVTMYASDYVNDVDFLNMSMQAVAVLHLMNVNGERDDTGPGR 675

Query: 625 TYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGV 684
           +YK+Y G  LYPF YGLSYT F                      NL+++     T     
Sbjct: 676 SYKYYTGEPLYPFAYGLSYTTF----------------------NLSWSPAPPMT----T 709

Query: 685 LVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATY-------IKQVIGFQRV 737
             + LR      +     N GS  G +VV  + KP +E   T        IK++ GFQRV
Sbjct: 710 FTSTLRS---TTYTATVTNTGSVGGDEVVFAFYKPKSESLKTLPVGNPVPIKEIFGFQRV 766

Query: 738 FVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGNG 780
            +  G++ ++ F  NA ++L  V    +  L +GE  I +  G
Sbjct: 767 ALGPGQSTQVTFELNA-ETLAQVTLDGHRELHSGEFEIELTRG 808


>gi|219887077|gb|ACL53913.1| unknown [Zea mays]
 gi|224035251|gb|ACN36701.1| unknown [Zea mays]
 gi|413919685|gb|AFW59617.1| putative O-Glycosyl hydrolase superfamily protein [Zea mays]
          Length = 405

 Score =  348 bits (893), Expect = 6e-93,   Method: Compositional matrix adjust.
 Identities = 182/408 (44%), Positives = 254/408 (62%), Gaps = 17/408 (4%)

Query: 377 MRLGFFDGSPQ---YVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTV 433
           MRLGFFDG P+   + +LG  D+C+  N ELA EAAR+GIVLLKN    LPL++  +K++
Sbjct: 1   MRLGFFDGDPRELPFGNLGPSDVCTPSNQELAREAARQGIVLLKN-TGKLPLSATSIKSM 59

Query: 434 AVVGPHANATVAMIGNYAGIPCRYMSPIAGFSGYANVTYKTGCDDVACKSNN-SIFAASE 492
           AV+GP+ANA+  MIGNY G PC+Y +P+ G        Y+ GC +V C  N+  + AA++
Sbjct: 60  AVIGPNANASFTMIGNYEGTPCKYTTPLQGLGANVATVYQPGCTNVGCSGNSLQLDAATK 119

Query: 493 AAKTADATIILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVD 552
           AA +AD T+++ G D S+E ESLDR  L LPG Q QL++ VA  + GP ILV+MS G  D
Sbjct: 120 AAASADVTVLVVGADQSIERESLDRTSLLLPGQQPQLVSAVANASSGPCILVVMSGGPFD 179

Query: 553 IAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMP 612
           I+FA+++  I AILW GYPGE GG AIADV+FG  NP GRLP+TWY   + + +P+T M 
Sbjct: 180 ISFAKSSDKIAAILWVGYPGEAGGAAIADVLFGYHNPSGRLPVTWYPESFTK-VPMTDMR 238

Query: 613 LRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNY 672
           +RP  S GYPGRTY+FY G T+Y FG GLSYT F ++L+S  K + + L +   C     
Sbjct: 239 MRPDPSTGYPGRTYRFYTGDTVYAFGDGLSYTSFAHHLVSAPKQLALQLAEGHACLT--- 295

Query: 673 TSDASKTRCPGVLVNDLRCDDY-FEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQV 731
                  +CP V      C+   F+  +  +N G   G   V ++S PPA +     K +
Sbjct: 296 ------EQCPSVEAEGAHCEGLAFDVHLRVRNAGERSGGHTVFLFSSPPA-VHNAPAKHL 348

Query: 732 IGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGN 779
           +GF++V +  G+   + F  + CK L++VD   N  +  G HT+ VG+
Sbjct: 349 LGFEKVSLEPGQAGVVAFKVDVCKDLSVVDELGNRKVALGSHTLHVGD 396


>gi|255572559|ref|XP_002527213.1| Thermostable beta-glucosidase B, putative [Ricinus communis]
 gi|223533389|gb|EEF35139.1| Thermostable beta-glucosidase B, putative [Ricinus communis]
          Length = 454

 Score =  347 bits (891), Expect = 1e-92,   Method: Compositional matrix adjust.
 Identities = 178/449 (39%), Positives = 272/449 (60%), Gaps = 13/449 (2%)

Query: 339 LDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDG---SPQYVSLGKQD 395
           +D++CG Y      +AV +GK++E DID++L  L++V +RLG FDG   +  +  LG +D
Sbjct: 1   MDINCGSYAIRNAQSAVDKGKLREEDIDRALLNLFSVQLRLGLFDGDRINGHFSKLGPED 60

Query: 396 ICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPC 455
           +C++E+ +LA EAAR+GIVLLKN++  LPLN   V ++A++GP AN   ++ G+Y G  C
Sbjct: 61  VCTEEHKKLALEAARQGIVLLKNEKKFLPLNKKAVSSLAIIGPLANNGGSLGGDYTGYSC 120

Query: 456 RYMSPIAGFSGYANVT-YKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAES 514
              S   G   Y   T Y  GC +V+C S++    A   AKTAD  I++AG+DLS E E 
Sbjct: 121 NPQSLFDGVQAYIKRTSYAVGCSNVSCDSDDQFPEAIHIAKTADFVIVVAGIDLSQETED 180

Query: 515 LDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEE 574
            DR  L LPG Q  L++ VA  +K PVILV+   G VD++FA+ ++ I +ILW GYPGE 
Sbjct: 181 RDRISLLLPGKQMALVSYVAAASKKPVILVLTGGGPVDVSFAKRDSRIASILWIGYPGEA 240

Query: 575 GGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTL 634
           G +A+AD++FG++NPGGRLP+TWY   +   +P+  M +R   + GYPGRTY+FY G  +
Sbjct: 241 GAKALADIIFGEYNPGGRLPMTWYPESFTN-VPMNDMNMRANPNRGYPGRTYRFYTGERV 299

Query: 635 YPFGYGLSYTQFKYNLLSFTKTIQV--NLNKLQHCRNLNYTSDASKTRCPGVLVNDL-RC 691
           Y FG GLSYT + Y  LS    + +  +L      R L+   D    R   + ++++  C
Sbjct: 300 YGFGEGLSYTNYAYKFLSAPSKLSLSGSLTATSRKRILHQRGD----RLDYIFIDEISSC 355

Query: 692 DDY-FEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFV 750
           +   F  ++   NVG  DGS VV+++S+ P     T  KQ++GF+R+   + ++     +
Sbjct: 356 NSLRFTVQISVMNVGDMDGSHVVMLFSRVPQVSEGTPEKQLVGFERINTVSHKSTETSIL 415

Query: 751 FNACKSLNIVDYAANTLLPAGEHTIFVGN 779
            + CK L+I +     ++P G H + +G+
Sbjct: 416 LDPCKHLSIANGQGKRIMPVGSHVLLLGD 444


>gi|332377068|gb|AEE64772.1| Xyl3A [Ruminococcus albus 8]
          Length = 691

 Score =  347 bits (890), Expect = 1e-92,   Method: Compositional matrix adjust.
 Identities = 223/628 (35%), Positives = 331/628 (52%), Gaps = 74/628 (11%)

Query: 54  FCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGT 113
           + D SL    R + L   MT +E+  QL   A  + RLG+P Y WW+E +HG++  G   
Sbjct: 4   YLDESLSAEERAEALTDEMTTEEQASQLRYDAPAIERLGIPAYNWWNEGIHGLARSGV-- 61

Query: 114 HFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRA--------GLTYW 165
                   AT FP  I   A F++ L K+  +  S EARA YN            GLT W
Sbjct: 62  --------ATMFPQAIGLAAMFDDELTKRTAEITSEEARAKYNAYTVEGDRDIYKGLTLW 113

Query: 166 SPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCC 225
           +PNIN+ RDPRWGR  ET GEDP++  +     VRGLQ           + + +K ++C 
Sbjct: 114 APNINIFRDPRWGRSHETFGEDPYLTAQNGKAVVRGLQG----------DGKVMKAAACA 163

Query: 226 KHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPS 285
           KH+A   V +     R+ FDA+   +DMEET+L  FE  VKE    SVM +YNRVNG P+
Sbjct: 164 KHFA---VHSGPEALRHSFDAKADAKDMEETYLPAFEALVKEAKVESVMGAYNRVNGEPA 220

Query: 286 CADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQ 345
           CA   L+ +    EW+  GY V+DC +I+   ++H   A++ E A A  LKAG D++CG 
Sbjct: 221 CASDYLMEKL--KEWEFDGYFVSDCWAIRDFHEHHMVTANAVESA-AMALKAGCDVNCGC 277

Query: 346 YYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQDICSDENIELA 405
            Y N    A+ +G + +  I  +  +L    +RLG FD    +  +    +   E+  ++
Sbjct: 278 TYQNLLA-ALDKGLITKEQIRTACVHLMRTRIRLGMFDKHTDFDDIPYSKVACAEHKAVS 336

Query: 406 AEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAG-- 463
            E A + +VLLKN+   LPL+  K KT+AV+GP+A++  A+ GNY G+  RY + + G  
Sbjct: 337 LECAEKSLVLLKNN-GILPLDDKKYKTIAVIGPNADSRTALEGNYNGLSDRYTTFLNGIQ 395

Query: 464 --FSGYANVTYKTGCDDVACKSNNSIFAASE-------AAKTADATIILAGLDLSVEAE- 513
             F G   V +  GC  +  KS + +  A +       AAK AD  I+  GLD ++E E 
Sbjct: 396 DRFEG--RVIFAEGC-HLYKKSISGLAQAGDRYAEAVAAAKNADLVIMCVGLDATIEGEE 452

Query: 514 --------SLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAI 565
                   S D+  L LP  Q  L+ ++  V K PV+ V+ +   ++     T +   A+
Sbjct: 453 GDTGNEFSSGDKNGLTLPPPQKILVEKIMSVGK-PVVTVVCAGSAIN-----TESQPDAL 506

Query: 566 LWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRT 625
           + A YPG EGG+A+A+V+FG  +P G+LP+T+Y  D  ++   T   ++        GRT
Sbjct: 507 IHAFYPGAEGGKALAEVLFGDVSPSGKLPVTFYE-DTDKLPEFTDYSMK--------GRT 557

Query: 626 YKFYNGPTLYPFGYGLSYTQFKYNLLSF 653
           Y++     L+PFGYGL+Y   K N + +
Sbjct: 558 YRYTTDNILFPFGYGLTYGGVKVNAVEY 585


>gi|291240559|ref|XP_002740189.1| PREDICTED: hypothetical protein [Saccoglossus kowalevskii]
          Length = 745

 Score =  347 bits (889), Expect = 2e-92,   Method: Compositional matrix adjust.
 Identities = 230/658 (34%), Positives = 351/658 (53%), Gaps = 66/658 (10%)

Query: 42  FSKLGLQMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQL---GDFAHG----VPRLGLP 94
           FS +   +S F F ++SLP++ RV+DLV R+ L+E V Q+   G +++G    + RL + 
Sbjct: 15  FSLISTILSDFPFRNTSLPWNKRVEDLVGRLKLEEIVLQMSRGGRYSNGPAPPIDRLNIG 74

Query: 95  QYEWWSEALHGVSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAM 154
            Y W +E L G  + GP          ATSFP      A+F+  L K+I  A + E RA 
Sbjct: 75  PYSWNTECLRGDLSAGP----------ATSFPQAFGLAATFDAVLIKQIANATAYEVRAK 124

Query: 155 YNL--------GRAGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVE 206
           YN            GL+ +SP IN+AR P WGRI ET GEDP++ G  A ++V GLQ   
Sbjct: 125 YNNYTKHKEYGDHKGLSCFSPVINIARHPLWGRIQETYGEDPYLSGTLAASFVTGLQ--- 181

Query: 207 GHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVK 266
           G+      + R +  ++ CKH+ AY         R  FDA+V+++D+  TFL  F  C++
Sbjct: 182 GN------HPRYVTANAGCKHFDAYAGPENIPSSRSTFDAKVSDRDLRMTFLPAFHECIQ 235

Query: 267 EGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADS 326
            G   S+MCSYN +NG+P+CA+ KLL   +R EW+  GY+++D  +++ + D H +  D 
Sbjct: 236 AG-TYSLMCSYNSINGVPACANKKLLTDILRTEWNFTGYVISDQSAVEKVYDAHHYTKDM 294

Query: 327 KEDAVAQTLKAGLDLDCGQYYTN----FTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFF 382
            + A+A  + +GL+L+     T+     T  AV+QG V    +   +  L+   MRLG F
Sbjct: 295 LDTAIA-CVNSGLNLELSSNLTDNVMMQTTKAVKQGNVTMKTVKARVSPLFYTRMRLGEF 353

Query: 383 DGSPQYVSLGKQD---ICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPH 439
           D  P+     K D   I S E+ EL+ +AA +  VLLKN+   LPL   K+  +AVVGP 
Sbjct: 354 D-PPEMNPYSKLDLSIIQSQEHQELSLKAAAKSFVLLKNENRFLPLKE-KIDKLAVVGPF 411

Query: 440 ANATVAMIGNYA-GIPCRYMSPIAGFSGYANV--TYKTGCDDVACKSNNSIFAASEAAKT 496
            +  + + G+ +  +    ++P  G S  A +  T+ +GC   AC   +   +  +A   
Sbjct: 412 GDNPIEIYGSKSPDVSNLTVTPRYGLSKIARLATTFASGCLSPACTEYDPK-STKQAIDR 470

Query: 497 ADATIILAGLDLSVEAESLDREDLWLPGYQTQLI-NQVAEVAKGPVILVIMSAGGVDIAF 555
            D  ++  G    VE E+ DR +L LPG Q +L+ + V   A  PVIL++ +AG +DI +
Sbjct: 471 VDMVVVCLGTGNEVENEAHDRSELTLPGQQLRLLQDAVTFAADKPVILLLFNAGPLDITW 530

Query: 556 AETNTNIKAILWAGYPGEEGGRAIADVVFGK--FNPGGRLPITWYNGDYVQMLPLTSMPL 613
           A +N  I  I+   +P +  G A+  +       NPGGRLPITW         P +   +
Sbjct: 531 AVSNPAIPVIVECFFPAQTTGTALYHLFVNSPGSNPGGRLPITW---------PKSMSQV 581

Query: 614 RPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLN 671
            P++     GRTY+++NG  L+PFGYGLSYT F Y+ L  T +  +     + C ++N
Sbjct: 582 PPMEDYTMEGRTYRYFNGDPLFPFGYGLSYTTFHYSDLLITPSTPI-----KPCSSIN 634


>gi|125534110|gb|EAY80658.1| hypothetical protein OsI_35835 [Oryza sativa Indica Group]
          Length = 511

 Score =  346 bits (888), Expect = 3e-92,   Method: Compositional matrix adjust.
 Identities = 195/486 (40%), Positives = 277/486 (56%), Gaps = 16/486 (3%)

Query: 305 YIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFTGNAVQQGKVKETD 364
           Y+ +DCD++  + D H +   S ED VA ++KAG+D++CG Y       AVQ+G + E D
Sbjct: 16  YVASDCDAVATIRDAHHYTL-SPEDTVAVSIKAGMDVNCGNYTQVHAMAAVQKGNLTEKD 74

Query: 365 IDKSLKYLYTVLMRLGFFDGSPQ----YVSLGKQDICSDENIELAAEAAREGIVLLKNDQ 420
           ID++L  L+ V MRLG FDG P+    Y  LG  D+CS  +  LA EAA++GIVLLKND 
Sbjct: 75  IDRALVNLFAVRMRLGHFDGDPRSNAVYGHLGAADVCSPAHKSLALEAAQDGIVLLKNDA 134

Query: 421 NTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFSGYA--NVTYKTGCDD 478
             LPL  + V ++AV+GP+A+   A+ GNY G PC   +P+ G  GY      +  GCD 
Sbjct: 135 GALPLQPSAVTSLAVIGPNADNLGALHGNYFGPPCETTTPLQGIKGYLGDRARFLAGCDS 194

Query: 479 VACKSNNSIFAASEAAKTADATIILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAK 538
            AC    +  AA+ A+ ++D  ++  GL    E E LDR  L LPG Q  LI  VA  A+
Sbjct: 195 PACAVAATNEAAALAS-SSDHVVLFMGLSQKQEQEGLDRTSLLLPGEQQGLITAVANAAR 253

Query: 539 GPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWY 598
            PVILV+++ G VD+ FA+ N  I AIL AGYPG+ GG AIA V+FG  NP GRLP+TWY
Sbjct: 254 RPVILVLLTGGPVDVTFAKDNPKIGAILLAGYPGQAGGLAIAKVLFGDHNPSGRLPVTWY 313

Query: 599 NGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLL-SFTKTI 657
             ++ + +P+T M +R   + GYPGR+Y+FY G T+Y FGYGLSY++F   +  SF+ + 
Sbjct: 314 PEEFTK-VPMTDMRMRADPATGYPGRSYRFYQGNTVYNFGYGLSYSKFSRRMFSSFSTSN 372

Query: 658 QVNLNKLQHCRNLNYTSDASKTRCPGVLVNDL---RCDDY-FEFKVDFQNVGSTDGSDVV 713
             NL+ L          D         LV ++   RC    F   V+ QN G  DG   V
Sbjct: 373 AGNLSLLAGVMARRAGDDGGGMSS--YLVKEIGVERCSRLVFPAVVEVQNHGPMDGKHSV 430

Query: 714 IVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEH 773
           ++Y + P +      +Q+IGF+   V+ G    + F  + C+  + V      ++  G H
Sbjct: 431 LMYLRWPTKSGGRPARQLIGFRSQHVKVGEKAMVSFEVSPCEHFSWVGEDGERVIDGGAH 490

Query: 774 TIFVGN 779
            + VG+
Sbjct: 491 FLMVGD 496


>gi|340369765|ref|XP_003383418.1| PREDICTED: probable beta-D-xylosidase 2-like [Amphimedon
           queenslandica]
          Length = 748

 Score =  346 bits (887), Expect = 3e-92,   Method: Compositional matrix adjust.
 Identities = 253/767 (32%), Positives = 370/767 (48%), Gaps = 106/767 (13%)

Query: 52  FLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDF-------AHGVPRLGLPQYEWWSEALH 104
           F F D SLP   RVKD+V +++LD+ V+Q+          A G+P+  +  Y+W +E L 
Sbjct: 27  FPFRDPSLPIEERVKDIVDQLSLDQLVEQMAHGGAGSNGPAPGIPKFNIKPYQWGTECLS 86

Query: 105 GVSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLG------ 158
           G  N G           ATSFP  I   ASFN  L K++  A + E RA           
Sbjct: 87  GDVNAG----------DATSFPMSIGMAASFNYDLLKQVSNATAYEVRAKNTAAVLNGSY 136

Query: 159 --RAGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNS 216
               GL+ WSP +N+ RDPRWGR  ET GEDP++ G     +V GLQ             
Sbjct: 137 AFHTGLSCWSPVLNIMRDPRWGRNQETYGEDPYLSGYLGQAFVTGLQ-----------GD 185

Query: 217 RPLKV--SSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVM 274
            P  V  ++ CKH+  +       + R  FDA VT  D   TFL  F+ CV+ G A S+M
Sbjct: 186 DPTYVIANAGCKHFDVHGGPEDTPLPRASFDANVTMIDWRMTFLPQFKACVEAG-ALSLM 244

Query: 275 CSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLAD-----SKED 329
           CSYNR+NG+P+CA+ KLL   +R EW+  GY+V+D  +++ +V  H +  D     +   
Sbjct: 245 CSYNRINGVPACANKKLLTDILRNEWNFKGYVVSDQGALENIVTQHHYAPDFVTAAADAA 304

Query: 330 AVAQTLKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFF---DGSP 386
                L+ G     G    +   +AV++G V    +  ++  L+ V  +LG F   D + 
Sbjct: 305 NAGTCLEDGNSEGKGGNVFDNLDDAVEKGLVSVDTLKDAVSRLFYVRTKLGEFDPPDNNN 364

Query: 387 QYVSLGKQDICSDENIELAAEAAREGIVLLKNDQNT---LPLNSAKVKTVAVVGPHANAT 443
            Y ++    I SDE+I+L+ +AA E IVL+KND +    LPL +   K   VVGP     
Sbjct: 365 PYANIPLSIIQSDEHIKLSIQAAMETIVLMKNDNDGSPFLPLAADDFKKACVVGPFIENA 424

Query: 444 VAMIGNYAG--IPCRYMSPIAGFS----GYANVTYKTGCDD-VACKSNNSIFAASEAAKT 496
             M G+Y+   +    ++P+AG      G   + Y+ GC D  AC+  +  +    A + 
Sbjct: 425 DTMFGDYSPTMMTDYIVTPLAGIKTTQIGSDLLNYEDGCTDGPACEIYDG-YKVRTACEG 483

Query: 497 ADATIILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKG--PVILVIMSAGGVDIA 554
            D  I+ AGL   +E E  D  D++LPG+Q  L+   AE A G  P+IL++ +A  +DI+
Sbjct: 484 VDLVIVTAGLSRYLEHEGHDISDIYLPGHQMSLLTD-AESASGSAPIILLLFNANPLDIS 542

Query: 555 FAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLR 614
           +A++N    AIL A YPG+E G AIA+V+ G +NP GRLP TW          L  +P  
Sbjct: 543 YAKSNPRFAAILEAYYPGQEAGVAIANVLTGSYNPAGRLPNTW-------PASLDQVP-- 593

Query: 615 PVDSLGY--PGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNY 672
             D + Y    RTY+++    LYPFGYGLS+T F Y+ L+   T   N            
Sbjct: 594 --DMIDYTMKERTYRYFTQEPLYPFGYGLSFTTFNYSDLNVASTANTN------------ 639

Query: 673 TSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVI 732
                              +      V   N G+ DG +V   Y K      A  I Q++
Sbjct: 640 ------------------GEGSIAVSVTVMNTGTMDGDEVTQAYVKWDNVAEAPNI-QLV 680

Query: 733 GFQRVFVRAGRNKRIKFVFNACK-SLNIVDYAANTLLPAGEHTIFVG 778
           G  R F+  G++  + F     +  + I        +P G +++FVG
Sbjct: 681 GVSRKFISKGQSITVSFTIKPEQLQVWINGDDGKWSIPGGTYSLFVG 727


>gi|340368019|ref|XP_003382550.1| PREDICTED: probable beta-D-xylosidase 2-like [Amphimedon
           queenslandica]
          Length = 742

 Score =  345 bits (886), Expect = 5e-92,   Method: Compositional matrix adjust.
 Identities = 247/732 (33%), Positives = 369/732 (50%), Gaps = 97/732 (13%)

Query: 52  FLFCDSSLPYSIRVKDLVSRMTLDEKVQQ-------LGDFAHGVPRLGLPQYEWWSEALH 104
           F F ++SL    RVKD+V  +TL+E V+Q       L   A G+PRL +  Y+W +E L 
Sbjct: 24  FPFQNTSLSIEDRVKDIVDNLTLEELVEQMAHGGATLNGPAPGIPRLHINPYQWGTECLS 83

Query: 105 GVSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLG------ 158
           G  NV  G         ATSFP  I   ASFN  L K++  A + E RA +         
Sbjct: 84  G--NVSAGD--------ATSFPMPIGMAASFNYDLLKRVTNATAYEVRAKHAAAVKDGSY 133

Query: 159 --RAGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNS 216
               GL+ WSP +N+ RDPRWGR  ET GEDP++ G     YV GLQ   G+      NS
Sbjct: 134 AFHTGLSCWSPVLNIMRDPRWGRNQETYGEDPYLSGYLGQAYVNGLQ---GN------NS 184

Query: 217 RPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCS 276
           R +  ++ CKH+  +         R+ FDA+V+ +D   TFL  F+ CV+ G A S+MCS
Sbjct: 185 RYIIANAGCKHFDVHGGPENIPTSRFSFDAKVSMRDWRMTFLPQFKACVEAG-ALSLMCS 243

Query: 277 YNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLK 336
           YNR+NG+P+CA+  LL   +R EWD  GY+V+D  +++ +V  H +  D  + A      
Sbjct: 244 YNRINGVPACANKALLTDILRNEWDFKGYVVSDQGALEFIVIEHHYAPDFMKAAADAANA 303

Query: 337 AGL--DLDCGQYYTNFTG---NAVQQGKVKETDIDKSLKYLYTVLMRLGFFD--GSPQYV 389
                D + G+ + N      +AV+   V    +  ++  L+ V M+LG FD   +  Y 
Sbjct: 304 GTCLEDGNIGRKFFNVFEHLVDAVKNNLVSVDTLKNAVSRLFYVRMKLGEFDPPDNNPYA 363

Query: 390 SLGKQDICSDENIELAAEAAREGIVLLKNDQN----TLPLNSAKVKTVAVVGPHANATVA 445
           ++    I SD +I L+ +AA E IVL+KND       LP+ + +VK   +VGP ++    
Sbjct: 364 NIPLSVIQSDAHINLSLQAAMESIVLMKNDDGFRSPFLPITN-EVKKACMVGPFSDDPEV 422

Query: 446 MIGNYAGIPCR--YMSPIAGFS----GYANVTYKTGCDD-VACKSNNSIFAASEAAKTAD 498
           + G+Y+    R   ++ +AG      G   + Y  GC+D  AC++ +S    S A    +
Sbjct: 423 LFGDYSPTLMRDYVITSLAGLKNANIGTDTLNYAVGCEDGPACRNYDSAKVRS-ACDGVE 481

Query: 499 ATIILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAK-GPVILVIMSAGGVDIAFAE 557
             I+ AGL   +E+E  D  D+ LPG+Q  L+      +K   VIL++ +A  +DI +A+
Sbjct: 482 LIIVTAGLSKHLESEGKDLSDINLPGHQLDLMQDAEAASKNASVILILFNASPLDIRYAK 541

Query: 558 TNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVD 617
           T+  I  IL A YPG+  G+AIA+V+ G++NP GRLP TW         P +   +  + 
Sbjct: 542 TDPRIVGILEAYYPGQTAGKAIANVLTGEYNPSGRLPNTW---------PASLDQVPGIT 592

Query: 618 SLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDAS 677
           +     RTY+++    LYPFGYGLSYT F Y+                   NLN +S A+
Sbjct: 593 NYTMKERTYRYFTQEPLYPFGYGLSYTTFHYS-------------------NLNISSTAT 633

Query: 678 KTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRV 737
            +    + V+ L             N GS DG++V  VY      I+     Q++G  + 
Sbjct: 634 ASGAGMIAVSVL-----------VTNTGSMDGTEVTQVYVW--CNISYAPKLQLVGVNKD 680

Query: 738 FVRAGRNKRIKF 749
           F+  G+   + F
Sbjct: 681 FISKGKTLEVSF 692


>gi|390956994|ref|YP_006420751.1| beta-glucosidase-like glycosyl hydrolase [Terriglobus roseus DSM
           18391]
 gi|390411912|gb|AFL87416.1| beta-glucosidase-like glycosyl hydrolase [Terriglobus roseus DSM
           18391]
          Length = 742

 Score =  345 bits (885), Expect = 5e-92,   Method: Compositional matrix adjust.
 Identities = 254/768 (33%), Positives = 367/768 (47%), Gaps = 112/768 (14%)

Query: 5   VSSLLCFSLSIALLVFSTNAVDANGSSSPVFVCDPGRFSKLGLQMSSFLFCDSSLPYSIR 64
           + S+   + ++ L V S  +  A GS++P    +              ++ D S P   R
Sbjct: 3   LRSVALSTAAVLLSVASCVSASAQGSNAPASGGE--------------VYRDMSRPIEDR 48

Query: 65  VKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGATS 124
           + DL+ R TL EK  QL     GVPRLGLP +  W++ LHGV +  P           T 
Sbjct: 49  ITDLIKRFTLQEKAMQLNHTNRGVPRLGLPMWGGWNQTLHGVWSKQP----------TTL 98

Query: 125 FPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAG------LTYWSPNINVARDPRWG 178
           FP      A+++  L   +  A+S EARA+YN    G      L Y SP IN++RDPRWG
Sbjct: 99  FPIPTAMGATWDPELVHTVADAMSDEARALYNAHAEGPRTPHGLVYRSPVINISRDPRWG 158

Query: 179 RITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKG 238
           RI E   EDP + GR  V YVRGLQ         DL    LK+++  KH+A  +V++   
Sbjct: 159 RIQEVFSEDPLLTGRMGVAYVRGLQ-------GDDLQH--LKLAATVKHFAVNNVES--- 206

Query: 239 VDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRG 298
             R H +A V E+++ E +L  +   + E  A SVM SYN +NG+P   +  LL   +R 
Sbjct: 207 -GRQHLNADVDERNLFEFWLPHWRAAIMEAHAQSVMSSYNAINGMPDAVNHWLLTDVLRK 265

Query: 299 EWDLHGYIVADCDSIQVMVDNH--------KFLADSKEDAVAQTLKAGLDLDCGQYYTNF 350
           +W   G++  D  ++ ++            +  ++    A A  ++AG D D  ++ TN 
Sbjct: 266 KWGFDGFVTDDLGAVALLSGTRATNTSEPGQHFSEDPVVAAAAAIRAGNDSDDVEFETNL 325

Query: 351 TGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ---YVSLGKQDICSDENIELAAE 407
              AVQ+G + E D+D +L+ +  V  RLG +D  PQ   Y  +G   + S  + +L+  
Sbjct: 326 P-LAVQRGLLTEKDVDGALRNVLRVGFRLGAYD-PPQASKYSRIGMDVVRSQAHRDLSQR 383

Query: 408 AAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFSGY 467
            A E + LL N +  LPL   +VK+VAV+GP A       GNY G P    S   G    
Sbjct: 384 VAEESMTLLLNRRQFLPLQRDQVKSVAVIGP-AGGEAYETGNYYGTPAVKTSVTEGLRAL 442

Query: 468 ----ANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAESLDREDLWLP 523
                 V Y+ G   V    +  I  A+  A+ +D  ++  G +L VEAE  DR DL LP
Sbjct: 443 LGSGVKVEYEKGAGYVDLADDKEIERAANLARKSDVVVLCLGTNLQVEAEGRDRRDLNLP 502

Query: 524 GYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVV 583
           G Q +L+  V   A   V LV+M+AG + + +A  + ++ AIL A YPGE GG AIA  +
Sbjct: 503 GAQQRLLEAVY-AANPKVALVLMNAGPLGVTWA--HDHVPAILSAWYPGELGGAAIARTL 559

Query: 584 FGKFNPGGRLPITWY-NGDYVQMLPLTSMPLRPVD-SLGYPGRTYKFYNGPTLYPFGYGL 641
           FG  NPGG LP T Y N D V        P    D S GY   TY+++ G  LYPFG+GL
Sbjct: 560 FGLNNPGGHLPYTVYANLDGVP-------PQNEYDVSRGY---TYQYFKGVPLYPFGHGL 609

Query: 642 SYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDF 701
           SYT F Y+ L  T+T                                    D+    V F
Sbjct: 610 SYTHFDYSKLKVTQT----------------------------------SGDHANVTVSF 635

Query: 702 --QNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRI 747
              N G + G++V  +YS          ++ + GF+RV ++ G +K +
Sbjct: 636 TLTNTGQSAGAEVTQLYSHQVKSSEVQPLRTLRGFERVTLQPGESKAV 683


>gi|5690010|emb|CAB51937.1| Family 3 Glycoside Hydrolase [Ruminococcus flavefaciens]
          Length = 690

 Score =  344 bits (883), Expect = 8e-92,   Method: Compositional matrix adjust.
 Identities = 237/752 (31%), Positives = 363/752 (48%), Gaps = 113/752 (15%)

Query: 54  FCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGT 113
           + D +L    R +D+  R++ +EK +Q    A    RLG   Y WWSE LHGV+  G   
Sbjct: 6   YLDEALSDLERAEDITDRLSTEEKAEQQKYDAPAEERLGKDAYNWWSEGLHGVARAGT-- 63

Query: 114 HFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRA--------GLTYW 165
                   AT FP  I   A F++    + G+  S EARA YN   A        GLT W
Sbjct: 64  --------ATMFPQTIGMAAMFDDEAVHRAGETTSREARAKYNEYSAHDDRDIYKGLTLW 115

Query: 166 SPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCC 225
           SPN+N+ RDPRWGR  ET GEDP++     V Y +GLQ           + + L+ ++C 
Sbjct: 116 SPNVNIFRDPRWGRGQETYGEDPYLTSCLGVAYAKGLQG----------DGKVLRTAACA 165

Query: 226 KHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPS 285
           KH+A   V +     R+ FDA+   +DM ET++  FE  VK+    SVM +YNRVNG P+
Sbjct: 166 KHFA---VHSGPEATRHEFDAKANMKDMTETYIAAFEALVKDAKVESVMGAYNRVNGEPA 222

Query: 286 CADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQ 345
           CA   ++N+    EW   G+ V+DC +I+    NH     + E A A  LK G DL+CG 
Sbjct: 223 CASDFVMNKLE--EWGFDGHFVSDCWAIRDFHTNHGVTKTAPESA-ALALKKGCDLNCGN 279

Query: 346 YYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQDICSDENIELA 405
            Y +    A  +G + E D+ +S   L    +RLG FD S +Y  L    +  DE+ E +
Sbjct: 280 TYLHLLA-AFNEGLINEEDLRRSCIKLMRTRVRLGMFDKSTEYDGLDYDIVACDEHKEFS 338

Query: 406 AEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFS 465
              +   +VLLKN+   LPL+ +K KT+ V+GP+A++  A+ GNY G    Y++ ++G  
Sbjct: 339 LRCSERSMVLLKNN-GILPLDGSKYKTIGVIGPNADSVPALEGNYNGKADEYITFLSGIR 397

Query: 466 ---------GYANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAE--- 513
                       +  YK  C  +A   ++ +  A    +T   +  L  LD ++E E   
Sbjct: 398 EAHDGRVLYTEGSHLYKDRCMGLAL-PDDRLSEAEIITRTLRCSGSLCWLDATIEGEEGD 456

Query: 514 ------SLDREDLWLPGYQTQLINQVAEVAKG-PVILVIMSAGGVDIAFAETNTNIKAIL 566
                 S D+ DL LP  Q +L+  V  +AKG PVI+V  +   +++       +  A++
Sbjct: 457 TGNEFSSGDKNDLRLPESQRKLVKTV--MAKGKPVIIVTAAGSAINV-----EADCDALI 509

Query: 567 WAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTY 626
            A YPG+ GGRA+A+++FGK +P G+LP+T+Y  D  ++   +   ++         RTY
Sbjct: 510 QAWYPGQLGGRALANILFGKVSPSGKLPVTFYE-DASKLPDFSDYSMK--------NRTY 560

Query: 627 KFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLV 686
           ++  G  L+PFGYGL+Y++ + + LSF   +                             
Sbjct: 561 RYSEGNILFPFGYGLTYSETECSELSFENGVAT--------------------------- 593

Query: 687 NDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKR 746
                       V   N GS    DVV +Y K  +E A      + GF+RV + AG ++ 
Sbjct: 594 ------------VKVTNTGSRFTEDVVQIYIKGYSENAVPN-HSLCGFKRVALDAGESRI 640

Query: 747 IKFVFNACKSLNIVDYAANTLLPAGEHTIFVG 778
           ++      ++   V+     +    E T++ G
Sbjct: 641 VQITLPE-RAFMAVNEKGEWIKEGSEFTLYAG 671


>gi|325679939|ref|ZP_08159508.1| glycosyl hydrolase family 3 C-terminal domain protein [Ruminococcus
           albus 8]
 gi|324108377|gb|EGC02624.1| glycosyl hydrolase family 3 C-terminal domain protein [Ruminococcus
           albus 8]
          Length = 691

 Score =  344 bits (883), Expect = 1e-91,   Method: Compositional matrix adjust.
 Identities = 222/628 (35%), Positives = 330/628 (52%), Gaps = 74/628 (11%)

Query: 54  FCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGT 113
           + D SL    R + L   MT +E+  QL   A  + RLG+P Y WW+E +HG++  G   
Sbjct: 4   YLDESLSAEERAEALTDEMTTEEQASQLRYDAPAIERLGIPAYNWWNEGIHGLARSGV-- 61

Query: 114 HFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRA--------GLTYW 165
                   AT FP  I   A F++ L K+  +  S EARA YN            GLT W
Sbjct: 62  --------ATMFPQAIGLAAMFDDELTKRTAEITSEEARAKYNAYTVEGDRDIYKGLTLW 113

Query: 166 SPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCC 225
           +PNIN+ RDPRWGR  ET GEDP++  +     VRGLQ           + + +K ++C 
Sbjct: 114 APNINIFRDPRWGRGHETFGEDPYLTAQNGKAVVRGLQG----------DGKVMKAAACA 163

Query: 226 KHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPS 285
           KH+A   V +     R+ FDA+   +DMEET+L  FE  VKE    SVM +YNRVNG P+
Sbjct: 164 KHFA---VHSGPEALRHSFDAKADAKDMEETYLPAFEALVKEAKVESVMGAYNRVNGEPA 220

Query: 286 CADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQ 345
           CA   L+ +    EW+  GY V+DC +I+   ++H   A++ E A A  LKAG D++CG 
Sbjct: 221 CASDYLMEKL--KEWEFDGYFVSDCWAIRDFHEHHMVTANAVESA-AMALKAGCDVNCGC 277

Query: 346 YYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQDICSDENIELA 405
            Y N    A+ +G + +  I  +  +L    +RLG FD    +  +    +   E+  ++
Sbjct: 278 TYQNLLA-ALDKGLITKEQIRTACVHLMRTRIRLGMFDKHTDFDDIPYSKVACAEHKAVS 336

Query: 406 AEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAG-- 463
            E A + +VLLKN+   LPL+  K KT+AV+GP+A++  A+ GNY G+  RY + + G  
Sbjct: 337 LECAEKSLVLLKNN-GILPLDDKKYKTIAVIGPNADSRTALEGNYNGLSDRYTTFLNGIQ 395

Query: 464 --FSGYANVTYKTGCDDVACKSNNSIFAASE-------AAKTADATIILAGLDLSVEAE- 513
             F G   V +  GC  +  KS + +  A +       AAK AD  I+  GLD ++E E 
Sbjct: 396 DRFEG--RVIFAEGC-HLYKKSISGLAQAGDRYAEAVAAAKNADLVIMCVGLDATIEGEE 452

Query: 514 --------SLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAI 565
                   S D+  L LP  Q  L+ ++  V K PV+ V+ +   ++     T +   A+
Sbjct: 453 GDTGNEFSSGDKNGLTLPPPQKILVEKIMSVGK-PVVTVVCAGSAIN-----TESQPDAL 506

Query: 566 LWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRT 625
           + A YPG EG +A+A+V+FG  +P G+LP+T+Y  D  ++   T   ++        GRT
Sbjct: 507 IHAFYPGAEGSKALAEVLFGDVSPSGKLPVTFYE-DTDKLPEFTDYSMK--------GRT 557

Query: 626 YKFYNGPTLYPFGYGLSYTQFKYNLLSF 653
           Y++     L+PFGYGL+Y   K N + +
Sbjct: 558 YRYTTDNILFPFGYGLTYGGVKVNAVEY 585


>gi|359473580|ref|XP_003631325.1| PREDICTED: protein BRASSINOSTEROID INSENSITIVE 1-like [Vitis
           vinifera]
          Length = 785

 Score =  344 bits (883), Expect = 1e-91,   Method: Compositional matrix adjust.
 Identities = 166/285 (58%), Positives = 212/285 (74%), Gaps = 2/285 (0%)

Query: 305 YIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFTGNAVQQGKVKETD 364
           YIV+DC  ++V+VDN  +L +SK DAVA+TL+AGLDL+CG YYT+    +V  GKV + +
Sbjct: 10  YIVSDCYGLEVIVDNQNYLNESKVDAVAKTLQAGLDLECGHYYTDALNESVLTGKVSQYE 69

Query: 365 IDKSLKYLYTVLMRLGFFDGSPQYVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLP 424
           +D++LK +Y +LMR+G+FDG P Y SLG +DIC+ ++IELA EAAR+GIVLLKND   LP
Sbjct: 70  LDRALKNIYVLLMRVGYFDGIPAYESLGLKDICAADHIELAREAARQGIVLLKNDYEVLP 129

Query: 425 LNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFSGYANVTYKTGCDDVACKSN 484
           L   K   + +VGPHANAT  MIGNYAG+P +Y+SP+  FS   NVTY TGC D +C ++
Sbjct: 130 LKPGK--KLVLVGPHANATEVMIGNYAGLPYKYVSPLEAFSAIGNVTYATGCLDASCSND 187

Query: 485 NSIFAASEAAKTADATIILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILV 544
                A EAAK A+ TII  G DLS+EAE +DR D  LPG QT+LI QVAEV+ GPVILV
Sbjct: 188 TYFSEAKEAAKFAEVTIIFVGTDLSIEAEFVDRVDFLLPGNQTELIKQVAEVSSGPVILV 247

Query: 545 IMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNP 589
           ++S   +DI FA+ N  I AILW G+PGE+GG AIADVVFGK+NP
Sbjct: 248 VLSGSNIDITFAKNNPRISAILWVGFPGEQGGHAIADVVFGKYNP 292


>gi|147826476|emb|CAN72807.1| hypothetical protein VITISV_033721 [Vitis vinifera]
          Length = 236

 Score =  343 bits (881), Expect = 2e-91,   Method: Compositional matrix adjust.
 Identities = 157/216 (72%), Positives = 177/216 (81%)

Query: 34  VFVCDPGRFSKLGLQMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGL 93
            +VCD  R++ LGL M SF FCD SL Y  R KDLVSRMTL EKV Q    A GV RLGL
Sbjct: 16  TYVCDESRYALLGLDMKSFAFCDKSLSYEERAKDLVSRMTLQEKVMQSVHTASGVRRLGL 75

Query: 94  PQYEWWSEALHGVSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARA 153
           P+Y WWSEALHG+SN+GPG  FD+ IPGATSFPTVIL+TA+FN++LWK +G+ VSTE RA
Sbjct: 76  PEYSWWSEALHGISNLGPGVFFDETIPGATSFPTVILSTAAFNQTLWKTLGRVVSTEGRA 135

Query: 154 MYNLGRAGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATD 213
           MYNLG AGLT+WSPNINV RD RWGR  ET GEDPF+VG +AVNYVRGLQDVEG EN TD
Sbjct: 136 MYNLGHAGLTFWSPNINVVRDTRWGRTQETSGEDPFIVGEFAVNYVRGLQDVEGTENVTD 195

Query: 214 LNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVT 249
           LNSRPLKVSSCCKHYAAYD+D+W  VDR+ FDARV+
Sbjct: 196 LNSRPLKVSSCCKHYAAYDIDSWLNVDRHTFDARVS 231


>gi|224068504|ref|XP_002302759.1| predicted protein [Populus trichocarpa]
 gi|222844485|gb|EEE82032.1| predicted protein [Populus trichocarpa]
          Length = 273

 Score =  342 bits (878), Expect = 3e-91,   Method: Compositional matrix adjust.
 Identities = 163/261 (62%), Positives = 191/261 (73%), Gaps = 14/261 (5%)

Query: 35  FVCDPGRFSKLGLQMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLP 94
           F CDP   +       +F FC   LP   RV DL+ RMTL EKV  L + A  VPRLG+ 
Sbjct: 27  FACDPEDGTS-----RNFPFCQVKLPIQSRVSDLIGRMTLQEKVGLLVNDAAAVPRLGIK 81

Query: 95  QYEWWSEALHGVSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAM 154
            YEWWSEALHGVSNVGPGT F    PGATSFP VI T ASFN +LW+ IG+ VS EARAM
Sbjct: 82  GYEWWSEALHGVSNVGPGTQFGGAFPGATSFPQVITTAASFNATLWEAIGRVVSDEARAM 141

Query: 155 YNLGRAGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDL 214
           +N G AGLTYWSPN+N+ RDPRWGR  ETPGEDP V G+YA +YVRGLQ  +G       
Sbjct: 142 FNGGVAGLTYWSPNVNIFRDPRWGRGQETPGEDPVVAGKYAASYVRGLQGNDGDR----- 196

Query: 215 NSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVM 274
               LKV++CCKH+ AYD+DNW GVDR+HF+A+V++QDME+TF  PF MCVKEG  +SVM
Sbjct: 197 ----LKVAACCKHFTAYDLDNWNGVDRFHFNAQVSKQDMEDTFDVPFRMCVKEGKVASVM 252

Query: 275 CSYNRVNGIPSCADPKLLNQT 295
           CSYN+VNGIP+CADPKLL +T
Sbjct: 253 CSYNQVNGIPTCADPKLLKKT 273


>gi|6573772|gb|AAF17692.1|AC009243_19 F28K19.27 [Arabidopsis thaliana]
          Length = 696

 Score =  341 bits (874), Expect = 1e-90,   Method: Compositional matrix adjust.
 Identities = 197/493 (39%), Positives = 292/493 (59%), Gaps = 30/493 (6%)

Query: 309 DCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKS 368
           DCD++ ++ D   + A S EDAVA  LKAG+D++CG Y    T +A+QQ KV ETDID++
Sbjct: 221 DCDAVSIIYDAQGY-AKSPEDAVADVLKAGMDVNCGSYLQKHTKSALQQKKVSETDIDRA 279

Query: 369 LKYLYTVLMRLGFFDGSPQ---YVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPL 425
           L  L++V +RLG F+G P    Y ++   ++CS  +  LA +AAR GIVLLKN+   LP 
Sbjct: 280 LLNLFSVRIRLGLFNGDPTKLPYGNISPNEVCSPAHQALALDAARNGIVLLKNNLKLLPF 339

Query: 426 NSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFSGYA-NVTYKTGCDDVACKSN 484
           +   V ++AV+GP+A+    ++GNYAG PC+ ++P+     Y  N  Y  GCD VAC SN
Sbjct: 340 SKRSVSSLAVIGPNAHVVKTLLGNYAGPPCKTVTPLDALRSYVKNAVYHQGCDSVAC-SN 398

Query: 485 NSIFAASEAAKTADATIILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILV 544
            +I  A   AK AD  +++ GLD + E E  DR DL LPG Q +LI  VA  AK PV+LV
Sbjct: 399 AAIDQAVAIAKNADHVVLIMGLDQTQEKEDFDRVDLSLPGKQQELITSVANAAKKPVVLV 458

Query: 545 IMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQ 604
           ++  G VDI+FA  N  I +I+WAGYPGE GG AI++++FG  NPGGRLP+TWY   +V 
Sbjct: 459 LICGGPVDISFAANNNKIGSIIWAGYPGEAGGIAISEIIFGDHNPGGRLPVTWYPQSFVN 518

Query: 605 MLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKT-IQVNLNK 663
            + +T M +R   + GYPGRTYKFY GP +Y FG+GLSY+ + Y   +  +T + +N +K
Sbjct: 519 -IQMTDMRMR--SATGYPGRTYKFYKGPKVYEFGHGLSYSAYSYRFKTLAETNLYLNQSK 575

Query: 664 LQ-HCRNLNYT--SDASKTRCPGVLVNDLRCDDYFEFK--VDFQNVGSTDGSDVVIVYSK 718
            Q +  ++ YT  S+  K  C           D  + K  V+ +N G   G   V+++++
Sbjct: 576 AQTNSDSVRYTLVSEMGKEGC-----------DVAKTKVTVEVENQGEMAGKHPVLMFAR 624

Query: 719 PP--AEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIF 776
                E      KQ++GF+ + +  G    ++F    C+ L+  +     +L  G++ + 
Sbjct: 625 HERGGEDGKRAEKQLVGFKSIVLSNGEKAEMEFEIGLCEHLSRANEFGVMVLEEGKYFLT 684

Query: 777 VGNGGVSFPIHLN 789
           VG+     P+ +N
Sbjct: 685 VGDS--ELPLIVN 695



 Score =  233 bits (593), Expect = 4e-58,   Method: Compositional matrix adjust.
 Identities = 113/215 (52%), Positives = 143/215 (66%), Gaps = 14/215 (6%)

Query: 30  SSSPVFVCDPGR-FSKLGLQMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGV 88
           S+ P   CDP    +KL      + FC + LP   R +DLVSR+T+DEK+ QL + A G+
Sbjct: 19  SAPPPHSCDPSNPTTKL------YQFCRTDLPIGKRARDLVSRLTIDEKISQLVNTAPGI 72

Query: 89  PRLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVS 148
           PRLG+P YEWWSEALHGV+  GPG  F+  +  ATSFP VILT ASF+   W +I Q + 
Sbjct: 73  PRLGVPAYEWWSEALHGVAYAGPGIRFNGTVKAATSFPQVILTAASFDSYEWFRIAQVIG 132

Query: 149 TEARAMYNLGRA-GLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQ--DV 205
            EAR +YN G+A G+T+W+PNIN+ RDPRWGR  ETPGEDP + G YAV YVRGLQ    
Sbjct: 133 KEARGVYNAGQANGMTFWAPNINIFRDPRWGRGQETPGEDPMMTGTYAVAYVRGLQGDSF 192

Query: 206 EGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVD 240
           +G +      S  L+ S+CCKH+ AYD+D WK  D
Sbjct: 193 DGRKTL----SNHLQASACCKHFTAYDLDRWKDCD 223


>gi|323344407|ref|ZP_08084632.1| beta-glucosidase [Prevotella oralis ATCC 33269]
 gi|323094534|gb|EFZ37110.1| beta-glucosidase [Prevotella oralis ATCC 33269]
          Length = 722

 Score =  340 bits (873), Expect = 1e-90,   Method: Compositional matrix adjust.
 Identities = 249/766 (32%), Positives = 378/766 (49%), Gaps = 106/766 (13%)

Query: 42  FSKLGLQMSSFLFCDSSLPYSI--RVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWW 99
           F  + L   +F F  S     +  + K ++S++TLDEK+ QL   A G+ RLG+  Y W 
Sbjct: 10  FISVALVSVTFTFAQSKKEKEMIQKAKSIISQLTLDEKISQLTQDAKGIDRLGIKPYYWL 69

Query: 100 SEALHGVSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGR 159
           +EALHGV   G           AT FP  I   A+F+  + ++IG A++TE RA + + +
Sbjct: 70  NEALHGVGRDGR----------ATVFPQPISLGATFDPEIVQQIGDAIATEGRAKFIVAQ 119

Query: 160 --------AGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENA 211
                   AGLT+W+PN+N+ RDPRWGR  ET GEDPF+ G     +V+G+Q        
Sbjct: 120 RQKNYSMYAGLTFWAPNVNIFRDPRWGRGMETYGEDPFLTGVLGTAFVKGMQ-------- 171

Query: 212 TDLNSRP--LKVSSCCKHYAAYDVDNWKGVDRYHFDARV--TEQDMEETFLRPFEMCVKE 267
               + P  LK ++C KH+A +      G +R    A V  T+ D+ ET+L  F+M V++
Sbjct: 172 ---GNDPFYLKAAACGKHFAVHS-----GPERTRHTANVEPTKHDLYETYLPAFKMLVQQ 223

Query: 268 GDASSVMCSYNRVNGIPSCADPK-LLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADS 326
           G   S+M +Y R+ G  SC+  K LL   +R +W   G++V+DC ++  M + HK L  S
Sbjct: 224 GKVESIMGAYQRLYG-ESCSGSKYLLTDILRKDWGFKGHVVSDCGAVTDMYEGHK-LVKS 281

Query: 327 KEDAVAQTLKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFF--DG 384
           + +AVA  +KAGL+L+CG        +A++Q  + E D+DK+L  L    ++LG    D 
Sbjct: 282 EAEAVAFAIKAGLNLECGNSMRTMK-DALKQKLITEKDLDKALLPLMMTRLKLGILQPDV 340

Query: 385 SPQYVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATV 444
           +  Y    +  I S +N  +A  AA E +VLLKND   LP+ +  ++T+ V GP A    
Sbjct: 341 ACPYNEFPESVIGSIDNRNIAQRAAEESMVLLKND-GVLPI-AKDIRTLFVTGPGATDAY 398

Query: 445 AMIGNYAGIPCRYMSPIAGFSGY----ANVTYKTGCDDVACKSNNSIFAASEAAKTADAT 500
            ++GNY G+  RY + + G  G      +V YK G   V    N+  ++ SE ++ A+ +
Sbjct: 399 YLMGNYFGLSDRYSTYLEGIVGKVSNGTSVNYKQGFMQVFKNLNDVNWSVSE-SRGAEVS 457

Query: 501 IILAGLDLSVE---------AESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGV 551
           II+ G   + E         +E  DR DL LP  Q Q + +V++     +++V+   GG 
Sbjct: 458 IIIMGNSGNTEGEEGDAIASSERGDRVDLRLPEPQMQYLREVSKDRTNKLVVVL--TGGS 515

Query: 552 DIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSM 611
            I   E      A++ A YPG+EGG A+A+++FG  N  GRLP+T+         P T+ 
Sbjct: 516 PIDVKEITELADAVVMAWYPGQEGGVALANLLFGDANFSGRLPVTF---------PETTD 566

Query: 612 PLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLN 671
            L   D     GRTYK+     LYPFGYGLSY +  Y   + TK                
Sbjct: 567 KLPSFDDYSMKGRTYKYMTDNILYPFGYGLSYGKVAYGNATVTKLP-------------- 612

Query: 672 YTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQV 731
            T  +S T                   VD  N G+    +VV VY   P+    + I+ +
Sbjct: 613 -TKHSSMT-----------------VSVDLSNDGNMPVDEVVQVYLSTPSAGVTSPIESL 654

Query: 732 IGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFV 777
           + F+RV +         F     + L  V     + L  GE+ + +
Sbjct: 655 VAFKRVKIAPHATVTTDFEI-PVERLETVQEDGTSKLLKGEYRVMI 699


>gi|405968899|gb|EKC33925.1| Putative beta-D-xylosidase 5 [Crassostrea gigas]
          Length = 748

 Score =  340 bits (872), Expect = 2e-90,   Method: Compositional matrix adjust.
 Identities = 241/732 (32%), Positives = 367/732 (50%), Gaps = 95/732 (12%)

Query: 42  FSKLGLQMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQL--------GDFAHGVPRLGL 93
           F+   L  S+F F + SL +S RV DLV R+TLD+ VQQL        G  A  +  LG+
Sbjct: 14  FALTPLASSNFPFQNVSLSWSERVDDLVGRLTLDQIVQQLARGGAGLNGGPAPAIENLGI 73

Query: 94  PQYEWWSEALHGVSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARA 153
             Y+W +E L G    G           ATSFP  I   A+F++ L   + +A +TE RA
Sbjct: 74  GPYQWNTECLRGDVEAG----------NATSFPQAIGLAAAFSKDLIFNVSKAAATEVRA 123

Query: 154 MYN--------LGRAGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDV 205
            +N            GL+ +SP +N+ R P WGR  ET GEDP++ G YA  +V+GLQ  
Sbjct: 124 KHNDFVKRGIFTDHTGLSCFSPVVNIMRHPLWGRNQETYGEDPYLSGTYASYFVQGLQG- 182

Query: 206 EGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCV 265
             H+       R ++ ++ CKH+ A+         R  FDA+V+ +D+  TFL  F+ CV
Sbjct: 183 -DHD-------RYIQANAGCKHFDAHGGPEDIPESRMGFDAKVSMRDLRLTFLPAFQKCV 234

Query: 266 KEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLAD 325
           + G A S+MCSYN +NG+P+C++  L+   +RGEW+  GY+V+D  +I+  +  H +  +
Sbjct: 235 QAG-AYSLMCSYNSINGVPACSNKLLMMDILRGEWNFTGYVVSDEGAIENQISFHHYYNN 293

Query: 326 SKEDAVAQTLKAGLDLDCGQYYTN----FTGNAVQQGKVKETDIDKSLKYLYTVLMRLGF 381
           S EDA A ++ AG +L+     T       G+AV+ GK++E+ +   +K L+   MRLG 
Sbjct: 294 S-EDAAAGSVNAGCNLELSGNLTEPVFMKIGDAVKSGKLEESVVRNRVKPLFYTRMRLGE 352

Query: 382 FDGSPQ---YVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLP---LNSAKVKTVAV 435
           FD  P+   Y S+    I S+E+  L+  AA + +VLLK          +     + +AV
Sbjct: 353 FD-PPEMNPYSSVNLSVIQSEEHRNLSLTAAAKSLVLLKRPSKFSKRHLIGGFPSERMAV 411

Query: 436 VGPHANATVAMIGNYAGI--PCRYMSPIAGFSGYA-NVTYKTGCDDVACKSNNSIFAASE 492
           +GP AN T  + G+Y+    P    +P+ G +    ++ Y  GC D     N S      
Sbjct: 412 IGPMANNTDQIFGDYSPTTDPRFVKTPLKGLTELNFSMNYAAGCVDGTRCLNYSQDDVKT 471

Query: 493 AAKTADATIILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVD 552
           A   AD  ++  G    +E+E++DR+D+ LPG Q QL+  V  +    V L++ SAG V+
Sbjct: 472 ALVGADLVVVCLGTGKDLESENVDRKDMMLPGKQLQLLQDVVSMTNKAVYLLVFSAGPVN 531

Query: 553 IAFAETNTNIKAILWAGYPGEEGGRAIADVVF---GKFNPGGRLPITWYNGDYVQMLPLT 609
           I +A+ +  +  IL   YP +  G AI   +    G+FNP GRLP TWY   Y + +P  
Sbjct: 532 ITWAQESERVLIILQCFYPAQSAGDAITQALIMRDGRFNPAGRLPYTWYR--YTEQIP-- 587

Query: 610 SMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRN 669
                 +       +TY+++ G  LYPFGYGLSY+ F ++ L F   +            
Sbjct: 588 -----EMTDYSMARKTYRYFTGVPLYPFGYGLSYSTFVFSKLYFLPKVNAG--------- 633

Query: 670 LNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIK 729
                       P V+            +V   N G  DG +V+ VY K  +        
Sbjct: 634 -----------DPNVV------------QVRVFNEGPFDGDEVLQVYIKWMSTKERMPRV 670

Query: 730 QVIGFQRVFVRA 741
           Q++ F+RVF+R+
Sbjct: 671 QLVAFERVFIRS 682


>gi|429738050|ref|ZP_19271875.1| glycosyl hydrolase family 3 protein [Prevotella saccharolytica
           F0055]
 gi|429161155|gb|EKY03583.1| glycosyl hydrolase family 3 protein [Prevotella saccharolytica
           F0055]
          Length = 722

 Score =  340 bits (871), Expect = 2e-90,   Method: Compositional matrix adjust.
 Identities = 235/702 (33%), Positives = 360/702 (51%), Gaps = 99/702 (14%)

Query: 64  RVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGAT 123
           + K ++S++TLDEK+ QL   A G+ RLG+  Y W +EALHGV   G           AT
Sbjct: 34  KAKSIISQLTLDEKISQLTQDAKGIDRLGIKPYYWLNEALHGVGRDGR----------AT 83

Query: 124 SFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGR--------AGLTYWSPNINVARDP 175
            FP  I   A+F+  +  +IG A++TE RA + + +        AGLT+W+PN+N+ RDP
Sbjct: 84  VFPQPINLGATFDPKIVHQIGDAIATEGRAKFIVAQRQKNYSMYAGLTFWAPNVNIFRDP 143

Query: 176 RWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDN 235
           RWGR  ET GEDPF+ G     +V+G+Q  +            LK ++C KH+A +    
Sbjct: 144 RWGRGMETYGEDPFLTGTLGTAFVKGMQGDDPFY---------LKAAACGKHFAVHS--- 191

Query: 236 WKGVDRYHFDARV--TEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPK-LL 292
             G +R    A V  T++D+ ET+L  F+M V++G   S+M +Y R+ G  SC+  K LL
Sbjct: 192 --GPERTRHTANVEPTKRDLYETYLPAFKMLVQKGKVESIMGAYQRLYG-ESCSGSKYLL 248

Query: 293 NQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFTG 352
              +R +W   G++V+DC ++  M + HK L  S+ +AVA  +KAGL+L+CG        
Sbjct: 249 TDILRKDWGFKGHVVSDCGAVTDMYEGHK-LVKSEAEAVAFAIKAGLNLECGNSMRTMK- 306

Query: 353 NAVQQGKVKETDIDKSLKYLYTVLMRLGFF--DGSPQYVSLGKQDICSDENIELAAEAAR 410
           +A+QQ  + E D+DK+L  L    ++LG    D +  Y    +  I S+ N ++A +AA 
Sbjct: 307 DAIQQKLITEKDLDKALLPLMMTRLKLGILQPDAACPYNEFPESVIGSEANRKIAEQAAE 366

Query: 411 EGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFSGY--- 467
           E +VLLKN+   LP+ +  ++T+ V GP A     ++GNY G+  RY + + G  G    
Sbjct: 367 ESMVLLKNN-GVLPI-AKDIRTLFVTGPGATDAYYLMGNYFGLSNRYSTYLEGIVGKVSN 424

Query: 468 -ANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVE---------AESLDR 517
             +V YK G   V    N+  ++ SE ++ A+ +I++ G   + E         AE  DR
Sbjct: 425 GTSVNYKQGFMQVFKNLNDVNWSVSE-SRGAEVSILIMGNSGNTEGEEGDAIASAERGDR 483

Query: 518 EDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGR 577
            +L LP  Q + + +V++     +++V+   GG  I   E      A++ A YPG+EGG 
Sbjct: 484 VNLRLPDSQMEYLREVSKDRTNKLVVVL--TGGSPIDVKEITELADAVVMAWYPGQEGGV 541

Query: 578 AIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPF 637
           A+A+++FG  N  GRLP+T+         P ++  L   D     GRTYK+     LYPF
Sbjct: 542 ALANLLFGDANFSGRLPVTF---------PESADRLPAFDDYSMKGRTYKYMTDNILYPF 592

Query: 638 GYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEF 697
           GYGLSY++  Y                         S+A+ T+ P               
Sbjct: 593 GYGLSYSKVTY-------------------------SNAAVTKMPTKTTP-------MTV 620

Query: 698 KVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFV 739
            VD  N G     +VV VY   P     + I+ +IGF+RV +
Sbjct: 621 YVDVTNNGDMPVDEVVQVYLSTPGAGNTSPIESLIGFKRVKI 662


>gi|390630430|ref|ZP_10258413.1| Beta-xylosidase B [Weissella confusa LBAE C39-2]
 gi|390484359|emb|CCF30761.1| Beta-xylosidase B [Weissella confusa LBAE C39-2]
          Length = 674

 Score =  338 bits (867), Expect = 7e-90,   Method: Compositional matrix adjust.
 Identities = 221/687 (32%), Positives = 350/687 (50%), Gaps = 99/687 (14%)

Query: 91  LGLPQYEWWSEALHGVSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTE 150
           + +P+Y +W+EALHGV+  G           AT FP  I   A+F++ L  +I   + TE
Sbjct: 1   MNIPEYNYWNEALHGVARAGV----------ATVFPQAIGLAATFDDHLINEIADVIGTE 50

Query: 151 ARAMYNLGRA--------GLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGL 202
            RA YN            GLT+WSPN+N+ RDPRWGR  ET GEDPF+  ++ V +++GL
Sbjct: 51  GRAKYNEFTKHDDRDIYKGLTFWSPNVNIFRDPRWGRGHETYGEDPFLTSKFGVAFIKGL 110

Query: 203 QDVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFE 262
           Q            ++ LK+++  KH+A +     +G+ R+ FDA V+++D+ ET+L  F+
Sbjct: 111 QG----------QAKYLKLAATAKHFAVHS--GPEGL-RHGFDAVVSDKDLYETYLPAFK 157

Query: 263 MCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKF 322
             V+E D  S+M +YN V+G+P+     LL   +  +W   G++V+D  + + + +NHK+
Sbjct: 158 AAVEEADVESIMTAYNAVDGVPASVSEMLLKDILHDKWSFEGHVVSDYMAPEDVHENHKY 217

Query: 323 LADSKEDAVAQTLKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFF 382
             D+ E  +   +KAGL+L  G    +    A+ +G V E +I  ++  LY   +RLG F
Sbjct: 218 TKDAAE-TMGLAIKAGLNLVAGHIEQSLH-EALDRGLVTEEEITNAVISLYATRVRLGMF 275

Query: 383 DGSPQYVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANA 442
               +Y ++  +   +  +  L+  AA +  VLLKND   LPL    ++ +AVVGP+A++
Sbjct: 276 ATDNEYDAIPYEANDTKAHNNLSEIAAEKSFVLLKND-GVLPLRKETMEAIAVVGPNAHS 334

Query: 443 TVAMIGNYAGIPCRYMSPIAGFSGY----ANVTYKTGC----DDVA---CKSNNSIFAAS 491
            +A++GNY G P R  + + G          V Y  G     D  A    K++     A 
Sbjct: 335 EIALLGNYFGTPSRSYTILEGIQERLGDDVRVHYSIGSGLFQDHAAEPLAKADERESEAV 394

Query: 492 EAAKTADATIILAGLDLSVEAE---------SLDREDLWLPGYQTQLINQVAEVAKGPVI 542
            AA+ +D  + + GLD ++E E         + D+ +L LPG Q QL+ ++  V K PV+
Sbjct: 395 IAAEHSDVVVAVLGLDSTIEGEEGDAGNSQGAGDKPNLSLPGRQRQLLERLLAVGK-PVV 453

Query: 543 LVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDY 602
           +++ S   + +   E + N++AI+   YPG  GG A+ADV+FG  +P G+LP+T+Y    
Sbjct: 454 VLLASGSSLQLDGLENHPNLRAIMQIWYPGARGGLAVADVLFGAVSPSGKLPVTFYKN-- 511

Query: 603 VQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLN 662
           V  LP         +     GRTY++     LYPFGYGL+Y+              V L+
Sbjct: 512 VDNLP-------AFEDYNMAGRTYRYMTDEALYPFGYGLTYS-------------SVELS 551

Query: 663 KLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAE 722
            LQ  ++   T+  + T                      QN G+ D  +VV VY K    
Sbjct: 552 DLQ-VKSYEDTATVTAT---------------------IQNTGNFDTDEVVQVYVKDLGS 589

Query: 723 IAATYIKQVIGFQRVFVRAGRNKRIKF 749
             A    Q+ GF+RV++  G  + I F
Sbjct: 590 EFAVPNAQLKGFKRVYLGKGAKQTITF 616


>gi|413925161|gb|AFW65093.1| putative O-Glycosyl hydrolase superfamily protein [Zea mays]
          Length = 323

 Score =  338 bits (866), Expect = 9e-90,   Method: Compositional matrix adjust.
 Identities = 161/305 (52%), Positives = 209/305 (68%), Gaps = 14/305 (4%)

Query: 33  PVFVCDPGRFSKLGLQMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLG 92
           P F C P              FCD +L  + R  DLVSR+T  EK+ QLGD A GVPRLG
Sbjct: 29  PPFSCGPSSAEA----SEGLAFCDVTLAPAQRAADLVSRLTAAEKIAQLGDQAPGVPRLG 84

Query: 93  LPQYEWWSEALHGVSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEAR 152
           +P Y+WW+EALHG++  G G HFD  +  ATSFP V+LT A+F++ LW +IGQA+  EAR
Sbjct: 85  VPGYKWWNEALHGLATSGKGLHFDAAVRAATSFPQVLLTAAAFDDDLWLRIGQAIGREAR 144

Query: 153 AMYNLGRA-GLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENA 211
           A++N+G+A GLT WSPN+N+ RDPRWGR  ETPGEDP V  RYAV +VRG+Q        
Sbjct: 145 ALFNVGQAEGLTIWSPNVNIFRDPRWGRGQETPGEDPAVASRYAVAFVRGIQG------- 197

Query: 212 TDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDAS 271
            + +S  L+ S+CCKH  AYD+++W GV RY F ARVTEQD+E+TF  PF  CV E  AS
Sbjct: 198 -NSSSSLLQTSACCKHATAYDLEDWNGVARYSFVARVTEQDLEDTFNPPFRSCVVEAKAS 256

Query: 272 SVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAV 331
            VMC+Y  +NG+P+CA+  LL  TVRG+W L GY+ +DCD++ +M D  ++ A + EDAV
Sbjct: 257 CVMCAYTAINGVPACANSDLLTGTVRGDWGLDGYVASDCDAVAIMRDAQRY-APTPEDAV 315

Query: 332 AQTLK 336
           A +LK
Sbjct: 316 AVSLK 320


>gi|372209036|ref|ZP_09496838.1| glycoside hydrolase [Flavobacteriaceae bacterium S85]
          Length = 859

 Score =  337 bits (865), Expect = 1e-89,   Method: Compositional matrix adjust.
 Identities = 241/751 (32%), Positives = 371/751 (49%), Gaps = 94/751 (12%)

Query: 52  FLFCDSSLPYSI---------RVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEA 102
           FLF  SS+  +          RV DL++ MTL+EK+   G     + RLG+P +EW+ EA
Sbjct: 14  FLFSFSSIAQTWKNPNASIEDRVNDLLANMTLEEKISYCGSRIPEIKRLGIPYFEWYGEA 73

Query: 103 LHGVSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGL 162
           LHG+           +    T FP  I   A++N  L   +  A+S EARA+ N G+  +
Sbjct: 74  LHGI-----------ISWNCTQFPQNIAMGATWNPDLMFDVATAISNEARALKNAGKKEV 122

Query: 163 TYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVS 222
             +SP +N+ARDPRWGR  E   EDP ++   A  YVRG+Q   G++       + +K  
Sbjct: 123 MMFSPTVNMARDPRWGRNGECYAEDPHLMSEMARMYVRGMQ---GND------PKYVKTV 173

Query: 223 SCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNG 282
           +  KHY A +V+      R    + + ++D+ E +   ++ C+ + +A+ +M + N +NG
Sbjct: 174 TTVKHYVANNVE----TKREWIHSNIGKKDLYEYYFPAYKTCIVDEEATGIMTALNGLNG 229

Query: 283 IPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLD 342
           IP  A   L+N  +R EW   GY++AD  ++Q +    K+ + S+  A A  +KAG+D +
Sbjct: 230 IPCSAHDWLVNGVLRNEWGFKGYVIADWAAVQGLEKRMKYAS-SQAQAAAMAIKAGVDQE 288

Query: 343 CGQY------YTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSP--QYVSLGKQ 394
           C +             +A+QQG + E ++D ++K L  +    G FD      Y ++   
Sbjct: 289 CFRNKVRQAPMVQALPDALQQGLITEKELDVTVKRLLRLRFMTGDFDDPSLNPYSAIPTS 348

Query: 395 DICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIP 454
            +  D + +LA +AA + IVLLKND   LPL    +K++A++GP A+     +G Y+G P
Sbjct: 349 VLECDAHKQLALKAAEQSIVLLKNDA-VLPLKK-DLKSIAMIGPFADR--CWMGIYSGHP 404

Query: 455 CRYMSPIAGFSGYAN--VTYKTGCDDVACKSNNSIFAASEA-AKTADATIILAGLDLSVE 511
              +SP+ G   Y N  V++  GC+  A + +    A + A AK ++  I++ G D +  
Sbjct: 405 KSKVSPLDGIKAYTNAKVSFAQGCEVTAKEDDEQKIAEAVALAKKSEQVILVVGNDETTS 464

Query: 512 AESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYP 571
            E+ DR+ + LPG Q QLI  V  V K  VILV++ +G   + + +   NI  I+ A   
Sbjct: 465 TENTDRKSIKLPGNQHQLIKAVQAVNKN-VILVLVPSGPTAVTWEQ--KNIPGIVCAWPN 521

Query: 572 GEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNG 631
           G+E G A+A V+FG  NPGG+L  TWY  D         +P      +    RTY ++ G
Sbjct: 522 GQEQGTALAKVLFGDVNPGGKLNATWYQSD-------KDLPNFHDYKMAGGNRTYMYFKG 574

Query: 632 PTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRC 691
             LYPFGYGLSYT F  + +S  K                                 L+ 
Sbjct: 575 KPLYPFGYGLSYTNFTISDVSINK-------------------------------KTLQA 603

Query: 692 DDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNK--RIKF 749
           ++Y   K    N G+  G +VV VY +       T +K + GFQR+ V AG +K   IK 
Sbjct: 604 NEYVTVKAKVNNTGAVAGDEVVQVYIRDVKSKEKTPLKALKGFQRISVAAGASKWVEIKI 663

Query: 750 VFNACKSLNIVDYAANTLLPAGEHTIFVGNG 780
            + A    N    A   ++  GE  I VGN 
Sbjct: 664 PYEAFSHYNTKKEA--LMVAKGEFEILVGNA 692


>gi|443692971|gb|ELT94448.1| hypothetical protein CAPTEDRAFT_221920 [Capitella teleta]
          Length = 757

 Score =  337 bits (864), Expect = 2e-89,   Method: Compositional matrix adjust.
 Identities = 256/764 (33%), Positives = 365/764 (47%), Gaps = 103/764 (13%)

Query: 47  LQMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQL-----GDFAHGVPRLGLPQYEWWSE 101
           +Q   F F D SL +  R  DLV+R+TL+E   Q      G     + RLG+  Y W +E
Sbjct: 15  VQSYDFPFQDPSLSWDDRADDLVARLTLEEIAPQTQASYGGQHTPAIERLGIKPYVWITE 74

Query: 102 ALHGVSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRA- 160
            L G  N             AT++P  I   ASF+E L   + + +S E RA +N  RA 
Sbjct: 75  CLAGQVNTN-----------ATAYPQPIGMAASFSEELLFNVSRDISYEVRAHWNANRAV 123

Query: 161 -------GLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATD 213
                  GL+ +SP IN+ R P WGR  ET GEDP + G  A ++VRGLQ  +       
Sbjct: 124 GKYSTKVGLSCFSPVINIMRHPLWGRNQETYGEDPLLSGTLAQSFVRGLQGDD------- 176

Query: 214 LNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSV 273
              R L+ ++ CKH+  +       V R+ FDA+V  +D   TFL  F+MCV  G + S+
Sbjct: 177 --PRYLRANAGCKHFDVHGGPEDIPVSRFSFDAKVNMRDWRMTFLPQFKMCVDAG-SYSL 233

Query: 274 MCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQ 333
           MCSYNR+NGIP+CA+ +LL    R EW  HGYIV+D  +I  + + H +  +S    V  
Sbjct: 234 MCSYNRINGIPACANKQLLTDITRDEWGFHGYIVSDSGAISNIKEQHHY-TNSTVATVVA 292

Query: 334 TLKAGLDLDCG----QYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ-- 387
            +KAG +L+ G     YY     +A++QG + E +I  +++ L    +RLG FD      
Sbjct: 293 AIKAGTNLELGGGSNMYYPKQL-DAMKQGLLTEKEIRDNVRPLLYTRLRLGEFDPEAMVD 351

Query: 388 YVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMI 447
           Y  +G   I S E+ E A +AA  G VLLKN  N LP+     K +A+VGP  NAT  + 
Sbjct: 352 YNKIGVDVIQSPEHREQAVKAAYMGFVLLKNHNNLLPIKKQYSK-LAIVGPFTNATSELF 410

Query: 448 GNYAG-IPCRY-------MSPIAGFSGYANVTYKTGCDDVACKSNNSIFAASEAAKTADA 499
           G Y+  +  ++       +SP+ G +  AN     GC + AC S         A   AD 
Sbjct: 411 GTYSSEVNLKFTSTIFEGLSPLGGSTRSAN-----GCTNSAC-SGYVRDDVETAVAGADL 464

Query: 500 TIILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKG-PVILVIMSAGGVDIAFAET 558
            I+  G     E+E  DR  L L G+Q  ++      + G PVILV+++AG +DI +A+ 
Sbjct: 465 VIVALGSGQRFESEGNDRAYLDLHGHQLDILKDAVFFSNGAPVILVLINAGPLDITWAKL 524

Query: 559 NTNIKAILWAGYPGEEGGRAIADVVF---GKFNPGGRLPITWYNGDYVQMLPLTSMPLRP 615
           +  + AIL  GYP +  G A+   +     +  P GRL  TW         PL    +  
Sbjct: 525 DPGVTAILSCGYPAQSTGEALRRSLTMSEPQAAPAGRLQATW---------PLNLDQVPK 575

Query: 616 VDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSD 675
           +      GRTY++Y G  LYPFG+GLSYT F Y  LS +                     
Sbjct: 576 ITDYTMQGRTYRYYVGEPLYPFGFGLSYTSFSYTRLSIS--------------------- 614

Query: 676 ASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQ 735
                 P V+       D    +V  +N GS D  +VV VY   P          +  F 
Sbjct: 615 ------PSVITQ----GDNVTVEVCLKNTGSYDSDEVVQVYMSWPQTPFPLPKWTLAAFA 664

Query: 736 RVFVRAGRNKRIKFVFNACK-SLNIVDYAANTLLPAGEHTIFVG 778
           R F+ AG+   +K V  A + ++ + D A    +P G  T++ G
Sbjct: 665 RPFISAGQTICVKSVIRADQMAVWLSDDAGFGFVP-GVMTVYAG 707


>gi|413925165|gb|AFW65097.1| putative O-Glycosyl hydrolase superfamily protein [Zea mays]
          Length = 412

 Score =  335 bits (858), Expect = 7e-89,   Method: Compositional matrix adjust.
 Identities = 171/310 (55%), Positives = 208/310 (67%), Gaps = 14/310 (4%)

Query: 33  PVFVCDPGRFSKLGLQMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLG 92
           P F C  G    LGL      FC++ LP + R  DLVSRMT  EK  QLGD A+GVPRLG
Sbjct: 84  PPFSCGGG--PSLGLP-----FCNTKLPAAQRAADLVSRMTPAEKASQLGDVANGVPRLG 136

Query: 93  LPQYEWWSEALHGVSNVGPGTHFD-DVIPGATSFPTVILTTASFNESLWKKIGQAVSTEA 151
           +P Y+WW+EALHGV+  G G H D   +  ATSFP V+LT ASFN++LW +IGQA   EA
Sbjct: 137 VPSYKWWNEALHGVAISGKGIHMDRGAVRSATSFPQVLLTAASFNDNLWFRIGQATGKEA 196

Query: 152 RAMYNLGRA-GLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHEN 210
           RA YN+G+A GLT WSPN+N+ RDPRWGR  ETPGEDP V  RYA  +VRGLQ   G  +
Sbjct: 197 RAFYNIGQAEGLTMWSPNVNIFRDPRWGRGQETPGEDPAVASRYAAAFVRGLQ---GSSS 253

Query: 211 ATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDA 270
            T      L  S+CCKH  AYD+++WKGV RY F A VT QD+ +TF  PF  CV +G A
Sbjct: 254 NTKSVPPVLLTSACCKHATAYDLEDWKGVTRYSFRATVTVQDLADTFNPPFRSCVVDGKA 313

Query: 271 SSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHG-YIVADCDSIQVMVDNHKFLADSKED 329
           S VMC+Y  VNG+PSCA+  LL +T RG W L G Y+ ADCD++ +M  N +F   + ED
Sbjct: 314 SCVMCAYTSVNGVPSCANADLLTKTFRGSWGLDGRYVAADCDAVSIM-RNSQFYRPTAED 372

Query: 330 AVAQTLKAGL 339
            VA TLKAG+
Sbjct: 373 TVATTLKAGM 382


>gi|157676888|emb|CAP07659.1| beta-xylosidase [uncultured rumen bacterium]
          Length = 761

 Score =  335 bits (858), Expect = 8e-89,   Method: Compositional matrix adjust.
 Identities = 244/768 (31%), Positives = 365/768 (47%), Gaps = 147/768 (19%)

Query: 54  FCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGT 113
           + D S P  +R K L+ +++L+EK   +   +  V RLG+  Y WWSEALHGV+  G   
Sbjct: 31  YTDKSQPAELRAKALLPKLSLEEKAGLVQYNSPAVERLGIKAYNWWSEALHGVARNG--- 87

Query: 114 HFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNL----GR----AGLTYW 165
                   AT FP  I   ASF+    + +  AVS EAR    +    GR    AGL++W
Sbjct: 88  -------SATVFPQPIGMAASFDVEKIETVFTAVSDEARVKNRIAAEDGRVYQYAGLSFW 140

Query: 166 SPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCC 225
           +PNIN+ RDPRWGR  ET GEDP+++G+  +  VRGLQ         D ++  LK  +C 
Sbjct: 141 TPNINIFRDPRWGRGMETYGEDPYLMGQLGMAVVRGLQ--------GDPDADVLKTHACA 192

Query: 226 KHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPS 285
           KHYA   V +    +R+ FDA+V+E+D+ ET+L  F+  V +     VM +YNR  G P 
Sbjct: 193 KHYA---VHSGLESNRHRFDAQVSERDLRETYLPAFKDLVTKAGVKEVMTAYNRFRGYPC 249

Query: 286 CADPKLLNQTVRGEWDLHGYIVADCDSIQVMVD--NHKFLADSKEDAVAQTLKAGLDLDC 343
            A   L+ + +R EW   G +V+DC +I    +   H F+A + E+A A  +  GLD++C
Sbjct: 250 AASEYLVQKILREEWGYKGLVVSDCWAIPDFFEPGRHGFVA-TGEEAAALAVANGLDVEC 308

Query: 344 GQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQDICSDENIE 403
           G  ++     A+ QG +KE D+D++L  + T   RLG  DG   +  L    +   E+  
Sbjct: 309 GSTFSKIPA-AIDQGLLKEEDLDRNLLRVLTERFRLGEMDGESPWDDLDPAIVEGPEHRA 367

Query: 404 LAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAG 463
           L+ + ARE +VLL+N+   LPL +   + +A++GP+A+      GNY  +P   ++ +  
Sbjct: 368 LSLDIARETMVLLRNN-GVLPLKAG--EKIALIGPNADDAQMQWGNYNPVPKSTITLLQA 424

Query: 464 F------------------------SGYANVT-------------YKTGCDDVAC----- 481
                                    S YAN+              Y    +D+       
Sbjct: 425 MQARVPGLVYDRACGILDAEYAPQGSAYANLIGASEAQLEAAARRYAVSVNDIKNYIRRD 484

Query: 482 --KSNNSIFAASEAA-----KTADATIILAGLDLSVEAESL----------DREDLWLPG 524
             +  + + A  EAA     +  D  +   G+   +E E +          DR D+ LPG
Sbjct: 485 EEQRRSFMPALDEAAVLKKLEGVDVVVFAGGISPRLEGEEMRVQVPGFSGGDRTDIELPG 544

Query: 525 YQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVF 584
            Q +L+  + +  K  V+LV  S  G  I       +  AIL A YPG+EGG AIADV+F
Sbjct: 545 VQRRLLKALHDAGK-KVVLVNFS--GCAIGLVPETESCDAILQAWYPGQEGGTAIADVLF 601

Query: 585 GKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYT 644
           G  NP G+LP+T+Y    V  LP        V+     G TY+++ G  LYPFGYGLSYT
Sbjct: 602 GDVNPSGKLPVTFYKN--VDQLP-------DVEDYNMEGHTYRYFRGEPLYPFGYGLSYT 652

Query: 645 QFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNV 704
            F +                                 P V   +L        ++D  N 
Sbjct: 653 SFAFGE-------------------------------PKVKGKNL--------EIDVTNT 673

Query: 705 GSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFN 752
           GS  G++VV +Y + P + A   +K +  F+RV V AG+  ++    +
Sbjct: 674 GSVAGTEVVQLYVRKPDDTAGP-VKTLRAFRRVSVPAGQTVKVSIPLD 720


>gi|348688508|gb|EGZ28322.1| family 3 glycoside hydrolase [Phytophthora sojae]
          Length = 701

 Score =  331 bits (849), Expect = 8e-88,   Method: Compositional matrix adjust.
 Identities = 241/753 (32%), Positives = 370/753 (49%), Gaps = 133/753 (17%)

Query: 54  FCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPR-----LGLPQYEWWSEALHGVSN 108
           FC++SLP S RV+DL++R+ LDEK   L   A   PR     +GLP+Y W +  +HGV +
Sbjct: 34  FCNTSLPVSARVEDLLARLPLDEKAILLT--ARASPRGNMSSIGLPEYNWGANCVHGVRS 91

Query: 109 VGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPN 168
              GT+        TSFP  +      N S+ ++                          
Sbjct: 92  TC-GTNC------PTSFPNPV------NLSIHRR-------------------------- 112

Query: 169 INVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHY 228
               RDPRWGR TETP EDP V  +Y V Y +GLQ+ + HE+      R L+     KHY
Sbjct: 113 ----RDPRWGRNTETPSEDPLVNSKYGVAYTKGLQEGK-HED-----PRYLQAVVTLKHY 162

Query: 229 AAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCAD 288
            AY  +N+ G +R  F+A V+  D  +T+   F   + +G+A  VMCSYN VNG+P+CA+
Sbjct: 163 VAYSYENYGGGNRKTFNAIVSPYDFADTYFPAFRSSIVDGNAKGVMCSYNSVNGVPACAN 222

Query: 289 PKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQ--Y 346
            +L N+ +RG     GYI +D  +I+ + D   ++  ++ +A    + AG D++ G+   
Sbjct: 223 NELENKLLRGMLGFDGYITSDSGAIEAISDWLHYVP-TRCEAARLAILAGTDVNSGRGFG 281

Query: 347 YTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFD--GSPQYVSLGKQDICSDENIEL 404
           Y       V+  ++    +D  L++   +   LG FD      Y  +   D+ +D   +L
Sbjct: 282 YMACLKELVESNQLDVKVVDDVLRHTLKLRFELGLFDPIEDQPYWKVTPNDVNTDAAKKL 341

Query: 405 AAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCR-------- 456
           + + AR+ IVLL+N+Q  LPL    VK +AVVGPHA A  A++GNY G  C         
Sbjct: 342 SLDLARKSIVLLQNNQPVLPLRRG-VK-LAVVGPHAQAKRALLGNYLGQMCHGDYNEVGC 399

Query: 457 YMSPIAGFS---GYANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAE 513
             +P    S   G ++ TY  GC +V   S      A +A + A+A ++  G+D SVEAE
Sbjct: 400 IKTPFEAVSASNGDSSTTYALGC-NVTGNSTAGFVEAVKAVQGAEAVVLFLGIDKSVEAE 458

Query: 514 SLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGE 573
             DR ++ LP  Q QL+ +V  V K P ++V+M+ GGV  A  +      A++ A YPG 
Sbjct: 459 VRDRNNIDLPAIQVQLLQRVRAVGK-PTVVVLMN-GGVLTA-EDIIGQTDALVEAFYPGF 515

Query: 574 EGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPT 633
            G +A+ D++FG  NPGG+LP+T Y  DYV  + + SM     +   YPGR+Y+++ G  
Sbjct: 516 FGAQAMTDILFGDANPGGKLPVTMYRSDYVNTVDMKSM-----NVTAYPGRSYRYFKGEP 570

Query: 634 LYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDD 693
           ++PFG+GLSYT F                            DA+ T              
Sbjct: 571 VFPFGWGLSYTSFSLK-----------------------ADDATATTA------------ 595

Query: 694 YFEFKVDFQNVGSTDGSDVVIVYS-----KPPAEIAATYI-KQVIGFQRVFVRAGRNKRI 747
                   ++V +T  + + +V++     K  A   AT + KQ+  ++RV ++   + R+
Sbjct: 596 --------KSVSATMNTTISVVFAYFRPIKTDASGPATLLNKQLFDYRRVTLKPSESTRL 647

Query: 748 KFVFNACKSLNIVDYAANTLLPAGEHTIFVGNG 780
            F      +L +VD   N +   G + I + NG
Sbjct: 648 SFEVQR-STLALVDEEGNLVSFPGSYDIIITNG 679


>gi|449489074|ref|XP_002195511.2| PREDICTED: beta-xylosidase/alpha-L-arabinofuranosidase 2-like
           [Taeniopygia guttata]
          Length = 685

 Score =  330 bits (846), Expect = 2e-87,   Method: Compositional matrix adjust.
 Identities = 241/718 (33%), Positives = 365/718 (50%), Gaps = 99/718 (13%)

Query: 88  VPRLGLPQYEWWSEALHGVSNVGPGTHFDDVIPG-ATSFPTVILTTASFNESLWKKIGQA 146
           +PRLG+  Y W +E L G          D   PG AT+FP  +   A+F+  L  ++  A
Sbjct: 9   IPRLGIAPYNWNTECLRG----------DGEAPGWATAFPQALGLAAAFSPELIYRVANA 58

Query: 147 VSTEARAMYN----LGR----AGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNY 198
            +TE RA +N     GR     GL+ +SP +N+ R P WGR  ET GEDPF+ G  A ++
Sbjct: 59  TATEVRAKHNSFAAAGRYSDHTGLSCFSPVLNIMRHPLWGRNQETYGEDPFLSGELARSF 118

Query: 199 VRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFL 258
           V+GLQ           + R +K S+ CKH++ +     + +  Y     V E+D   TFL
Sbjct: 119 VQGLQGP---------HPRYVKASAGCKHFSVHG--GHENILLYLLT--VLERDWRMTFL 165

Query: 259 RPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVD 318
             F+ CV+ G + S MCSYNR+NG+P+CA+ KLL   +RGEW   GY+V+D  ++++++ 
Sbjct: 166 PQFQACVRAG-SYSFMCSYNRINGVPACANKKLLTDILRGEWGFDGYVVSDEGAVELIML 224

Query: 319 NHKFLADSKEDAVAQTLKAG--LDLDCGQYYTNFT--GNAVQQGKVKETDIDKSLKYLYT 374
            H +     E AVA ++ AG  L+L  G     F     A+  G +    +   ++ L+ 
Sbjct: 225 GHHYTRSFLETAVA-SVNAGCNLELSYGMRNNVFMRIPEALAMGNITLQMLRDRVRPLFY 283

Query: 375 VLMRLGFFDGSPQ--YVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKT 432
             MRLG FD      Y SL    + S E+  L+ EAA +  VLLKN + TLPL +  + +
Sbjct: 284 TRMRLGEFDPPAMNPYSSLDLSVVQSPEHRNLSLEAAVKSFVLLKNVRGTLPLKAQDLSS 343

Query: 433 --VAVVGPHANATVAMIGNYAGIP-CRYM-SPIAGFSGY-ANVTYKTGCDDVACKSNNSI 487
             +AVVGP A+    + G+YA +P  RY+ +P  G     ANV++  GC +  C+     
Sbjct: 344 QHLAVVGPFADNPRVLFGDYAPVPEPRYIYTPRRGLEMLGANVSFAAGCSEPRCQR---- 399

Query: 488 FAASEAAK---TADATIILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKG-PVIL 543
           ++ +E  K    AD  ++  G  + VE E+ DR DL LPG+Q +L+    + A G PVIL
Sbjct: 400 YSRAELVKVVGAADVVLVCLGTGVDVETEAKDRSDLSLPGHQLELLQDAVQAAAGRPVIL 459

Query: 544 VIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGK--FNPGGRLPITWYNGD 601
           ++ +AG +D+++A+ +  + AIL   +P +  G AIA V+ G+   +P GRLP TW  G 
Sbjct: 460 LLFNAGPLDVSWAQAHDGVGAILACFFPAQATGLAIARVLLGEAGASPAGRLPATWPAG- 518

Query: 602 YVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPT-LYPFGYGLSYTQFKYNLLSFTKTIQVN 660
            +  +P       P+++    GRTY++Y     LYPFGYGLSYT F+Y  L  +  +   
Sbjct: 519 -MHQVP-------PMENYTMEGRTYRYYGQEAPLYPFGYGLSYTTFRYRDLVLSPPV--- 567

Query: 661 LNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPP 720
              L  C NL+ +                         V  +N G  D  +VV +Y +  
Sbjct: 568 ---LPLCANLSVS-------------------------VVLENTGLRDSEEVVQLYLRWE 599

Query: 721 AEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVG 778
                    Q++ F+RV V AGR  ++ F   A +      +A +  L  G  T+F G
Sbjct: 600 HSSVPVPRWQLVAFRRVAVPAGREAKLSFQVLAEQR---AVWAQHWHLEPGTFTLFAG 654


>gi|85813774|emb|CAJ65923.1| xylan 1,4-beta-xylosidase [Populus tremula x Populus alba]
          Length = 704

 Score =  329 bits (844), Expect = 4e-87,   Method: Compositional matrix adjust.
 Identities = 184/483 (38%), Positives = 287/483 (59%), Gaps = 28/483 (5%)

Query: 309 DCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKS 368
           DCD++ V+    K+ A + EDAVA  LK+G+      Y  N+T +AV++ KV  ++ID++
Sbjct: 229 DCDAVNVLHVEQKY-AKTPEDAVADALKSGIS-----YLRNYTKSAVEKKKVTVSEIDRA 282

Query: 369 LKYLYTVLMRLGFFDGSPQ---YVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPL 425
           L  L++  MRLG F+G P    Y  +G   +CS E+  LA EAA +GIVLLKN    LPL
Sbjct: 283 LHNLFSTRMRLGLFNGDPTKQLYSDIGPDQVCSQEHQALALEAALDGIVLLKNADRLLPL 342

Query: 426 NSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFSGY-ANVTYKTGCDDVACKSN 484
           + + + ++AV+GP+A+ +  ++GNY G  C+ ++ + G   Y ++ +Y+ GC++V+C S 
Sbjct: 343 SKSGISSLAVIGPNAHNSTNLLGNYFGPACKNVTILEGLRNYVSSASYEKGCNNVSCTSA 402

Query: 485 NSIFAASEAAKTADATIILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILV 544
            +     E A+T D  I++ GLD S E E LDR DL LPG Q  LI  VA+ AK P++LV
Sbjct: 403 -AKKKPVEMAQTEDQVILVMGLDQSQEKERLDRMDLVLPGKQPTLITAVAKAAKRPIVLV 461

Query: 545 IMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNP---GGRLPITWYNGD 601
           ++    +D+ FA+ N  I +ILWAGYPG+ G  A+A ++FG+ NP   GGRLP+TWY  D
Sbjct: 462 LLGGSPMDVTFAKNNRKIGSILWAGYPGQAGATALAQIIFGEHNPGNAGGRLPMTWYPQD 521

Query: 602 YVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNL 661
           + + +P+T M +RP  S G PGRTY+FY G  ++ FGYGLSY+ + Y   S      V  
Sbjct: 522 FTK-VPMTDMRMRPQPSTGNPGRTYRFYEGEKVFEFGYGLSYSDYSYTFAS------VAQ 574

Query: 662 NKLQHCRNLNYTSDASKTRCPGV-LVNDL---RCDDY-FEFKVDFQNVGSTDGSDVVIVY 716
           N+L    + N   + S+T  PG  LV+D+   +C++  F+  V  +N G   G   V+++
Sbjct: 575 NQLNVKDSSNQQPENSET--PGYKLVSDIGEEQCENIKFKVTVSVKNEGQMAGKHPVLLF 632

Query: 717 SKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIF 776
           ++         IK+++GFQ V + AG    I++  + C+ L+  +     ++  G   + 
Sbjct: 633 ARHAKPGKGRPIKKLVGFQTVKLGAGEKTEIEYELSPCEHLSSANEDGVMVMEEGSQILL 692

Query: 777 VGN 779
           VG+
Sbjct: 693 VGD 695



 Score =  215 bits (548), Expect = 7e-53,   Method: Compositional matrix adjust.
 Identities = 111/232 (47%), Positives = 147/232 (63%), Gaps = 22/232 (9%)

Query: 17  LLVFSTNAVDANG-----SSSPVFVCDPGRFSKLGLQMSSFLFCDSSLPYSIRVKDLVSR 71
           +L+F+      NG     +S P + CD    S       ++ FC ++LP S R +DLVSR
Sbjct: 7   VLLFARQTKQGNGRPRKQASQPPYSCDSSDPS-----TKTYDFCKTTLPISRRAEDLVSR 61

Query: 72  MTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGV---SNVGPG-THFDDVIPGATSFPT 127
           +T +EK  QL D +  +PRLG+P YEWWSE LHG+   + V  G + F+  I  ATSFP 
Sbjct: 62  LTFEEKATQLVDTSPAIPRLGIPAYEWWSEGLHGIGFLTRVQQGISFFNRTIQHATSFPQ 121

Query: 128 VILTTASFNESLWKKIGQAVSTEARAMYNLGR-AGLTYWSPNINVARDPRWGRITETPGE 186
           VILT ASF+  +W +IGQ V  EARA+YN G+  GL +W+PN+N+ RDPRWGR  ETPGE
Sbjct: 122 VILTAASFDAHIWYRIGQ-VGKEARALYNAGQVTGLGFWAPNVNIFRDPRWGRGQETPGE 180

Query: 187 DPFVVGRYAVNYVRGLQ--DVEGHENATDLNSRPLKVSSCCKHYAAYDVDNW 236
           DP VVG+Y  ++VRG+Q    EG     D     L+ S+CCKHY A+D+DNW
Sbjct: 181 DPLVVGKYGASFVRGVQGDSFEGESTLGDH----LQASACCKHYTAHDLDNW 228


>gi|405955586|gb|EKC22647.1| Putative beta-D-xylosidase 2 [Crassostrea gigas]
          Length = 745

 Score =  329 bits (843), Expect = 4e-87,   Method: Compositional matrix adjust.
 Identities = 233/726 (32%), Positives = 359/726 (49%), Gaps = 96/726 (13%)

Query: 47  LQMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHG--------VPRLGLPQYEW 98
           L +  + F ++SLP+  RVKDLV R+T++E V Q+     G        VPRLG+  + W
Sbjct: 21  LHVQDYPFRNTSLPWDARVKDLVDRLTIEEIVVQMSRGGSGPRASPAPAVPRLGVGPFSW 80

Query: 99  WSEALHGVSNVGPGTHFDDVIPG-ATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNL 157
            +E L G           DV  G ATSFP  +   A+F+  +   +  A S E RA +N 
Sbjct: 81  NTECLRG-----------DVYAGNATSFPQALGLAATFSTEVICDVASATSIEVRAKFND 129

Query: 158 --------GRAGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHE 209
                      G++ +SP IN+ R P WGR  ET GEDPF+ G  A  +V+ LQ   G +
Sbjct: 130 YQRRKIYGDHKGISCFSPVINIMRHPLWGRNQETYGEDPFLSGELAAIFVKCLQ---GDD 186

Query: 210 NATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGD 269
                    ++ ++ CKH+  +       V R+ FDA+V+E+D   TFL  F+ CV+ G 
Sbjct: 187 PTY------IRANAGCKHFDVHGGPENIPVSRFSFDAKVSERDWRLTFLPAFKRCVQAG- 239

Query: 270 ASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKED 329
           + S+MCS+NR+NG+P+C + +LL   +R EW   GY+V+D ++I+ ++  H +  +S  D
Sbjct: 240 SYSLMCSFNRINGVPACGNKRLLTDILRTEWGFTGYVVSDQEAIENIMTYHHYTNNSV-D 298

Query: 330 AVAQTLKAGLDLDCGQYYTN----FTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGS 385
             A  +KAG +L+           +  +A++ GK+ + D+ KS+  L+   MRLG FD  
Sbjct: 299 TAALCVKAGCNLELSTNEVKPTYFYIIDALKAGKLDKEDLVKSVSPLFYTRMRLGEFDPP 358

Query: 386 PQ--YVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANAT 443
               Y  +    I S+E+  ++  AA +  VLLKN    LP+ +    T++V+GP A+  
Sbjct: 359 DHNPYNFIDLSVIQSEEHRAISLNAAMKSFVLLKNKGGFLPI-TKLFDTISVLGPMADNK 417

Query: 444 VAMIGNYAG--IPCRYMSPIAGFSGYAN-VTYKTGCDDVACKSNNSIFAASEAAKTADAT 500
              IG+YA   +P    +P+ G S  +  V Y  GC+D AC   N       A  ++D  
Sbjct: 418 YQQIGSYAPDVMPSYTTTPLQGLSKLSKRVQYAAGCNDNACSKYNRT-EIQRAVNSSDIF 476

Query: 501 IILAGLDLSVEAESLDREDLWLPGYQTQLI-NQVAEVAKG-PVILVIMSAGGVDIAFAET 558
            +  G    +E E  DR  + LPG Q QL+ + +   AKG P++L++ + G V+I +A+ 
Sbjct: 477 FVCLGTGPMIENEDHDRASMELPGQQAQLLKDAIMFSAKGVPIVLLLFNGGPVNITWADR 536

Query: 559 NTNIKAILWAGYPGEEGGRAIADVVF---GKFNPGGRLPITWYNGDYVQMLPLTSMPLRP 615
           +  + AI+   +P +E G A+  VV       NP GRLP TW    Y   +P        
Sbjct: 537 SDRVVAIMECFFPAQETGEAVLRVVTNTGNSSNPAGRLPYTW--PKYQDQIP-------S 587

Query: 616 VDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSD 675
           + +    GRTY++++G  LYPFGYGLSY+ F +        I    +             
Sbjct: 588 MVNYSMEGRTYRYFHGDPLYPFGYGLSYSTFNFTNAWMNPIISQGQD------------- 634

Query: 676 ASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQ 735
                                 +V+  N G TDG +V+ VY K         I Q++GF+
Sbjct: 635 -------------------LTVRVEVCNEGPTDGDEVIQVYLKWLDTNETMPIHQLVGFE 675

Query: 736 RVFVRA 741
           RV +RA
Sbjct: 676 RVSLRA 681


>gi|285016879|ref|YP_003374590.1| beta-glucosidase [Xanthomonas albilineans GPE PC73]
 gi|283472097|emb|CBA14604.1| putative beta-glucosidase protein [Xanthomonas albilineans GPE
           PC73]
          Length = 914

 Score =  328 bits (842), Expect = 6e-87,   Method: Compositional matrix adjust.
 Identities = 192/455 (42%), Positives = 265/455 (58%), Gaps = 33/455 (7%)

Query: 54  FCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGT 113
           + DS   ++ R  DLV+RMTL+EKV Q+ + A  +PRLG+P Y+WW+E LHGV+  G   
Sbjct: 34  YLDSQRTFAQRADDLVARMTLEEKVAQMQNAAPAIPRLGVPAYDWWNEGLHGVARAG--- 90

Query: 114 HFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYN-------LGR-AGLTYW 165
                  GAT FP  I   A+F+  L  ++  A+S EARA ++        GR  GLT+W
Sbjct: 91  -------GATVFPQAIGLAATFDLPLMHEVSTAISDEARAKHHEALRRGEHGRYQGLTFW 143

Query: 166 SPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQD--VEGHENATDLNSRPLKVSS 223
           SPNIN+ RDPRWGR  ET GEDPF+  R  V +V+G+Q    +  +NA     R  K+ +
Sbjct: 144 SPNINIFRDPRWGRGQETYGEDPFLTARMGVTFVQGMQGEGADAPKNAQGETYR--KLDA 201

Query: 224 CCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGI 283
             KH+A   V +    +R+HFDAR +++D+ ET+L  FE  VKEG   +VM +YNR+ G 
Sbjct: 202 TAKHFA---VHSGPESERHHFDARPSQRDLYETYLPAFEALVKEGKVDAVMGAYNRLFGE 258

Query: 284 PSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDC 343
            + A   LL   +R  W  HGY+V+DC +I  +  NHK +A ++E A A  +K G  L+C
Sbjct: 259 SASASKFLLRDVLRERWGFHGYVVSDCWAIVDIWKNHKIVA-TREQAAALAVKNGTQLEC 317

Query: 344 GQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFD--GSPQYVSLGKQDICSDEN 401
           GQ Y      AVQQG + ETDID +L+ L T  MRLG FD  G  ++  L      S E+
Sbjct: 318 GQEYATLPA-AVQQGLIGETDIDAALRTLMTARMRLGMFDPPGQLRWAQLPISVNQSPEH 376

Query: 402 IELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPI 461
             LA   ARE +VLLKND   LPL+ AK K +AV+GP A+ T+A++GNY G P   ++ +
Sbjct: 377 DALARRTARESLVLLKND-GLLPLSRAKHKRIAVIGPTADDTMALLGNYYGTPATPVTIL 435

Query: 462 AGFSGY---ANVTYKTGCDDVACKSNNSIFAASEA 493
            G       A+V Y  G D V  +S+ +     EA
Sbjct: 436 QGIRAAAPDADVLYARGADLVEGRSDPAATPLIEA 470



 Score =  143 bits (361), Expect = 3e-31,   Method: Compositional matrix adjust.
 Identities = 99/313 (31%), Positives = 146/313 (46%), Gaps = 54/313 (17%)

Query: 490 ASEAAKTADATIILAGLDLSVEAESL----------DREDLWLPGYQTQLINQVAEVAKG 539
           A + A+ AD  + + GL   VE E +          DR DL LP  Q +L+  ++   K 
Sbjct: 628 ALDTARRADVVVFVGGLTGDVEGEEMKVNYPGFAGGDRTDLRLPKPQRELLQALSATGK- 686

Query: 540 PVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYN 599
           PV+ V+ +   + I +A+ +  + AIL A YPG+ GG A+ADV+FG  NPGGRLP+T+Y 
Sbjct: 687 PVVAVLTTGSALAIDWAQEH--VPAILLAWYPGQRGGSAVADVLFGDTNPGGRLPVTFYK 744

Query: 600 GDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQV 659
                     S  L   D     GRTY+++ G  LYPFG+GLSYTQF Y+ L        
Sbjct: 745 A---------SETLPAFDDYAMRGRTYRYFAGTPLYPFGHGLSYTQFAYSDLRL------ 789

Query: 660 NLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKP 719
                          D  K    G L   L+            N G+  G +VV +Y  P
Sbjct: 790 ---------------DRRKVAADGQLSATLKV----------TNTGTRAGDEVVQLYLHP 824

Query: 720 PAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYA-ANTLLPAGEHTIFVG 778
            A   A  IK++ GFQR+ +  G ++ + F  +    L I D A  + ++  G++ + VG
Sbjct: 825 LAPTRARAIKELRGFQRIALAPGESRDVHFTISPQTDLRIYDEAQKHYVVDPGDYELQVG 884

Query: 779 NGGVSFPIHLNFN 791
                  +   F+
Sbjct: 885 ASSADVRVRERFS 897


>gi|194700280|gb|ACF84224.1| unknown [Zea mays]
          Length = 452

 Score =  328 bits (841), Expect = 6e-87,   Method: Compositional matrix adjust.
 Identities = 179/451 (39%), Positives = 262/451 (58%), Gaps = 20/451 (4%)

Query: 339 LDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ---YVSLGKQD 395
           +D++CG Y  +   +A+QQGK+ E DI+++L  L+ V MRLG F+G P+   Y  +G   
Sbjct: 1   MDVNCGSYVQDHGASALQQGKITEQDINRALHNLFAVRMRLGLFNGDPRRNLYGDIGPDQ 60

Query: 396 ICSDENIELAAEAAREGIVLLKND--QNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGI 453
           +C+ E+ +LA EAA++GIVLLKND     LPL+   V ++AV+G +AN  + + GNY G 
Sbjct: 61  VCTQEHQDLALEAAQDGIVLLKNDGGAGALPLSKPNVASLAVIGFNANDAIRLRGNYFGP 120

Query: 454 PCRYMSPIAGFSGYA-NVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEA 512
           PC  ++P+    GY  + ++  GC+  AC    +I  A +AA +AD+ ++  GLD   E 
Sbjct: 121 PCVTVTPLQVLQGYVKDTSFVAGCNSAACNVT-TIPEAVQAASSADSVVLFMGLDQDQER 179

Query: 513 ESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPG 572
           E +DR DL LPG Q  LI  VA  AK PVILV++  G VD++FA+TN  I AILWAGYPG
Sbjct: 180 EEVDRLDLTLPGQQQTLIESVANAAKKPVILVLLCGGPVDVSFAKTNPKIGAILWAGYPG 239

Query: 573 EEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGP 632
           E GG AIA V+FG+ NPGGRLP+TWY  D+ + +P+T M +R   + GYPGRTY+FY GP
Sbjct: 240 EAGGIAIAQVLFGEHNPGGRLPVTWYPQDFTR-VPMTDMRMRADPATGYPGRTYRFYRGP 298

Query: 633 TLYPFGYGLSYTQFKYNLLSFTKTIQ--VNLNKLQHCRNLNYTSDASKTRCPGVLVNDLR 690
           T++ FGYGLSY+++ +   +          L  ++       + D          +    
Sbjct: 299 TVFNFGYGLSYSKYSHRFATKPPPTSNVAGLKAVEATAGGMASYDVEA-------IGSET 351

Query: 691 CDDY-FEFKVDFQNVGSTDGSDVVIVYSKPP--AEIAATYIKQVIGFQRVFVRAGRNKRI 747
           CD   F   V  QN G  DG   V+V+ + P   + +     Q+IGFQ + +RA +   +
Sbjct: 352 CDRLKFPAVVRVQNHGPMDGKHSVLVFMRWPNATDGSGRPASQLIGFQSLHLRATQTAHV 411

Query: 748 KFVFNACKSLNIVDYAANTLLPAGEHTIFVG 778
           +F  + CK  +        ++  G H + VG
Sbjct: 412 EFEVSPCKHFSRATEDGRKVIDQGSHFVMVG 442


>gi|323451996|gb|EGB07871.1| hypothetical protein AURANDRAFT_71699 [Aureococcus anophagefferens]
          Length = 1202

 Score =  327 bits (839), Expect = 1e-86,   Method: Compositional matrix adjust.
 Identities = 234/665 (35%), Positives = 340/665 (51%), Gaps = 88/665 (13%)

Query: 50  SSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGV-SN 108
           +++ +CD +LP   RV DL +R T++E + Q+G  A  VPRLGLP   +  EALHGV S 
Sbjct: 339 AAYPYCDRALPIRARVADLAARFTVNETISQMGTMAAAVPRLGLPALNYGGEALHGVWST 398

Query: 109 VGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNL----------- 157
              G          T FP      ASF+  LW+ +G A   EARA++             
Sbjct: 399 CAAGRC-------PTQFPAPHAMGASFDRDLWRAVGAASGLEARALFRWNQRHNASDCAR 451

Query: 158 ---GRAGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDL 214
              G  GLT+++PN+N+ARDPRWGRI E P EDP + G Y   +VRG Q    +  A   
Sbjct: 452 SLEGCLGLTFYAPNVNLARDPRWGRIEEVPSEDPLLNGVYGAEFVRGFQGDGAYRVA--- 508

Query: 215 NSRPLKVSSCCKHYAAYDVD---------NWKGV-------DRYHFDARVTEQDMEETFL 258
                  ++  KH+A Y+++         +W G        DR+ FDARV+ +D EET++
Sbjct: 509 -------NAVVKHFAVYNLEVDVEDTPPADWCGSAACAPPNDRHSFDARVSPRDFEETYV 561

Query: 259 RPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVD 318
            PF        A++ MCSYN VNG P+C D  LL   +RG  +  G +  DC +++  V 
Sbjct: 562 GPFVA-PVAAGAAAAMCSYNAVNGEPACTDGALLRGALRGALNFTGVLATDCGALEDAVA 620

Query: 319 NHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMR 378
            HK  A ++ +A A  + AG+D +CG+  T+    A+  G V+   +   L+ L    +R
Sbjct: 621 RHKRYA-TEAEAAAAAIAAGVDSNCGKVLTSALPEALAAGLVRPDALRPPLERLLEARLR 679

Query: 379 LGFFDGSPQYVSLGKQD---ICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAV 435
           LG  D       + + D   + S  +  LA  AAREG+VLL+N    LPL+     T+AV
Sbjct: 680 LGLLDDWDADAPVPRPDVDAVDSPAHRALALRAAREGLVLLQNPNQILPLDGR--GTLAV 737

Query: 436 VGPHANATVAMIGNYAGIPCRYM--SPIAGFSGY---ANVTYKTGCDDVACKSNNSIFAA 490
           +GP+ANA++ ++  Y G P   +  SP+           V Y  GC + +  +  ++  A
Sbjct: 738 IGPNANASMNLLSGYHGTPPPDLLRSPLQELEARWRGGKVVYAVGC-NASGAATAALDEA 796

Query: 491 SEAAKTADATIILAGL------------DLSV----EAESLDREDLWLPGYQTQLINQVA 534
            + AKTAD  ++  GL            D +     EAES+DR  L LPG Q  L +++ 
Sbjct: 797 VDLAKTADVVVLGLGLCGDNYGGGPPKEDATCFSIDEAESVDRTSLKLPGAQEALFSKIW 856

Query: 535 EVAKGPVILV-IMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRL 593
            + K   + V ++SAG VD +FA+   +  A+L AGY GE GG A+AD + G +NPGG L
Sbjct: 857 ALGKPVAVAVFLVSAGAVDASFAK---DKAALLLAGYGGEFGGVAVADALLGAYNPGGAL 913

Query: 594 PITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYP---FGYGLSYTQFKYNL 650
             T        + P   M +RP  S   PGRTY+F +   + P   FG+GLSYT F  +L
Sbjct: 914 TATMLPD--AGLPPFRDMAMRP--SAASPGRTYRFLDERRVAPLWRFGFGLSYTAFAVSL 969

Query: 651 LSFTK 655
              T+
Sbjct: 970 AGPTR 974


>gi|389636381|ref|XP_003715843.1| beta-xylosidase [Magnaporthe oryzae 70-15]
 gi|351648176|gb|EHA56036.1| beta-xylosidase [Magnaporthe oryzae 70-15]
 gi|440480767|gb|ELQ61414.1| beta-xylosidase [Magnaporthe oryzae P131]
          Length = 517

 Score =  324 bits (830), Expect = 1e-85,   Method: Compositional matrix adjust.
 Identities = 192/496 (38%), Positives = 276/496 (55%), Gaps = 24/496 (4%)

Query: 49  MSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSN 108
           +S+   CD +L    R   LV  ++++EK+Q L   + G PR+GLP Y WWSEALHGV+ 
Sbjct: 35  LSTNNVCDRTLSPPERAAALVEALSIEEKLQNLVSKSQGAPRIGLPAYNWWSEALHGVA- 93

Query: 109 VGPGTHFDD---VIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYW 165
             PGT+F         +TS+P  +L  A F+++L +KIG A+  EARA  N G AG  YW
Sbjct: 94  YAPGTYFPQGNVEFNSSTSYPMPLLMAAGFDDNLIEKIGTAIGIEARAWGNSGWAGFDYW 153

Query: 166 SPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCC 225
           +PN+N  +DPRWGR +ETPGED   + RYA    RGL     +E          ++ S C
Sbjct: 154 TPNVNAFKDPRWGRGSETPGEDVLRIKRYAEYITRGLDGPVPNEQR--------RIISTC 205

Query: 226 KHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPS 285
           KHYA  D ++W G  R+ F+A++T QD+ E +L+PF+ C ++    S+MC+YN VNG+PS
Sbjct: 206 KHYAGNDFEDWNGTTRHDFNAKITMQDLAEYYLKPFQQCARDSKVGSIMCAYNAVNGVPS 265

Query: 286 CADPKLLNQTVRGEW---DLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLD 342
           CA+  LL   +R  W   + + Y+ +DC+++  +  NH + A +     A   +AG+D  
Sbjct: 266 CANKYLLQTILRDHWKWTEHNNYVTSDCEAVLDVSANHHY-APTNAAGTAICFEAGMDTS 324

Query: 343 CGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDG-SPQYVSLGKQDICSDEN 401
           C    ++    A  QG +KE  +D++L  LY  L+R G+FDG    Y  L  Q + S E 
Sbjct: 325 CEYTGSSDIPGAWSQGLLKEETVDRALLRLYEGLVRAGYFDGEEAMYADLDWQHVNSAEA 384

Query: 402 IELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSP- 460
             LA +AA EG+VLLKN+  TLPL+      +A++G  A+A   + G Y+G      SP 
Sbjct: 385 QSLALQAAVEGMVLLKNN-GTLPLDLDPSHKIAMIGFWADAPEKLQGGYSGRAHHLYSPA 443

Query: 461 IAGFSGYANVTYKTG---CDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAESLDR 517
            A      ++T  +G    D+ A  S+N    A EAA  AD  +   GLD S   E+LDR
Sbjct: 444 FAARQLGLDITVASGPVLQDNNA--SDNWTTNALEAASGADYILYFGGLDTSAAGETLDR 501

Query: 518 EDLWLPGYQTQLINQV 533
            DL  P  Q  L+  V
Sbjct: 502 TDLDWPEAQLTLVKVV 517


>gi|365118446|ref|ZP_09337032.1| hypothetical protein HMPREF1033_00378 [Tannerella sp.
           6_1_58FAA_CT1]
 gi|363649697|gb|EHL88801.1| hypothetical protein HMPREF1033_00378 [Tannerella sp.
           6_1_58FAA_CT1]
          Length = 1283

 Score =  323 bits (828), Expect = 2e-85,   Method: Compositional matrix adjust.
 Identities = 238/748 (31%), Positives = 373/748 (49%), Gaps = 108/748 (14%)

Query: 54  FCDSSLPYSIRVKDLVSRMTLDEKVQQLGDF--AHGVPRLGLPQYEWWSEALHGVSNVGP 111
           + + ++P   R+ DL+ R+TL+EKV QL D   + G+ RL +P     +E LHG S    
Sbjct: 72  YLNPNIPIEERIDDLLPRLTLEEKVIQLSDSWGSKGIARLKIPAM-LKTEGLHGQS---- 126

Query: 112 GTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNINV 171
                    G+T FP  I   ++F+  L +++G+A + EA+A  NL       WSP ++V
Sbjct: 127 ------YATGSTIFPHGINMGSTFDTELIQEVGKATAIEAKAA-NL----RVSWSPVLDV 175

Query: 172 ARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAY 231
           ARD RWGR+ ET GEDP++VGR  V +++G Q                 + +C KH+A +
Sbjct: 176 ARDARWGRVEETYGEDPYLVGRIGVAWIKGFQGEH--------------MFACPKHFAGH 221

Query: 232 DVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKL 291
                 G D +  D  ++++ M    L PF   +KE +A  VM +Y   NG+P     +L
Sbjct: 222 G-QPVGGRDSH--DYGLSDRVMRNIHLAPFRDVIKEANAFGVMAAYGLWNGVPDNGSKEL 278

Query: 292 LNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFT 351
           L + +R EW   G++V+DC   +  +   + +  + E+A A  ++AG+D++CG  Y    
Sbjct: 279 LQKILREEWGFEGFVVSDCSGPE-NIQRKQSVVGTMEEAAAMAVRAGVDIECGSAYKKAL 337

Query: 352 GNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQDIC---SDENIELAAEA 408
            +AV++G +KE+++D +L+ ++   MRLG FD  P   ++    +    + E+  LA + 
Sbjct: 338 ASAVKKGIIKESELDANLRRVFRAKMRLGLFD-RPSIENMVWNKLPEYDTPEHRALARKV 396

Query: 409 AREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAG--IPCRYMSPIAGFSG 466
           A +  VLLKN+ N LPL+   +KT+AV+GP  NA     G+Y+    P + +S + G   
Sbjct: 397 AVKSTVLLKNENNLLPLDK-NIKTIAVIGP--NADQGQTGDYSAKYAPGQIISVLEGVKN 453

Query: 467 YAN----VTYKTGCDDVACKSNNSIFA-ASEAAKTADATIILAGLD---------LSVEA 512
           + +    V Y  GC  +   +    FA A   AK ADA I++ G +          S   
Sbjct: 454 HVSPSTKVLYAQGCTQLDMDTTG--FAEAVNIAKQADAVILVVGDNSNRHENGNKKSTTG 511

Query: 513 ESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPG 572
           E++D   L +PG Q QLI  V    K PV+LV+++  G        + NI++IL   YPG
Sbjct: 512 ENVDGATLEIPGVQRQLIKAVEATGK-PVVLVLVN--GKPFTLTWEDENIESILETWYPG 568

Query: 573 EEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGP 632
           EEGG A AD++FG  NP GRLPI++    +   LPL         +    GR Y +Y+ P
Sbjct: 569 EEGGNATADIIFGDENPSGRLPISFPR--HPGQLPLWY-------NYETSGRNYDYYDMP 619

Query: 633 --TLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLR 690
              LY FG+GLSYT F+Y+ L  T                      +K+  PG       
Sbjct: 620 FTPLYRFGHGLSYTTFRYSNLKAT----------------------TKSGDPG------- 650

Query: 691 CDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFV 750
              +    VD +N G   G +V  +Y         T +  + GF+RVF++ G  K + F 
Sbjct: 651 ---FVTVSVDIENTGKRPGEEVAQLYITDLVASVNTAVIDLKGFKRVFLKPGEKKTVTFE 707

Query: 751 FNACKSLNIVDYAANTLLPAGEHTIFVG 778
            N    L++++     +L AG+  + VG
Sbjct: 708 LNPY-LLSLLNPDMKRVLEAGKFRMHVG 734


>gi|440476402|gb|ELQ45004.1| beta-xylosidase, partial [Magnaporthe oryzae Y34]
          Length = 515

 Score =  323 bits (828), Expect = 3e-85,   Method: Compositional matrix adjust.
 Identities = 191/493 (38%), Positives = 275/493 (55%), Gaps = 24/493 (4%)

Query: 49  MSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSN 108
           +S+   CD +L    R   LV  ++++EK+Q L   + G PR+GLP Y WWSEALHGV+ 
Sbjct: 35  LSTNNVCDRTLSPPERAAALVEALSIEEKLQNLVSKSQGAPRIGLPAYNWWSEALHGVA- 93

Query: 109 VGPGTHFDD---VIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYW 165
             PGT+F         +TS+P  +L  A F+++L +KIG A+  EARA  N G AG  YW
Sbjct: 94  YAPGTYFPQGNVEFNSSTSYPMPLLMAAGFDDNLIEKIGTAIGIEARAWGNSGWAGFDYW 153

Query: 166 SPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCC 225
           +PN+N  +DPRWGR +ETPGED   + RYA    RGL     +E          ++ S C
Sbjct: 154 TPNVNAFKDPRWGRGSETPGEDVLRIKRYAEYITRGLDGPVPNEQR--------RIISTC 205

Query: 226 KHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPS 285
           KHYA  D ++W G  R+ F+A++T QD+ E +L+PF+ C ++    S+MC+YN VNG+PS
Sbjct: 206 KHYAGNDFEDWNGTTRHDFNAKITMQDLAEYYLKPFQQCARDSKVGSIMCAYNAVNGVPS 265

Query: 286 CADPKLLNQTVRGEW---DLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLD 342
           CA+  LL   +R  W   + + Y+ +DC+++  +  NH + A +     A   +AG+D  
Sbjct: 266 CANKYLLQTILRDHWKWTEHNNYVTSDCEAVLDVSANHHY-APTNAAGTAICFEAGMDTS 324

Query: 343 CGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDG-SPQYVSLGKQDICSDEN 401
           C    ++    A  QG +KE  +D++L  LY  L+R G+FDG    Y  L  Q + S E 
Sbjct: 325 CEYTGSSDIPGAWSQGLLKEETVDRALLRLYEGLVRAGYFDGEEAMYADLDWQHVNSAEA 384

Query: 402 IELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSP- 460
             LA +AA EG+VLLKN+  TLPL+      +A++G  A+A   + G Y+G      SP 
Sbjct: 385 QSLALQAAVEGMVLLKNN-GTLPLDLDPSHKIAMIGFWADAPEKLQGGYSGRAHHLYSPA 443

Query: 461 IAGFSGYANVTYKTG---CDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAESLDR 517
            A      ++T  +G    D+ A  S+N    A EAA  AD  +   GLD S   E+LDR
Sbjct: 444 FAARQLGLDITVASGPVLQDNNA--SDNWTTNALEAASGADYILYFGGLDTSAAGETLDR 501

Query: 518 EDLWLPGYQTQLI 530
            DL  P  Q  L+
Sbjct: 502 TDLDWPEAQLTLV 514


>gi|361127339|gb|EHK99311.1| putative exo-1,4-beta-xylosidase bxlB [Glarea lozoyensis 74030]
          Length = 569

 Score =  323 bits (827), Expect = 3e-85,   Method: Compositional matrix adjust.
 Identities = 193/551 (35%), Positives = 284/551 (51%), Gaps = 57/551 (10%)

Query: 49  MSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSN 108
           + S   CD++ P + R   LV  M   EK+Q +   + GV RLGLP Y WWSEALHGV+ 
Sbjct: 59  LKSNKVCDTTAPPADRAAALVKAMQSSEKLQNIISKSAGVSRLGLPPYNWWSEALHGVAG 118

Query: 109 VGPGTHFDDVIPG--ATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWS 166
             PG  F    P   ATS P  IL  A+F++ L +K+G  + TEARA  N   +G+ +W+
Sbjct: 119 A-PGIQFSSSSPWNYATSLPMPILMAAAFDDDLIEKVGTLIGTEARAFGNGNHSGIDFWT 177

Query: 167 PNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCK 226
           PNIN  +DPRWGR +ETPGED   +  Y    +RGL+           N    ++ + CK
Sbjct: 178 PNINPFKDPRWGRGSETPGEDTLRLKGYVAALLRGLEG----------NKAQRRIIATCK 227

Query: 227 HYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSC 286
           HYAA D+++W GV R+ FDA+++ QD+ E +L+PF+ C ++    S MCSYN VNG+P+C
Sbjct: 228 HYAANDLESWNGVTRHDFDAKISMQDLAEYYLQPFQQCARDSKVGSFMCSYNSVNGVPAC 287

Query: 287 ADPKLLNQTVRGEWDL---HGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDC 343
           A+  LL   +R  W+    + Y+ +DC+++Q +  NH + A +     A    AG D  C
Sbjct: 288 ANKYLLQTILRDHWNWTSENQYVTSDCEAVQDISLNHHY-ASTNAAGTALAFNAGTDSSC 346

Query: 344 GQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ-YVSLGKQDICSDENI 402
                                               G+FDGS   Y SLG  D+ + +  
Sbjct: 347 ----------------------------------EAGYFDGSKALYSSLGWSDVNTPQAQ 372

Query: 403 ELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPI- 461
           +LA +A  +GIV+LKND  TLPL       VA++G  A+ +  + G Y+G      +P+ 
Sbjct: 373 QLALQATVDGIVMLKND-GTLPLKLDSKSKVAMIGFWASDSSKLQGGYSGKAPYLRTPVY 431

Query: 462 -AGFSGYANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAESLDREDL 520
            A   G+            A  ++N    A  AA  +D  +   GLD S  AE +DR  L
Sbjct: 432 AAQQLGFTPNVATGPVQQSASATDNWTTNALAAASKSDYILYFGGLDTSAAAEGVDRTSL 491

Query: 521 WLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIA 580
             P  Q  LI +++ + K P+I +I     +D     TN  + +ILWA +PG++GG A+ 
Sbjct: 492 EWPSAQLALIKKLSALGK-PLI-IIQEGDQMDNTPLLTNKGVSSILWASWPGQDGGPAVM 549

Query: 581 DVVFGKFNPGG 591
            ++ G  +P G
Sbjct: 550 QIISGAKSPAG 560


>gi|308208211|gb|ADO20356.1| putative beta-D-xylosidase/alpha-L-arabinosidase [uncultured rumen
           bacterium]
          Length = 780

 Score =  321 bits (822), Expect = 1e-84,   Method: Compositional matrix adjust.
 Identities = 241/773 (31%), Positives = 351/773 (45%), Gaps = 149/773 (19%)

Query: 47  LQMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGV 106
           L +S+  + D SLP   R KDLVSR+TL+EK       +  V  LG+  Y WWSEALHGV
Sbjct: 39  LSLSAQPYKDRSLPPEERAKDLVSRLTLEEKASLSMHPSAPVEALGIKAYNWWSEALHGV 98

Query: 107 SNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRA------ 160
           +  G           AT FP  I   ASF+E L  ++  AVS EAR  Y + +       
Sbjct: 99  ARNG----------AATVFPQPIGMAASFDEPLLYEVFTAVSDEARVKYKIAKESGHIGQ 148

Query: 161 --GLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRP 218
             G+T+W+PNIN+ RDPRWGR  ET GEDP++ G+  +  VRGLQ           +S  
Sbjct: 149 YQGVTFWTPNINIFRDPRWGRGMETYGEDPYLTGQMGMAVVRGLQGPS--------DSPV 200

Query: 219 LKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYN 278
           LK  +C KHYA +    W   +R+ +DA V+E+D+ ET+L  F+  V + +   VM +YN
Sbjct: 201 LKAHACAKHYAVHSGPEW---NRHSYDAEVSERDLRETYLPAFKDLVTKANVQEVMTAYN 257

Query: 279 RVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQ--VMVDNHKFLADSKEDAVAQTLK 336
           R  G P  A   L+N  +RGEW   G I +DC +++   +   H +  D    A A    
Sbjct: 258 RFRGEPCGASDYLINTILRGEWGYKGLITSDCWAVEDFYVQGRHGYSPDVASAAAAAVHA 317

Query: 337 AGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQDI 396
             +D +CGQ Y +    AV++G + E D+D++L  L+T   +LG  D    +  L    +
Sbjct: 318 G-VDTECGQAYRHIP-EAVERGLLDEKDLDRNLIRLFTARYQLGEMDDISLWDDLPASIL 375

Query: 397 CSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCR 456
              E++ L+ + A+E +VLL+N    LPL  A    VA+VGP+ +      GNY  +P R
Sbjct: 376 EGPEHLALSRKMAQESMVLLQNKGGILPL--APDVRVALVGPNGDDREMQWGNYNPVPGR 433

Query: 457 YMSPIAGF-SGYANVTYKTGCDDVACK------SNNSIFAASEAAKTA------------ 497
            ++        +  + Y  GC  V  +       NN +  A   ++              
Sbjct: 434 TVTLYDALKERFPGIKYVRGCGIVGAEFAPKPDPNNPLSQALGKSREEMEAIARQYAIGV 493

Query: 498 ---------------------------------DATIILAGLDLSVEAESL--------- 515
                                            D  I   G+    E E +         
Sbjct: 494 QDILNYVRRQERMQASFLPELDVQSVLKELEGIDVVIFAGGISPRFEGEEMPVNLPGFKG 553

Query: 516 -DREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEE 574
            DR D+ LP  Q  L+  + +  K  VILV  S  G  I       +  AIL A YPGEE
Sbjct: 554 GDRTDIQLPQVQRDLMKALHDAGKK-VILVNFS--GCAIGLVPETESCDAILQAWYPGEE 610

Query: 575 GGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTL 634
           GG AI DV+FG  NP G+LP+T+Y    V+ LP         ++    G TY+++ G  L
Sbjct: 611 GGLAITDVLFGDVNPSGKLPVTFYRS--VEDLP-------DFENYDMKGHTYRYFKGKPL 661

Query: 635 YPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDY 694
           +PFGYGLSY+ F+Y      K  +V  N L                              
Sbjct: 662 FPFGYGLSYSTFRY------KRAKVRNNSL------------------------------ 685

Query: 695 FEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRI 747
               +  +N G  + ++VV VY +   +     +K +  F+RV + AG+  ++
Sbjct: 686 ---IIPVKNTGKREATEVVQVYVRRKGDPDGP-VKTLRAFRRVTIPAGKTVKV 734


>gi|198425898|ref|XP_002119549.1| PREDICTED: similar to predicted protein [Ciona intestinalis]
          Length = 754

 Score =  320 bits (819), Expect = 3e-84,   Method: Compositional matrix adjust.
 Identities = 245/743 (32%), Positives = 360/743 (48%), Gaps = 104/743 (13%)

Query: 41  RFSKLGLQMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDF-------AHGVPRLGL 93
            F+   +    F F + SLP   R++DLV+R+T++E + QL          A  + RLG+
Sbjct: 14  HFASSKVTSEEFPFRNFSLPIEERLEDLVNRLTIEEVILQLSRGGVRDNGPAPAITRLGI 73

Query: 94  PQYEWWSEALHGVSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARA 153
             Y+W +E L G +  G           AT FP  I   A+F++ L  K+ + V+ EARA
Sbjct: 74  GPYQWNTECLRGYAMNG----------DATCFPQPIGLAATFDQGLIYKMAKTVALEARA 123

Query: 154 MYN-------LG-RAGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDV 205
            +N        G   GL+ +SP IN+ R P WGR  ET GEDP +    A  YV GLQ  
Sbjct: 124 KHNNFTKNGNFGDHTGLSCFSPVINILRHPLWGRNQETYGEDPVLTSLMARAYVTGLQGD 183

Query: 206 EGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCV 265
           E +          L  ++ CKH+ AY         R+ F A V++ D+  TF   F  CV
Sbjct: 184 EIY----------LPATAVCKHFVAYGGPENIPTTRFSFSANVSDHDIGTTFYPAFRECV 233

Query: 266 KEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLAD 325
             G A  VMCSYN +NG+PSCA+P +L  T+R ++   GY+V+D ++++  +D +     
Sbjct: 234 HAG-AQGVMCSYNAINGVPSCANP-MLETTLRKKFHFDGYVVSDENALE-NIDLYFNFTK 290

Query: 326 SKEDAVAQTLKAGLDLDCGQY-YTN---FTGNAVQQGKVKETDIDKSLKYLYTVLMRLGF 381
           SK +  A  L AG+DL+   +  TN       AV+QG V E  + +S K L+   M LG 
Sbjct: 291 SKLETAAVALNAGVDLELTGFGKTNRYSLLNQAVEQGLVTEAALRRSAKRLFRTRMALGE 350

Query: 382 FDGSPQY---VSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGP 438
           FD  P++   +++    + S  + + A E A +  VLLKND   LPL     K V++VGP
Sbjct: 351 FD-PPEFNHWLNVPIDVVQSLAHRKQAVEVAAKSFVLLKND-GILPLKQLYDK-VSIVGP 407

Query: 439 HANATVAMIGNY-AGIPCRYMS-PIAG---FSGYANVTYKTGC------DDVACKSNNSI 487
             N + A+ G+Y A    +Y S P+      S      + TGC      +   C + NS 
Sbjct: 408 FINNSEALTGDYPAEFNLKYFSSPLFAANSLSSSGVARFTTGCVGTNNQNLPICATYNST 467

Query: 488 FAASEAAKTADATIILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMS 547
               E    +D  ++  G    VEAES DR D+ LPG Q QLI  V + A GPVI+V+ +
Sbjct: 468 -NVKEVVTGSDIVLVTLGTGRGVEAESNDRRDINLPGKQLQLIQDVVKYANGPVIVVLFN 526

Query: 548 AGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLP 607
           AG +D+++   NT   A++   +  +  G A+ +V+ G  NP GRLP TW         P
Sbjct: 527 AGPLDVSWVMGNT--AAVIACHFSAQMTGEAMLEVLTGVVNPAGRLPNTW---------P 575

Query: 608 LTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKY-NLLSFTKTIQVNLNKLQH 666
            +   + P+       RTY++     L+PFGYGLSYT+F Y + +    TIQ        
Sbjct: 576 ASMEQVPPMTDYSMHERTYRYSTSSPLFPFGYGLSYTKFWYLDAVVEPTTIQ-------- 627

Query: 667 CRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAAT 726
                        RC   +V           +V  QN G  DG +VV +Y     +    
Sbjct: 628 -------------RCQIPVV-----------RVLIQNTGHLDGEEVVQIYMTSKKKRDRE 663

Query: 727 YIKQVIGFQRVFVRAGRNKRIKF 749
            ++Q++ FQRV ++AG    I  
Sbjct: 664 LLRQLVAFQRVPIKAGEEVSISL 686


>gi|167524198|ref|XP_001746435.1| hypothetical protein [Monosiga brevicollis MX1]
 gi|163775197|gb|EDQ88822.1| predicted protein [Monosiga brevicollis MX1]
          Length = 834

 Score =  318 bits (815), Expect = 7e-84,   Method: Compositional matrix adjust.
 Identities = 233/734 (31%), Positives = 358/734 (48%), Gaps = 102/734 (13%)

Query: 52  FLFCDSSLPYSIRVKDLVSRMTLDEKVQQL--GDFAHGVP-----RLGLPQYEWWSEALH 104
           + F +  LP++ R+ DLV R+TL+EK+QQL  G  A   P     RLG+  + W SE + 
Sbjct: 34  YPFRNPDLPWAARLDDLVGRLTLEEKLQQLQHGGAAQMTPAPAVERLGIGPFVWGSECVT 93

Query: 105 GVSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRA---- 160
           G+     GT  +D  P  T+FP  +   A+F+ +L K+    ++ E RA  N  R     
Sbjct: 94  GL-----GTDGND--PHGTAFPQPLGMAATFDPALLKRAAGTIALELRAQRNFDRENGVV 146

Query: 161 ----GLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNS 216
               GL+ WSP +N+ R P WGR  ET GE P +    A ++V G+Q           ++
Sbjct: 147 KFHHGLSCWSPVVNINRHPLWGRNDETFGECPVLSSFMARSFVEGIQGN---------HT 197

Query: 217 RPLKVSSCCKHYAAYDVDNWKGVD--RYHFDARVTEQDMEETFLRPFEMCVKEGDASSVM 274
           R    ++ CKH     +D + G D  RY FDA V++ D+  TFL  FE C   G     M
Sbjct: 198 RYYAAAAACKH-----LDVYGGPDNLRYVFDADVSQADLTGTFLMAFEECAAAG-VMGYM 251

Query: 275 CSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQT 334
           CSYN + G+P+CA+ + +    R +W   GY+V+D  ++  + ++H + A+    AVA  
Sbjct: 252 CSYNSIRGVPACANYRTMTFFAREQWGFEGYVVSDQGAVFRITESHNYTANQTLGAVA-A 310

Query: 335 LKAGLDL---DCGQYYTNFTGNAVQQGKVKE-TDIDKSLKYLYTVLMRLGFFDGSPQ--- 387
           L AG D+   D  Q+   +  +     K+ +   ID S+  L+ V MRLG FD  P+   
Sbjct: 311 LNAGCDMEDSDDAQHVAYYNLSLALDLKLTDMATIDASVSRLFYVRMRLGEFD-PPENDP 369

Query: 388 YVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSA-KVKTVAVVGPHANATVAM 446
           + SL    + S  ++E+A + A   IVLLKN   TLPL++A K  +  ++GP A+    M
Sbjct: 370 WRSLNMSIVSSPAHVEMARDVATASIVLLKNQNETLPLSAAAKNASYCLLGPFADNADLM 429

Query: 447 IGNYA-----GIPCRYMSPIAGF----SGYANVTYKTGCDDVACKSNNSIFAASEAAKTA 497
           +G Y+      +   Y + +A      S  A+  Y  GC    C   ++    +   +  
Sbjct: 430 MGKYSPHGSTNVTVTYRAGLAAALQNASQTASFQYLEGCTGPFCDGLDTAAVTTFIQQGC 489

Query: 498 DATIILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEV--AKGPVILVIMSAGGVDIAF 555
           D  ++  G    VE+ESLDR ++  PG Q  L+  V E    K  ++L++ +AG VD+A 
Sbjct: 490 DTVLLAVGTSYHVESESLDRSNMSFPGAQPTLVQTVLEALGTKQRLVLLVSTAGPVDLAA 549

Query: 556 AETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRP 615
            E +T + AIL   Y G+  G A+AD++ G+ +P GRLP +W N        ++ +P  P
Sbjct: 550 LEQDTRVAAILDLIYLGQTAGTALADILLGETSPSGRLPFSWPN-------KVSDVP--P 600

Query: 616 VDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSD 675
           +D     GRTY+F     L+PFGYGLSYTQF  + L+    + V       C+ L     
Sbjct: 601 IDDYTMQGRTYRFAQADVLFPFGYGLSYTQFNLSHLAAPYILPV-------CQAL----- 648

Query: 676 ASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQ 735
                                  V+  N G   G+  + VY + P  +    I+Q+    
Sbjct: 649 --------------------RLSVNVTNTGRLSGAIPLQVYVEWPNAVGGP-IRQLATTT 687

Query: 736 RVFVRAGRNKRIKF 749
           RVFV A  +K ++ 
Sbjct: 688 RVFVDAASSKTVQL 701


>gi|346726970|ref|YP_004853639.1| beta-glucosidase-related glycosidase [Xanthomonas axonopodis pv.
           citrumelo F1]
 gi|346651717|gb|AEO44341.1| Beta-glucosidase-related glycosidase [Xanthomonas axonopodis pv.
           citrumelo F1]
          Length = 902

 Score =  318 bits (815), Expect = 7e-84,   Method: Compositional matrix adjust.
 Identities = 182/445 (40%), Positives = 256/445 (57%), Gaps = 31/445 (6%)

Query: 54  FCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGT 113
           + D+   +  R  DLVSRMTL+EK  Q+ + A  +PRLG+P Y+WW+EALHGV+  G   
Sbjct: 35  YLDTQRSFEQRAADLVSRMTLEEKAAQMQNAAPAIPRLGVPAYDWWNEALHGVARAG--- 91

Query: 114 HFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYN--------LGRAGLTYW 165
                  GAT FP  I   A+F+  L  ++  A+S EARA ++            GLT+W
Sbjct: 92  -------GATVFPQAIGMAATFDLPLMHEVATAISDEARAKHHQFLRQNQHARYQGLTFW 144

Query: 166 SPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPL-KVSSC 224
           SPNIN+ RDPRWGR  ET GEDPF+  R  V +V+GLQ  EG +   +    P  K+ + 
Sbjct: 145 SPNINIFRDPRWGRGQETYGEDPFLTARMGVTFVQGLQG-EGADAPKNAQGEPYRKLDAT 203

Query: 225 CKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIP 284
            KH+A   V +    DR+HFDAR +++D+ ET+L  FE  VK+G   +VM +YNRV G  
Sbjct: 204 AKHFA---VHSGPEADRHHFDARPSQRDLYETYLPAFEALVKDGKVDAVMGAYNRVYGES 260

Query: 285 SCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCG 344
           + A   LL   +R +W   GY+V+DC +I  +  +HK +A ++E A A  +K G +L+CG
Sbjct: 261 ASASKFLLQDVLRQQWGFKGYVVSDCWAIVDIWKHHKIVA-TREQAAALAVKHGTELECG 319

Query: 345 QYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFD--GSPQYVSLGKQDICSDENI 402
           + Y+     AV QG + E  ID +LK L T  MRLG FD  G   + ++      S  + 
Sbjct: 320 EEYSTLPA-AVHQGLIDEAQIDTALKTLMTARMRLGMFDPPGQLPWSTIPASVNQSPAHD 378

Query: 403 ELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIA 462
            LA   ARE +VLLKND   LPL+ AK+K +AV+GP A+ T+A++GNY G P   ++ + 
Sbjct: 379 ALARRTARESLVLLKND-GLLPLSRAKLKRIAVIGPTADDTMALLGNYYGTPAAPVTVLQ 437

Query: 463 GFSGY---ANVTYKTGCDDVACKSN 484
           G       A V Y  G D V  + +
Sbjct: 438 GIRAAAPNAQVLYARGADLVEGRDD 462



 Score =  135 bits (340), Expect = 8e-29,   Method: Compositional matrix adjust.
 Identities = 89/282 (31%), Positives = 129/282 (45%), Gaps = 53/282 (18%)

Query: 490 ASEAAKTADATIILAGLDLSVEAESL----------DREDLWLPGYQTQLINQVAEVAKG 539
           A + A++AD  + + GL   VE E +          DR DL LP  Q  L+  +    K 
Sbjct: 629 ALDVARSADVVVFVGGLTGDVEGEEMKVNYPGFAGGDRTDLRLPKPQRDLLEALQATGK- 687

Query: 540 PVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYN 599
           PV+ V+ +   + I +A+ +  + AIL A YPG+ GG A+AD +FG  NPGGRLP+T+Y 
Sbjct: 688 PVVAVLTTGSALAIDWAQQH--LPAILLAWYPGQRGGTAVADTLFGDANPGGRLPVTFYK 745

Query: 600 GDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQV 659
                     S  L   D     GRTY+++ G  LYPFG+GLSYTQF Y+ L   +T   
Sbjct: 746 ---------ESETLPAFDDYAMRGRTYRYFGGTPLYPFGHGLSYTQFAYSGLRLDRTT-- 794

Query: 660 NLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKP 719
                                        +  D      V  +N G   G +VV +Y  P
Sbjct: 795 -----------------------------IAADGSLTATVTVKNTGQRAGDEVVQLYLHP 825

Query: 720 PAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVD 761
                    K++ GFQR+ ++ G  + + F  +A  +L I D
Sbjct: 826 LTPQRERAGKELHGFQRIALQPGEQRALHFTLDAKNALRIYD 867


>gi|433677589|ref|ZP_20509555.1| beta-glucosidase [Xanthomonas translucens pv. translucens DSM
           18974]
 gi|430817300|emb|CCP39963.1| beta-glucosidase [Xanthomonas translucens pv. translucens DSM
           18974]
          Length = 913

 Score =  317 bits (812), Expect = 2e-83,   Method: Compositional matrix adjust.
 Identities = 182/447 (40%), Positives = 262/447 (58%), Gaps = 35/447 (7%)

Query: 54  FCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGT 113
           + D+   +  R  DLV+RMTL+EK  Q+ + A  +PRLG+P Y+WW+EALHGV+  G   
Sbjct: 37  YLDTQRSFEQRAADLVARMTLEEKAAQMQNAAPAIPRLGVPAYDWWNEALHGVARAG--- 93

Query: 114 HFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLG--------RAGLTYW 165
                  GAT FP  I   A+F+  L  ++  A+S EARA ++            GLT+W
Sbjct: 94  -------GATVFPQAIGMAATFDLPLMHEVSTAISDEARAKHHEALRHDQHARYQGLTFW 146

Query: 166 SPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQ--DVEGHENATDLNSRPLKVSS 223
           SPNIN+ RDPRWGR  ET GEDPF+  R  V +V+GLQ  DV+  +NA     R  K+ +
Sbjct: 147 SPNINIFRDPRWGRGQETYGEDPFLTARMGVTFVQGLQGEDVDVPKNAQGEAYR--KLDA 204

Query: 224 CCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGI 283
             KH+A   V +    DR+HFDA  +++D+ ET+L  FE  VKEG   +VM +YNRV G 
Sbjct: 205 TAKHFA---VHSGPEADRHHFDAHPSQRDLYETYLPAFEALVKEGKVDAVMGAYNRVYGE 261

Query: 284 PSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDC 343
            + A   LL   +R  W   GY+V+DC +I  +  NHK +A ++E+A A  +K G +L+C
Sbjct: 262 SASASKFLLRDVLRDRWGFDGYVVSDCWAIVDIWKNHKIVA-TREEAAALAVKHGTELEC 320

Query: 344 GQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQDICSDENIE 403
           G  Y+     AV++G + E D+D +L+ L    MRLG FD  P+ ++  +  + ++++ E
Sbjct: 321 GAEYSTLP-TAVRKGLISEADVDNALQKLMYSRMRLGMFD-PPEKLAWAQIPLSANQSPE 378

Query: 404 ---LAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSP 460
              LA   ARE +VLLKND   LPL+ AK+K +AVVGP A+ T+A++GNY G P   ++ 
Sbjct: 379 HDALARRTARESLVLLKND-GVLPLSRAKIKRIAVVGPTADDTMALLGNYYGTPAAPVTV 437

Query: 461 IAGFSGY---ANVTYKTGCDDVACKSN 484
           + G       A V Y  G D V  + +
Sbjct: 438 LQGIREAAPDAEVLYARGADLVEGRDD 464



 Score =  138 bits (347), Expect = 1e-29,   Method: Compositional matrix adjust.
 Identities = 96/300 (32%), Positives = 141/300 (47%), Gaps = 54/300 (18%)

Query: 490 ASEAAKTADATIILAGLDLSVEAESL----------DREDLWLPGYQTQLINQVAEVAKG 539
           A +AA+ AD  + + GL   VE E +          DR DL LP  Q  L+  +    K 
Sbjct: 631 ALDAARRADVVVFVGGLTGDVEGEEMKVNYPGFAGGDRTDLRLPKPQRALLEALHGTGK- 689

Query: 540 PVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYN 599
           PV+ V+ +   + I +A+ +  + AIL A YPG+ GG A+ADV+FG  NPGGRLP+T+Y 
Sbjct: 690 PVVAVLTTGSALAIDWAQQH--VPAILLAWYPGQRGGSAVADVLFGDANPGGRLPVTFYK 747

Query: 600 GDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQV 659
                     S  L   D     GRTY+++ G  LYPFG+GLSYTQF Y+ L        
Sbjct: 748 ---------ESETLPAFDDYAMRGRTYRYFAGTALYPFGHGLSYTQFAYSDLRL------ 792

Query: 660 NLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKP 719
                          D SK    G L   L+           +N G   G +VV +Y +P
Sbjct: 793 ---------------DRSKLAADGRLHATLKV----------KNTGQRAGDEVVQLYLQP 827

Query: 720 PAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANT-LLPAGEHTIFVG 778
            +       K + GFQR+ ++ G  + ++F  +    L + D A    ++  G++ + VG
Sbjct: 828 LSPQRERASKDLRGFQRIALQPGETREVRFAISPQSDLRLYDEARKAYVVDPGDYELQVG 887


>gi|381170979|ref|ZP_09880130.1| glycosyl hydrolase family 3 N terminal domain protein [Xanthomonas
           citri pv. mangiferaeindicae LMG 941]
 gi|380688543|emb|CCG36617.1| glycosyl hydrolase family 3 N terminal domain protein [Xanthomonas
           citri pv. mangiferaeindicae LMG 941]
          Length = 901

 Score =  317 bits (811), Expect = 2e-83,   Method: Compositional matrix adjust.
 Identities = 183/451 (40%), Positives = 257/451 (56%), Gaps = 31/451 (6%)

Query: 48  QMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVS 107
           Q ++  + D+   +  R  DLVSRMTL+EK  Q+ + A  +PRL +P Y+WW+EALHGV+
Sbjct: 28  QAATPPYLDTQRSFEQRAADLVSRMTLEEKAAQMQNAAPAIPRLQVPAYDWWNEALHGVA 87

Query: 108 NVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYN--------LGR 159
             G          GAT FP  I   A+F+  L  ++  A+S EARA ++           
Sbjct: 88  RAG----------GATVFPQAIGMAATFDLPLMHEVATAISDEARAKHHQFLRQNQHARY 137

Query: 160 AGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPL 219
            GLT+WSPNIN+ RDPRWGR  ET GEDPF+  R  V +V+GLQ  EG     +    P 
Sbjct: 138 QGLTFWSPNINIFRDPRWGRGQETYGEDPFLTARMGVTFVQGLQG-EGAAAPKNAQGEPY 196

Query: 220 -KVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYN 278
            K+ +  KH+A   V +    DR+HFDAR +++D+ ET+L  FE  VKEG   +VM +YN
Sbjct: 197 RKLDATAKHFA---VHSGPEADRHHFDARPSQRDLYETYLPAFEALVKEGKVDAVMGAYN 253

Query: 279 RVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAG 338
           RV G  + A   LL   +R +W   GY+V+DC +I  +  +HK +A ++E A A  +K G
Sbjct: 254 RVYGESASASKFLLQDVLRDQWGFRGYVVSDCWAIVDIWKHHKIVA-TREQAAALAVKHG 312

Query: 339 LDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFD--GSPQYVSLGKQDI 396
            +L+CG+ Y      AV+QG + E  ID +LK L T  MRLG FD  G   + ++     
Sbjct: 313 TELECGEEYATLPA-AVRQGLIDEAQIDTALKTLMTARMRLGMFDPPGQLPWSTIPASVN 371

Query: 397 CSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCR 456
            S  +  LA   ARE +VLLKND   LPL+ AK+K +AV+GP A+ T+A++GNY G P  
Sbjct: 372 QSPAHDALARRTARESLVLLKND-GLLPLSRAKLKRIAVIGPTADDTMALLGNYYGTPAA 430

Query: 457 YMSPIAGFSGY---ANVTYKTGCDDVACKSN 484
            ++ + G       A V Y  G D V  + +
Sbjct: 431 PVTVLQGIRAAAPNAQVLYARGADLVEGRDD 461



 Score =  136 bits (343), Expect = 4e-29,   Method: Compositional matrix adjust.
 Identities = 89/282 (31%), Positives = 130/282 (46%), Gaps = 53/282 (18%)

Query: 490 ASEAAKTADATIILAGLDLSVEAESL----------DREDLWLPGYQTQLINQVAEVAKG 539
           A + A++AD  + + GL   VE E +          DR DL LP  Q  L+  +    + 
Sbjct: 628 ALDVARSADVVVFVGGLTGDVEGEEMKVNYPGFAGGDRTDLRLPKPQRDLLEALQATGR- 686

Query: 540 PVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYN 599
           PV+ V+ +   + I +A+ +  + AIL A YPG+ GG A+AD +FG  NPGGRLP+T+Y 
Sbjct: 687 PVVAVLTTGSALAIDWAQQH--LPAILLAWYPGQRGGTAVADTLFGDANPGGRLPVTFYK 744

Query: 600 GDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQV 659
                     S  L   D     GRTY+++ G  LYPFG+GLSYTQF Y+ L   +T   
Sbjct: 745 ---------ESETLPAFDDYAMRGRTYRYFGGTPLYPFGHGLSYTQFAYSGLRLDRTT-- 793

Query: 660 NLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKP 719
                                        +  D      V  +N G   G +VV +Y  P
Sbjct: 794 -----------------------------IATDGSLTATVTVKNTGQRAGDEVVQLYLHP 824

Query: 720 PAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVD 761
            A       K++ GFQR+ ++ G  + + F  NA  +L + D
Sbjct: 825 LAPQRERAGKELHGFQRIALQPGEQRELGFTINAKDALRLYD 866


>gi|440731995|ref|ZP_20911965.1| glucan 1,4-beta-glucosidase [Xanthomonas translucens DAR61454]
 gi|440370332|gb|ELQ07251.1| glucan 1,4-beta-glucosidase [Xanthomonas translucens DAR61454]
          Length = 913

 Score =  316 bits (810), Expect = 3e-83,   Method: Compositional matrix adjust.
 Identities = 181/447 (40%), Positives = 262/447 (58%), Gaps = 35/447 (7%)

Query: 54  FCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGT 113
           + D+   +  R  DLV+RMTL+EK  Q+ + A  +PRLG+P Y+WW+EALHGV+  G   
Sbjct: 37  YLDTQRSFEQRAADLVARMTLEEKAAQMQNAAPAIPRLGVPAYDWWNEALHGVARAG--- 93

Query: 114 HFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLG--------RAGLTYW 165
                  GAT FP  I   A+F+  L  ++  A+S EARA ++            GLT+W
Sbjct: 94  -------GATVFPQAIGMAATFDVPLMHEVSTAISDEARAKHHEALRHDQHARYQGLTFW 146

Query: 166 SPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQD--VEGHENATDLNSRPLKVSS 223
           SPNIN+ RDPRWGR  ET GEDPF+  R  V +V+GLQ    +  +NA     R  K+ +
Sbjct: 147 SPNINIFRDPRWGRGQETYGEDPFLTARMGVTFVQGLQGEGADAPKNAQGEAYR--KLDA 204

Query: 224 CCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGI 283
             KH+A   V +    DR+HFDA  +++D+ ET+L  FE  VKEG   +VM +YNRV G 
Sbjct: 205 TAKHFA---VHSGPEADRHHFDAHPSQRDLYETYLPAFEALVKEGKVDAVMGAYNRVYGE 261

Query: 284 PSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDC 343
            + A   LL   +R  W   GY+V+DC +I  +  NHK +A ++E+A A  +K G +L+C
Sbjct: 262 SASASKFLLRDVLRDRWGFDGYVVSDCWAIVDIWKNHKIVA-TREEAAALAVKHGTELEC 320

Query: 344 GQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQDICSDENIE 403
           G  Y+    +AV++G + E D+DK+L+ L    MRLG FD  P+ ++  +  + ++++ E
Sbjct: 321 GAEYSTLP-SAVRKGLISEADVDKALQKLMYSRMRLGMFD-PPEKLAWAQIPLSANQSPE 378

Query: 404 ---LAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSP 460
              LA   ARE +VLLKND   LPL+ AK+K +AVVGP A+ T+A++GNY G P   ++ 
Sbjct: 379 HDALARRTARESLVLLKND-GVLPLSRAKIKRIAVVGPTADDTMALLGNYYGTPAAPVTV 437

Query: 461 IAGFSGY---ANVTYKTGCDDVACKSN 484
           + G       A V Y  G D V  + +
Sbjct: 438 LQGIREAAPDAEVLYARGADLVEGRDD 464



 Score =  137 bits (346), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 97/313 (30%), Positives = 144/313 (46%), Gaps = 54/313 (17%)

Query: 490 ASEAAKTADATIILAGLDLSVEAESL----------DREDLWLPGYQTQLINQVAEVAKG 539
           A +AA+ AD  + + GL   VE E +          DR DL LP  Q  L+  +    K 
Sbjct: 631 ALDAARRADVVVFVGGLTGDVEGEEMKVNYPGFAGGDRTDLRLPKPQRALLEALHGTGK- 689

Query: 540 PVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYN 599
           PV+ V+ +   + I +A+ +  + AIL A YPG+ GG A+ADV+FG  NPGGRLP+T+Y 
Sbjct: 690 PVVAVLTTGSALAIDWAQQH--VPAILLAWYPGQRGGSAVADVLFGDANPGGRLPVTFYK 747

Query: 600 GDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQV 659
                     S  L   D     GRTY+++ G  LYPFG+GLSYTQF Y+ L        
Sbjct: 748 ---------ESETLPAFDDYAMRGRTYRYFAGTPLYPFGHGLSYTQFAYSDLRL------ 792

Query: 660 NLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKP 719
                          D SK    G L   L+           +N G   G +VV +Y +P
Sbjct: 793 ---------------DRSKLAADGRLHATLKV----------KNTGQRAGDEVVQLYLQP 827

Query: 720 PAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANT-LLPAGEHTIFVG 778
            +       K + GFQR+ ++ G  + ++F  +    L + D A    ++  G++ + VG
Sbjct: 828 LSPQRERASKDLRGFQRIALQPGETREVRFAISPQSDLRLYDEARKGYVVDPGDYELQVG 887

Query: 779 NGGVSFPIHLNFN 791
                  +   F+
Sbjct: 888 ASSSDVRVRQRFS 900


>gi|443717728|gb|ELU08656.1| hypothetical protein CAPTEDRAFT_228276 [Capitella teleta]
          Length = 731

 Score =  316 bits (809), Expect = 4e-83,   Method: Compositional matrix adjust.
 Identities = 211/638 (33%), Positives = 329/638 (51%), Gaps = 61/638 (9%)

Query: 46  GLQMSSFLFCDSSLPYSIRVKDLVSRMTLDE----KVQQLGDFAHGVPRLGLPQYEWWSE 101
           G+  + F F D +L +  RV DLV R+T++E     V Q G     V RLG+  Y++ +E
Sbjct: 14  GVANAKFPFEDVTLSWDKRVDDLVQRLTIEEVVNISVAQYGKSTIPVDRLGVKPYQFINE 73

Query: 102 ALHGVSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNL---- 157
            + GV               +T+FP  I   ASF+  L   + QA++ E R  YN     
Sbjct: 74  CITGVR-----------WENSTAFPQAIGLGASFSPDLAFNMSQAIARELRGFYNTEVKS 122

Query: 158 ---GRAGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDL 214
              G  G+  ++P IN+ R P WGR  ET GEDP++ G+ +V +V+GLQ           
Sbjct: 123 QIYGHRGVNCFTPVINIMRHPLWGRNQETYGEDPWLSGQLSVGFVKGLQGD--------- 173

Query: 215 NSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVM 274
           + R ++ S  CKH+  ++      V R+ FDA+V+E+D   TFL  F+ CV+ G + ++M
Sbjct: 174 HPRYIQASGGCKHFDVHNGPENIPVSRFGFDAKVSERDWRMTFLPQFKTCVEAG-SINIM 232

Query: 275 CSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQT 334
           CSYNR+NG+P+CA+ KLL   +R EW  +GY+++D  +I+ +V +HK+     E A A +
Sbjct: 233 CSYNRINGVPACANKKLLTDILRKEWGFNGYVISDSGAIENIVYHHKYTKTLAE-AAADS 291

Query: 335 LKAGLDLD------CGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ- 387
           +KAG +++       G  Y N   NAV+Q  + E ++ ++LK      MR G FD     
Sbjct: 292 VKAGCNVELTGATGSGVAYFNLL-NAVKQNLISEEELRENLKKPMYSRMRQGEFDPVDMN 350

Query: 388 -YVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAM 446
            +  +    + S E+ +LA +A+    VL+KN    LPL   +   +A++GP A+    +
Sbjct: 351 PFTKIDMSVVLSQEHQDLAVKASAMSFVLMKNLNRVLPLKK-RFDRLAIIGPFADNAETL 409

Query: 447 IGNYAG--IPCRYMSPIAGFSGYA-NVTYKTGCDDVACKSNNSIFAASEAAKTADATIIL 503
            G+Y     P    +P  G      +V Y +GCDD +C +N    A  +A K A    + 
Sbjct: 410 FGDYIPNWDPKFVSTPYEGLKSLGDDVRYASGCDDPSC-TNYDPKAIEKAVKGAQFVFVC 468

Query: 504 AGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAK-GPVILVIMSAGGVDIAFAETNTNI 562
            G+  ++E E  DR DL LPGYQ Q++      ++  P++LV+ +AG VD+ + + +  +
Sbjct: 469 LGVGSNLEREGHDRADLDLPGYQLQILKDAEFFSREAPLVLVLFNAGPVDLTWPKLSPEV 528

Query: 563 KAILWAGYPGEEGGRAIADVVFGKFN---PGGRLPITWYNGDYVQMLPLTSMPLRPVDSL 619
             I+   YP    G+A+  VV    +   P  RLP TW         P     +  +   
Sbjct: 529 DGIIECFYPAMGTGKALYQVVTATGDDGVPAARLPSTW---------PAQLHQVPSITDY 579

Query: 620 GYPGRTYKFYN-GPTLYPFGYGLSYTQFKYNLLSFTKT 656
              G TY++++ G  LYPFGYGLSYT F Y  +S + T
Sbjct: 580 NMTGHTYRYFDGGDPLYPFGYGLSYTSFHYQTVSVSPT 617


>gi|418518550|ref|ZP_13084692.1| glucan 1,4-beta-glucosidase [Xanthomonas axonopodis pv. malvacearum
           str. GSPB1386]
 gi|418522850|ref|ZP_13088880.1| glucan 1,4-beta-glucosidase [Xanthomonas axonopodis pv. malvacearum
           str. GSPB2388]
 gi|410700720|gb|EKQ59264.1| glucan 1,4-beta-glucosidase [Xanthomonas axonopodis pv. malvacearum
           str. GSPB2388]
 gi|410703176|gb|EKQ61671.1| glucan 1,4-beta-glucosidase [Xanthomonas axonopodis pv. malvacearum
           str. GSPB1386]
          Length = 901

 Score =  315 bits (808), Expect = 5e-83,   Method: Compositional matrix adjust.
 Identities = 180/450 (40%), Positives = 254/450 (56%), Gaps = 29/450 (6%)

Query: 48  QMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVS 107
           Q ++  + D+   +  R  DLVSRMTL+EK  Q+ + A  +PRL +P Y+WW+EALHGV+
Sbjct: 28  QTATPPYLDTQRSFEARAADLVSRMTLEEKAAQMQNAAPAIPRLQVPAYDWWNEALHGVA 87

Query: 108 NVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYN--------LGR 159
             G          GAT FP  I   A+F+  L  ++  A+S EARA ++           
Sbjct: 88  RAG----------GATVFPQAIGMAATFDLPLMHEVATAISDEARAKHHQFLRQNQHARY 137

Query: 160 AGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPL 219
            GLT+WSPNIN+ RDPRWGR  ET GEDPF+  R  V +V+GLQ             R  
Sbjct: 138 QGLTFWSPNINIFRDPRWGRGQETYGEDPFLTARMGVTFVQGLQGEGADAPKNAQGERYR 197

Query: 220 KVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNR 279
           K+ +  KH+A   V +    DR+HFDAR +++D+ ET+L  FE  VK+G   +VM +YNR
Sbjct: 198 KLDATAKHFA---VHSGPEADRHHFDARPSQRDLYETYLPAFEALVKDGKVDAVMGAYNR 254

Query: 280 VNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGL 339
           V G  + A   LL   +R +W   GY+V+DC +I  +  +HK +A ++E A A  +K G 
Sbjct: 255 VYGESASASKFLLQDVLRDQWGFRGYVVSDCWAIVDIWKHHKIVA-TREQAAALAVKHGT 313

Query: 340 DLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFD--GSPQYVSLGKQDIC 397
           +L+CG+ Y      AV+QG + E  ID +LK L T  MRLG FD  G   + ++      
Sbjct: 314 ELECGEEYATLPA-AVRQGLIDEAQIDTALKTLMTARMRLGMFDPPGQLPWSTIPASVNQ 372

Query: 398 SDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRY 457
           S  +  LA   ARE +VLLKND   LPL+ AK+K +AV+GP A+ T+A++GNY G P   
Sbjct: 373 SPAHDALARRTARESLVLLKND-GLLPLSRAKLKRIAVIGPTADDTMALLGNYYGTPAAP 431

Query: 458 MSPIAGFSGY---ANVTYKTGCDDVACKSN 484
           ++ + G       A V Y  G D V  + +
Sbjct: 432 VTVLQGIRAAAPNAQVLYARGADLVEGRDD 461



 Score =  137 bits (346), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 96/317 (30%), Positives = 141/317 (44%), Gaps = 60/317 (18%)

Query: 462 AGFSGYANVTYKTGCDDVACK-------SNNSIFAASEAAKTADATIILAGLDLSVEAES 514
           AG +    + Y  G  D A +       +   +  A + A++AD  + + GL   VE E 
Sbjct: 593 AGRAYEVRLEYFEGERDAAVRLAWRQPGAKPPLQEALDVARSADVVVFVGGLTGDVEGEE 652

Query: 515 L----------DREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKA 564
           +          DR DL LP  Q  L+  +    K PV+ V+ +   + I +A+ +  + A
Sbjct: 653 MKVNYPGFAGGDRTDLRLPKPQRDLLEALQATGK-PVVAVLTAGSALAIDWAQQH--LPA 709

Query: 565 ILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGR 624
           IL A YPG+ GG A+AD +FG  NPGGRLP+T+Y           S  L   D     GR
Sbjct: 710 ILLAWYPGQRGGTAVADTLFGDANPGGRLPVTFYK---------ESETLPAFDDYAMRGR 760

Query: 625 TYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGV 684
           TY+++ G  LYPFG+GLSYTQF Y+ L   +T                            
Sbjct: 761 TYRYFGGTPLYPFGHGLSYTQFAYSGLRLDRTT--------------------------- 793

Query: 685 LVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRN 744
               +  D      V  +N G   G +VV +Y  P A       K++ GFQR+ ++ G  
Sbjct: 794 ----IATDGSLTATVTVKNTGQRAGDEVVQLYLHPLAPQRERAGKELHGFQRIALQPGEQ 849

Query: 745 KRIKFVFNACKSLNIVD 761
           + + F  NA  +L + D
Sbjct: 850 RELGFTINAKDALRLYD 866


>gi|116181370|ref|XP_001220534.1| hypothetical protein CHGG_01313 [Chaetomium globosum CBS 148.51]
 gi|88185610|gb|EAQ93078.1| hypothetical protein CHGG_01313 [Chaetomium globosum CBS 148.51]
          Length = 549

 Score =  315 bits (807), Expect = 6e-83,   Method: Compositional matrix adjust.
 Identities = 201/512 (39%), Positives = 283/512 (55%), Gaps = 40/512 (7%)

Query: 55  CDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTH 114
           CD       R   LV  + ++EK+Q L D + G  RLGLP Y WWSEALHGV+   PG  
Sbjct: 39  CDPKATPPERAAALVKALNIEEKLQNLVDMSKGAERLGLPAYAWWSEALHGVA-ASPGVR 97

Query: 115 FDDVIPG----ATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNIN 170
           F+    G    ATSF   I  +A+F++ L  K+   +STEARA  N G AGL YW+PNIN
Sbjct: 98  FNRTAGGRFSSATSFANSITLSAAFDDELVYKVADTISTEARAFANAGLAGLDYWTPNIN 157

Query: 171 VARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAA 230
             +DPRWGR  ETPGEDP  +  Y    + GL+         D + R  KV + CKHYAA
Sbjct: 158 PYKDPRWGRGHETPGEDPVRIKGYVKALLAGLE-------GDDPSIR--KVVATCKHYAA 208

Query: 231 YDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPK 290
           YD++ W+G  R+ FDA V+ QD+ E +L PF+ C ++    S MCSYN +NG P+CA   
Sbjct: 209 YDLERWQGTTRHRFDAVVSLQDLSEYYLPPFQQCARDSKVGSFMCSYNALNGTPACASTY 268

Query: 291 LLNQTVRGEW---DLHGYIVADCDSIQVMVDN---HKFLADSKE-DAVAQTLKAGLDLDC 343
           L++  +R  W   + + YI +DC++IQ  +     H F +   E +A A   +AG D  C
Sbjct: 269 LMDDILRKHWGWTEHNNYITSDCNAIQDFLPGPKWHNFSSTQTEAEAAAVAYQAGTDTVC 328

Query: 344 G----QYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFD---GSPQYVSLGKQDI 396
                  YT+  G A  Q  + E  ID +LK LY  L+R+G+FD   GSP Y S+G +D+
Sbjct: 329 EVPGWPPYTDVIG-AYNQTLLSEEVIDTALKRLYEGLVRVGYFDPASGSP-YRSIGWEDV 386

Query: 397 CSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVA--MIGNYAGIP 454
            + E  ELA ++  +G+VLLKND  TLPLN  + KTVA++G  AN+T    ++G Y+G P
Sbjct: 387 NTPEAQELALQSGTDGLVLLKND-GTLPLN-LEDKTVALIGFWANSTNGGRILGGYSGFP 444

Query: 455 CRYMSPIAGFSGYANVTYKTGCDDVA-----CKSNNSIFAASEAAKTADATIILAGLDLS 509
               SP+       N+TY      +A        ++ +  A E AK ++  +   G D S
Sbjct: 445 PYIHSPVDAAEKL-NLTYHYASGPLAENITQAAIDDWVAKALEPAKKSNVILYFGGTDTS 503

Query: 510 VEAESLDREDLWLPGYQTQLINQVAEVAKGPV 541
           + AE LDR+ +  P  Q  +I  ++ + + P 
Sbjct: 504 IAAEDLDRDSIAWPEIQLAVIEALSALRQAPA 535


>gi|118489157|gb|ABK96385.1| unknown [Populus trichocarpa x Populus deltoides]
          Length = 343

 Score =  315 bits (807), Expect = 7e-83,   Method: Compositional matrix adjust.
 Identities = 150/335 (44%), Positives = 216/335 (64%), Gaps = 9/335 (2%)

Query: 446 MIGNYAGIPCRYMSPIAGFSGYANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAG 505
           MIGNYAG+ C Y +P+ G   YA   + +GC+DV C  N    AA  AA+ ADATI++ G
Sbjct: 1   MIGNYAGVACGYTTPLQGIRRYAKTVHLSGCNDVFCNGNQQFNAAEVAARHADATILVMG 60

Query: 506 LDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAI 565
           LD S+EAE  DR+ L LPGYQ +L+++VA  ++GP ILV+MS G +D++FA+ +  I AI
Sbjct: 61  LDQSIEAEFRDRKGLLLPGYQQELVSRVARASRGPTILVLMSGGPIDVSFAKNDPRIGAI 120

Query: 566 LWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRT 625
           LW GYPG+ GG AIADV+FG  NPGG+LP+TWY  DY+  +P+T+M +R   S GYPGRT
Sbjct: 121 LWVGYPGQAGGAAIADVLFGTANPGGKLPMTWYPHDYLAKVPMTNMGMRADPSRGYPGRT 180

Query: 626 YKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVL 685
           Y+FY GP ++PFG+G+SYT F ++L+   + + V L  L   RN    S+A       + 
Sbjct: 181 YRFYKGPVVFPFGHGMSYTTFAHSLVQAPREVSVPLASLHVSRNTTGASNA-------IR 233

Query: 686 VNDLRCDDY-FEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRN 744
           V+   C+       +D +N G  DG+  ++V+S PP    +T  KQ+IGF++V +  G  
Sbjct: 234 VSHANCEALALGVHIDVKNTGDMDGTHTLLVFSSPPGGKWSTQ-KQLIGFEKVHLVTGSQ 292

Query: 745 KRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGN 779
           KR+K   + CK L++VD      +P GEH +++G+
Sbjct: 293 KRVKIDIHVCKHLSVVDRFGIRRIPNGEHYLYIGD 327


>gi|390991557|ref|ZP_10261819.1| glycosyl hydrolase family 3 N terminal domain protein [Xanthomonas
           axonopodis pv. punicae str. LMG 859]
 gi|372553724|emb|CCF68794.1| glycosyl hydrolase family 3 N terminal domain protein [Xanthomonas
           axonopodis pv. punicae str. LMG 859]
          Length = 901

 Score =  315 bits (807), Expect = 7e-83,   Method: Compositional matrix adjust.
 Identities = 181/445 (40%), Positives = 254/445 (57%), Gaps = 31/445 (6%)

Query: 54  FCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGT 113
           + D+   +  R  DLVSRMTL+EK  Q+ + A  +PRL +P Y+WW+EALHGV+  G   
Sbjct: 34  YLDTQRSFEARAADLVSRMTLEEKAAQMQNAAPAIPRLQVPAYDWWNEALHGVARAG--- 90

Query: 114 HFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYN--------LGRAGLTYW 165
                  GAT FP  I   A+F+  L  ++  A+S EARA ++            GLT+W
Sbjct: 91  -------GATVFPQAIGMAATFDLPLMHEVATAISDEARAKHHQFLRQNQHARYQGLTFW 143

Query: 166 SPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPL-KVSSC 224
           SPNIN+ RDPRWGR  ET GEDPF+  R  V +V+GLQ  EG     +    P  K+ + 
Sbjct: 144 SPNINIFRDPRWGRGQETYGEDPFLTARMGVTFVQGLQG-EGAAAPKNAQGEPYRKLDAT 202

Query: 225 CKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIP 284
            KH+A   V +    DR+HFDAR +++D+ ET+L  FE  VK+G   +VM +YNRV G  
Sbjct: 203 AKHFA---VHSGPEADRHHFDARPSQRDLYETYLPAFEALVKDGKVDAVMGAYNRVYGES 259

Query: 285 SCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCG 344
           + A   LL   +R +W   GY+V+DC +I  +  +HK +A ++E A A  +K G +L+CG
Sbjct: 260 ASASKFLLQDVLRDQWGFRGYVVSDCWAIVDIWKHHKIVA-TREQAAALAVKHGTELECG 318

Query: 345 QYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFD--GSPQYVSLGKQDICSDENI 402
           + Y      AV+QG + E  ID +LK L T  MRLG FD  G   + ++      S  + 
Sbjct: 319 EEYATLPA-AVRQGLIDEAQIDTALKTLMTARMRLGMFDPPGQLPWSTIPASVNQSPAHD 377

Query: 403 ELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIA 462
            LA   ARE +VLLKND   LPL+ AK+K +AV+GP A+ T+A++GNY G P   ++ + 
Sbjct: 378 ALARRTARESLVLLKND-GLLPLSRAKLKRIAVIGPTADDTMALLGNYYGTPAAPVTVLQ 436

Query: 463 GFSG---YANVTYKTGCDDVACKSN 484
           G       A V Y  G D V  + +
Sbjct: 437 GIRAAAPKAQVLYARGADLVEGRDD 461



 Score =  137 bits (344), Expect = 3e-29,   Method: Compositional matrix adjust.
 Identities = 95/317 (29%), Positives = 141/317 (44%), Gaps = 60/317 (18%)

Query: 462 AGFSGYANVTYKTGCDDVACK-------SNNSIFAASEAAKTADATIILAGLDLSVEAES 514
           AG +    + Y  G  D A +       +   +  A + A++AD  + + GL   VE E 
Sbjct: 593 AGRAYEVRLEYFEGERDAAVRLAWRQPGAKPPLQEALDVARSADVVVFVGGLTGDVEGEE 652

Query: 515 L----------DREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKA 564
           +          DR DL LP  Q  L+  +    + PV+ V+ +   + I +A+ +  + A
Sbjct: 653 MKVNYPGFAGGDRTDLRLPKPQRDLLEALQATGR-PVVAVLTTGSALAIDWAQQH--LPA 709

Query: 565 ILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGR 624
           IL A YPG+ GG A+AD +FG  NPGGRLP+T+Y           S  L   D     GR
Sbjct: 710 ILLAWYPGQRGGTAVADTLFGDANPGGRLPVTFYK---------ESETLPAFDDYAMRGR 760

Query: 625 TYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGV 684
           TY+++ G  LYPFG+GLSYTQF Y+ L   +T                            
Sbjct: 761 TYRYFGGTPLYPFGHGLSYTQFAYSGLRLDRTT--------------------------- 793

Query: 685 LVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRN 744
               +  D      V  +N G   G +VV +Y  P A       K++ GFQR+ ++ G  
Sbjct: 794 ----IATDGSLTATVTVKNTGQRAGDEVVQLYLHPLAPQRERAGKELHGFQRIALQPGEQ 849

Query: 745 KRIKFVFNACKSLNIVD 761
           + + F  NA  +L + D
Sbjct: 850 RELGFTINAKDALRLYD 866


>gi|78049893|ref|YP_366068.1| beta-glucosidase precursor [Xanthomonas campestris pv. vesicatoria
           str. 85-10]
 gi|78038323|emb|CAJ26068.1| beta-glucosidase precursor [Xanthomonas campestris pv. vesicatoria
           str. 85-10]
          Length = 902

 Score =  315 bits (806), Expect = 8e-83,   Method: Compositional matrix adjust.
 Identities = 180/445 (40%), Positives = 256/445 (57%), Gaps = 31/445 (6%)

Query: 54  FCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGT 113
           + D+   +  R  DLVSRMTL+EK  Q+ + A  +PRLG+P Y+WW+EALHGV+  G   
Sbjct: 35  YLDTQRSFEQRAADLVSRMTLEEKAAQMQNAAPAIPRLGVPAYDWWNEALHGVARAG--- 91

Query: 114 HFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYN--------LGRAGLTYW 165
                  GAT FP  I   A+F+  L  ++  A+S EARA ++            GLT+W
Sbjct: 92  -------GATVFPQAIGMAATFDLPLMHEVATAISDEARAKHHQFLRQNQHARYQGLTFW 144

Query: 166 SPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPL-KVSSC 224
           SPNIN+ RDPRWGR  ET GEDPF+  R  V +V+GL+  EG +   +    P  K+ + 
Sbjct: 145 SPNINIFRDPRWGRGQETYGEDPFLTARMGVTFVQGLRG-EGADAPKNAQGEPYRKLDAT 203

Query: 225 CKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIP 284
            KH+A   V +    DR+HFDAR +++D+ ET+L  FE  VK+G   +VM +YNRV G  
Sbjct: 204 AKHFA---VHSGPEADRHHFDARPSQRDLYETYLPAFEALVKDGKVDAVMGAYNRVYGES 260

Query: 285 SCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCG 344
           + A   LL   +R +W   GY+V+DC +I  +  +HK +A ++E A A  +K G +L+CG
Sbjct: 261 ASASKFLLQDVLRQQWGFKGYVVSDCWAIVDIWKHHKIVA-TREQAAALAVKHGTELECG 319

Query: 345 QYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFD--GSPQYVSLGKQDICSDENI 402
           + Y+     AV+QG + E  ID +L  L T  MRLG FD  G   + ++      S  + 
Sbjct: 320 EEYSTLPA-AVRQGLIDEAQIDTALTTLMTARMRLGMFDPPGQLPWSTIPASVNQSPAHD 378

Query: 403 ELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIA 462
            LA   ARE +VLLKND   LPL+ AK+K +AV+GP A+ T+A++GNY G P   ++ + 
Sbjct: 379 ALARRTARESLVLLKND-GLLPLSRAKLKRIAVIGPTADDTMALLGNYYGTPAAPVTVLQ 437

Query: 463 GFSGY---ANVTYKTGCDDVACKSN 484
           G       A V Y  G D V  + +
Sbjct: 438 GIRAAAPNAQVLYARGADLVEGRDD 462



 Score =  137 bits (345), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 90/282 (31%), Positives = 130/282 (46%), Gaps = 53/282 (18%)

Query: 490 ASEAAKTADATIILAGLDLSVEAESL----------DREDLWLPGYQTQLINQVAEVAKG 539
           A + A +AD  + + GL   VE E +          DR DL LP  Q  L+  +    K 
Sbjct: 629 ALDVASSADVVVFVGGLTGDVEGEEMKVNYPGFAGGDRTDLRLPKPQRDLLEALQATGK- 687

Query: 540 PVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYN 599
           PV+ V+ +   + I +A+ +  + AIL A YPG+ GG A+AD +FG  NPGGRLP+T+Y 
Sbjct: 688 PVVAVLTTGSALAIDWAQQH--LPAILLAWYPGQRGGTAVADTLFGDANPGGRLPVTFYK 745

Query: 600 GDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQV 659
                     S  L   D     GRTY+++ G  LYPFG+GLSYTQF Y+ L   +T   
Sbjct: 746 ---------ESETLPAFDDYAMRGRTYRYFGGTPLYPFGHGLSYTQFAYSGLRLDRTT-- 794

Query: 660 NLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKP 719
                                        +  D      V  +N G   G +VV +Y  P
Sbjct: 795 -----------------------------IAADGSLTATVTVKNTGQRAGDEVVQLYLHP 825

Query: 720 PAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVD 761
                    K++ GFQR+ ++AG  + + F+ +A  +L I D
Sbjct: 826 LTPQRERAGKELHGFQRITLQAGEQRALHFILDAKNALRIYD 867


>gi|294667502|ref|ZP_06732718.1| glucan 1,4-beta-glucosidase [Xanthomonas fuscans subsp.
           aurantifolii str. ICPB 10535]
 gi|292602731|gb|EFF46166.1| glucan 1,4-beta-glucosidase [Xanthomonas fuscans subsp.
           aurantifolii str. ICPB 10535]
          Length = 901

 Score =  314 bits (804), Expect = 1e-82,   Method: Compositional matrix adjust.
 Identities = 179/450 (39%), Positives = 254/450 (56%), Gaps = 29/450 (6%)

Query: 48  QMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVS 107
           Q ++  + D+   +  R  DLVSRMTL+EK  Q+ + A  +PRL +P Y+WW+EALHGV+
Sbjct: 28  QAATPPYLDTQRSFEARAADLVSRMTLEEKAAQMQNAAPAIPRLQVPAYDWWNEALHGVA 87

Query: 108 NVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNL--------GR 159
             G          GAT FP  I   A+F+  L  ++  A+S EARA ++           
Sbjct: 88  RAG----------GATVFPQAIGMAATFDLPLMHEVATAISDEARAKHHQFLRQNQHERY 137

Query: 160 AGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPL 219
            GLT+WSPNIN+ RDPRWGR  ET GEDPF+  R  V +V+GLQ   G         R  
Sbjct: 138 QGLTFWSPNINIFRDPRWGRGQETYGEDPFLTARMGVTFVQGLQGEGGDAPKNAQGERYR 197

Query: 220 KVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNR 279
           K+ +  KH+A   V +    DR+HFDA  +++D+ ET+L  FE  VK+G   +VM +YNR
Sbjct: 198 KLDATAKHFA---VHSGPEADRHHFDAHPSQRDLYETYLPAFEALVKDGKVDAVMGAYNR 254

Query: 280 VNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGL 339
           V G  + A   LL   +R +W   GY+V+DC +I  +  +HK +A ++E A A  +K G 
Sbjct: 255 VYGESASASKFLLQDVLRDQWGFRGYVVSDCWAIVDIWKHHKIVA-TREQAAALAVKHGT 313

Query: 340 DLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFD--GSPQYVSLGKQDIC 397
           +L+CG+ Y+     AV+QG + E  ID +LK L T  MRLG FD  G   +  +      
Sbjct: 314 ELECGEEYSTLPA-AVRQGLIDEAQIDTALKTLMTARMRLGMFDPPGQLPWSQIPASVNQ 372

Query: 398 SDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRY 457
           S  +  LA   ARE +VLLKND   LPL+ A++K +AV+GP A+ T+A++GNY G P   
Sbjct: 373 SPAHDALARRTARESLVLLKND-GLLPLSRARLKRIAVIGPTADDTMALLGNYYGTPAAP 431

Query: 458 MSPIAGFSGY---ANVTYKTGCDDVACKSN 484
           ++ + G       A V Y  G D V  + +
Sbjct: 432 VTVLQGIRAAAPNAQVLYARGADLVEGRDD 461



 Score =  133 bits (335), Expect = 4e-28,   Method: Compositional matrix adjust.
 Identities = 94/317 (29%), Positives = 139/317 (43%), Gaps = 60/317 (18%)

Query: 462 AGFSGYANVTYKTGCDDVACK-------SNNSIFAASEAAKTADATIILAGLDLSVEAES 514
           AG +    + Y  G  D A +       +   +  A + A++A+  + + GL   VE E 
Sbjct: 593 AGRAYEVRLEYFEGERDAAVRLAWRQPGAKPPLQEALDVARSAEVVVFVGGLTGDVEGEE 652

Query: 515 L----------DREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKA 564
           +          DR DL LP  Q  L+  +    K PV+ V+ +   + I +A+ +  + A
Sbjct: 653 MKVNYPGFAGGDRTDLRLPKPQRDLLEALHATGK-PVVAVLTTGSALAIDWAQQH--LPA 709

Query: 565 ILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGR 624
           IL A YPG+ GG A+AD +FG  NPGGRLP+T+Y           S  L   D     GR
Sbjct: 710 ILLAWYPGQRGGTAVADTLFGDANPGGRLPVTFYK---------ESETLPAFDDYAMRGR 760

Query: 625 TYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGV 684
           TY+++ G  LYPFG+GLSYTQF Y+ L   +T                            
Sbjct: 761 TYRYFGGTPLYPFGHGLSYTQFAYSGLRLDRTT--------------------------- 793

Query: 685 LVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRN 744
               +  D      V  +N G   G +VV +Y  P         K++ GFQR+ +  G  
Sbjct: 794 ----IATDGSLTATVTVKNTGQRAGDEVVQLYLHPLTPQRERAGKELHGFQRIALTPGEQ 849

Query: 745 KRIKFVFNACKSLNIVD 761
           + + F  NA  +L + D
Sbjct: 850 RELGFTINAKDALRLYD 866


>gi|21244948|ref|NP_644530.1| glucan 1,4-beta-glucosidase [Xanthomonas axonopodis pv. citri str.
           306]
 gi|21110666|gb|AAM39066.1| glucan 1,4-beta-glucosidase [Xanthomonas axonopodis pv. citri str.
           306]
          Length = 901

 Score =  314 bits (804), Expect = 1e-82,   Method: Compositional matrix adjust.
 Identities = 182/445 (40%), Positives = 252/445 (56%), Gaps = 31/445 (6%)

Query: 54  FCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGT 113
           + D+   +  R  DLVSRMTL+EK  Q+ + A  +PRL +P Y+WW+EALHGV+  G   
Sbjct: 34  YLDTQRSFEARAADLVSRMTLEEKAAQMQNAAPAIPRLQVPAYDWWNEALHGVARAG--- 90

Query: 114 HFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYN--------LGRAGLTYW 165
                  GAT FP  I   A+F+  L  ++  A+S EARA ++            GLT+W
Sbjct: 91  -------GATVFPQAIGMAATFDLPLMHEVATAISDEARAKHHQFLRQNQHARYQGLTFW 143

Query: 166 SPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPL-KVSSC 224
           SPNIN+ RDPRWGR  ET GEDPF+  R  V +V+GLQ  EG     +    P  K+ + 
Sbjct: 144 SPNINIFRDPRWGRGQETYGEDPFLTARMGVTFVQGLQG-EGAAAPKNAQGEPYRKLDAT 202

Query: 225 CKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIP 284
            KH A   V +    DR+HFDAR +++D+ ET+L  FE  VKEG   +VM +YNRV G  
Sbjct: 203 AKHLA---VHSGPEADRHHFDARPSQRDLYETYLPAFEALVKEGKVDAVMGAYNRVYGES 259

Query: 285 SCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCG 344
           + A   LL   +R +W   GY+V+DC +I  +  +HK +A ++E A A  +K G +L+CG
Sbjct: 260 ASASKFLLQDVLRDQWGFRGYVVSDCWAIVDIWKHHKIVA-TREQAAALAVKHGTELECG 318

Query: 345 QYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFD--GSPQYVSLGKQDICSDENI 402
           + Y      AV+QG + E  ID +LK L T  MRLG FD  G   + ++      S  + 
Sbjct: 319 EEYATLPA-AVRQGLIDEAQIDTALKTLMTARMRLGMFDPPGQLPWSTIPASVNQSPAHD 377

Query: 403 ELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIA 462
            LA   ARE +VLLKND   LPL+ AK K +AV+GP A+ T+A++GNY G P   ++ + 
Sbjct: 378 ALARRTARESLVLLKND-GLLPLSRAKFKRIAVIGPTADDTMALLGNYYGTPAAPVTVLQ 436

Query: 463 GFSGY---ANVTYKTGCDDVACKSN 484
           G       A V Y  G D V  + +
Sbjct: 437 GIRAAAPNAQVLYARGADLVEGRDD 461



 Score =  137 bits (344), Expect = 3e-29,   Method: Compositional matrix adjust.
 Identities = 95/317 (29%), Positives = 141/317 (44%), Gaps = 60/317 (18%)

Query: 462 AGFSGYANVTYKTGCDDVACK-------SNNSIFAASEAAKTADATIILAGLDLSVEAES 514
           AG +    + Y  G  D A +       +   +  A + A++AD  + + GL   VE E 
Sbjct: 593 AGRAYEVRLEYFEGERDAAVRLAWRQPGARPPLQEALDVARSADVVVFVGGLTGDVEGEE 652

Query: 515 L----------DREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKA 564
           +          DR DL LP  Q  L+  +    + PV+ V+ +   + I +A+ +  + A
Sbjct: 653 MKVNYPGFAGGDRTDLRLPKPQRDLLEALQATGR-PVVAVLTTGSALAIDWAQQH--LPA 709

Query: 565 ILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGR 624
           IL A YPG+ GG A+AD +FG  NPGGRLP+T+Y           S  L   D     GR
Sbjct: 710 ILLAWYPGQRGGTAVADTLFGDANPGGRLPVTFYK---------ESETLPAFDDYAMRGR 760

Query: 625 TYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGV 684
           TY+++ G  LYPFG+GLSYTQF Y+ L   +T                            
Sbjct: 761 TYRYFGGTPLYPFGHGLSYTQFAYSGLRLDRTT--------------------------- 793

Query: 685 LVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRN 744
               +  D      V  +N G   G +VV +Y  P A       K++ GFQR+ ++ G  
Sbjct: 794 ----IATDGSLAATVTVKNTGQRAGDEVVQLYLHPLAPQRERAGKELHGFQRIALQPGEQ 849

Query: 745 KRIKFVFNACKSLNIVD 761
           + + F  NA  +L + D
Sbjct: 850 RELGFTINAKDALRLYD 866


>gi|116621778|ref|YP_823934.1| glycoside hydrolase family 3 protein [Candidatus Solibacter
           usitatus Ellin6076]
 gi|116224940|gb|ABJ83649.1| glycoside hydrolase, family 3 domain protein [Candidatus Solibacter
           usitatus Ellin6076]
          Length = 850

 Score =  313 bits (802), Expect = 2e-82,   Method: Compositional matrix adjust.
 Identities = 183/464 (39%), Positives = 264/464 (56%), Gaps = 42/464 (9%)

Query: 32  SPVFVCDPGRFSKLGLQMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRL 91
           S +F+      + +G   S   F D  L    R  DLV+RMTLDEKV Q+ + A  +PRL
Sbjct: 4   SGIFLALAASPALIGQTTSQLPFMDPDLSAERRAADLVARMTLDEKVLQMQNSAPAIPRL 63

Query: 92  GLPQYEWWSEALHGVSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEA 151
           G+P Y+WW+EALHGV+  G           AT FP  I   A+++ +L  +I + +STEA
Sbjct: 64  GIPAYDWWNEALHGVARAG----------LATVFPQAIGLAATWDATLMHRIAETISTEA 113

Query: 152 RAMYNLG--------RAGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQ 203
           RA YN            GLT+WSPNIN+ RDPRWGR  ET GEDPF+  R AV +++G+Q
Sbjct: 114 RAKYNEAIRNDDHSRYRGLTFWSPNINIFRDPRWGRGQETYGEDPFLTSRMAVAFIKGMQ 173

Query: 204 DVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEM 263
             + H           KV +  KHYA   V +     R+ FD + + +D+ +T+L  F  
Sbjct: 174 GEDPHY---------YKVIATAKHYA---VHSGPESSRHQFDVKPSPRDLADTYLPAFRA 221

Query: 264 CVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFL 323
            + E  A S+MC+YNRV+GIP+CA   LL + +RGEW   G++V+DC ++  +   H + 
Sbjct: 222 SIVEARADSLMCAYNRVDGIPACASTDLLEKRLRGEWGFQGFVVSDCGAVSDIFRGHHYQ 281

Query: 324 ADSKEDAVAQTLKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFD 383
            D+   A A  +KAG DL CG  Y     +AV+ G + E +I++SL+ L+    +LG FD
Sbjct: 282 PDAAS-ASAVAVKAGTDLTCGNEYRALV-DAVKTGLITEPEINRSLERLFVARFKLGMFD 339

Query: 384 GSPQYVSLGK---QDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHA 440
             P+ V        ++ S  + ++A EAAR+ IVLLKND  TLPL S+ +K +AV+GP A
Sbjct: 340 -PPERVPFSNIPYSEVDSAGHRKIALEAARKSIVLLKND-GTLPLKSS-IKKIAVIGPAA 396

Query: 441 NATVAMIGNYAGIPCRYMSPIAG----FSGYANVTYKTGCDDVA 480
           +   A++GNY G     ++P+AG    ++G A V Y  G +  A
Sbjct: 397 DDAEALLGNYNGFSSLQVTPLAGIEHQWAGKAEVRYALGANYTA 440



 Score =  114 bits (284), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 90/274 (32%), Positives = 131/274 (47%), Gaps = 60/274 (21%)

Query: 487 IFAASEAAKTADATIILAGLDLSVEAESL----------DREDLWLPGYQTQLINQVAEV 536
           + AA EA   AD T+   GL+ S+E E +          DR +L LP  Q +LI   A +
Sbjct: 594 LAAAIEAVSNADVTLAFVGLNPSLEGEEMPVSVPGFQGGDRTNLELPEPQEKLIE--AAI 651

Query: 537 AKG-PVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPI 595
           A G PV++V+ S   V + FA  + +  A+L   Y GEE G AIAD + G  NP GRLP+
Sbjct: 652 ATGKPVVVVLASGSAVAMNFAAQHAS--ALLETWYNGEETGTAIADTLAGINNPSGRLPV 709

Query: 596 TWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTK 655
           T+Y    V  LP       P +     GRTY+++NG  LY FG+GLSY++F+Y       
Sbjct: 710 TFYRS--VDQLP-------PFEEYAMKGRTYRYFNGDALYSFGFGLSYSKFQY------- 753

Query: 656 TIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIV 715
                 + L+  R  + T  AS+ R                      N  S +G +VV +
Sbjct: 754 ------SALKTRRAGSGTIVASRVR----------------------NASSIEGDEVVQL 785

Query: 716 YSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKF 749
           Y    +      I+ + GFQR+ +R G ++ + F
Sbjct: 786 YVN-GSGADGDPIRSLRGFQRIHLRPGESREVHF 818


>gi|289668505|ref|ZP_06489580.1| glucan 1,4-beta-glucosidase [Xanthomonas campestris pv. musacearum
           NCPPB 4381]
          Length = 902

 Score =  313 bits (801), Expect = 3e-82,   Method: Compositional matrix adjust.
 Identities = 182/446 (40%), Positives = 255/446 (57%), Gaps = 33/446 (7%)

Query: 54  FCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGT 113
           + D+   +  R  DLVSRMTL+EK  Q+ + A  +PRLG+  Y+WW+EALHGV+  G   
Sbjct: 35  YLDTQRSFEQRAADLVSRMTLEEKAAQMQNAAPAIPRLGVAAYDWWNEALHGVARAG--- 91

Query: 114 HFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNL--------GRAGLTYW 165
                  GAT FP  I   A+F+  L  ++  A+S EARA ++            GLT+W
Sbjct: 92  -------GATVFPQAIGMAATFDLPLMHEVATAISDEARAKHHQFLRQNQHERYQGLTFW 144

Query: 166 SPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGH--ENATDLNSRPLKVSS 223
           SPNIN+ RDPRWGR  ET GEDPF+  R  V +VRGLQ   G   +NA   + R  K+ +
Sbjct: 145 SPNINIFRDPRWGRGQETYGEDPFLTARMGVTFVRGLQGEGGDAPKNAQGESYR--KLDA 202

Query: 224 CCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGI 283
             KH+A   V +    DR+HFDAR +++D+ ET+L  FE  VK+G   +VM +YNRV G 
Sbjct: 203 TAKHFA---VHSGPEADRHHFDARPSQRDLYETYLPAFEALVKDGKVDAVMGAYNRVYGE 259

Query: 284 PSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDC 343
            + A   LL   +R +W   GY+V+DC +I  +  +HK +A ++E A A  +K G +L+C
Sbjct: 260 SASASKFLLQDVLRQQWGFKGYVVSDCWAIVDIWKHHKIVA-TREQAAALAVKHGTELEC 318

Query: 344 GQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFD--GSPQYVSLGKQDICSDEN 401
           G+ Y+     AV QG ++E  ID SL+ L T  MRLG FD  G   +  +      S  +
Sbjct: 319 GEEYSTLPA-AVHQGLIEEAQIDTSLQTLMTARMRLGMFDPPGQLPWSKIPASVNQSPAH 377

Query: 402 IELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPI 461
             LA   ARE +VLLKND   LPL+  K+K +AV+GP A+ T+A++GNY G P   ++ +
Sbjct: 378 DALARRTARESLVLLKND-GLLPLSRTKLKRIAVIGPTADDTMALLGNYYGTPAAPVTVL 436

Query: 462 AGFSGY---ANVTYKTGCDDVACKSN 484
            G       A V Y  G D V  + +
Sbjct: 437 QGIRAAAPNAQVLYARGADLVEGRDD 462



 Score =  139 bits (350), Expect = 6e-30,   Method: Compositional matrix adjust.
 Identities = 91/287 (31%), Positives = 132/287 (45%), Gaps = 53/287 (18%)

Query: 490 ASEAAKTADATIILAGLDLSVEAESL----------DREDLWLPGYQTQLINQVAEVAKG 539
           A + A++A+  + + GL   VE E +          DR DL LP  Q +L+  +    K 
Sbjct: 629 ALDVARSAEVVVFVGGLTGDVEGEEMKVNYPGFAGGDRTDLRLPKPQRELLEALQATGK- 687

Query: 540 PVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYN 599
           PV+ V+ +   + I +A+ +  + AIL A YPG+ GG A+AD +FG  NPGGRLP+T+Y 
Sbjct: 688 PVVAVLTAGSALAIDWAQQH--VPAILLAWYPGQRGGTAVADTLFGDANPGGRLPVTFYK 745

Query: 600 GDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQV 659
                     S  L   D     GRTY+++ G  LYPFG+GLSYTQF Y+ L   +    
Sbjct: 746 ---------ESEALPAFDDYAMHGRTYRYFGGTPLYPFGHGLSYTQFAYSDLRLDR---- 792

Query: 660 NLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKP 719
                                      N +  D  F   V  +N G   G +V  +Y  P
Sbjct: 793 ---------------------------NTVAADGSFTATVTVKNTGQRAGDEVAQLYLHP 825

Query: 720 PAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANT 766
                    K++ GFQRV +  G  + ++F  NA ++L I D    T
Sbjct: 826 LTPQRERAGKELRGFQRVALHPGEQRELRFPINAKEALRIYDEQRKT 872


>gi|289666226|ref|ZP_06487807.1| beta-glucosidase precursor [Xanthomonas campestris pv. vasculorum
           NCPPB 702]
          Length = 902

 Score =  313 bits (801), Expect = 3e-82,   Method: Compositional matrix adjust.
 Identities = 182/446 (40%), Positives = 255/446 (57%), Gaps = 33/446 (7%)

Query: 54  FCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGT 113
           + D+   +  R  DLVSRMTL+EK  Q+ + A  +PRLG+  Y+WW+EALHGV+  G   
Sbjct: 35  YLDTQRSFEQRAADLVSRMTLEEKAAQMQNAAPAIPRLGVAAYDWWNEALHGVARAG--- 91

Query: 114 HFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNL--------GRAGLTYW 165
                  GAT FP  I   A+F+  L  ++  A+S EARA ++            GLT+W
Sbjct: 92  -------GATVFPQAIGMAATFDLPLMHEVATAISDEARAKHHQFLRQNQHERYQGLTFW 144

Query: 166 SPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGH--ENATDLNSRPLKVSS 223
           SPNIN+ RDPRWGR  ET GEDPF+  R  V +VRGLQ   G   +NA   + R  K+ +
Sbjct: 145 SPNINIFRDPRWGRGQETYGEDPFLTARMGVTFVRGLQGEGGDAPKNAQGESYR--KLDA 202

Query: 224 CCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGI 283
             KH+A   V +    DR+HFDAR +++D+ ET+L  FE  VK+G   +VM +YNRV G 
Sbjct: 203 TAKHFA---VHSGPEADRHHFDARPSQRDLYETYLPAFEALVKDGKVDAVMGAYNRVYGE 259

Query: 284 PSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDC 343
            + A   LL   +R +W   GY+V+DC +I  +  +HK +A ++E A A  +K G +L+C
Sbjct: 260 SASASKFLLQDLLRQQWGFKGYVVSDCWAIVDIWKHHKIVA-TREQAAALAVKHGTELEC 318

Query: 344 GQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFD--GSPQYVSLGKQDICSDEN 401
           G+ Y+     AV QG ++E  ID SL+ L T  MRLG FD  G   +  +      S  +
Sbjct: 319 GEEYSTLPA-AVHQGLIEEAQIDTSLQTLMTARMRLGMFDPPGQLPWSKIPASVNQSPAH 377

Query: 402 IELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPI 461
             LA   ARE +VLLKND   LPL+  K+K +AV+GP A+ T+A++GNY G P   ++ +
Sbjct: 378 DALARRTARESLVLLKND-GLLPLSRTKLKRIAVIGPTADDTMALLGNYYGTPAAPVTVL 436

Query: 462 AGFSGY---ANVTYKTGCDDVACKSN 484
            G       A V Y  G D V  + +
Sbjct: 437 QGIRAAAPNAQVLYARGADLVEGRDD 462



 Score =  138 bits (347), Expect = 1e-29,   Method: Compositional matrix adjust.
 Identities = 91/287 (31%), Positives = 131/287 (45%), Gaps = 53/287 (18%)

Query: 490 ASEAAKTADATIILAGLDLSVEAESL----------DREDLWLPGYQTQLINQVAEVAKG 539
           A + A++A+  + + GL   VE E +          DR DL LP  Q +L+  +    K 
Sbjct: 629 ALDVARSAEVVVFVGGLTGDVEGEEMKVNYPGFAGGDRTDLRLPKPQRELLEALQATGK- 687

Query: 540 PVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYN 599
           PV+ V+ +   + I +A+ +  + AIL A YPG+ GG A+AD +FG  NPGGRLP+T+Y 
Sbjct: 688 PVVAVLTAGSALAIDWAQQH--VPAILLAWYPGQRGGTAVADTLFGDANPGGRLPVTFYK 745

Query: 600 GDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQV 659
                     S  L   D     GRTY+++ G  LYPFG+GLSYTQF Y+ L   +    
Sbjct: 746 ---------ESEALPAFDDYAMHGRTYRYFGGTPLYPFGHGLSYTQFAYSDLRLDR---- 792

Query: 660 NLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKP 719
                                      N +  D  F   V  +N G   G +V  +Y  P
Sbjct: 793 ---------------------------NTVAADGSFTATVTVKNTGQRAGDEVAQLYLHP 825

Query: 720 PAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANT 766
                    K++ GFQRV +  G  + + F  NA ++L I D    T
Sbjct: 826 LTPQRERAGKELRGFQRVALHPGEQRELSFPINAKEALRIYDEQRKT 872


>gi|424796589|ref|ZP_18222299.1| exported beta-glucosidase [Xanthomonas translucens pv. graminis
           ART-Xtg29]
 gi|422794891|gb|EKU23686.1| exported beta-glucosidase [Xanthomonas translucens pv. graminis
           ART-Xtg29]
          Length = 913

 Score =  311 bits (796), Expect = 1e-81,   Method: Compositional matrix adjust.
 Identities = 179/447 (40%), Positives = 257/447 (57%), Gaps = 35/447 (7%)

Query: 54  FCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGT 113
           + D+   +  R  DLVSRMTL+EK  Q+ + A  +PRLG+P Y+WW+EALHGV+  G   
Sbjct: 37  YLDTQRSFEQRAADLVSRMTLEEKAAQMQNAAPAIPRLGVPAYDWWNEALHGVARAG--- 93

Query: 114 HFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLG--------RAGLTYW 165
                  GAT FP  I   A+F+  L  ++  A+S EARA ++            GLT+W
Sbjct: 94  -------GATVFPQAIGMAATFDLPLMHEVSTAISDEARAKHHEALRHDQHARYQGLTFW 146

Query: 166 SPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQD--VEGHENATDLNSRPLKVSS 223
           SPNIN+ RDPRWGR  ET GEDPF+  R  V +V+GLQ    +  +NA     R  K+ +
Sbjct: 147 SPNINIFRDPRWGRGQETYGEDPFLTARMGVTFVQGLQGEGADAPKNAQGDAYR--KLDA 204

Query: 224 CCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGI 283
             KH+A   V +    DR+HFDA  +++D+ ET+L  FE  VKEG   +VM +YNRV G 
Sbjct: 205 TAKHFA---VHSGPEADRHHFDAHPSQRDLYETYLPAFEALVKEGKVDAVMGAYNRVYGE 261

Query: 284 PSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDC 343
            + A   LL   +R  W   GY+V+DC +I  +  NHK +A ++E A A  +  G +L+C
Sbjct: 262 SASASKFLLRDVLRDTWGFDGYVVSDCWAIVDIWKNHKIVA-TREQAAALAVNNGTELEC 320

Query: 344 GQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQDICSDENIE 403
           G+ Y+     AV++G + E D+DK+L+ L    MRLG FD  P  +   +  + ++++ E
Sbjct: 321 GEEYSTLPA-AVRKGLISEADVDKALQKLMYSRMRLGMFD-PPDTLRWAQIPLSANQSPE 378

Query: 404 ---LAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSP 460
              LA   ARE +VLLKND   LPL+  K+K +AV+GP A+ T+A++GNY G P   ++ 
Sbjct: 379 HDALARRTARESLVLLKND-GVLPLSRGKIKRIAVIGPTADDTMALLGNYYGTPAAPVTV 437

Query: 461 IAGFSGY---ANVTYKTGCDDVACKSN 484
           + G       A V Y  G D V  + +
Sbjct: 438 LQGIREAAPDAEVLYARGADLVEGRDD 464



 Score =  138 bits (348), Expect = 1e-29,   Method: Compositional matrix adjust.
 Identities = 96/300 (32%), Positives = 141/300 (47%), Gaps = 54/300 (18%)

Query: 490 ASEAAKTADATIILAGLDLSVEAESL----------DREDLWLPGYQTQLINQVAEVAKG 539
           A +AA+ AD  + + GL   VE E +          DR DL LP  Q +L+  +    K 
Sbjct: 631 ALDAARRADVVVFVGGLTGDVEGEEMKVNYPGFAGGDRTDLRLPKPQRELLEALQGTGK- 689

Query: 540 PVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYN 599
           PV+ V+ +   + I +A+ +  + AIL A YPG+ GG A+ADV+FG  NPGGRLP+T+Y 
Sbjct: 690 PVVAVLTTGSALAIDWAQQH--VPAILLAWYPGQRGGSAVADVLFGDANPGGRLPVTFYK 747

Query: 600 GDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQV 659
                     S  L   D     GRTY+++ G  LYPFG+GLSYTQF Y+ L        
Sbjct: 748 ---------ESEKLPAFDDYAMRGRTYRYFAGTALYPFGHGLSYTQFAYSDLRL------ 792

Query: 660 NLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKP 719
                          D SK    G L   L+           +N G   G +VV +Y  P
Sbjct: 793 ---------------DRSKLATDGSLHATLKV----------KNTGQRAGDEVVQLYLHP 827

Query: 720 PAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANT-LLPAGEHTIFVG 778
            +       K++ GFQR+ ++ G  + + F  +    L + D A    ++  G++ + VG
Sbjct: 828 LSPQRERARKELRGFQRIALQPGETREVSFAISPQTDLRLYDEARKAYVVDPGDYELQVG 887


>gi|390340546|ref|XP_001186857.2| PREDICTED: probable beta-D-xylosidase 2-like [Strongylocentrotus
           purpuratus]
          Length = 623

 Score =  311 bits (796), Expect = 1e-81,   Method: Compositional matrix adjust.
 Identities = 210/610 (34%), Positives = 321/610 (52%), Gaps = 61/610 (10%)

Query: 48  QMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDF-------AHGVPRLGLPQYEWWS 100
           Q S   F + SLP+  R+ DL+SR+ +D+   QL          A  + RL + +Y W +
Sbjct: 26  QKSQLPFWNQSLPWDQRLDDLLSRLKVDDMTYQLARGGADPNGPAPAIGRLQIGKYVWNT 85

Query: 101 EALHGVSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNL--- 157
           E L G +  G           AT+FP  +  +A+F+  L  ++  A   E RA YN    
Sbjct: 86  ECLRGDAQAG----------NATAFPQALGLSAAFSRDLLFEVANATGYEVRAKYNYYLQ 135

Query: 158 -----GRAGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENAT 212
                   GL  +SP IN+ R P WGR  ET GEDP++ G  A ++V GLQ         
Sbjct: 136 KGDFNNHQGLNCFSPVINIMRHPYWGRNQETYGEDPYLTGELAKSFVWGLQGN------- 188

Query: 213 DLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASS 272
             + R L  ++ CKH+AAY         R+ FDA+V+++D++ TF   F+ C+K G   S
Sbjct: 189 --HPRYLLTNAGCKHFAAYSGPENYPSSRFSFDAKVSDKDLQVTFFPAFKECIKAG-TYS 245

Query: 273 VMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVA 332
           VMCSYN VNGIP+CA+  LLN  +R EW   GY+V+D  ++++    H +   S  D   
Sbjct: 246 VMCSYNSVNGIPACANSYLLNDVLRTEWGFKGYVVSDQRALELEELAHNY-TTSYLDTAI 304

Query: 333 QTLKAGLDLDCGQYYT---NFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ-- 387
           ++LKAG +LD G       ++   AV+ G +   D+  S+  L+   +RLG FD      
Sbjct: 305 KSLKAGCNLDLGTTKPAVYDYLAEAVELGMLTAQDLRDSIAPLFYTRLRLGEFDPPDHNP 364

Query: 388 YVSLG-KQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAM 446
           YV L   Q + S E+ E+A +AA +  VL+KND +TLP+    + T+AVVGP AN +  +
Sbjct: 365 YVKLNVDQVVESPEHQEIALKAALKSFVLVKNDGSTLPI-EGTIHTLAVVGPFANNSKLL 423

Query: 447 IGNYAGIP-CRYMSPI-AGFSGYANVT-YKTGCDDVACKSNNSIFAASEAAKTADATIIL 503
            G+YA  P  R+++ +  G S  A  T + +GC    C + +       A   AD  ++ 
Sbjct: 424 FGDYAPNPDPRFVTTVLEGLSPMATKTRHASGCPSPKCVTYDQQ-GVLNAVTGADVVVVC 482

Query: 504 AGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKG-PVILVIMSAGGVDIAFAETNTNI 562
            G  + +E+E  DR D+ LPG Q QL+   A  A G PVIL++ +AG ++I +A ++ ++
Sbjct: 483 LGTGIELESEGNDRRDMLLPGKQEQLLQDAARYAAGKPVILLLFNAGPLNITWALSSPSV 542

Query: 563 KAILWAGYPGEEGGRAIADVVFGK---FNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSL 619
           +AI+   +P +  G A+  ++F      NPGGRLP TW         P T   + P+++ 
Sbjct: 543 QAIVECFFPAQATGVAL-RMMFQNAPGANPGGRLPSTW---------PATVAQIPPMENY 592

Query: 620 GYPGRTYKFY 629
              GRTY+++
Sbjct: 593 SMDGRTYRYF 602


>gi|389794400|ref|ZP_10197553.1| beta-glucosidase-related glycosidase [Rhodanobacter fulvus Jip2]
 gi|388432423|gb|EIL89432.1| beta-glucosidase-related glycosidase [Rhodanobacter fulvus Jip2]
          Length = 902

 Score =  308 bits (790), Expect = 5e-81,   Method: Compositional matrix adjust.
 Identities = 188/498 (37%), Positives = 272/498 (54%), Gaps = 54/498 (10%)

Query: 9   LCFSLSIALLVFSTNAVDANGSSSPVFVCDPGRFSKLGLQMSSFLFCDSSLPYSIRVKDL 68
           +   +++ L VF ++A  A+  ++P    +P             ++ D S  +  R  DL
Sbjct: 17  VALGMALVLPVFPSHAEGAD--AAPSAASEP-------------VYRDLSRSFHDRAADL 61

Query: 69  VSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGATSFPTV 128
           V+ MTL+EK  Q+ + A  +PRLG+  Y+WW+E LHGV+  G           AT FP  
Sbjct: 62  VAHMTLEEKAAQMQNTAPAIPRLGVAAYDWWNEGLHGVARAGQ----------ATVFPQA 111

Query: 129 ILTTASFNESLWKKIGQAVSTEARAMYN-------LGR-AGLTYWSPNINVARDPRWGRI 180
           I   A+F+  L  ++  A+S EARA YN        GR  GLTYWSPNIN+ RDPRWGR 
Sbjct: 112 IGLAATFDVPLMHEVATAISDEARAKYNEFQRKGSHGRYEGLTYWSPNINIFRDPRWGRG 171

Query: 181 TETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVD 240
            ET GEDP++  R  V +V GLQ           N    K+ +  KH+A   V +    D
Sbjct: 172 QETYGEDPYLTERMGVAFVTGLQGD---------NPTYRKLDATAKHFA---VHSGPEAD 219

Query: 241 RYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEW 300
           R+HFD   +E+D+ ET+L  F+  V+E D  +VM +YNRVNG P+   P+LL Q +R +W
Sbjct: 220 RHHFDVHPSERDLYETYLPAFQTLVQEADVDAVMSAYNRVNGEPATGSPRLLGQILRKDW 279

Query: 301 DLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFTGNAVQQGKV 360
              GY+V+DC +++ +  +HK + D+ E A A  +K G+DLDCG  Y      AV  G +
Sbjct: 280 GFKGYVVSDCGAVEDIYKHHKVV-DTVEAASALAVKNGVDLDCGTEYAALV-KAVHDGLI 337

Query: 361 KETDIDKSLKYLYTVLMRLGFFDGSPQ--YVSLGKQDICSDENIELAAEAAREGIVLLKN 418
           KE++ID +L  L    MRLG FD + +  +  +      S ++  LA  AARE +VLLKN
Sbjct: 338 KESEIDAALTRLMQARMRLGMFDPASKVPWSDVPYSVNQSPQHDALARRAARESMVLLKN 397

Query: 419 DQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGF---SGYANVTYKTG 475
           D   LPL S  +K +AV+GP A+  +A++GNY G P   ++ + G    +  A V Y  G
Sbjct: 398 D-GVLPL-SKDIKHIAVIGPTADDVMALVGNYHGTPADPVTILRGIREAAPQAKVVYARG 455

Query: 476 CDDVACKSNNSIFAASEA 493
            D V  +S+ +     EA
Sbjct: 456 VDLVEGRSDPTGMPLVEA 473



 Score =  125 bits (313), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 89/289 (30%), Positives = 129/289 (44%), Gaps = 54/289 (18%)

Query: 501 IILAGLDLSVEAESL----------DREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGG 550
           +   GL   VE E +          DR DL LP  Q +L+  +    K PV+LV+ S   
Sbjct: 642 VFAGGLTSDVEGEEMKVNYPGFAGGDRTDLRLPATQRKLLEALQATGK-PVVLVLTSGSA 700

Query: 551 VDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTS 610
           + + +A  N ++ A+L A YPG+ GG A+ADV+FGK +P GRLP+T+Y           S
Sbjct: 701 LAVDWA--NQHLPAVLLAWYPGQRGGNAVADVLFGKADPAGRLPVTFYK---------AS 749

Query: 611 MPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNL 670
             L   D     GRTY+++ G  LYPFGYGLSYT+F Y  L           KL H    
Sbjct: 750 EKLPAFDDYRMDGRTYRYFKGEPLYPFGYGLSYTKFTYADL-----------KLDH---- 794

Query: 671 NYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQ 730
                           N +  +D     V   N G   G +VV +Y +          K 
Sbjct: 795 ----------------NKIGKNDKLHVTVKVHNAGKRAGDEVVQLYLRGVGTPHERSNKD 838

Query: 731 VIGFQRVFVRAGRNKRIKFVFNACKSLNIVD-YAANTLLPAGEHTIFVG 778
           + G QR+ ++ G+ + + F  +    L   D   A   + AG + + +G
Sbjct: 839 LRGIQRITLQPGQTRDVSFDVSPATDLRYYDTKKAAYAVDAGRYEVQIG 887


>gi|325916103|ref|ZP_08178390.1| beta-glucosidase-like glycosyl hydrolase [Xanthomonas vesicatoria
           ATCC 35937]
 gi|325537647|gb|EGD09356.1| beta-glucosidase-like glycosyl hydrolase [Xanthomonas vesicatoria
           ATCC 35937]
          Length = 896

 Score =  308 bits (788), Expect = 9e-81,   Method: Compositional matrix adjust.
 Identities = 185/454 (40%), Positives = 255/454 (56%), Gaps = 42/454 (9%)

Query: 54  FCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGT 113
           + D+ LP+  R  DLVSRMTL+EK  Q+ + A  +PRL +P Y+WW+EALHGV+  G   
Sbjct: 40  YLDTQLPFETRAADLVSRMTLEEKAAQMQNAAPAIPRLRVPAYDWWNEALHGVARAG--- 96

Query: 114 HFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYN--LGR------AGLTYW 165
                  GAT FP  I   A+F+  L  ++  A+S EARA ++  L R       GLT+W
Sbjct: 97  -------GATVFPQAIGLAATFDTPLMAEVATAISDEARAKHHAFLARDEHKRYQGLTFW 149

Query: 166 SPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCC 225
           SPNIN+ RDPRWGR  ET GEDPF+  R  V +V+GLQ  +G            K+ +  
Sbjct: 150 SPNINIFRDPRWGRGQETYGEDPFLTARMGVTFVQGLQAQQGPYR---------KLDATA 200

Query: 226 KHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPS 285
           KHYA   V +    DR+HFD   +E+D+ ET+L  F+  V+EG  ++VM +YNRVNG  +
Sbjct: 201 KHYA---VHSGPEADRHHFDVHPSERDLHETYLPAFQALVQEGHVAAVMGAYNRVNGESA 257

Query: 286 CADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQ 345
            A  + L   +R +W   GYIV+DC +I+ +  NHK +  + E A A  +K G DLDCG 
Sbjct: 258 SASTR-LEGILRRDWGFDGYIVSDCAAIRDIWQNHKIVP-TPEAAAALGVKHGTDLDCGD 315

Query: 346 YYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQDIC---SDENI 402
            Y      AV+ G + E  ID SLK L T  MRLG FD  P  V+  +       S ++ 
Sbjct: 316 TYAALP-KAVRAGLIDEATIDTSLKRLMTTRMRLGMFD-PPAKVAWAQIPASVNQSPQHD 373

Query: 403 ELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIA 462
            LA   ARE +VLLKND   LPL    +K +AVVGP A+  ++++GNY G P   ++ + 
Sbjct: 374 ALARRTARESLVLLKND-GLLPLKPT-LKRIAVVGPTADDPMSLLGNYYGTPAAPVTILQ 431

Query: 463 GF---SGYANVTYKTGCDDVACKSNNSIFAASEA 493
           G    +  A V Y  G D V  + + +  A  +A
Sbjct: 432 GIRDAAPQAEVVYARGSDLVEGREDPNAAAPIDA 465



 Score =  134 bits (336), Expect = 3e-28,   Method: Compositional matrix adjust.
 Identities = 88/287 (30%), Positives = 131/287 (45%), Gaps = 53/287 (18%)

Query: 490 ASEAAKTADATIILAGLDLSVEAESLD----------REDLWLPGYQTQLINQVAEVAKG 539
           A +AA+ A+  + + GL   VE E +D          R D  LP  Q +L+ Q  +    
Sbjct: 623 AVDAARNAEVVVFVGGLTGDVEGEEMDVNYPGFAGGDRTDTRLPKPQRELL-QALQATGT 681

Query: 540 PVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYN 599
           PV+ V+ +   + + +A+ +  + AIL A YPG+ GG A+ DV+FG+ +PGGRLPIT+Y 
Sbjct: 682 PVVAVLTTGSALAVDWAQQH--VPAILLAWYPGQRGGSAVGDVLFGQASPGGRLPITFYK 739

Query: 600 GDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQV 659
               + LP         D     GRTY+++ G  LYPFG+GLSYTQF Y+ L   +T   
Sbjct: 740 --EAERLPA-------FDDYAMRGRTYRYFTGTALYPFGHGLSYTQFAYSDLRLDRTT-- 788

Query: 660 NLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKP 719
                                        L  D      +  +N G   G +VV +Y  P
Sbjct: 789 -----------------------------LGADGTLRATLKVRNTGKRAGDEVVQLYLHP 819

Query: 720 PAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANT 766
                    K++ GFQR+ ++ G  + + F   A  +L I D    T
Sbjct: 820 LDPKRERAGKELRGFQRMTLQPGEQREVAFTLKAADALRIYDEQRKT 866


>gi|188574621|ref|YP_001911550.1| glucan 1,4-beta-glucosidase [Xanthomonas oryzae pv. oryzae PXO99A]
 gi|188519073|gb|ACD57018.1| glucan 1,4-beta-glucosidase [Xanthomonas oryzae pv. oryzae PXO99A]
          Length = 904

 Score =  307 bits (787), Expect = 1e-80,   Method: Compositional matrix adjust.
 Identities = 176/444 (39%), Positives = 249/444 (56%), Gaps = 29/444 (6%)

Query: 54  FCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGT 113
           +  +   +  R  DLVSRMTL+EK  Q+ + A  +PRLG+P Y+WW+EALHGV+  G   
Sbjct: 37  YLQTQRSFEQRAADLVSRMTLEEKAAQMQNAAPAIPRLGVPAYDWWNEALHGVARAG--- 93

Query: 114 HFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYN--------LGRAGLTYW 165
                  GAT FP  I   A+F+  L  ++  A+S EARA ++            GLT+W
Sbjct: 94  -------GATVFPQAIGMAATFDLPLMHEVATAISDEARAKHHRFLRQHQHARYQGLTFW 146

Query: 166 SPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCC 225
           SPNIN+ RDPRWGR  ET GEDPF+  R  V +V+GLQ             R  K+ +  
Sbjct: 147 SPNINIFRDPRWGRGQETYGEDPFLTARMGVTFVQGLQGEGSDAPKNAQGERYRKLDATA 206

Query: 226 KHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPS 285
           KH+A   V +    DR+HFDAR +++D+ ET+L  FE  VK+G   +VM +YNRV G  +
Sbjct: 207 KHFA---VHSGPEADRHHFDARPSQRDLYETYLPAFEALVKDGKVDAVMGAYNRVYGESA 263

Query: 286 CADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQ 345
            A   LL   +R +W   GY+V+DC +I  +  +HK +A ++E A A  +  G +L+CG+
Sbjct: 264 SASKFLLQDVLRQQWGFKGYVVSDCWAIVDVWKHHKIVA-TREQAAALAVTHGTELECGE 322

Query: 346 YYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFD--GSPQYVSLGKQDICSDENIE 403
            Y+     AV QG + E  ID +L+ L T  MRLG FD  G   +  +      S  +  
Sbjct: 323 EYSTLPA-AVHQGLIDEAQIDTALQTLMTARMRLGMFDPPGQLPWSKIPASVNQSPAHDA 381

Query: 404 LAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAG 463
           LA   ARE +VLLKND   LPL+ A +K +AV+GP A+ T+A++GNY G P   ++ + G
Sbjct: 382 LARRTARESLVLLKND-GLLPLSRATLKRIAVIGPTADDTMALLGNYYGTPAAPVTVLQG 440

Query: 464 FSGY---ANVTYKTGCDDVACKSN 484
                  A V Y  G D V  +++
Sbjct: 441 IRAAAPNAQVLYARGADLVEGRND 464



 Score =  139 bits (351), Expect = 4e-30,   Method: Compositional matrix adjust.
 Identities = 97/317 (30%), Positives = 143/317 (45%), Gaps = 60/317 (18%)

Query: 462 AGFSGYANVTYKTGCDDVACK-------SNNSIFAASEAAKTADATIILAGLDLSVEAES 514
           AG S    + Y  G  D A +       +   +  A + A++AD  + + GL   VE E 
Sbjct: 596 AGRSYDLRLDYFEGERDAAVRLAWRQPGAKPPLQEALDVARSADVVVFVGGLTGDVEGEE 655

Query: 515 L----------DREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKA 564
           +          DR DL LP  Q +L+  +    K PV+ V+ +   + I +A+ +  + A
Sbjct: 656 MKVSYPGFAGGDRTDLRLPKPQRELLEALQATGK-PVVAVLTAGSALAIDWAQQH--VPA 712

Query: 565 ILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGR 624
           IL A YPG+ GG A+AD +FG  NPGGRLP+T+Y           S  L   D     GR
Sbjct: 713 ILLAWYPGQRGGTAVADTLFGDANPGGRLPVTFYK---------ESETLPAFDDYAMHGR 763

Query: 625 TYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGV 684
           TY+++ G  LYPFG+GLSYTQF Y+ L   ++                            
Sbjct: 764 TYRYFGGTPLYPFGHGLSYTQFAYSDLRLDRST--------------------------- 796

Query: 685 LVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRN 744
               L  D      V  +N G   G +VV +Y  P         K++ GFQR+ ++ G+ 
Sbjct: 797 ----LTADGALTATVAVKNTGQRAGDEVVQLYLHPLKPQRERAGKELRGFQRLALQPGQQ 852

Query: 745 KRIKFVFNACKSLNIVD 761
           + ++F  NA  +L I D
Sbjct: 853 RELRFTINAKDALRIYD 869


>gi|58584046|ref|YP_203062.1| glucan 1,4-beta-glucosidase [Xanthomonas oryzae pv. oryzae KACC
           10331]
 gi|84625823|ref|YP_453195.1| glucan 1,4-beta-glucosidase [Xanthomonas oryzae pv. oryzae MAFF
           311018]
 gi|58428640|gb|AAW77677.1| glucan 1,4-beta-glucosidase [Xanthomonas oryzae pv. oryzae KACC
           10331]
 gi|84369763|dbj|BAE70921.1| glucan 1,4-beta-glucosidase [Xanthomonas oryzae pv. oryzae MAFF
           311018]
          Length = 904

 Score =  307 bits (787), Expect = 1e-80,   Method: Compositional matrix adjust.
 Identities = 176/444 (39%), Positives = 249/444 (56%), Gaps = 29/444 (6%)

Query: 54  FCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGT 113
           +  +   +  R  DLVSRMTL+EK  Q+ + A  +PRLG+P Y+WW+EALHGV+  G   
Sbjct: 37  YLQTQRSFEQRAADLVSRMTLEEKAAQMQNAAPAIPRLGVPAYDWWNEALHGVARAG--- 93

Query: 114 HFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYN--------LGRAGLTYW 165
                  GAT FP  I   A+F+  L  ++  A+S EARA ++            GLT+W
Sbjct: 94  -------GATVFPQAIGMAATFDLPLMHEVATAISDEARAKHHRFLRQHQHARYQGLTFW 146

Query: 166 SPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCC 225
           SPNIN+ RDPRWGR  ET GEDPF+  R  V +V+GLQ             R  K+ +  
Sbjct: 147 SPNINIFRDPRWGRGQETYGEDPFLTARMGVTFVQGLQGEGSDAPKNAQGERYRKLDATA 206

Query: 226 KHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPS 285
           KH+A   V +    DR+HFDAR +++D+ ET+L  FE  VK+G   +VM +YNRV G  +
Sbjct: 207 KHFA---VHSGPEADRHHFDARPSQRDLYETYLPAFEALVKDGKVDAVMGAYNRVYGESA 263

Query: 286 CADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQ 345
            A   LL   +R +W   GY+V+DC +I  +  +HK +A ++E A A  +  G +L+CG+
Sbjct: 264 SASKFLLQDVLRQQWGFKGYVVSDCWAIVDVWKHHKIVA-TREQAAALAVTHGTELECGE 322

Query: 346 YYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFD--GSPQYVSLGKQDICSDENIE 403
            Y+     AV QG + E  ID +L+ L T  MRLG FD  G   +  +      S  +  
Sbjct: 323 EYSTLPA-AVHQGLIDEAQIDTALQTLMTARMRLGMFDPPGQLPWSKIPASVNQSPAHDA 381

Query: 404 LAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAG 463
           LA   ARE +VLLKND   LPL+ A +K +AV+GP A+ T+A++GNY G P   ++ + G
Sbjct: 382 LARRTARESLVLLKND-GLLPLSRATLKRIAVIGPTADDTMALLGNYYGTPAAPVTVLQG 440

Query: 464 FSGY---ANVTYKTGCDDVACKSN 484
                  A V Y  G D V  +++
Sbjct: 441 IRAAAPNAQVLYARGADLVEGRND 464



 Score =  139 bits (349), Expect = 7e-30,   Method: Compositional matrix adjust.
 Identities = 96/317 (30%), Positives = 143/317 (45%), Gaps = 60/317 (18%)

Query: 462 AGFSGYANVTYKTGCDDVACK-------SNNSIFAASEAAKTADATIILAGLDLSVEAES 514
           AG S    + Y  G  D A +       +   +  A + A++AD  + + GL   VE E 
Sbjct: 596 AGRSYDLRLDYFEGERDAAVRLAWRQPGAKPPLQEALDVARSADVVVFVGGLTGDVEGEE 655

Query: 515 L----------DREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKA 564
           +          DR DL LP  Q +L+  +    K PV+ V+ +   + + +A+ +  + A
Sbjct: 656 MKVSYPGFAGGDRTDLRLPKPQRELLEALQATGK-PVVAVLTAGSALAVDWAQQH--VPA 712

Query: 565 ILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGR 624
           IL A YPG+ GG A+AD +FG  NPGGRLP+T+Y           S  L   D     GR
Sbjct: 713 ILLAWYPGQRGGTAVADTLFGDANPGGRLPVTFYK---------ESETLPAFDDYAMHGR 763

Query: 625 TYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGV 684
           TY+++ G  LYPFG+GLSYTQF Y+ L   ++                            
Sbjct: 764 TYRYFGGTPLYPFGHGLSYTQFAYSDLRLDRST--------------------------- 796

Query: 685 LVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRN 744
               L  D      V  +N G   G +VV +Y  P         K++ GFQR+ ++ G+ 
Sbjct: 797 ----LTADGALTATVAVKNTGQRAGDEVVQLYLHPLKPQRERAGKELRGFQRLALQPGQQ 852

Query: 745 KRIKFVFNACKSLNIVD 761
           + ++F  NA  +L I D
Sbjct: 853 RELRFTINAKDALRIYD 869


>gi|397642422|gb|EJK75223.1| hypothetical protein THAOC_03061, partial [Thalassiosira oceanica]
          Length = 534

 Score =  307 bits (786), Expect = 2e-80,   Method: Compositional matrix adjust.
 Identities = 198/599 (33%), Positives = 313/599 (52%), Gaps = 101/599 (16%)

Query: 217 RPLKVSSCCKHYAAYDVDNWKGVDRYHFDAR-VTEQDMEETFLRPFEMCV---------- 265
           RP ++++ CKH AAY ++     DR++F A  +   D E T+L  F+ CV          
Sbjct: 7   RP-RIAATCKHLAAYSLE----TDRFNFSADGIDRTDWEGTYLPAFDACVHAERFLLEHY 61

Query: 266 ---------KEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVM 316
                    ++  A  VMCSYN ++G+P+CADP LL   +R +W+  G +V+DC ++  +
Sbjct: 62  NASGGGGGGQDRGALGVMCSYNAIDGVPACADPALLKDMLRRDWNFTGLVVSDCWAVDNI 121

Query: 317 VDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVL 376
             NH+F+A S E+AV   L++G+DLDCG  + +F   A  +  + E DID++L  L+ VL
Sbjct: 122 HSNHRFVA-SYEEAVGLALRSGVDLDCGNTFQDFGRLAYDESLLDEDDIDEALSRLFRVL 180

Query: 377 MRLGFFDGSPQYVSLGKQDICSDENIELAAEAAREGIVLLKNDQNT-----LPLNSAKVK 431
           M LG+FD + +  +    D    E+ +LA EAA + IVLLKN  N      LPL+ AK K
Sbjct: 181 MDLGYFDETDEPDAKSSDDEM--EHDQLALEAALQSIVLLKNGINEDEPGPLPLSLAKHK 238

Query: 432 TVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFSGYANVTYKTGCDDVACKSNNSIFAAS 491
            +A+ GP A+    ++GNY G+P   ++P+ G +       K G + VA +   S+    
Sbjct: 239 EIALFGPLADNQTVLLGNYHGLPSTIVTPLMGLA-------KMGVE-VAFRQRASVCDFH 290

Query: 492 EAAKTADATIILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKG---PVILVIMSA 548
             +    ATI++ GLD S+EAE  DR  L LP  Q  LI  ++  +K    PV+LV++S 
Sbjct: 291 GES----ATILVVGLDQSLEAEDQDRTTLLLPVEQRDLIKTISRCSKVRDLPVVLVVVSG 346

Query: 549 GGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPL 608
           G VD++  + +++I A++   YPG+ GG A+A V++G +NP G+L  T Y   Y+  + L
Sbjct: 347 GMVDLSRYKNSSDIDAMIHMSYPGQNGGSALAQVLYGAYNPSGKLVGTMYPESYLNEVSL 406

Query: 609 TSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCR 668
             M +RP     +PGRT+++Y G  +YPFGYGLSYT F+Y +     T++V ++      
Sbjct: 407 HDMRMRPDGK--FPGRTHRYYRGDVIYPFGYGLSYTSFRYAMEFLGGTVKVTVS------ 458

Query: 669 NLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGS-DVVIVYSKPPAEIAATY 727
                                             N GS DGS  V++ +S P A      
Sbjct: 459 ----------------------------------NSGSMDGSVAVLLFHSAPQAGNEQEP 484

Query: 728 IKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGNGGVSFPI 786
            + +IGF++++V  G ++ +   F+  K +N  +        AG HT  + N  +   +
Sbjct: 485 FRSLIGFEKIYVSVGDSQLVS--FDVSKRMNPGE--------AGSHTFRIENESIDVEV 533


>gi|384421334|ref|YP_005630694.1| glucan 1,4-beta-glucosidase [Xanthomonas oryzae pv. oryzicola
           BLS256]
 gi|353464247|gb|AEQ98526.1| glucan 1,4-beta-glucosidase [Xanthomonas oryzae pv. oryzicola
           BLS256]
          Length = 904

 Score =  307 bits (786), Expect = 2e-80,   Method: Compositional matrix adjust.
 Identities = 177/445 (39%), Positives = 253/445 (56%), Gaps = 31/445 (6%)

Query: 54  FCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGT 113
           + D++  +  R  DLVSRMTL+EK  Q+ + A  +PRL +P Y+WW+EALHGV+  G   
Sbjct: 37  YLDTARSFEQRAADLVSRMTLEEKAAQMQNAAPAIPRLQVPAYDWWNEALHGVARAG--- 93

Query: 114 HFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYN--------LGRAGLTYW 165
                  GAT FP  I   A+F+  L  ++  A+S EARA ++            GLT+W
Sbjct: 94  -------GATVFPQAIGMAATFDLPLMHEVATAISDEARAKHHQFLRQNQHARYQGLTFW 146

Query: 166 SPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPL-KVSSC 224
           SPNIN+ RDPRWGR  ET GEDPF+  R  V +V+GLQ  EG     +    P  K+ + 
Sbjct: 147 SPNINIFRDPRWGRGQETYGEDPFLTARMGVTFVQGLQG-EGAAAPKNAQGEPYRKLDAT 205

Query: 225 CKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIP 284
            KH+A   V +    +R+HFDAR +++D+ ET+L  FE  VK+G   +VM +YNRV G  
Sbjct: 206 AKHFA---VHSGPEAERHHFDARPSQRDLYETYLPAFEALVKDGKVDAVMGAYNRVYGES 262

Query: 285 SCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCG 344
           + A   LL   +R +W   GY+V+DC +I  +  +HK +A ++E A A  +  G +L+CG
Sbjct: 263 ASASKFLLQDVLRQQWGFKGYVVSDCWAIVDVWKHHKIVA-TREQAAALAVTHGTELECG 321

Query: 345 QYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFD--GSPQYVSLGKQDICSDENI 402
           + Y+     AV QG + E  ID +L+ L T  MRLG FD  G   +  +      S  + 
Sbjct: 322 EEYSTLPA-AVHQGLIDEAQIDTALQTLMTARMRLGMFDPPGQLPWSKIPASVNQSPAHD 380

Query: 403 ELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIA 462
            LA   ARE +VLLKND   LPL+ A +K +AV+GP A+ T+A++GNY G P   ++ + 
Sbjct: 381 ALARRTARESLVLLKND-GLLPLSRATLKRIAVIGPTADDTMALLGNYYGTPAAPVTVLQ 439

Query: 463 GFSGY---ANVTYKTGCDDVACKSN 484
           G       A V Y  G D V  +++
Sbjct: 440 GIRAAAPNAQVLYARGADLVEGRND 464



 Score =  139 bits (350), Expect = 6e-30,   Method: Compositional matrix adjust.
 Identities = 97/317 (30%), Positives = 142/317 (44%), Gaps = 60/317 (18%)

Query: 462 AGFSGYANVTYKTGCDDVACK-------SNNSIFAASEAAKTADATIILAGLDLSVEAES 514
           AG S    + Y  G  D A +       +   +  A + A++AD  + + GL   VE E 
Sbjct: 596 AGRSYDLRLDYFEGERDAAVRLAWRQPGAKPPLQEALDVARSADVVVFVGGLTGDVEGEE 655

Query: 515 L----------DREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKA 564
           +          DR DL LP  Q +L+  +    K PV+ V+ +   + I +A+ +  + A
Sbjct: 656 MKVNYPGFAGGDRTDLRLPKPQRELLEALQATGK-PVVAVLTAGSALAIDWAQQH--VPA 712

Query: 565 ILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGR 624
           IL A YPG+ GG A+AD +FG  NPGGRLP+T+Y           S  L   D     GR
Sbjct: 713 ILLAWYPGQRGGTAVADTLFGDANPGGRLPVTFYK---------ESETLPAFDDYTMHGR 763

Query: 625 TYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGV 684
           TY+++ G  LYPFG+GLSYTQF Y+ L   ++                            
Sbjct: 764 TYRYFGGTPLYPFGHGLSYTQFAYSDLRLDRST--------------------------- 796

Query: 685 LVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRN 744
               L  D      V  +N G   G +VV +Y  P         K++ GFQR+ ++ G  
Sbjct: 797 ----LTADGALTATVAVKNTGQRAGDEVVQLYLHPLKPQRERAGKELRGFQRLALQPGEQ 852

Query: 745 KRIKFVFNACKSLNIVD 761
           + ++F  NA  +L I D
Sbjct: 853 RELRFTINATDALRIYD 869


>gi|255572557|ref|XP_002527212.1| beta-glucosidase, putative [Ricinus communis]
 gi|223533388|gb|EEF35138.1| beta-glucosidase, putative [Ricinus communis]
          Length = 349

 Score =  305 bits (782), Expect = 5e-80,   Method: Compositional matrix adjust.
 Identities = 145/308 (47%), Positives = 198/308 (64%), Gaps = 20/308 (6%)

Query: 50  SSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNV 109
           +S+ FC+ SL    R   L+S +TL+EK++QL D A G+PR G+P YEWWSE+LHG++  
Sbjct: 38  NSYTFCNQSLSVPTRAHSLISLLTLEEKIKQLSDNASGIPRFGIPPYEWWSESLHGIAIN 97

Query: 110 GPGTHFD-DVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPN 168
           GPG  F    +  AT FP VI++ A+FN +LW  IG A++ EARAM+N+G++GLT+W+PN
Sbjct: 98  GPGVSFTIGPVSAATGFPQVIISAAAFNRTLWFLIGSAIAIEARAMHNVGQSGLTFWAPN 157

Query: 169 INVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQ-----------------DVEGHENA 211
           +N+ RDPRWGR  ETPGEDP +   YA+ +V+G Q                   E     
Sbjct: 158 VNIFRDPRWGRGQETPGEDPMLTSAYAIEFVKGFQGGNWKSGVSGSGSGRYGFGEKRMLR 217

Query: 212 TDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDAS 271
            D     L +S+CCKH  AYD++ W    RY F+A VTEQD+E+T+  PF  C++EG AS
Sbjct: 218 DDDGDDGLMLSACCKHLTAYDLEKWGNFSRYSFNAVVTEQDLEDTYQPPFRSCIEEGKAS 277

Query: 272 SVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAV 331
            +MCSYN VNG+P+CA   LL Q  R EW   GYIV+DCD++  + +   + + S EDAV
Sbjct: 278 CLMCSYNEVNGVPACAREDLL-QKAREEWGFEGYIVSDCDAVATIFEYQNY-SKSAEDAV 335

Query: 332 AQTLKAGL 339
           A  LKAG+
Sbjct: 336 AIALKAGM 343


>gi|371777036|ref|ZP_09483358.1| glycoside hydrolase [Anaerophaga sp. HS1]
          Length = 890

 Score =  303 bits (777), Expect = 2e-79,   Method: Compositional matrix adjust.
 Identities = 180/438 (41%), Positives = 247/438 (56%), Gaps = 41/438 (9%)

Query: 54  FCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGT 113
           + D +LP+  R  DLVS+MTL+EKV Q+   A  + RLG+P+Y WW+E LHGV   G   
Sbjct: 40  YLDPTLPFEERAADLVSKMTLEEKVSQMQHAAPAIERLGIPEYNWWNECLHGVGRAGI-- 97

Query: 114 HFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARA-MYNLGR-------AGLTYW 165
                   AT FP  I   A +++    +I  AVS EARA  ++  R        GLT+W
Sbjct: 98  --------ATVFPQAIGMAAMWDDEEMYRIATAVSDEARAKHHDFARRGKRGIYQGLTFW 149

Query: 166 SPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCC 225
           +PNIN+ RDPRWGR  ET GEDPF+ G  AV+Y++GLQ  +          R LK+ +  
Sbjct: 150 TPNINIFRDPRWGRGMETYGEDPFLTGELAVDYIKGLQGDD---------DRYLKLVATS 200

Query: 226 KHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPS 285
           KH+    V +    DR+HFDAR + +D   T+   F+  ++E    SVMC+YNR NG+P 
Sbjct: 201 KHFL---VHSGPEPDRHHFDARTSARDSLMTYTPHFKKTIQEAGVYSVMCAYNRYNGLPC 257

Query: 286 CADPKLLNQTVRGEWDLHGYIVADCDSI-QVMVDNHKFLADSKEDAVAQTLKAGLDLDCG 344
           C   K +   +R EW   GYIV+DC ++       H  +  + E+A A  +KAG DL+CG
Sbjct: 258 CGS-KPVENLLRNEWGFKGYIVSDCWAVADFYKKGHHEVVPTVEEAAAMAVKAGTDLNCG 316

Query: 345 QYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ---YVSLGKQDICSDEN 401
             Y     +AV+QG V E +ID  +K L    +RLG FD  P+   Y ++    + S E+
Sbjct: 317 NSYPALV-DAVKQGLVSEEEIDVLVKRLMEARLRLGMFD-PPEMVPYTNIPYSVVDSKEH 374

Query: 402 IELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPI 461
            ELA  AAR+ +VLLKND NTLPL+   VK VAV+GP+AN    ++ NY G P   ++P+
Sbjct: 375 RELALIAARKSMVLLKNDNNTLPLDK-NVKNVAVIGPNANNLDVLLANYNGYPSNPVTPL 433

Query: 462 AGFSGY---ANVTYKTGC 476
            G       ANV Y  GC
Sbjct: 434 DGIRQKLPNANVQYALGC 451



 Score =  142 bits (357), Expect = 1e-30,   Method: Compositional matrix adjust.
 Identities = 94/299 (31%), Positives = 144/299 (48%), Gaps = 56/299 (18%)

Query: 490 ASEAAKTADATIILAGLDLSVEAESL----------DREDLWLPGYQTQLINQVAEVAKG 539
           A + A  +D  ++  GL  ++E E +          DR D+ LP  QT L+  +  + K 
Sbjct: 610 AIQIAAASDVVLMFMGLSPNLEGEEMPVNVPGFSGGDRVDIKLPQIQTDLVKAIMSLGK- 668

Query: 540 PVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYN 599
           PV+LV+++   + I +   N  + AIL A YPG+ GG AIADV+FG +NP GRLP+T+Y 
Sbjct: 669 PVVLVLLNGSALAINWEAEN--VPAILEAWYPGQAGGTAIADVLFGDYNPAGRLPVTFYK 726

Query: 600 GDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQV 659
                   +T +P  P +     GRTY+++ G  L+PFGYGLSYT FKY+ L        
Sbjct: 727 S-------VTQLP--PFEDYSMDGRTYQYFKGEALFPFGYGLSYTSFKYDNL-------- 769

Query: 660 NLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKP 719
                                   V+ + L         VD  N G+ DG +VV +Y   
Sbjct: 770 ------------------------VVPDKLEAGKEVTVHVDVTNTGNRDGDEVVQLYVSH 805

Query: 720 PAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVG 778
           P ++ +  I+ + GF R+ ++AG  K + F     + L +       ++PAG   + VG
Sbjct: 806 P-DVESAPIRSLQGFDRIALKAGETKTVSFTLKP-EQLAVYQPQNGLVVPAGNLKLSVG 862


>gi|374313710|ref|YP_005060140.1| Beta-glucosidase [Granulicella mallensis MP5ACTX8]
 gi|358755720|gb|AEU39110.1| Beta-glucosidase [Granulicella mallensis MP5ACTX8]
          Length = 883

 Score =  303 bits (777), Expect = 2e-79,   Method: Compositional matrix adjust.
 Identities = 169/424 (39%), Positives = 245/424 (57%), Gaps = 37/424 (8%)

Query: 43  SKLGLQMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEA 102
           S  G +     + D++LP   R  DLV R+TLDEK  QL   A G+PRLG+P Y++WSE 
Sbjct: 26  SPAGTRTPLLPYQDTTLPAEQRAADLVGRLTLDEKAAQLVTSAPGIPRLGVPAYDFWSEG 85

Query: 103 LHGVSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRA-- 160
           LHG++  G           AT FP  +   A+F+E L  +IG+ +STEARA YN   A  
Sbjct: 86  LHGIARSG----------YATLFPQAVGMAATFDEPLLHQIGEVISTEARAKYNDAVAHD 135

Query: 161 ------GLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDL 214
                 GLT WSPNIN+ RDPRWGR  ET GEDPF+  R    +V GLQ         D 
Sbjct: 136 LRSIFYGLTIWSPNINIFRDPRWGRGQETYGEDPFLTARLGTAFVEGLQ-------GDDP 188

Query: 215 NSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVM 274
           N    +     KH+A   V +    +R+ F+A  +  D+ +T+L  F   + EG A S+M
Sbjct: 189 NY--YRAIGTPKHFA---VHSGPESERHRFNADPSPHDLWDTYLPAFRATIVEGKAGSIM 243

Query: 275 CSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMV--DNHKFLADSKEDAVA 332
           C+YN + G P+CA   LL++ +R +W   G++ +DC +I      D H +  D+ E A  
Sbjct: 244 CAYNAIEGKPACASDLLLDEVLRKDWAFKGFVTSDCGAIDNFFEKDGHHYSKDA-EQASV 302

Query: 333 QTLKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ--YVS 390
             ++AG D +CG  Y N   +AV++G ++E+++D  L+ L+    +LG FD   Q  Y S
Sbjct: 303 DGIRAGTDTNCGGTYRNL-ASAVRKGMIQESELDVPLRRLFLARFKLGLFDPPSQVKYAS 361

Query: 391 LGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNY 450
           +   +  S  + ELA +AARE +VLLKN+ +TLPL+ A+VKT+AV+GP+A++ +++ GNY
Sbjct: 362 MPITENMSSSHTELALQAAREAVVLLKNEHHTLPLD-ARVKTIAVIGPNASSLISLEGNY 420

Query: 451 AGIP 454
             IP
Sbjct: 421 NAIP 424



 Score =  125 bits (313), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 90/301 (29%), Positives = 142/301 (47%), Gaps = 55/301 (18%)

Query: 490 ASEAAKTADATIILAGLDLSVEAESLD----------REDLWLPGYQTQLINQVAEVAKG 539
           A EA K ADA +   GL   +E E +D          R DL LP  Q QL+ + A+ +  
Sbjct: 606 AMEAVKQADAVVAFVGLSPELEGEEMDVHIPGFSGGDRTDLVLPAAQQQLL-EAAKASGK 664

Query: 540 PVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYN 599
           P+++V+++   + + +A+ + +  AIL A YPG+ G +AIA+ + GK NP GRLP+T+Y 
Sbjct: 665 PLVVVLLNGSALAVNWAQEHAD--AILEAWYPGQAGAQAIAETLSGKNNPSGRLPVTFYR 722

Query: 600 GDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQV 659
              V  LP       P        RTY+++ G  LY FGYGLSY+ F Y+          
Sbjct: 723 S--VNDLP-------PFTDYAMANRTYRYFKGKPLYEFGYGLSYSTFSYS---------- 763

Query: 660 NLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKP 719
                         +  SK R        L   D    + D +N  +  G +V  +Y  P
Sbjct: 764 -------------NAHLSKER--------LDAGDTLRVEADVKNTSTLAGDEVAELYLTP 802

Query: 720 PAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGN 779
           P +     ++ + GF+ V +  G++K + F  +  + L+ VD      + AG +++ VG 
Sbjct: 803 P-QNGVYPLRSLEGFEHVHLLPGQSKHVSFTLDP-RQLSEVDEKGIRAVRAGVYSVTVGG 860

Query: 780 G 780
           G
Sbjct: 861 G 861


>gi|325919363|ref|ZP_08181395.1| beta-glucosidase-like glycosyl hydrolase [Xanthomonas gardneri ATCC
           19865]
 gi|325550152|gb|EGD20974.1| beta-glucosidase-like glycosyl hydrolase [Xanthomonas gardneri ATCC
           19865]
          Length = 876

 Score =  303 bits (776), Expect = 2e-79,   Method: Compositional matrix adjust.
 Identities = 177/445 (39%), Positives = 252/445 (56%), Gaps = 42/445 (9%)

Query: 54  FCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGT 113
           + D+  P+  R  DLV+RMTL+EK  Q+ + A  +PRL +P+Y+WW+EALHGV+  G   
Sbjct: 20  YLDTQRPFDARAADLVARMTLEEKAAQMQNAAPAIPRLQVPEYDWWNEALHGVARAG--- 76

Query: 114 HFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYN--LGRA------GLTYW 165
                  GAT FP  I   A+F+  L  ++  A+S EARA ++  L R       GLT+W
Sbjct: 77  -------GATVFPQAIGLAATFDTPLMAEVATAISDEARAKHHAFLARGEYKRYQGLTFW 129

Query: 166 SPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCC 225
           SPNIN+ RDPRWGR  ET GEDPF+  R  V +V+GLQ  +G            K+ +  
Sbjct: 130 SPNINIFRDPRWGRGQETYGEDPFLTARMGVTFVQGLQAQQGPYR---------KLDATA 180

Query: 226 KHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPS 285
           KH+A   V +    DR+HFD   +E+D+ ET+L  F+  V+EG  ++VM +YNRVNG  +
Sbjct: 181 KHFA---VHSGPEADRHHFDVHPSERDLHETYLPAFQALVQEGKVAAVMGAYNRVNGESA 237

Query: 286 CADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQ 345
            A  + L   +R +W   GYIV+DC +I+ +  NHK +  + E A A  +K G DLDCG 
Sbjct: 238 SASTR-LEGILRRDWGFDGYIVSDCAAIRDIWQNHKIVP-TPEAAAALGVKHGTDLDCGD 295

Query: 346 YYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQDICSDENIE-- 403
            Y      AV+ G + E  ID +LK L T  MRLG FD  P  V   +    ++++ +  
Sbjct: 296 TYAALPA-AVRAGLIDEATIDTALKRLMTTRMRLGMFD-PPAKVPWAQIPASANQSPQHD 353

Query: 404 -LAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIA 462
            LA   ARE +VLLKND   LPL    +K +AV+GP A+  ++++GNY G P   ++ + 
Sbjct: 354 ALARRTARESLVLLKND-GVLPLKPT-LKRIAVIGPTADDPMSLLGNYYGTPAAPVTILQ 411

Query: 463 GF---SGYANVTYKTGCDDVACKSN 484
           G    +  A V Y  G D V  + +
Sbjct: 412 GIRDAAPQAQVIYARGSDLVEGRED 436



 Score =  134 bits (337), Expect = 2e-28,   Method: Compositional matrix adjust.
 Identities = 86/282 (30%), Positives = 132/282 (46%), Gaps = 53/282 (18%)

Query: 490 ASEAAKTADATIILAGLDLSVEAESLD----------REDLWLPGYQTQLINQVAEVAKG 539
           A +AA+ A+  + + GL   VE E +D          R D  LP  Q +L+ Q  +    
Sbjct: 603 AVDAARDAEVVVFVGGLTGDVEGEEMDVNYPGFAGGDRTDTRLPKPQRELL-QALQATGT 661

Query: 540 PVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYN 599
           PV+ V+ +   + I +A+ +  + AIL A YPG+ GG A+ DV+FG+ +PGGRLP+T+Y 
Sbjct: 662 PVVAVLTTGSALAIDWAQQH--VPAILLAWYPGQRGGSAVGDVLFGQASPGGRLPVTFYK 719

Query: 600 GDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQV 659
               + LP         D     GRTY+++ G  LYPFG+GLSYTQF Y+ L   +T   
Sbjct: 720 --EAERLPA-------FDDYAMRGRTYRYFQGKPLYPFGHGLSYTQFAYSDLRLDRTT-- 768

Query: 660 NLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKP 719
                                        +  D      V  +N G   G +VV +Y  P
Sbjct: 769 -----------------------------VAADGTLTATVTLKNTGQRAGDEVVQLYLHP 799

Query: 720 PAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVD 761
                   +K++ G QR+ ++ G  ++++F   A  +L I D
Sbjct: 800 LKPQRERALKELHGLQRITLQPGEQRQLRFTIKAQDALRIYD 841


>gi|229580225|ref|YP_002838625.1| glycoside hydrolase family protein [Sulfolobus islandicus
           Y.G.57.14]
 gi|229581131|ref|YP_002839530.1| glycoside hydrolase family protein [Sulfolobus islandicus
           Y.N.15.51]
 gi|228010941|gb|ACP46703.1| glycoside hydrolase family 3 domain protein [Sulfolobus islandicus
           Y.G.57.14]
 gi|228011847|gb|ACP47608.1| glycoside hydrolase family 3 domain protein [Sulfolobus islandicus
           Y.N.15.51]
          Length = 754

 Score =  302 bits (774), Expect = 4e-79,   Method: Compositional matrix adjust.
 Identities = 221/700 (31%), Positives = 347/700 (49%), Gaps = 120/700 (17%)

Query: 122 ATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTY-WSPNINVARDPRWGRI 180
           +T+FP  I   +++N  L   I   + ++ R +      G+    SP ++V +DPRWGR 
Sbjct: 101 STAFPQAIGLASTWNPELVMDIASVIRSQGRLV------GVNQCLSPVLDVCKDPRWGRC 154

Query: 181 TETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDV-DNWKGV 239
            ET GEDP++V    + Y+ GLQ     +N         ++ +  KH+AA+   +  + +
Sbjct: 155 EETYGEDPYLVASMGLAYITGLQG----DN---------QLVATAKHFAAHGFPEGGRNI 201

Query: 240 DRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGE 299
            + H    V  +++ ETFL PFE+ VK G   S+M +Y+ ++GIP   +P+LL   +R E
Sbjct: 202 AQVH----VGNRELRETFLFPFEVAVKIGKVMSIMPAYHEIDGIPCHGNPQLLTNILRQE 257

Query: 300 WDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDL-----DCGQYYTNFTGNA 354
           W   G +V+D D I+ +   H+ +A +K +A    L++G+D+     DC   Y     NA
Sbjct: 258 WGFDGIVVSDYDGIRQLETIHR-VASNKMEAAILALESGVDIEFPTIDC---YGEPLVNA 313

Query: 355 VQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQDICSDENIELAAEAAREGIV 414
           +++G V E+ ID++++ +  +  RLG  D      +   + +   ++ ELA + ARE IV
Sbjct: 314 LKEGLVPESLIDRAVERVLRIKDRLGLLDNPFVNENSVPEKLDDHKSRELALKTARESIV 373

Query: 415 LLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNY---------AGIP-CRYMSPIAGF 464
           LLKN+ N LPL S  V  +AV+GP+AN    M+G+Y         +GI     +  I   
Sbjct: 374 LLKNENNILPL-SKNVNKIAVIGPNANDPRNMLGDYTYTGHLNIDSGIEIVTVLQGIVKK 432

Query: 465 SGYANVTYKTGCDDVACKSNNSIFAASEAAKTADATIIL----AGLDLS----------- 509
            G + V Y  GCD +A +S      A E A+ AD  I +    +GL LS           
Sbjct: 433 VGESKVLYAKGCD-IASESKEGFAEAIEIARQADVIIAIMGEKSGLPLSWMDIPSEEEFK 491

Query: 510 ----VEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAI 565
               V  E  DR  L LPG Q +L+ ++ +  K P+ILV+++  G  +  +     +KA+
Sbjct: 492 KYQAVTGEGNDRSSLRLPGVQEELLKELYKTGK-PIILVLIN--GRPLVLSSIINYVKAV 548

Query: 566 LWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTS--MPL---RPVDSLG 620
           + A +PGEEGG AIADV+FG +NPGGRLPIT+         P+ +  +PL   R   S  
Sbjct: 549 IEAWFPGEEGGNAIADVIFGDYNPGGRLPITF---------PMDTGQIPLYYNRKPSSF- 598

Query: 621 YPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFT-KTIQVNLNKLQHCRNLNYTSDASKT 679
              R Y       L+ FGYGLSYTQF+Y+ L  T K I  N N                 
Sbjct: 599 ---RPYVMLRSSPLFTFGYGLSYTQFEYSNLEVTPKEIGPNSN----------------- 638

Query: 680 RCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFV 739
                              +D +NVG  +G DVV +Y        A  +K++ GF ++ +
Sbjct: 639 ---------------IAISIDVKNVGKMEGDDVVQLYVSKTFSSVARPVKELKGFAKIHL 683

Query: 740 RAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGN 779
           + G  +R+KF+    ++L   D     ++  GE+ + +GN
Sbjct: 684 KPGEKRRVKFIL-PTEALAFYDSFMRLVVEKGEYQLLIGN 722


>gi|297736784|emb|CBI25985.3| unnamed protein product [Vitis vinifera]
          Length = 241

 Score =  302 bits (773), Expect = 5e-79,   Method: Compositional matrix adjust.
 Identities = 142/214 (66%), Positives = 168/214 (78%)

Query: 185 GEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHF 244
           GEDPF V  YAV+YVRGLQDVEG EN TDLNSRPLKVSS  KH+AAYD+DNW  VDR HF
Sbjct: 9   GEDPFTVSVYAVSYVRGLQDVEGTENTTDLNSRPLKVSSSGKHFAAYDLDNWLNVDRNHF 68

Query: 245 DARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHG 304
           +ARV+EQDM ETFLRPFE CV+EGD S VMCS+N +NGIP CADP+L   T+R EW+LHG
Sbjct: 69  NARVSEQDMAETFLRPFEACVREGDVSGVMCSFNNINGIPPCADPRLFKGTIRDEWNLHG 128

Query: 305 YIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFTGNAVQQGKVKETD 364
           YIV+DC SI+ +V++ KFL  + E+AVA  LKAGLDL+CG YY +   +AV  G+V + D
Sbjct: 129 YIVSDCWSIETIVEDQKFLDVTGEEAVALNLKAGLDLECGHYYNDSPASAVMAGRVGQHD 188

Query: 365 IDKSLKYLYTVLMRLGFFDGSPQYVSLGKQDICS 398
           +D+SL  LY VLMRLGFFDG P   SLGK DI +
Sbjct: 189 LDQSLSNLYVVLMRLGFFDGIPALASLGKDDISA 222


>gi|325929067|ref|ZP_08190221.1| beta-glucosidase-like glycosyl hydrolase [Xanthomonas perforans
           91-118]
 gi|325540562|gb|EGD12150.1| beta-glucosidase-like glycosyl hydrolase [Xanthomonas perforans
           91-118]
          Length = 850

 Score =  301 bits (772), Expect = 7e-79,   Method: Compositional matrix adjust.
 Identities = 174/427 (40%), Positives = 246/427 (57%), Gaps = 31/427 (7%)

Query: 72  MTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGATSFPTVILT 131
           MTL+EK  Q+ + A  +PRLG+P Y+WW+EALHGV+  G          GAT FP  I  
Sbjct: 1   MTLEEKAAQMQNAAPAIPRLGVPAYDWWNEALHGVARAG----------GATVFPQAIGM 50

Query: 132 TASFNESLWKKIGQAVSTEARAMYN--------LGRAGLTYWSPNINVARDPRWGRITET 183
            A+F+  L  ++  A+S EARA ++            GLT+WSPNIN+ RDPRWGR  ET
Sbjct: 51  AATFDLPLMHEVATAISDEARAKHHQFLRQNQHARYQGLTFWSPNINIFRDPRWGRGQET 110

Query: 184 PGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPL-KVSSCCKHYAAYDVDNWKGVDRY 242
            GEDPF+  R  V +V+GLQ  EG +   +    P  K+ +  KH+A   V +    DR+
Sbjct: 111 YGEDPFLTARMGVTFVQGLQG-EGADAPKNAQGEPYRKLDATAKHFA---VHSGPEADRH 166

Query: 243 HFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDL 302
           HFDAR +++D+ ET+L  FE  VK+G   +VM +YNRV G  + A   LL   +R +W  
Sbjct: 167 HFDARPSQRDLYETYLPAFEALVKDGKVDAVMGAYNRVYGESASASKFLLQDVLRQQWGF 226

Query: 303 HGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFTGNAVQQGKVKE 362
            GY+V+DC +I  +  +HK +A ++E A A  +K G +L+CG+ Y+     AV+QG + E
Sbjct: 227 KGYVVSDCWAIVDIWKHHKIVA-TREQAAALAVKHGTELECGEEYSTLPA-AVRQGLIDE 284

Query: 363 TDIDKSLKYLYTVLMRLGFFD--GSPQYVSLGKQDICSDENIELAAEAAREGIVLLKNDQ 420
             ID +L  L T  MRLG FD  G   + ++      S  +  LA   ARE +VLLKND 
Sbjct: 285 AQIDTALTTLMTARMRLGMFDPPGQLPWSTIPASVNQSPAHDALARRTARESLVLLKND- 343

Query: 421 NTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFSGY---ANVTYKTGCD 477
             LPL+ AK+K +AV+GP A+ T+A++GNY G P   ++ + G       A V Y  G D
Sbjct: 344 GLLPLSRAKLKRIAVIGPTADDTMALLGNYYGTPAAPVTVLQGIRAAAPNAQVLYARGAD 403

Query: 478 DVACKSN 484
            V  + +
Sbjct: 404 LVEGRDD 410



 Score =  135 bits (340), Expect = 8e-29,   Method: Compositional matrix adjust.
 Identities = 89/282 (31%), Positives = 129/282 (45%), Gaps = 53/282 (18%)

Query: 490 ASEAAKTADATIILAGLDLSVEAESL----------DREDLWLPGYQTQLINQVAEVAKG 539
           A + A++AD  + + GL   VE E +          DR DL LP  Q  L+  +    K 
Sbjct: 577 ALDVARSADVVVFVGGLTGDVEGEEMKVNYPGFAGGDRTDLRLPKPQRDLLEALQATGK- 635

Query: 540 PVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYN 599
           PV+ V+ +   + I +A+ +  + AIL A YPG+ GG A+AD +FG  NPGGRLP+T+Y 
Sbjct: 636 PVVAVLTTGSALAIDWAQQH--LPAILLAWYPGQRGGTAVADTLFGDANPGGRLPVTFYK 693

Query: 600 GDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQV 659
                     S  L   D     GRTY+++ G  LYPFG+GLSYTQF Y+ L   +T   
Sbjct: 694 ---------ESETLPAFDDYAMRGRTYRYFGGTPLYPFGHGLSYTQFAYSGLRLDRTT-- 742

Query: 660 NLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKP 719
                                        +  D      V  +N G   G +VV +Y  P
Sbjct: 743 -----------------------------IAADGSLTATVTVKNTGQRAGDEVVQLYLHP 773

Query: 720 PAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVD 761
                    K++ GFQR+ ++ G  + + F  +A  +L I D
Sbjct: 774 LTPQRERAGKELHGFQRIALQPGEQRALHFTLDAKNALRIYD 815


>gi|384430040|ref|YP_005639401.1| glucan 1,4-beta-glucosidase [Xanthomonas campestris pv. raphani
           756C]
 gi|341939144|gb|AEL09283.1| glucan 1,4-beta-glucosidase [Xanthomonas campestris pv. raphani
           756C]
          Length = 896

 Score =  301 bits (771), Expect = 9e-79,   Method: Compositional matrix adjust.
 Identities = 180/454 (39%), Positives = 254/454 (55%), Gaps = 42/454 (9%)

Query: 54  FCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGT 113
           + D + P   R  DLVSRMTL+EK  Q+ + A  +PRL +P+Y+WW+EALHGV+  G   
Sbjct: 40  YLDPTQPLQARAADLVSRMTLEEKAAQMQNAAPAIPRLQVPEYDWWNEALHGVARAG--- 96

Query: 114 HFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYN--LGRA------GLTYW 165
                  GAT FP  I   A+F+  L  ++  A+S EARA ++  L R       GLT+W
Sbjct: 97  -------GATVFPQAIGLAATFDTPLMAEVATAISDEARAKHHAFLARGEHKRYQGLTFW 149

Query: 166 SPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCC 225
           SPNIN+ RDPRWGR  ET GEDPF+  R  V +V+GLQ  +G            K+ +  
Sbjct: 150 SPNINIFRDPRWGRGQETYGEDPFLTARMGVTFVQGLQAQQGPYR---------KLDATA 200

Query: 226 KHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPS 285
           KHYA   V +    DR+HFD   +E+D+ ET+L  F+  V+EG  ++VM +YNRVNG  +
Sbjct: 201 KHYA---VHSGPEADRHHFDVHPSERDLYETYLPAFQALVQEGHVAAVMGAYNRVNGESA 257

Query: 286 CADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQ 345
            A  + L   +R +W   GYIV+DC +I+ +  NHK +  + E A A  +K G DLDCG 
Sbjct: 258 SASTR-LEGILRRDWGFDGYIVSDCAAIRDIWQNHKIVP-TPEAAAALGVKHGTDLDCGD 315

Query: 346 YYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQDICSDENIE-- 403
            Y      AV+ G + E  ID+SL  L    +RLG FD  P  V   +    ++++ +  
Sbjct: 316 TYAALPA-AVRAGLIDEATIDRSLTRLMAARLRLGMFD-PPAKVPWAQTPASANQSPQHD 373

Query: 404 -LAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIA 462
            LA   ARE +VLLKND   LPL    +K +AVVGP A+  ++++GNY G P   ++ + 
Sbjct: 374 ALARRTARESLVLLKND-GLLPLKPT-LKRIAVVGPTADDPMSLLGNYYGTPAAPVTILQ 431

Query: 463 GF---SGYANVTYKTGCDDVACKSNNSIFAASEA 493
           G    +  A V Y  G D V  + + +  A  +A
Sbjct: 432 GIRDAAPQAEVVYARGSDLVEGREDPNAAAPIDA 465



 Score =  134 bits (338), Expect = 1e-28,   Method: Compositional matrix adjust.
 Identities = 88/282 (31%), Positives = 133/282 (47%), Gaps = 53/282 (18%)

Query: 490 ASEAAKTADATIILAGLDLSVEAESLD----------REDLWLPGYQTQLINQVAEVAKG 539
           A +AA+ AD  + + GL   VE E +D          R D  LP  Q +L+ Q  +    
Sbjct: 623 AVDAARNADVVVFVGGLTGDVEGEEMDVNYPGFAGGDRTDTRLPKPQRELL-QALQATGT 681

Query: 540 PVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYN 599
           PV+ V+ +   + I +A+ +  + AIL A YPG+ GG A+ DV+FG+ +PGGRLPIT+Y 
Sbjct: 682 PVVAVLTTGSALAIDWAQQH--VPAILLAWYPGQRGGTAVGDVLFGQASPGGRLPITFYK 739

Query: 600 GDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQV 659
            D  + LP         D     GRTY++++G  LYPFG+GL+YTQF Y+ L   +T   
Sbjct: 740 ED--ERLPA-------FDDYAMRGRTYRYFDGKPLYPFGHGLAYTQFAYSNLRLDRTT-- 788

Query: 660 NLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKP 719
                                        +  D      V  +N G   G +VV +Y  P
Sbjct: 789 -----------------------------VAADGTLRATVWVKNTGQRAGDEVVQLYLHP 819

Query: 720 PAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVD 761
                    K++ GFQR+ ++ G ++ + F     ++L I D
Sbjct: 820 LNPQRERARKELRGFQRITLQPGEHREVSFTITPREALRIYD 861


>gi|21243803|ref|NP_643385.1| glucan 1,4-beta-glucosidase [Xanthomonas axonopodis pv. citri str.
           306]
 gi|21109396|gb|AAM37921.1| glucan 1,4-beta-glucosidase [Xanthomonas axonopodis pv. citri str.
           306]
          Length = 886

 Score =  301 bits (770), Expect = 1e-78,   Method: Compositional matrix adjust.
 Identities = 186/462 (40%), Positives = 251/462 (54%), Gaps = 43/462 (9%)

Query: 32  SPVFVCDPGRFSKLGLQMSSFLFCDSSLPYSI---RVKDLVSRMTLDEKVQQLGDFAHGV 88
           S VFV        LGL +        + P      R   LV++M+ +EKV Q  + A  +
Sbjct: 2   SSVFVSRLAMAVGLGLTLPCLALATPAKPAGSPEQRAAALVAQMSREEKVAQAMNDAPAI 61

Query: 89  PRLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVS 148
           PRLG+P YEWWSE LHG++  G           AT FP  I   AS+N SL +++G  VS
Sbjct: 62  PRLGIPAYEWWSEGLHGIARNG----------YATVFPQSIGLAASWNTSLMQQVGTVVS 111

Query: 149 TEARAMYNLGR---------AGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYV 199
           TEARA +N            AGLT WSPNIN+ RDPRWGR  ET GEDPF+ G+ AV ++
Sbjct: 112 TEARAKFNQAGGPGKDHQRYAGLTIWSPNINIFRDPRWGRGMETYGEDPFLTGQMAVGFI 171

Query: 200 RGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLR 259
           RGLQ         DLN  P  +++  KH A   V +     R+ FD  V+  D+E T+  
Sbjct: 172 RGLQ-------GEDLN-HPRTIATP-KHIA---VHSGPEPGRHGFDVDVSPHDVEATYTP 219

Query: 260 PFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDN 319
            F   + EG A SVMC+YN ++G P CA   LLN  VRG+W   G++V+DCD++  M   
Sbjct: 220 AFRAALVEGQAGSVMCAYNALHGTPVCAADWLLNGRVRGDWGFKGFVVSDCDAVDDMTQF 279

Query: 320 HKFLADSKEDAVAQTLKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRL 379
           H F  D+   + A  LKAG DL+CG  Y    G A+ +G+V E  +D+SL  L+    RL
Sbjct: 280 HYFRPDNAGSSAA-ALKAGHDLNCGHAYREL-GTAIARGEVDEALLDQSLVRLFAARYRL 337

Query: 380 GFFDGSPQ--YVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVG 437
           G  +   +  Y  LG +D+ +  +  LA +AA E IVLLKND NTLPL +     +AV+G
Sbjct: 338 GELEAPRKDPYARLGAKDVDNAAHRALALQAAAESIVLLKNDANTLPLRAG--TRLAVIG 395

Query: 438 PHANATVAMIGNYAGIPCRYMSPIAGFS---GYANVTYKTGC 476
           P+A+A  A+  NY G     ++P+ G     G   V+Y  G 
Sbjct: 396 PNADALAALEANYQGTSSAPVTPLLGLRQRFGAQQVSYAQGA 437



 Score =  134 bits (336), Expect = 3e-28,   Method: Compositional matrix adjust.
 Identities = 89/286 (31%), Positives = 143/286 (50%), Gaps = 55/286 (19%)

Query: 505 GLDLSVEAESL----------DREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIA 554
           GL   VE E L          DR D+ LP  Q  L+ + A+ +  P+++V+MS   V + 
Sbjct: 623 GLSPDVEGEELRIDVPGFDGGDRNDIALPAPQQALLER-AKASGKPLVVVLMSGSAVALN 681

Query: 555 FAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLR 614
           +A+ + +  AI+ A YPG+ GG AIA ++ G  NPGGRLP+T+Y          ++  L 
Sbjct: 682 WAKMHAD--AIVAAWYPGQSGGTAIARMLAGDDNPGGRLPVTFYR---------STKDLP 730

Query: 615 PVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTS 674
           P  S    GRTY+++ G  L+PFGYGLSYT+F Y+                         
Sbjct: 731 PYVSYDMKGRTYRYFKGEPLFPFGYGLSYTRFAYD------------------------- 765

Query: 675 DASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGF 734
                  P +    L+  +  +     +N G+  G +V  VY + P +   + ++ ++GF
Sbjct: 766 ------APQLSTTTLQAGNPLQVTATVRNTGARAGDEVAQVYLQYP-DRPQSPLRSLVGF 818

Query: 735 QRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGNG 780
           QRV + AG  + + F  +A ++L+ VD +    + AG++T+FVG G
Sbjct: 819 QRVHLAAGEQRTLTFHLDA-RALSDVDRSGQRAVEAGDYTLFVGGG 863


>gi|289670678|ref|ZP_06491753.1| glucan 1,4-beta-glucosidase [Xanthomonas campestris pv. musacearum
           NCPPB 4381]
          Length = 886

 Score =  301 bits (770), Expect = 1e-78,   Method: Compositional matrix adjust.
 Identities = 177/427 (41%), Positives = 242/427 (56%), Gaps = 40/427 (9%)

Query: 64  RVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGAT 123
           R   LV++M+ +EKV Q  + A  +PRLG+P YEWWSE LHG++  G           AT
Sbjct: 37  RAAALVAQMSREEKVAQAMNDAPAIPRLGIPAYEWWSEGLHGIARNG----------HAT 86

Query: 124 SFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGR---------AGLTYWSPNINVARD 174
            FP  I   AS+N +L +++G  VSTEARA +N            AGLT WSPNIN+ RD
Sbjct: 87  VFPQAIGLAASWNTNLMQQVGTVVSTEARAKFNQAGGPGKDHKRYAGLTIWSPNINIFRD 146

Query: 175 PRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVD 234
           PRWGR  ET GEDPF+ G+ AV ++RGLQ         DLN  P  +++  KH A   V 
Sbjct: 147 PRWGRGMETYGEDPFLTGQMAVGFIRGLQ-------GEDLN-HPRTIATP-KHLA---VH 194

Query: 235 NWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQ 294
           +     R+ FD  V+  D+E T+   F   + +G A SVMC+YN ++G P+CA   LLN 
Sbjct: 195 SGPEPGRHGFDVDVSPHDVEATYTPAFRAALVQGQAGSVMCAYNSLHGTPACAADWLLNG 254

Query: 295 TVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFTGNA 354
            VRG+W   G++V+DCD++  M   H F  D+   + A  LKAG DL+CG  Y    G A
Sbjct: 255 RVRGDWGFKGFVVSDCDAVDDMTQFHYFRPDNAGSSAA-ALKAGHDLNCGHAYREL-GTA 312

Query: 355 VQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ--YVSLGKQDICSDENIELAAEAAREG 412
           +++G V E  +D+SL  L+    RLG  +   +  Y  LG +D+ +  +  LA +AA E 
Sbjct: 313 IERGDVDEALLDQSLVRLFAARYRLGELEAPRKDPYARLGAKDVDNAAHRALALQAAAES 372

Query: 413 IVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFS---GYAN 469
           IVLLKND NTLPLN+     +AV+GP+A+A  A+  NY G     ++P+ G     G   
Sbjct: 373 IVLLKNDANTLPLNAG--TRLAVIGPNADALAALEANYQGTSSAPVTPLLGLRQRFGAQQ 430

Query: 470 VTYKTGC 476
           V Y  G 
Sbjct: 431 VRYAQGA 437



 Score =  136 bits (342), Expect = 5e-29,   Method: Compositional matrix adjust.
 Identities = 91/294 (30%), Positives = 146/294 (49%), Gaps = 55/294 (18%)

Query: 497 ADATIILAGLDLSVEAESL----------DREDLWLPGYQTQLINQVAEVAKGPVILVIM 546
           +DA +   GL   VE E L          DR D+ LP  Q  L+ + A+ +  P+++V+M
Sbjct: 615 SDAVVAFVGLSPDVEGEELRIDVPGFDGGDRNDIALPAPQQALLER-AKASGKPLVVVLM 673

Query: 547 SAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQML 606
           S   V + +A+T+ +  AI+ A YPG+ GG AIA ++ G  NPGGRLP+T+Y        
Sbjct: 674 SGSAVALNWAKTHAD--AIVAAWYPGQSGGTAIARMLAGDDNPGGRLPVTFYR------- 724

Query: 607 PLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQH 666
             ++  L    S    GRTY+++ G  L+PFGYGLSYT+F Y+    + T          
Sbjct: 725 --STKDLPAYVSYDMKGRTYRYFKGEPLFPFGYGLSYTRFAYDAPQLSSTT--------- 773

Query: 667 CRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAAT 726
                                 L+  +  +     +N G+  G +V  VY + P +   +
Sbjct: 774 ----------------------LQAGNPLQVTTTVRNTGTHAGDEVAQVYLQYP-DRPQS 810

Query: 727 YIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGNG 780
            ++ ++GFQRV + AG  + + F  +A ++L+ VD +    + AG +T+FVG G
Sbjct: 811 PLRSLVGFQRVHLAAGEQRTLTFNLDA-RALSDVDRSGQRAVEAGNYTLFVGGG 863


>gi|289664871|ref|ZP_06486452.1| glucan 1,4-beta-glucosidase [Xanthomonas campestris pv. vasculorum
           NCPPB 702]
          Length = 886

 Score =  300 bits (769), Expect = 2e-78,   Method: Compositional matrix adjust.
 Identities = 177/427 (41%), Positives = 241/427 (56%), Gaps = 40/427 (9%)

Query: 64  RVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGAT 123
           R   LV+ M+ +EKV Q  + A  +PRLG+P YEWWSE LHG++  G           AT
Sbjct: 37  RAAALVAHMSREEKVAQAMNDAPAIPRLGIPAYEWWSEGLHGIARNG----------HAT 86

Query: 124 SFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGR---------AGLTYWSPNINVARD 174
            FP  I   AS+N +L +++G  VSTEARA +N            AGLT WSPNIN+ RD
Sbjct: 87  VFPQAIGLAASWNTNLMQQVGTVVSTEARAKFNQAGGPGKDHKRYAGLTIWSPNINIFRD 146

Query: 175 PRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVD 234
           PRWGR  ET GEDPF+ G+ AV ++RGLQ         DLN  P  +++  KH A   V 
Sbjct: 147 PRWGRGMETYGEDPFLTGQMAVGFIRGLQ-------GEDLN-HPRTIATP-KHLA---VH 194

Query: 235 NWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQ 294
           +     R+ FD  V+  D+E T+   F   + +G A SVMC+YN ++G P+CA   LLN 
Sbjct: 195 SGPEPGRHGFDVDVSPHDVEATYTPAFRAALVQGQAGSVMCAYNSLHGTPACAADWLLNG 254

Query: 295 TVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFTGNA 354
            VRG+W   G++V+DCD++  M   H F  D+   + A  LKAG DL+CG  Y    G A
Sbjct: 255 RVRGDWGFKGFVVSDCDAVDDMTQFHYFRPDNAGSSAA-ALKAGHDLNCGHAYREL-GTA 312

Query: 355 VQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ--YVSLGKQDICSDENIELAAEAAREG 412
           +++G V E  +D+SL  L+    RLG  +   +  Y  LG +D+ +  +  LA +AA E 
Sbjct: 313 IERGDVDEALLDQSLVRLFAARYRLGELEAPRKDPYARLGAKDVDNAAHRALALQAAAES 372

Query: 413 IVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFS---GYAN 469
           IVLLKND NTLPLN+     +AV+GP+A+A  A+  NY G     ++P+ G     G   
Sbjct: 373 IVLLKNDANTLPLNAG--TRLAVIGPNADALAALEANYQGTSSAPVTPLLGLRQRFGAQQ 430

Query: 470 VTYKTGC 476
           V Y  G 
Sbjct: 431 VRYAQGA 437



 Score =  136 bits (342), Expect = 5e-29,   Method: Compositional matrix adjust.
 Identities = 91/294 (30%), Positives = 146/294 (49%), Gaps = 55/294 (18%)

Query: 497 ADATIILAGLDLSVEAESL----------DREDLWLPGYQTQLINQVAEVAKGPVILVIM 546
           +DA +   GL   VE E L          DR D+ LP  Q  L+ + A+ +  P+++V+M
Sbjct: 615 SDAVVAFVGLSPDVEGEELRIDVPGFDGGDRNDIALPAPQQALLER-AKASGKPLVVVLM 673

Query: 547 SAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQML 606
           S   V + +A+T+ +  AI+ A YPG+ GG AIA ++ G  NPGGRLP+T+Y        
Sbjct: 674 SGSAVALNWAKTHAD--AIVAAWYPGQSGGTAIARMLAGDDNPGGRLPVTFYR------- 724

Query: 607 PLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQH 666
             ++  L    S    GRTY+++ G  L+PFGYGLSYT+F Y+                 
Sbjct: 725 --STKDLPAYVSYDMKGRTYRYFKGEPLFPFGYGLSYTRFAYD----------------- 765

Query: 667 CRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAAT 726
                          P +    L+  +  +     +N G+  G +V  VY + P +   +
Sbjct: 766 --------------APQLSTTALQAGNPLQVTTTVRNTGTRAGDEVAQVYLQYP-DRPQS 810

Query: 727 YIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGNG 780
            ++ ++GFQRV + AG  + + F  +A ++L+ VD +    + AG +T+FVG G
Sbjct: 811 PLRSLVGFQRVHLAAGEQRTLTFNLDA-RALSDVDRSGQRAVEAGNYTLFVGGG 863


>gi|385774250|ref|YP_005646817.1| glycoside hydrolase family protein [Sulfolobus islandicus HVE10/4]
 gi|323478365|gb|ADX83603.1| glycoside hydrolase family 3 domain protein [Sulfolobus islandicus
           HVE10/4]
          Length = 754

 Score =  300 bits (769), Expect = 2e-78,   Method: Compositional matrix adjust.
 Identities = 220/700 (31%), Positives = 347/700 (49%), Gaps = 120/700 (17%)

Query: 122 ATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTY-WSPNINVARDPRWGRI 180
           +T+FP  I   +++N  L   I   + ++ R +      G+    SP ++V +DPRWGR 
Sbjct: 101 STAFPQAIGLASTWNLELVMDIASVIRSQGRLV------GVNQCLSPVLDVCKDPRWGRC 154

Query: 181 TETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDV-DNWKGV 239
            ET GEDP++V    + Y+ GLQ     +N         ++ +  KH+AA+   +  + +
Sbjct: 155 EETYGEDPYLVASMGLAYITGLQG----DN---------QLVATAKHFAAHGFPEGGRNI 201

Query: 240 DRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGE 299
            + H    V  +++ ETFL PFE+ VK G   S+M +Y+ ++GIP   +P+LL   +R E
Sbjct: 202 AQVH----VGNRELRETFLFPFEVAVKIGKVMSIMPAYHEIDGIPCHGNPQLLTNILRQE 257

Query: 300 WDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDL-----DCGQYYTNFTGNA 354
           W   G +V+D D I+ +   H+ +A +K +A    L++G+D+     DC   Y+    NA
Sbjct: 258 WGFDGIVVSDYDGIRQLETIHR-VASNKMEAAILALESGVDIEFPTIDC---YSEPLVNA 313

Query: 355 VQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQDICSDENIELAAEAAREGIV 414
           + +G V E+ ID++++ +  +  RLG  D      +   + +   ++ ELA + ARE IV
Sbjct: 314 LTEGLVPESLIDRAVERVLRIKDRLGLLDNPFVNENSVPEKLDDHKSRELALKTARESIV 373

Query: 415 LLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNY---------AGIP-CRYMSPIAGF 464
           LLKN+ N LPL S  V  +AV+GP+AN    M+G+Y         +GI     +  +   
Sbjct: 374 LLKNENNILPL-SKNVNKIAVIGPNANDPRNMLGDYTYTGHLNIDSGIEIVTVLQGVVKK 432

Query: 465 SGYANVTYKTGCDDVACKSNNSIFAASEAAKTADATIIL----AGLDLS----------- 509
            G + V Y  GCD +A +S      A E A+ AD  I +    +GL LS           
Sbjct: 433 VGESKVLYAKGCD-IASESKEGFAEAIEIARQADVIIAVMGEKSGLPLSWTDIPSEEEFK 491

Query: 510 ----VEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAI 565
               V  E  DR  L LPG Q +L+ ++ +  K P+ILV+++  G  +  +     +KA+
Sbjct: 492 KYQAVTGEGNDRSSLRLPGVQEELLKELYKTGK-PIILVLIN--GRPLVLSPIINYVKAV 548

Query: 566 LWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTS--MPL---RPVDSLG 620
           + A +PGEEGG AIADV+FG +NPGGRLPIT+         P+ +  +PL   R   S  
Sbjct: 549 IEAWFPGEEGGNAIADVIFGDYNPGGRLPITF---------PMDTGQIPLYYNRKPSSF- 598

Query: 621 YPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFT-KTIQVNLNKLQHCRNLNYTSDASKT 679
              R Y       L+ FGYGLSYTQF+Y+ L  T K I  N N                 
Sbjct: 599 ---RPYVMLRSSPLFTFGYGLSYTQFEYSNLEVTPKEIGPNSN----------------- 638

Query: 680 RCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFV 739
                              +D +NVG  +G DVV +Y        A  +K++ GF ++ +
Sbjct: 639 ---------------IAISIDVKNVGKMEGDDVVQLYVSKTFSSVARPVKELKGFAKIHL 683

Query: 740 RAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGN 779
           + G  +R+KF+    ++L   D     ++  GE+ + +GN
Sbjct: 684 KPGEKRRVKFIL-PTEALAFYDSFMRLVVEKGEYQLLIGN 722


>gi|21233528|ref|NP_639445.1| glucan 1,4-beta-glucosidase [Xanthomonas campestris pv. campestris
           str. ATCC 33913]
 gi|66770493|ref|YP_245255.1| glucan 1,4-beta-glucosidase [Xanthomonas campestris pv. campestris
           str. 8004]
 gi|21115383|gb|AAM43327.1| glucan 1,4-beta-glucosidase [Xanthomonas campestris pv. campestris
           str. ATCC 33913]
 gi|66575825|gb|AAY51235.1| glucan 1,4-beta-glucosidase [Xanthomonas campestris pv. campestris
           str. 8004]
          Length = 896

 Score =  300 bits (768), Expect = 2e-78,   Method: Compositional matrix adjust.
 Identities = 179/454 (39%), Positives = 253/454 (55%), Gaps = 42/454 (9%)

Query: 54  FCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGT 113
           + D + P   R  DLVSRMTL+EK  Q+ + A  +PRL +P+Y+WW+EALHGV+  G   
Sbjct: 40  YLDPTQPLQARAADLVSRMTLEEKAAQMQNAAPAIPRLQVPEYDWWNEALHGVARAG--- 96

Query: 114 HFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRA--------GLTYW 165
                  GAT FP  I   A+F+  L  ++  A+S EARA ++   A        GLT+W
Sbjct: 97  -------GATVFPQAIGLAATFDTPLMAEVATAISDEARAKHHAFLAGGEHKRYQGLTFW 149

Query: 166 SPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCC 225
           SPNIN+ RDPRWGR  ET GEDPF+  R  V +V+GLQ  +G            K+ +  
Sbjct: 150 SPNINIFRDPRWGRGQETYGEDPFLTARMGVTFVQGLQAQQGPYR---------KLDATA 200

Query: 226 KHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPS 285
           KHYA   V +    DR+HFD   +E+D+ ET+L  F+  V+EG  ++VM +YNRVNG  +
Sbjct: 201 KHYA---VHSGPEADRHHFDVHPSERDLYETYLPAFQALVQEGHVAAVMGAYNRVNGESA 257

Query: 286 CADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQ 345
            A  + L   +R +W   GYIV+DC +I+ +  NHK +  + E A A  +K G DLDCG 
Sbjct: 258 SASTR-LEGILRRDWGFDGYIVSDCAAIRDIWQNHKIVP-TPEAAAALGVKHGTDLDCGD 315

Query: 346 YYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQDICSDENIE-- 403
            Y      AV+ G + E  ID+SL  L    +RLG FD  P  V   +    ++++ +  
Sbjct: 316 TYAALPA-AVRAGLIDEATIDRSLTRLMAARLRLGMFD-PPAKVPWAQTPASANQSPQHD 373

Query: 404 -LAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIA 462
            LA   ARE +VLLKND   LPL    +K +AVVGP A+  ++++GNY G P   ++ + 
Sbjct: 374 ALARRTARESLVLLKND-GLLPLKPT-LKRIAVVGPTADDPMSLLGNYYGTPAAPVTILQ 431

Query: 463 GF---SGYANVTYKTGCDDVACKSNNSIFAASEA 493
           G    +  A V Y  G D V  + + +  A  +A
Sbjct: 432 GIRDAAPQAEVVYARGSDLVEGREDPNAAAPIDA 465



 Score =  135 bits (339), Expect = 1e-28,   Method: Compositional matrix adjust.
 Identities = 88/282 (31%), Positives = 133/282 (47%), Gaps = 53/282 (18%)

Query: 490 ASEAAKTADATIILAGLDLSVEAESLD----------REDLWLPGYQTQLINQVAEVAKG 539
           A +AA+ AD  + + GL   VE E +D          R D  LP  Q +L+ Q  +    
Sbjct: 623 AVDAARNADVVVFVGGLTGDVEGEEMDVNYPGFAGGDRTDTRLPKPQRELL-QALQATGT 681

Query: 540 PVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYN 599
           PV+ V+ +   + I +A+ +  + AIL A YPG+ GG A+ DV+FG+ +PGGRLPIT+Y 
Sbjct: 682 PVVAVLTTGSALAIDWAQQH--VPAILLAWYPGQRGGTAVGDVLFGQASPGGRLPITFYK 739

Query: 600 GDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQV 659
            D  + LP         D     GRTY++++G  LYPFG+GL+YTQF Y+ L   +T   
Sbjct: 740 ED--ERLPA-------FDDYAMRGRTYRYFDGKPLYPFGHGLAYTQFAYSNLRLDRTT-- 788

Query: 660 NLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKP 719
                                        +  D      V  +N G   G +VV +Y  P
Sbjct: 789 -----------------------------VAADGTLRATVSVKNTGQRAGDEVVQLYLHP 819

Query: 720 PAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVD 761
                    K++ GFQR+ ++ G ++ + F     ++L I D
Sbjct: 820 LNPQRERARKELRGFQRITLQPGEHREVSFNITPREALRIYD 861


>gi|78048767|ref|YP_364942.1| beta-glucosidase precursor [Xanthomonas campestris pv. vesicatoria
           str. 85-10]
 gi|78037197|emb|CAJ24942.1| beta-glucosidase precursor [Xanthomonas campestris pv. vesicatoria
           str. 85-10]
          Length = 889

 Score =  300 bits (768), Expect = 2e-78,   Method: Compositional matrix adjust.
 Identities = 177/427 (41%), Positives = 242/427 (56%), Gaps = 40/427 (9%)

Query: 64  RVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGAT 123
           R   LV++M+ +EKV Q  + A  +PRLG+P YEWWSE LHG++  G           AT
Sbjct: 40  RAAALVAQMSREEKVAQAMNDAPAIPRLGIPAYEWWSEGLHGIARNG----------YAT 89

Query: 124 SFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGR---------AGLTYWSPNINVARD 174
            FP  I   AS+N  L +++G  VSTEARA +N            AGLT WSPNIN+ RD
Sbjct: 90  VFPQSIGLAASWNTRLMQQVGTVVSTEARAKFNQAGGPGKDHKRYAGLTIWSPNINIFRD 149

Query: 175 PRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVD 234
           PRWGR  ET GEDPF+ G+ AV ++RGLQ         DLN  P  +++  KH A   V 
Sbjct: 150 PRWGRGMETYGEDPFLTGQMAVGFIRGLQ-------GEDLN-HPRTIATP-KHIA---VH 197

Query: 235 NWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQ 294
           +     R+ FD  V+ +D+E T+   F   + EG A SVMC+YN ++G P+CA   LLN 
Sbjct: 198 SGPEPGRHGFDVDVSPRDVEATYTPAFRAAIVEGQAGSVMCAYNSLHGTPACAADWLLNG 257

Query: 295 TVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFTGNA 354
            VRG+W   G++V+DCD++  M   H F  D+   + A  LKAG DL+CG  Y    G A
Sbjct: 258 RVRGDWGFKGFVVSDCDAVDDMTQFHYFRPDNAGSSAA-ALKAGHDLNCGHAYREL-GTA 315

Query: 355 VQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ--YVSLGKQDICSDENIELAAEAAREG 412
           + +G+V E  +D+SL  L+    RLG  +   +  Y  LG +D+ +  +  LA +AA E 
Sbjct: 316 IARGEVDEALLDQSLVRLFAARYRLGELEAPRKDPYARLGAKDVDNAAHRALALQAAAES 375

Query: 413 IVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFS---GYAN 469
           IVLLKND NTLPL +     +AV+GP+A+A  A+  NY G     ++P+ G     G   
Sbjct: 376 IVLLKNDANTLPLKAG--TRLAVIGPNADALAALEANYQGTSSAPVTPLLGLRQRFGAQQ 433

Query: 470 VTYKTGC 476
           V+Y  G 
Sbjct: 434 VSYAQGA 440



 Score =  129 bits (324), Expect = 7e-27,   Method: Compositional matrix adjust.
 Identities = 91/286 (31%), Positives = 144/286 (50%), Gaps = 55/286 (19%)

Query: 505 GLDLSVEAESL----------DREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIA 554
           GL   VE E L          DR D+ LP  Q  L+ + A+ +  P+++V+MS   V + 
Sbjct: 626 GLSPDVEGEELRIDVPGFDGGDRNDIALPAPQQALLER-AKASGKPLVVVLMSGSAVALN 684

Query: 555 FAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLR 614
           +A+T+ +  AI+ A YPG+ GG AIA ++ G  NPGGRLP+T+Y          ++  L 
Sbjct: 685 WAKTHAD--AIVAAWYPGQSGGTAIARMLAGDDNPGGRLPVTFYR---------STKDLP 733

Query: 615 PVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTS 674
              S    GRTY+++ G  L+ FGYGLSYT+F Y+        Q++   LQ   +L  T+
Sbjct: 734 AYVSYDMKGRTYRYFKGEPLFAFGYGLSYTRFAYD------APQLSTTTLQAGSSLQVTT 787

Query: 675 DASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGF 734
                                      +N G+  G +V  VY + P +   + ++ ++GF
Sbjct: 788 -------------------------TVRNTGARAGDEVAQVYLQYP-DRPQSPLRSLVGF 821

Query: 735 QRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGNG 780
           QRV + AG  + + F  +A ++L+ VD +    + AG +T+FVG G
Sbjct: 822 QRVHLAAGEQRTLTFNLDA-RALSDVDRSGQRAVEAGNYTLFVGGG 866


>gi|325925754|ref|ZP_08187127.1| beta-glucosidase-like glycosyl hydrolase [Xanthomonas perforans
           91-118]
 gi|325543811|gb|EGD15221.1| beta-glucosidase-like glycosyl hydrolase [Xanthomonas perforans
           91-118]
          Length = 874

 Score =  300 bits (768), Expect = 2e-78,   Method: Compositional matrix adjust.
 Identities = 177/427 (41%), Positives = 242/427 (56%), Gaps = 40/427 (9%)

Query: 64  RVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGAT 123
           R   LV++M+ +EKV Q  + A  +PRLG+P YEWWSE LHG++  G           AT
Sbjct: 25  RAAALVAQMSREEKVAQAMNDAPAIPRLGIPAYEWWSEGLHGIARNG----------YAT 74

Query: 124 SFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGR---------AGLTYWSPNINVARD 174
            FP  I   AS+N  L +++G  VSTEARA +N            AGLT WSPNIN+ RD
Sbjct: 75  VFPQAIGLAASWNTRLMQQVGTVVSTEARAKFNQAGGPGKDHKRYAGLTIWSPNINIFRD 134

Query: 175 PRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVD 234
           PRWGR  ET GEDPF+ G+ AV ++RGLQ         DLN  P  +++  KH A   V 
Sbjct: 135 PRWGRGMETYGEDPFLTGQMAVGFIRGLQ-------GEDLN-HPRTIATP-KHIA---VH 182

Query: 235 NWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQ 294
           +     R+ FD  V+ +D+E T+   F   + EG A SVMC+YN ++G P+CA   LLN 
Sbjct: 183 SGPEPGRHGFDVDVSPRDVEATYTPAFRAAIVEGQAGSVMCAYNSLHGTPACAADWLLNG 242

Query: 295 TVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFTGNA 354
            VRG+W   G++V+DCD++  M   H F  D+   + A  LKAG DL+CG  Y    G A
Sbjct: 243 RVRGDWGFKGFVVSDCDAVDDMTQFHYFRPDNAGSSAA-ALKAGHDLNCGHAYREL-GTA 300

Query: 355 VQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ--YVSLGKQDICSDENIELAAEAAREG 412
           + +G+V E  +D+SL  L+    RLG  +   +  Y  LG +D+ +  +  LA +AA E 
Sbjct: 301 IARGEVDEALLDQSLVRLFAARYRLGELEAPRKDPYARLGAKDVDNAAHRALALQAAAES 360

Query: 413 IVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFS---GYAN 469
           IVLLKND NTLPL +     +AV+GP+A+A  A+  NY G     ++P+ G     G   
Sbjct: 361 IVLLKNDANTLPLKAG--TRLAVIGPNADALAALEANYQGTSSAPVTPLLGLRQRFGAQQ 418

Query: 470 VTYKTGC 476
           V+Y  G 
Sbjct: 419 VSYAQGA 425



 Score =  128 bits (322), Expect = 1e-26,   Method: Compositional matrix adjust.
 Identities = 91/286 (31%), Positives = 144/286 (50%), Gaps = 55/286 (19%)

Query: 505 GLDLSVEAESL----------DREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIA 554
           GL   VE E L          DR D+ LP  Q  L+ + A+ +  P+++V+MS   V + 
Sbjct: 611 GLSPDVEGEELRIDVPGFDGGDRNDIALPAPQQALLER-AKASGKPLVVVLMSGSAVALN 669

Query: 555 FAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLR 614
           +A+T+ +  AI+ A YPG+ GG AIA ++ G  NPGGRLP+T+Y          ++  L 
Sbjct: 670 WAKTHAD--AIVAAWYPGQSGGTAIARMLAGDDNPGGRLPVTFYR---------STKDLP 718

Query: 615 PVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTS 674
              S    GRTY+++ G  L+ FGYGLSYT+F Y+        Q++   LQ   +L  T+
Sbjct: 719 AYVSYDMKGRTYRYFKGEPLFAFGYGLSYTRFAYD------APQLSTTTLQAGSSLQVTT 772

Query: 675 DASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGF 734
                                      +N G+  G +V  VY + P +   + ++ ++GF
Sbjct: 773 -------------------------TVRNTGARAGDEVAQVYLQYP-DRPQSPLRSLVGF 806

Query: 735 QRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGNG 780
           QRV + AG  + + F  +A ++L+ VD +    + AG +T+FVG G
Sbjct: 807 QRVHLAAGEQRTLTFNLDA-RALSDVDRSGQRAVEAGNYTLFVGGG 851


>gi|418518029|ref|ZP_13084183.1| glucan 1,4-beta-glucosidase [Xanthomonas axonopodis pv. malvacearum
           str. GSPB1386]
 gi|410705279|gb|EKQ63755.1| glucan 1,4-beta-glucosidase [Xanthomonas axonopodis pv. malvacearum
           str. GSPB1386]
          Length = 886

 Score =  300 bits (768), Expect = 2e-78,   Method: Compositional matrix adjust.
 Identities = 180/427 (42%), Positives = 242/427 (56%), Gaps = 40/427 (9%)

Query: 64  RVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGAT 123
           R   LV++M+ +EKV Q  + A  +PRLG+P YEWWSE LHG++  G           AT
Sbjct: 37  RAAALVAQMSREEKVAQAMNDAPAIPRLGIPAYEWWSEGLHGIARNG----------YAT 86

Query: 124 SFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGR---------AGLTYWSPNINVARD 174
            FP  I   AS+N SL +++G  VSTEARA +N            AGLT WSPNIN+ RD
Sbjct: 87  VFPQSIGLAASWNTSLMQQVGTVVSTEARAKFNQAGGPGKDHQRYAGLTIWSPNINIFRD 146

Query: 175 PRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVD 234
           PRWGR  ET GEDPF+ G+ AV ++RGLQ         DLN  P  +++  KH A   V 
Sbjct: 147 PRWGRGMETYGEDPFLTGQMAVGFIRGLQ-------GEDLN-HPRTIATP-KHIA---VH 194

Query: 235 NWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQ 294
           +     R+ FD  V+  D+E T+   F   + EG A SVMC+YN ++G P CA   LLN 
Sbjct: 195 SGPEPGRHGFDVDVSPHDVEATYTPAFRAALVEGQAGSVMCAYNALHGTPVCAADWLLNG 254

Query: 295 TVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFTGNA 354
            VRG+W   G++V+DCD+I  M   H F  D+   +VA  LKAG DL+CG  Y    G A
Sbjct: 255 RVRGDWGFKGFVVSDCDAIDDMTQFHYFRPDNAGSSVA-ALKAGHDLNCGHAYREL-GTA 312

Query: 355 VQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ--YVSLGKQDICSDENIELAAEAAREG 412
           + +G+V E  +D+SL  L+    RLG  +   +  Y  LG +D+ +  +  LA +AA E 
Sbjct: 313 IARGEVDEALLDQSLVRLFAARYRLGELEAPRKDPYARLGAKDVDNVAHRALALQAAAES 372

Query: 413 IVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFS---GYAN 469
           IVLLKND NTLPL +     +AV+GP+A+A  A+  NY G     ++P+ G     G   
Sbjct: 373 IVLLKNDANTLPLRAG--TRLAVIGPNADALAALEANYQGTSSAPVTPLLGLRQRFGAQQ 430

Query: 470 VTYKTGC 476
           V+Y  G 
Sbjct: 431 VSYAQGA 437



 Score =  132 bits (332), Expect = 7e-28,   Method: Compositional matrix adjust.
 Identities = 92/286 (32%), Positives = 145/286 (50%), Gaps = 55/286 (19%)

Query: 505 GLDLSVEAESL----------DREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIA 554
           GL   VE E L          DR D+ LP  Q  L+ + A+ +  P+++V+MS   V + 
Sbjct: 623 GLSPDVEGEELRIDVPGFDGGDRNDIALPAPQQALLER-AKASGKPLVVVLMSGSAVALN 681

Query: 555 FAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLR 614
           +A+T+ +  AI+ A YPG+ GG AIA ++ G  NPGGRLP+T+Y          ++  L 
Sbjct: 682 WAKTHAD--AIMAAWYPGQSGGTAIARMLAGDDNPGGRLPVTFYR---------STKDLP 730

Query: 615 PVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTS 674
              S    GRTY+++ G  L+PFGYGLSYT+F Y+    + T     N LQ         
Sbjct: 731 AYVSYDMKGRTYRYFKGEPLFPFGYGLSYTRFAYDAPQLSTTTLQAGNPLQ--------- 781

Query: 675 DASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGF 734
                     ++  +R            N G+  G +V  VY + P +   + ++ ++GF
Sbjct: 782 ----------VIATVR------------NTGARAGDEVAQVYLQYP-DRPQSPLRSLVGF 818

Query: 735 QRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGNG 780
           QRV + AG  + + F  +A ++L+ VD +    + AG++T+FVG G
Sbjct: 819 QRVHLAAGEQRTLTFHLDA-RALSDVDRSGQRAVEAGDYTLFVGGG 863


>gi|284998833|ref|YP_003420601.1| glycoside hydrolase family protein [Sulfolobus islandicus L.D.8.5]
 gi|284446729|gb|ADB88231.1| glycoside hydrolase, family 3 domain protein [Sulfolobus islandicus
           L.D.8.5]
          Length = 754

 Score =  300 bits (767), Expect = 3e-78,   Method: Compositional matrix adjust.
 Identities = 220/700 (31%), Positives = 346/700 (49%), Gaps = 120/700 (17%)

Query: 122 ATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTY-WSPNINVARDPRWGRI 180
           +T+FP  I   +++N  L   I   + ++ R +      G+    SP ++V +DPRWGR 
Sbjct: 101 STAFPQAIGLASTWNPELVMDIASVIRSQGRLV------GVNQCLSPVLDVCKDPRWGRC 154

Query: 181 TETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDV-DNWKGV 239
            ET GEDP++V    + Y+ GLQ     +N         ++ +  KH+AA+   +  + +
Sbjct: 155 EETYGEDPYLVASMGLAYITGLQG----DN---------QLVATAKHFAAHGFPEGGRNI 201

Query: 240 DRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGE 299
            + H    V  +++ ETFL PFE+ VK G   S+M +Y+ ++GIP   +P+LL   +R E
Sbjct: 202 AQVH----VGNRELRETFLFPFEVAVKIGKVMSIMPAYHEIDGIPCHGNPQLLTNILRQE 257

Query: 300 WDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDL-----DCGQYYTNFTGNA 354
           W   G +V+D D I+ +   H+ +A +K +A    L++G+D+     DC   Y     NA
Sbjct: 258 WGFDGIVVSDYDGIRQLETIHR-VASNKMEAAILALESGVDIEFPTIDC---YGEPLVNA 313

Query: 355 VQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQDICSDENIELAAEAAREGIV 414
           +++G V E+ ID++++ +  +  RLG  D      +   + +   ++ ELA + ARE IV
Sbjct: 314 LKEGLVPESLIDRAVERVLRIKDRLGLLDNPFVNENSVPEKLDDHKSRELALKTARESIV 373

Query: 415 LLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNY---------AGIP-CRYMSPIAGF 464
           LLKN+ N LPL S  V  +AV+GP+AN    M+G+Y         +GI     +  I   
Sbjct: 374 LLKNENNILPL-SKNVNKIAVIGPNANDPRNMLGDYTYTGHLNIDSGIEIVTVLQGIVKK 432

Query: 465 SGYANVTYKTGCDDVACKSNNSIFAASEAAKTADATIIL----AGLDLS----------- 509
            G + V Y  GCD +A +S      A E A+ AD  I +    +GL LS           
Sbjct: 433 VGESKVLYAKGCD-IASESKEGFAEAIEIARQADVIIAIMGEKSGLPLSWMDIPSEEEFK 491

Query: 510 ----VEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAI 565
               V  E  DR  L LPG Q +L+ ++ +  K P+ILV+++  G  +  +     +KA+
Sbjct: 492 KYQAVTGEGNDRSSLRLPGVQEELLKELYKTGK-PIILVLIN--GRPLVLSSIINYVKAV 548

Query: 566 LWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTS--MPL---RPVDSLG 620
           + A +PGEEGG AIADV+FG +NP GRLPIT+         P+ +  +PL   R   S  
Sbjct: 549 IEAWFPGEEGGNAIADVIFGDYNPSGRLPITF---------PMDTGQIPLYYNRKPSSF- 598

Query: 621 YPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFT-KTIQVNLNKLQHCRNLNYTSDASKT 679
              R Y       L+ FGYGLSYTQF+Y+ L  T K I  N N                 
Sbjct: 599 ---RPYVMLRSSPLFTFGYGLSYTQFEYSNLEVTPKEIGPNSN----------------- 638

Query: 680 RCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFV 739
                              +D +NVG  +G DVV +Y        A  +K++ GF ++ +
Sbjct: 639 ---------------IAISIDVKNVGKMEGDDVVQLYVSKTFSSVARPVKELKGFAKIHL 683

Query: 740 RAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGN 779
           + G  +R+KF+    ++L   D     ++  GE+ + +GN
Sbjct: 684 KPGEKRRVKFIL-PTEALAFYDSFMRLVVEKGEYQLLIGN 722


>gi|58581402|ref|YP_200418.1| glucan 1,4-beta-glucosidase [Xanthomonas oryzae pv. oryzae KACC
           10331]
 gi|58425996|gb|AAW75033.1| glucan 1,4-beta-glucosidase [Xanthomonas oryzae pv. oryzae KACC
           10331]
          Length = 889

 Score =  300 bits (767), Expect = 3e-78,   Method: Compositional matrix adjust.
 Identities = 176/427 (41%), Positives = 245/427 (57%), Gaps = 40/427 (9%)

Query: 64  RVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGAT 123
           R  DLV+ M+ +EKV Q  + A  +PRLG+P YEWWSE LHG++  G           AT
Sbjct: 40  RAADLVAHMSREEKVAQAMNDAPAIPRLGIPAYEWWSEGLHGIARNG----------YAT 89

Query: 124 SFPTVILTTASFNESLWKKIGQAVSTEARAMYN-LGR--------AGLTYWSPNINVARD 174
            FP  I   AS+N  L +++G  VSTEARA +N  GR        AGLT WSPNIN+ RD
Sbjct: 90  VFPQAIGLAASWNTHLMQQVGTVVSTEARAKFNQAGRPGNDHKRYAGLTIWSPNINIFRD 149

Query: 175 PRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVD 234
           PRWGR  ET GEDPF+ G+ AV ++ GLQ         DL+  P  +++  KH A   V 
Sbjct: 150 PRWGRGMETYGEDPFLTGQMAVGFIHGLQ-------GEDLD-HPRTIATP-KHLA---VH 197

Query: 235 NWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQ 294
           +     R+ FD  V+ +D+E T+   F   + EG A +VMC+YN ++G P+CA   L+N 
Sbjct: 198 SGPEPGRHGFDVDVSPRDVEATYTPAFRAAIVEGQAGAVMCAYNSLHGTPACAADWLING 257

Query: 295 TVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFTGNA 354
            VRG+W   G++V+DCD++  M   H F  D+   + A  LKAG DL+CG  Y    G A
Sbjct: 258 RVRGDWGFKGFVVSDCDAVDDMTQFHYFRPDNAGSSAA-ALKAGHDLNCGHAYREL-GTA 315

Query: 355 VQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ--YVSLGKQDICSDENIELAAEAAREG 412
           + +G+V E  +D+SL  L+    RLG  +   +  Y  LG +D+ + ++  LA +AA E 
Sbjct: 316 IARGEVDEALLDQSLVRLFAARYRLGELEAPRKDPYARLGAKDVDNAQHRALALQAAAES 375

Query: 413 IVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFS---GYAN 469
           IVLLKN+ NTLPLN+     +AV+GP+A+A  A+  NY G     ++P+ G     G   
Sbjct: 376 IVLLKNNANTLPLNAG--TRLAVIGPNADALAALEANYQGTSSAPVTPLLGLRQRFGAQQ 433

Query: 470 VTYKTGC 476
           V+Y  G 
Sbjct: 434 VSYAQGA 440



 Score =  130 bits (327), Expect = 3e-27,   Method: Compositional matrix adjust.
 Identities = 88/286 (30%), Positives = 141/286 (49%), Gaps = 55/286 (19%)

Query: 505 GLDLSVEAESL----------DREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIA 554
           GL   VE E L          DR D+ LP  Q  L+ + A+ +  P+++V+MS   V + 
Sbjct: 626 GLSPDVEGEELRIDVPGFDGGDRNDIALPAPQQALLER-AKASGKPLVVVLMSGSAVALN 684

Query: 555 FAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLR 614
           +A+T+ +  AI+ A YPG+ GG AIA ++ G  NPGGRLP+T+Y          ++  L 
Sbjct: 685 WAKTHAD--AIVAAWYPGQSGGTAIARMLAGDDNPGGRLPVTFYR---------STKDLP 733

Query: 615 PVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTS 674
              S    GRTY+++ G  L+PFGYGLSYT+F Y+    + T                  
Sbjct: 734 AYVSYDMKGRTYRYFKGEPLFPFGYGLSYTRFAYDAPQLSSTA----------------- 776

Query: 675 DASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGF 734
                         ++     +     +N G+  G +V  VY + P +   + ++ ++GF
Sbjct: 777 --------------VQAGSTLQVTTTVRNTGARAGDEVAQVYLQYP-DRPQSPLRSLVGF 821

Query: 735 QRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGNG 780
           QRV + AG  + + F  +A ++L+ VD +    + AG +T+FVG G
Sbjct: 822 QRVHLAAGEQRTLTFNLDA-RALSDVDRSGQRAVEAGNYTLFVGGG 866


>gi|346725879|ref|YP_004852548.1| beta-glucosidase-related glycosidase [Xanthomonas axonopodis pv.
           citrumelo F1]
 gi|346650626|gb|AEO43250.1| Beta-glucosidase-related glycosidase [Xanthomonas axonopodis pv.
           citrumelo F1]
          Length = 889

 Score =  300 bits (767), Expect = 3e-78,   Method: Compositional matrix adjust.
 Identities = 177/427 (41%), Positives = 242/427 (56%), Gaps = 40/427 (9%)

Query: 64  RVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGAT 123
           R   LV++M+ +EKV Q  + A  +PRLG+P YEWWSE LHG++  G           AT
Sbjct: 40  RAAALVAQMSREEKVAQAMNDAPAIPRLGIPAYEWWSEGLHGIARNG----------YAT 89

Query: 124 SFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGR---------AGLTYWSPNINVARD 174
            FP  I   AS+N  L +++G  VSTEARA +N            AGLT WSPNIN+ RD
Sbjct: 90  VFPQSIGLAASWNTRLMQQVGTVVSTEARAKFNQAGGPGKDHKRYAGLTIWSPNINIFRD 149

Query: 175 PRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVD 234
           PRWGR  ET GEDPF+ G+ AV ++RGLQ         DLN  P  +++  KH A   V 
Sbjct: 150 PRWGRGMETYGEDPFLTGQMAVGFIRGLQ-------GEDLN-HPRTIATP-KHIA---VH 197

Query: 235 NWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQ 294
           +     R+ FD  V+ +D+E T+   F   + EG A SVMC+YN ++G P+CA   LLN 
Sbjct: 198 SGPEPGRHGFDVDVSPRDVEATYTPAFRAAIVEGQAGSVMCAYNSLHGTPACAADWLLNG 257

Query: 295 TVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFTGNA 354
            VRG+W   G++V+DCD++  M   H F  D+   + A  LKAG DL+CG  Y    G A
Sbjct: 258 RVRGDWGFKGFVVSDCDAVDDMTQFHYFRPDNAGSSAA-ALKAGHDLNCGHAYREL-GTA 315

Query: 355 VQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ--YVSLGKQDICSDENIELAAEAAREG 412
           + +G+V E  +D+SL  L+    RLG  +   +  Y  LG +D+ +  +  LA +AA E 
Sbjct: 316 IARGEVDEALLDQSLVRLFATRYRLGELEAPRKDPYARLGAKDVDNAAHRALALQAAAES 375

Query: 413 IVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFS---GYAN 469
           IVLLKND NTLPL +     +AV+GP+A+A  A+  NY G     ++P+ G     G   
Sbjct: 376 IVLLKNDANTLPLKAG--TRLAVIGPNADALAALEANYQGTSSAPVTPLLGLRQRFGAQQ 433

Query: 470 VTYKTGC 476
           V+Y  G 
Sbjct: 434 VSYAQGA 440



 Score =  129 bits (323), Expect = 9e-27,   Method: Compositional matrix adjust.
 Identities = 91/286 (31%), Positives = 144/286 (50%), Gaps = 55/286 (19%)

Query: 505 GLDLSVEAESL----------DREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIA 554
           GL   VE E L          DR D+ LP  Q  L+ + A+ +  P+++V+MS   V + 
Sbjct: 626 GLSPDVEGEELRIDVPGFDGGDRNDIALPAPQQALLER-AKASGKPLVVVLMSGSAVALN 684

Query: 555 FAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLR 614
           +A+T+ +  AI+ A YPG+ GG AIA ++ G  NPGGRLP+T+Y          ++  L 
Sbjct: 685 WAKTHAD--AIVAAWYPGQSGGTAIARMLAGDDNPGGRLPVTFYR---------STKDLP 733

Query: 615 PVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTS 674
              S    GRTY+++ G  L+ FGYGLSYT+F Y+        Q++   LQ   +L  T+
Sbjct: 734 AYVSYDMKGRTYRYFKGEPLFAFGYGLSYTRFAYD------APQLSTTTLQAGSSLQVTT 787

Query: 675 DASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGF 734
                                      +N G+  G +V  VY + P +   + ++ ++GF
Sbjct: 788 -------------------------TVRNTGARAGDEVAQVYLQYP-DRPQSPLRSLVGF 821

Query: 735 QRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGNG 780
           QRV + AG  + + F  +A ++L+ VD +    + AG +T+FVG G
Sbjct: 822 QRVHLAAGEQRTLTFNLDA-RALSDVDRSGQRAVEAGNYTLFVGGG 866


>gi|188993706|ref|YP_001905716.1| beta-glucosidase [Xanthomonas campestris pv. campestris str. B100]
 gi|167735466|emb|CAP53681.1| exported beta-glucosidase [Xanthomonas campestris pv. campestris]
          Length = 896

 Score =  299 bits (766), Expect = 3e-78,   Method: Compositional matrix adjust.
 Identities = 179/454 (39%), Positives = 253/454 (55%), Gaps = 42/454 (9%)

Query: 54  FCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGT 113
           + D + P   R  DLVSRMTL+EK  Q+ + A  +PRL +P+Y+WW+EALHGV+  G   
Sbjct: 40  YLDPTQPLQARAADLVSRMTLEEKAAQMQNAAPAIPRLQVPEYDWWNEALHGVARAG--- 96

Query: 114 HFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRA--------GLTYW 165
                  GAT FP  I   A+F+  L  ++  A+S EARA ++   A        GLT+W
Sbjct: 97  -------GATVFPQAIGLAATFDTPLMAEVATAISDEARAKHHAFLAGGEHKRYQGLTFW 149

Query: 166 SPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCC 225
           SPNIN+ RDPRWGR  ET GEDPF+  R  V +V+GLQ  +G            K+ +  
Sbjct: 150 SPNINIFRDPRWGRGQETYGEDPFLTARMGVTFVQGLQAQQGPYR---------KLDATA 200

Query: 226 KHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPS 285
           KHYA   V +    DR+HFD   +E+D+ ET+L  F+  V+EG  ++VM +YNRVNG  +
Sbjct: 201 KHYA---VHSGPEADRHHFDVHPSERDLYETYLPAFQALVQEGHVAAVMGAYNRVNGESA 257

Query: 286 CADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQ 345
            A  + L   +R +W   GYIV+DC +I+ +  NHK +  + E A A  +K G DLDCG 
Sbjct: 258 SASTR-LEGILRRDWGFDGYIVSDCAAIRDIWQNHKIVP-TPEAAAALGVKHGTDLDCGD 315

Query: 346 YYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQDICSDENIE-- 403
            Y      AV+ G + E  ID+SL  L    +RLG FD  P  V   +    ++++ +  
Sbjct: 316 TYAALPA-AVRAGLIDEATIDRSLTRLMAARLRLGMFD-PPAKVPWAQIPASANQSPQHD 373

Query: 404 -LAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIA 462
            LA   ARE +VLLKND   LPL    +K +AVVGP A+  ++++GNY G P   ++ + 
Sbjct: 374 ALARRTARESLVLLKND-GLLPLKPT-LKRIAVVGPTADDPMSLLGNYYGTPAAPVTILQ 431

Query: 463 GF---SGYANVTYKTGCDDVACKSNNSIFAASEA 493
           G    +  A V Y  G D V  + + +  A  +A
Sbjct: 432 GIRDAAPQAEVVYARGSDLVEGREDPNAAAPIDA 465



 Score =  135 bits (339), Expect = 1e-28,   Method: Compositional matrix adjust.
 Identities = 88/282 (31%), Positives = 133/282 (47%), Gaps = 53/282 (18%)

Query: 490 ASEAAKTADATIILAGLDLSVEAESLD----------REDLWLPGYQTQLINQVAEVAKG 539
           A +AA+ AD  + + GL   VE E +D          R D  LP  Q +L+ Q  +    
Sbjct: 623 AVDAARNADVVVFVGGLTGDVEGEEMDVNYPGFAGGDRTDTRLPKPQRELL-QALQATGT 681

Query: 540 PVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYN 599
           PV+ V+ +   + I +A+ +  + AIL A YPG+ GG A+ DV+FG+ +PGGRLPIT+Y 
Sbjct: 682 PVVAVLTTGSALAIDWAQQH--VPAILLAWYPGQRGGTAVGDVLFGQASPGGRLPITFYK 739

Query: 600 GDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQV 659
            D  + LP         D     GRTY++++G  LYPFG+GL+YTQF Y+ L   +T   
Sbjct: 740 ED--ERLPA-------FDDYAMRGRTYRYFDGKPLYPFGHGLAYTQFAYSNLRLDRTT-- 788

Query: 660 NLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKP 719
                                        +  D      V  +N G   G +VV +Y  P
Sbjct: 789 -----------------------------VAADGTLRATVSVKNTGQRAGDEVVQLYLHP 819

Query: 720 PAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVD 761
                    K++ GFQR+ ++ G ++ + F     ++L I D
Sbjct: 820 LNPQRERARKELRGFQRITLQPGEHREVSFNITPREALRIYD 861


>gi|227831319|ref|YP_002833099.1| glycoside hydrolase family protein [Sulfolobus islandicus L.S.2.15]
 gi|227457767|gb|ACP36454.1| glycoside hydrolase family 3 domain protein [Sulfolobus islandicus
           L.S.2.15]
          Length = 754

 Score =  299 bits (766), Expect = 3e-78,   Method: Compositional matrix adjust.
 Identities = 220/700 (31%), Positives = 346/700 (49%), Gaps = 120/700 (17%)

Query: 122 ATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTY-WSPNINVARDPRWGRI 180
           +T+FP  I   +++N  L   I   + ++ R +      G+    SP ++V +DPRWGR 
Sbjct: 101 STAFPQAIGLASTWNPELVMDIASVIRSQGRLV------GVNQCLSPVLDVCKDPRWGRC 154

Query: 181 TETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDV-DNWKGV 239
            ET GEDP++V    + Y+ GLQ     +N         ++ +  KH+AA+   +  + +
Sbjct: 155 EETYGEDPYLVASMGLAYITGLQG----DN---------QLVATAKHFAAHGFPEGGRNI 201

Query: 240 DRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGE 299
            + H    V  +++ ETFL PFE+ VK G   S+M +Y+ ++GIP   +P+LL   +R E
Sbjct: 202 AQVH----VGNRELRETFLFPFEVAVKIGKVMSIMPAYHEIDGIPCHGNPQLLTNILRQE 257

Query: 300 WDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDL-----DCGQYYTNFTGNA 354
           W   G +V+D D I+ +   H+ +A +K +A    L++G+D+     DC   Y     NA
Sbjct: 258 WGFDGIVVSDYDGIRQLETIHR-VASNKMEAAILALESGVDIEFPTIDC---YGEPLVNA 313

Query: 355 VQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQDICSDENIELAAEAAREGIV 414
           +++G V E+ ID++++ +  +  RLG  D      +   + +   ++ ELA + ARE IV
Sbjct: 314 LKEGLVPESLIDRAVERVLRIKDRLGLLDNPFVNENSVPEKLDDHKSRELALKTARESIV 373

Query: 415 LLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNY---------AGIP-CRYMSPIAGF 464
           LLKN+ N LPL S  V  +AV+GP+AN    M+G+Y         +GI     +  I   
Sbjct: 374 LLKNENNILPL-SKNVNKIAVIGPNANDPRNMLGDYTYTGHLNIDSGIEIVTVLQGIVKK 432

Query: 465 SGYANVTYKTGCDDVACKSNNSIFAASEAAKTADATIIL----AGLDLS----------- 509
            G + V Y  GCD +A +S      A E A+ AD  I +    +GL LS           
Sbjct: 433 VGESKVLYAKGCD-IASESKEGFAEAIEIARQADVIIAIMGEKSGLPLSWMDIPSKEEFK 491

Query: 510 ----VEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAI 565
               V  E  DR  L LPG Q +L+ ++ +  K P+ILV+++  G  +  +     +KA+
Sbjct: 492 KYQAVTGEGNDRSSLRLPGVQEELLKELYKTGK-PIILVLIN--GRPLVLSSIINYVKAV 548

Query: 566 LWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTS--MPL---RPVDSLG 620
           + A +PGEEGG AIADV+FG +NP GRLPIT+         P+ +  +PL   R   S  
Sbjct: 549 IEAWFPGEEGGNAIADVIFGDYNPSGRLPITF---------PMDTGQIPLYYNRKPSSF- 598

Query: 621 YPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFT-KTIQVNLNKLQHCRNLNYTSDASKT 679
              R Y       L+ FGYGLSYTQF+Y+ L  T K I  N N                 
Sbjct: 599 ---RPYVMLRSSPLFTFGYGLSYTQFEYSNLEVTPKEIGPNSN----------------- 638

Query: 680 RCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFV 739
                              +D +NVG  +G DVV +Y        A  +K++ GF ++ +
Sbjct: 639 ---------------IAISIDVKNVGKMEGDDVVQLYVSKTFSSVARPVKELKGFAKIHL 683

Query: 740 RAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGN 779
           + G  +R+KF+    ++L   D     ++  GE+ + +GN
Sbjct: 684 KPGEKRRVKFIL-PTEALAFYDSFMRLVVEKGEYQLLIGN 722


>gi|84623339|ref|YP_450711.1| glucan 1,4-beta-glucosidase [Xanthomonas oryzae pv. oryzae MAFF
           311018]
 gi|188577358|ref|YP_001914287.1| glucan 1,4-beta-glucosidase [Xanthomonas oryzae pv. oryzae PXO99A]
 gi|84367279|dbj|BAE68437.1| glucan 1,4-beta-glucosidase [Xanthomonas oryzae pv. oryzae MAFF
           311018]
 gi|188521810|gb|ACD59755.1| glucan 1,4-beta-glucosidase [Xanthomonas oryzae pv. oryzae PXO99A]
          Length = 889

 Score =  299 bits (766), Expect = 3e-78,   Method: Compositional matrix adjust.
 Identities = 176/427 (41%), Positives = 245/427 (57%), Gaps = 40/427 (9%)

Query: 64  RVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGAT 123
           R  DLV+ M+ +EKV Q  + A  +PRLG+P YEWWSE LHG++  G           AT
Sbjct: 40  RAADLVAHMSREEKVAQAMNDAPAIPRLGIPAYEWWSEGLHGIARNG----------YAT 89

Query: 124 SFPTVILTTASFNESLWKKIGQAVSTEARAMYN-LGR--------AGLTYWSPNINVARD 174
            FP  I   AS+N  L +++G  VSTEARA +N  GR        AGLT WSPNIN+ RD
Sbjct: 90  VFPQAIGLAASWNTHLMQQVGTVVSTEARAKFNQAGRPGKDHKRYAGLTIWSPNINIFRD 149

Query: 175 PRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVD 234
           PRWGR  ET GEDPF+ G+ AV ++ GLQ         DL+  P  +++  KH A   V 
Sbjct: 150 PRWGRGMETYGEDPFLTGQMAVGFIHGLQ-------GDDLD-HPRTIATP-KHLA---VH 197

Query: 235 NWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQ 294
           +     R+ FD  V+ +D+E T+   F   + EG A +VMC+YN ++G P+CA   L+N 
Sbjct: 198 SGPEPGRHGFDVDVSPRDVEATYTPAFRAAIVEGQAGAVMCAYNSLHGTPACAADWLING 257

Query: 295 TVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFTGNA 354
            VRG+W   G++V+DCD++  M   H F  D+   + A  LKAG DL+CG  Y    G A
Sbjct: 258 RVRGDWGFKGFVVSDCDAVDDMTQFHYFRPDNAGSSAA-ALKAGHDLNCGHAYREL-GTA 315

Query: 355 VQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ--YVSLGKQDICSDENIELAAEAAREG 412
           + +G+V E  +D+SL  L+    RLG  +   +  Y  LG +D+ + ++  LA +AA E 
Sbjct: 316 IARGEVDEALLDQSLVRLFAARYRLGELEAPRKDPYARLGAKDVDNAQHRALALQAAAES 375

Query: 413 IVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFS---GYAN 469
           IVLLKN+ NTLPLN+     +AV+GP+A+A  A+  NY G     ++P+ G     G   
Sbjct: 376 IVLLKNNANTLPLNAG--TRLAVIGPNADALAALEANYQGTSSAPVTPLLGLRQRFGAQQ 433

Query: 470 VTYKTGC 476
           V+Y  G 
Sbjct: 434 VSYAQGA 440



 Score =  130 bits (327), Expect = 3e-27,   Method: Compositional matrix adjust.
 Identities = 88/286 (30%), Positives = 141/286 (49%), Gaps = 55/286 (19%)

Query: 505 GLDLSVEAESL----------DREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIA 554
           GL   VE E L          DR D+ LP  Q  L+ + A+ +  P+++V+MS   V + 
Sbjct: 626 GLSPDVEGEELRIDVPGFDGGDRNDIALPAPQQALLER-AKASGKPLVVVLMSGSAVALN 684

Query: 555 FAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLR 614
           +A+T+ +  AI+ A YPG+ GG AIA ++ G  NPGGRLP+T+Y          ++  L 
Sbjct: 685 WAKTHAD--AIVAAWYPGQSGGTAIARMLAGDDNPGGRLPVTFYR---------STKDLP 733

Query: 615 PVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTS 674
              S    GRTY+++ G  L+PFGYGLSYT+F Y+    + T                  
Sbjct: 734 AYVSYDMKGRTYRYFKGEPLFPFGYGLSYTRFAYDAPQLSSTA----------------- 776

Query: 675 DASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGF 734
                         ++     +     +N G+  G +V  VY + P +   + ++ ++GF
Sbjct: 777 --------------VQAGSTLQVTTTVRNTGARAGDEVAQVYLQYP-DRPQSPLRSLVGF 821

Query: 735 QRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGNG 780
           QRV + AG  + + F  +A ++L+ VD +    + AG +T+FVG G
Sbjct: 822 QRVHLAAGEQRTLTFNLDA-RALSDVDRSGQRAVEAGNYTLFVGGG 866


>gi|385776908|ref|YP_005649476.1| glycoside hydrolase family protein [Sulfolobus islandicus REY15A]
 gi|323475656|gb|ADX86262.1| glycoside hydrolase family 3 domain protein [Sulfolobus islandicus
           REY15A]
          Length = 754

 Score =  299 bits (766), Expect = 3e-78,   Method: Compositional matrix adjust.
 Identities = 220/700 (31%), Positives = 347/700 (49%), Gaps = 120/700 (17%)

Query: 122 ATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTY-WSPNINVARDPRWGRI 180
           +T+FP  I   +++N  L   I   + ++AR +      G+    SP ++V +DPRWGR 
Sbjct: 101 STAFPQAIGLASTWNLELVMDIASVIRSQARLV------GVNQCLSPVLDVCKDPRWGRC 154

Query: 181 TETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDV-DNWKGV 239
            ET GEDP++V    + Y+ GLQ     +N         ++ +  KH+AA+   +  + +
Sbjct: 155 EETYGEDPYLVASMGLAYITGLQG----DN---------QLVATAKHFAAHGFPEGGRNI 201

Query: 240 DRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGE 299
            + H    V  +++ ETFL PFE+ VK G   S+M +Y+ ++GIP   +P+LL   +R E
Sbjct: 202 AQVH----VGNRELRETFLFPFEVAVKIGKVMSIMPAYHEIDGIPCHGNPQLLTNILRQE 257

Query: 300 WDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDL-----DCGQYYTNFTGNA 354
           W   G +V+D D I+ +   H+ +A +K +A    L++G+D+     DC   Y+    NA
Sbjct: 258 WGFDGIVVSDYDGIRQLETIHR-VASNKMEAAILALESGVDIEFPTIDC---YSEPLVNA 313

Query: 355 VQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQDICSDENIELAAEAAREGIV 414
           + +G V E+ ID++++ +  +  RLG  D      +   + +   ++ ELA + ARE IV
Sbjct: 314 LTEGLVPESLIDRAVERVLRIKDRLGLLDNPFVNENSVPEKLDDHKSRELALKTARESIV 373

Query: 415 LLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNY---------AGIP-CRYMSPIAGF 464
           LLKN+ N LPL S  V  +AV+GP+AN    M+G+Y         +GI     +  +   
Sbjct: 374 LLKNENNILPL-SKNVNKIAVIGPNANDPRNMLGDYTYTGHLNIDSGIEIVTVLQGVVKK 432

Query: 465 SGYANVTYKTGCDDVACKSNNSIFAASEAAKTADATIIL----AGLDLS----------- 509
            G + V Y  GCD +A +S      A E A+ AD  I +    +GL LS           
Sbjct: 433 VGESKVLYAKGCD-IASESKEGFAEAIEIARQADVIIAVMGEKSGLPLSWTDIPSEEEFK 491

Query: 510 ----VEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAI 565
               V  E  DR  L LPG Q +L+ ++ +  K P+ILV+++  G  +  +     +KA+
Sbjct: 492 KYQAVTGEGNDRSSLRLPGVQEELLKELYKTGK-PIILVLIN--GRPLVLSPIINYVKAV 548

Query: 566 LWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTS--MPL---RPVDSLG 620
           + A +PGEEGG AIADV+FG +NP GRLPIT+         P+ +  +PL   R   S  
Sbjct: 549 IEAWFPGEEGGNAIADVIFGDYNPSGRLPITF---------PMDTGQIPLYYNRKPSSF- 598

Query: 621 YPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFT-KTIQVNLNKLQHCRNLNYTSDASKT 679
              R Y       L+ FGYGLSYTQF+Y+ L  T K I  N N                 
Sbjct: 599 ---RPYVMLRSSPLFTFGYGLSYTQFEYSNLEVTPKEIGPNSN----------------- 638

Query: 680 RCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFV 739
                              +D +NVG  +G DVV +Y        A  +K++ GF ++ +
Sbjct: 639 ---------------IAISIDVKNVGKMEGDDVVQLYVSKTFSSVARPVKELKGFAKIHL 683

Query: 740 RAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGN 779
           + G  +R+KF+    ++L   D     ++  GE+ + +GN
Sbjct: 684 KPGEKRRVKFIL-PTEALAFYDSFMRLVVEKGEYQLLIGN 722


>gi|381169747|ref|ZP_09878910.1| glycosyl hydrolase family 3 N terminal domain protein [Xanthomonas
           citri pv. mangiferaeindicae LMG 941]
 gi|380689765|emb|CCG35397.1| glycosyl hydrolase family 3 N terminal domain protein [Xanthomonas
           citri pv. mangiferaeindicae LMG 941]
          Length = 874

 Score =  299 bits (765), Expect = 5e-78,   Method: Compositional matrix adjust.
 Identities = 182/449 (40%), Positives = 247/449 (55%), Gaps = 43/449 (9%)

Query: 45  LGLQMSSFLFCDSSLPYSI---RVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSE 101
           LGL +        + P      R   LV++M+ +EKV Q  + A  +PRLG+P YEWWSE
Sbjct: 3   LGLTLPCLALAPPAKPAGSPEQRAAALVAQMSREEKVAQAMNDAPAIPRLGIPAYEWWSE 62

Query: 102 ALHGVSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGR-- 159
            LHG++  G           AT FP  I   AS+N SL +++G  VSTEARA +N     
Sbjct: 63  GLHGIARNG----------YATVFPQSIGLAASWNTSLMQQVGTVVSTEARAKFNQAGGP 112

Query: 160 -------AGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENAT 212
                  AGLT WSPNIN+ RDPRWGR  ET GEDPF+ G+ AV ++RGLQ         
Sbjct: 113 GKDHQRYAGLTIWSPNINIFRDPRWGRGMETYGEDPFLTGQMAVGFIRGLQ-------GE 165

Query: 213 DLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASS 272
           DLN  P  +++  KH A   V +     R+ FD  V+  D+E T+   F   + EG A S
Sbjct: 166 DLN-HPRTIATP-KHIA---VHSGPEPGRHGFDVDVSPHDVEATYTPAFRAALVEGQAGS 220

Query: 273 VMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVA 332
           VMC+YN ++G P CA   LLN  VRG+W   G++V+DCD++  M   H F  D+   + A
Sbjct: 221 VMCAYNALHGTPVCAADWLLNGRVRGDWGFKGFVVSDCDAVDDMTQFHYFRPDNAGSSAA 280

Query: 333 QTLKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ--YVS 390
             LKAG DL+CG  Y    G A+ +G+V E  +D+SL  L+    RLG  +   +  Y  
Sbjct: 281 -ALKAGHDLNCGHAYREL-GTAIARGEVDEALLDQSLVRLFAARYRLGELEAPRKDPYAR 338

Query: 391 LGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNY 450
           LG +D+ +  +  LA +AA E IVLLKND NTLPL +     +AV+GP+A+A  A+  NY
Sbjct: 339 LGAKDVDNAAHRALALQAAAESIVLLKNDANTLPLRAG--TRLAVIGPNADALAALEANY 396

Query: 451 AGIPCRYMSPIAGFS---GYANVTYKTGC 476
            G     ++P+ G     G   V+Y  G 
Sbjct: 397 QGTSSAPVTPLLGLRQRFGAQQVSYAQGA 425



 Score =  133 bits (335), Expect = 3e-28,   Method: Compositional matrix adjust.
 Identities = 89/286 (31%), Positives = 143/286 (50%), Gaps = 55/286 (19%)

Query: 505 GLDLSVEAESL----------DREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIA 554
           GL   VE E L          DR D+ LP  Q  L+ + A+ +  P+++V+MS   V + 
Sbjct: 611 GLSPDVEGEELRIDVPGFDGGDRNDIALPAPQQALLER-AKASGKPLVVVLMSGSAVALN 669

Query: 555 FAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLR 614
           +A+ + +  AI+ A YPG+ GG AIA ++ G  NPGGRLP+T+Y          ++  L 
Sbjct: 670 WAKMHAD--AIVAAWYPGQSGGTAIARMLAGDDNPGGRLPVTFYR---------STKDLP 718

Query: 615 PVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTS 674
           P  S    GRTY+++ G  L+PFGYGLSYT+F Y+                         
Sbjct: 719 PYVSYDMKGRTYRYFKGEPLFPFGYGLSYTRFAYD------------------------- 753

Query: 675 DASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGF 734
                  P +    L+  +  +     +N G+  G +V  VY + P +   + ++ ++GF
Sbjct: 754 ------APQLSTTTLQAGNPLQVTATVRNTGARAGDEVAQVYLQYP-DRPQSPLRSLVGF 806

Query: 735 QRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGNG 780
           QRV + AG  + + F  +A ++L+ VD +    + AG++T+FVG G
Sbjct: 807 QRVHLAAGEQRTLTFHLDA-RALSDVDRSGQRAVEAGDYTLFVGGG 851


>gi|294665226|ref|ZP_06730524.1| glucan 1,4-beta-glucosidase [Xanthomonas fuscans subsp.
           aurantifolii str. ICPB 10535]
 gi|292605014|gb|EFF48367.1| glucan 1,4-beta-glucosidase [Xanthomonas fuscans subsp.
           aurantifolii str. ICPB 10535]
          Length = 886

 Score =  299 bits (765), Expect = 5e-78,   Method: Compositional matrix adjust.
 Identities = 177/427 (41%), Positives = 243/427 (56%), Gaps = 40/427 (9%)

Query: 64  RVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGAT 123
           R   LV++M+ +EKV Q  + A  +PRLG+P YEWWSE LHG++  G           AT
Sbjct: 37  RAAALVAQMSREEKVAQAMNDAPAIPRLGIPAYEWWSEGLHGIARNG----------YAT 86

Query: 124 SFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGR---------AGLTYWSPNINVARD 174
            FP  I   AS+N SL +++G  VSTEARA +N            AGLT WSPNIN+ RD
Sbjct: 87  VFPQSIGLAASWNTSLMQQVGTVVSTEARAKFNQAGGPGKDHQRYAGLTIWSPNINIFRD 146

Query: 175 PRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVD 234
           PRWGR  ET GEDPF+ G+ AV ++RGLQ         DL+  P  +++  KH A   V 
Sbjct: 147 PRWGRGMETYGEDPFLTGQMAVGFIRGLQ-------GEDLD-HPRTIATP-KHIA---VH 194

Query: 235 NWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQ 294
           +     R+ FD  V+  D+E T+   F   + EG A SVMC+YN ++G P+CA   LLN 
Sbjct: 195 SGPEPGRHGFDVDVSPHDVEATYTPAFRAALVEGQAGSVMCAYNSLHGTPACAADWLLNG 254

Query: 295 TVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFTGNA 354
            VRG+W   G++V+DCD++  M   H F  D+   + A  LKAG DL+CG  Y +  G A
Sbjct: 255 RVRGDWGFKGFVVSDCDAVDDMTQFHYFRPDNAGSSAA-ALKAGHDLNCGHAYRDL-GTA 312

Query: 355 VQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ--YVSLGKQDICSDENIELAAEAAREG 412
           +++G V E  +D+SL  L+    RLG  +   +  Y  LG +D+ +  +  LA +AA E 
Sbjct: 313 IERGDVDEALLDQSLVRLFAARYRLGELEAPRKDPYARLGAKDVDNAAHRALALQAAAES 372

Query: 413 IVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFS---GYAN 469
           IVLLKND NTLPL +     +AV+GP+A+A  A+  NY G     ++P+ G     G   
Sbjct: 373 IVLLKNDANTLPLKAG--TRLAVIGPNADALAALEANYQGTSSAPVTPLLGLRQRFGAQQ 430

Query: 470 VTYKTGC 476
           V+Y  G 
Sbjct: 431 VSYAQGA 437



 Score =  131 bits (330), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 88/286 (30%), Positives = 143/286 (50%), Gaps = 55/286 (19%)

Query: 505 GLDLSVEAESL----------DREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIA 554
           GL   VE E L          DR D+ LP  Q  L+ + A+ +  P+++V+MS   V + 
Sbjct: 623 GLSPDVEGEELRIDVPGFDGGDRNDIALPAPQQALLER-AKASGKPLVVVLMSGSAVALN 681

Query: 555 FAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLR 614
           +A+T+ +  AI+ A YPG+ GG A+A ++ G  NPGGRLP+T+Y          ++  L 
Sbjct: 682 WAKTHAD--AIVAAWYPGQSGGTAMARMLAGDDNPGGRLPVTFYR---------STKDLP 730

Query: 615 PVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTS 674
              S    GRTY+++ G  L+PFGYGLSYT+F Y+                         
Sbjct: 731 AYVSYDMKGRTYRYFKGEPLFPFGYGLSYTRFAYD------------------------- 765

Query: 675 DASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGF 734
                  P +    L+  +  +     +N G+  G +V  VY + P +   + ++ ++GF
Sbjct: 766 ------APQLSTTTLQAGNPLQVTTTVRNTGARAGDEVAQVYLQYP-DRPQSPLRSLVGF 818

Query: 735 QRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGNG 780
           QRV + AG  + + F  +A ++L+ VD +    + AG++T+FVG G
Sbjct: 819 QRVHLAAGEQRTLTFHLDA-RALSDVDRSGQRAVEAGDYTLFVGGG 863


>gi|294627323|ref|ZP_06705909.1| glucan 1,4-beta-glucosidase [Xanthomonas fuscans subsp.
           aurantifolii str. ICPB 11122]
 gi|292598405|gb|EFF42556.1| glucan 1,4-beta-glucosidase [Xanthomonas fuscans subsp.
           aurantifolii str. ICPB 11122]
          Length = 886

 Score =  298 bits (764), Expect = 5e-78,   Method: Compositional matrix adjust.
 Identities = 177/427 (41%), Positives = 243/427 (56%), Gaps = 40/427 (9%)

Query: 64  RVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGAT 123
           R   LV++M+ +EKV Q  + A  +PRLG+P YEWWSE LHG++  G           AT
Sbjct: 37  RAAALVAQMSREEKVAQAMNDAPAIPRLGIPAYEWWSEGLHGIARNG----------YAT 86

Query: 124 SFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGR---------AGLTYWSPNINVARD 174
            FP  I   AS+N SL +++G  VSTEARA +N            AGLT WSPNIN+ RD
Sbjct: 87  VFPQSIGLAASWNTSLMQQVGTVVSTEARAKFNQAGGPGKDHQRYAGLTIWSPNINIFRD 146

Query: 175 PRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVD 234
           PRWGR  ET GEDPF+ G+ AV ++RGLQ         DL+  P  +++  KH A   V 
Sbjct: 147 PRWGRGMETYGEDPFLTGQMAVGFIRGLQ-------GEDLD-HPRTIATP-KHIA---VH 194

Query: 235 NWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQ 294
           +     R+ FD  V+  D+E T+   F   + EG A SVMC+YN ++G P+CA   LLN 
Sbjct: 195 SGPEPGRHGFDVDVSPHDVEATYTPAFRAALVEGQAGSVMCAYNSLHGTPACAADWLLNG 254

Query: 295 TVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFTGNA 354
            VRG+W   G++V+DCD++  M   H F  D+   + A  LKAG DL+CG  Y +  G A
Sbjct: 255 RVRGDWGFKGFVVSDCDAVDDMTQFHYFRPDNAGSSAA-ALKAGHDLNCGHAYRDL-GTA 312

Query: 355 VQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ--YVSLGKQDICSDENIELAAEAAREG 412
           +++G V E  +D+SL  L+    RLG  +   +  Y  LG +D+ +  +  LA +AA E 
Sbjct: 313 IERGDVDEALLDQSLVRLFAARYRLGELEAPRKDPYARLGAKDVDNAAHRALALQAAAES 372

Query: 413 IVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFS---GYAN 469
           IVLLKND NTLPL +     +AV+GP+A+A  A+  NY G     ++P+ G     G   
Sbjct: 373 IVLLKNDANTLPLKAG--TRLAVIGPNADALAALEANYQGTSSAPVTPLLGLRQRFGAQQ 430

Query: 470 VTYKTGC 476
           V+Y  G 
Sbjct: 431 VSYAQGA 437



 Score =  132 bits (332), Expect = 7e-28,   Method: Compositional matrix adjust.
 Identities = 89/286 (31%), Positives = 143/286 (50%), Gaps = 55/286 (19%)

Query: 505 GLDLSVEAESL----------DREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIA 554
           GL   VE E L          DR D+ LP  Q  L+ + A+ +  P+++V+MS   V + 
Sbjct: 623 GLSPDVEGEELRIDVPGFDGGDRNDIALPAPQQALLER-AKASGKPLVVVLMSGSAVALN 681

Query: 555 FAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLR 614
           +A+T+ +  AI+ A YPG+ GG AIA ++ G  NPGGRLP+T+Y          ++  L 
Sbjct: 682 WAKTHAD--AIVAAWYPGQSGGTAIARMLAGDDNPGGRLPVTFYR---------STKDLP 730

Query: 615 PVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTS 674
              S    GRTY+++ G  L+PFGYGLSYT+F Y+                         
Sbjct: 731 AYVSYDMKGRTYRYFKGEPLFPFGYGLSYTRFAYD------------------------- 765

Query: 675 DASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGF 734
                  P +    L+  +  +     +N G+  G +V  VY + P +   + ++ ++GF
Sbjct: 766 ------APQLSTTTLQAGNPLQVTTTVRNTGARAGDEVAQVYLQYP-DRPQSPLRSLVGF 818

Query: 735 QRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGNG 780
           QRV + AG  + + F  +A ++L+ VD +    + AG++T+FVG G
Sbjct: 819 QRVHLAAGEQRTLTFHLDA-RALSDVDRSGQRAVEAGDYTLFVGGG 863


>gi|390992294|ref|ZP_10262532.1| glycosyl hydrolase family 3 N terminal domain protein [Xanthomonas
           axonopodis pv. punicae str. LMG 859]
 gi|372552957|emb|CCF69507.1| glycosyl hydrolase family 3 N terminal domain protein [Xanthomonas
           axonopodis pv. punicae str. LMG 859]
          Length = 886

 Score =  298 bits (764), Expect = 6e-78,   Method: Compositional matrix adjust.
 Identities = 178/427 (41%), Positives = 241/427 (56%), Gaps = 40/427 (9%)

Query: 64  RVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGAT 123
           R   LV++M+ +EKV Q  + A  +PRLG+P YEWWSE LHG++  G           AT
Sbjct: 37  RAAALVAQMSREEKVAQAMNDAPAIPRLGIPAYEWWSEGLHGIARNGY----------AT 86

Query: 124 SFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGR---------AGLTYWSPNINVARD 174
            FP  I   AS+N SL +++G  VSTEARA +N            AGLT WSPNIN+ RD
Sbjct: 87  VFPQSIGLAASWNTSLMQQVGTVVSTEARAKFNQAGGPGKDHQRYAGLTIWSPNINIFRD 146

Query: 175 PRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVD 234
           PRWGR  ET GEDPF+ G+ AV ++RGLQ         DLN  P  +++  KH A   V 
Sbjct: 147 PRWGRGMETYGEDPFLTGQMAVGFIRGLQ-------GEDLN-HPRTIATP-KHIA---VH 194

Query: 235 NWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQ 294
           +     R+ FD  V+  D+E T+   F   + EG A SVMC+YN ++G P CA   LLN 
Sbjct: 195 SGPEPGRHGFDVDVSPHDVEATYTPAFRAALVEGQAGSVMCAYNALHGTPVCAADWLLNG 254

Query: 295 TVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFTGNA 354
            VRG+W   G++V+DCD++  M   H F  D+   + A  LKAG DL+CG  Y    G A
Sbjct: 255 RVRGDWGFKGFVVSDCDAVDDMTQFHYFRPDNAGSSAA-ALKAGHDLNCGHAYREL-GTA 312

Query: 355 VQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ--YVSLGKQDICSDENIELAAEAAREG 412
           + +G+V E  +D+SL  L+    RLG  +   +  Y  LG +D+ +  +  LA +AA E 
Sbjct: 313 IARGEVDEALLDQSLVRLFAARYRLGELEAPRKDPYARLGAKDVDNAAHRALALQAAAES 372

Query: 413 IVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFS---GYAN 469
           IVLLKND NTLPL +     +AV+GP+A+A  A+  NY G     ++P+ G     G   
Sbjct: 373 IVLLKNDANTLPLRAG--TRLAVIGPNADALAALEANYQGTSSAPVTPLLGLRQRFGAQQ 430

Query: 470 VTYKTGC 476
           V+Y  G 
Sbjct: 431 VSYAQGA 437



 Score =  132 bits (331), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 89/286 (31%), Positives = 142/286 (49%), Gaps = 55/286 (19%)

Query: 505 GLDLSVEAESL----------DREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIA 554
           GL   VE E L          DR D+ LP  Q  L+ + A+ +  P+++V+MS   V + 
Sbjct: 623 GLSPDVEGEELRIDVPGFDGGDRNDIALPAAQQTLLER-AKASGKPLVVVLMSGSAVALN 681

Query: 555 FAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLR 614
           +A+T+ +  AI+ A YPG+ GG AIA ++ G  NPGGRLP+T+Y          ++  L 
Sbjct: 682 WAKTHAD--AIVAAWYPGQSGGTAIARMLAGDDNPGGRLPVTFYR---------STKDLP 730

Query: 615 PVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTS 674
              S    GRTY+++ G  L+PFGYGLSYT+F Y+                         
Sbjct: 731 AYVSYDMKGRTYRYFKGEPLFPFGYGLSYTRFAYD------------------------- 765

Query: 675 DASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGF 734
                  P +    L+  +  +     +N G+  G +V  VY + P +   + ++ ++GF
Sbjct: 766 ------APQLSTTTLQAGNPLQVTATVRNTGARAGDEVAQVYLQYP-DRPQSPLRSLVGF 818

Query: 735 QRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGNG 780
           QRV + AG  + + F  +A ++L+ VD +    + AG +T+FVG G
Sbjct: 819 QRVHLAAGEQRTLTFNLDA-RALSDVDRSGQRAVEAGNYTLFVGGG 863


>gi|418519424|ref|ZP_13085476.1| glucan 1,4-beta-glucosidase [Xanthomonas axonopodis pv. malvacearum
           str. GSPB2388]
 gi|410704868|gb|EKQ63347.1| glucan 1,4-beta-glucosidase [Xanthomonas axonopodis pv. malvacearum
           str. GSPB2388]
          Length = 886

 Score =  298 bits (764), Expect = 6e-78,   Method: Compositional matrix adjust.
 Identities = 178/427 (41%), Positives = 241/427 (56%), Gaps = 40/427 (9%)

Query: 64  RVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGAT 123
           R   LV++M+ +EKV Q  + A  +PRLG+P YEWWSE LHG++  G           AT
Sbjct: 37  RAAALVAQMSREEKVAQAMNDAPAIPRLGIPAYEWWSEGLHGIARNG----------YAT 86

Query: 124 SFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGR---------AGLTYWSPNINVARD 174
            FP  I   AS+N SL +++G  VSTEARA +N            AGLT WSPNIN+ RD
Sbjct: 87  VFPQSIGLAASWNTSLMQQVGTVVSTEARAKFNQAGGPGKDHQRYAGLTIWSPNINIFRD 146

Query: 175 PRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVD 234
           PRWGR  ET GEDPF+ G+ AV ++RGLQ         DLN  P  +++  KH A   V 
Sbjct: 147 PRWGRGMETYGEDPFLTGQMAVGFIRGLQ-------GEDLN-HPRTIATP-KHIA---VH 194

Query: 235 NWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQ 294
           +     R+ FD  V+  D+E T+   F   + EG A SVMC+YN ++G P CA   LLN 
Sbjct: 195 SGPEPGRHGFDVDVSPHDVEATYTPAFRAALVEGQAGSVMCAYNALHGTPVCAADWLLNG 254

Query: 295 TVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFTGNA 354
            VRG+W   G++V+DCD++  M   H F  D+   + A  LKAG DL+CG  Y    G A
Sbjct: 255 RVRGDWGFKGFVVSDCDAVDDMTQFHYFRPDNAGSSAA-ALKAGHDLNCGHAYREL-GTA 312

Query: 355 VQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ--YVSLGKQDICSDENIELAAEAAREG 412
           + +G+V E  +D+SL  L+    RLG  +   +  Y  LG +D+ +  +  LA +AA E 
Sbjct: 313 IARGEVDEALLDQSLVRLFAARYRLGELEAPRKDPYARLGAKDVDNAAHRALALQAAAES 372

Query: 413 IVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFS---GYAN 469
           IVLLKND NTLPL +     +AV+GP+A+A  A+  NY G     ++P+ G     G   
Sbjct: 373 IVLLKNDANTLPLRAG--TRLAVIGPNADALAALEANYQGTSSAPVTPLLGLRQRFGAQQ 430

Query: 470 VTYKTGC 476
           V+Y  G 
Sbjct: 431 VSYAQGA 437



 Score =  132 bits (333), Expect = 6e-28,   Method: Compositional matrix adjust.
 Identities = 89/286 (31%), Positives = 143/286 (50%), Gaps = 55/286 (19%)

Query: 505 GLDLSVEAESL----------DREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIA 554
           GL   VE E L          DR D+ LP  Q  L+ + A+ +  P+++V+MS   V + 
Sbjct: 623 GLSPDVEGEELRIDVPGFDGGDRNDIALPAPQQTLLER-AKASGKPLVVVLMSGSAVALN 681

Query: 555 FAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLR 614
           +A+T+ +  AI+ A YPG+ GG AIA ++ G  NPGGRLP+T+Y          ++  L 
Sbjct: 682 WAKTHAD--AIVAAWYPGQSGGTAIARMLAGDDNPGGRLPVTFYR---------STKDLP 730

Query: 615 PVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTS 674
              S    GRTY+++ G  L+PFGYGLSYT+F Y+                         
Sbjct: 731 AYVSYDMKGRTYRYFKGEPLFPFGYGLSYTRFAYD------------------------- 765

Query: 675 DASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGF 734
                  P +    L+  +  +     +N G+  G +V  VY + P +   + ++ ++GF
Sbjct: 766 ------APQLSTTTLQAGNPLQVTATVRNTGARAGDEVAQVYLQYP-DRPQSPLRSLVGF 818

Query: 735 QRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGNG 780
           QRV + AG  + + F  +A ++L+ VD +    + AG++T+FVG G
Sbjct: 819 QRVHLAAGEQRTLTFHLDA-RALSDVDRSGQRAVEAGDYTLFVGGG 863


>gi|325922365|ref|ZP_08184139.1| beta-glucosidase-like glycosyl hydrolase [Xanthomonas gardneri ATCC
           19865]
 gi|325547147|gb|EGD18227.1| beta-glucosidase-like glycosyl hydrolase [Xanthomonas gardneri ATCC
           19865]
          Length = 889

 Score =  298 bits (762), Expect = 1e-77,   Method: Compositional matrix adjust.
 Identities = 176/427 (41%), Positives = 245/427 (57%), Gaps = 40/427 (9%)

Query: 64  RVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGAT 123
           R   LV++M+ +EKV Q  + A  +PRLG+P YEWWSE LHG++  G           AT
Sbjct: 40  RAAALVAQMSREEKVAQAMNDAPAIPRLGIPAYEWWSEGLHGIARNG----------YAT 89

Query: 124 SFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGR---------AGLTYWSPNINVARD 174
            FP  I   AS+N  L +++G  VSTEARA +N            AGLT WSPNIN+ RD
Sbjct: 90  VFPQAIGLAASWNTQLMQQVGTVVSTEARAKFNQAGGPGKDHKRYAGLTIWSPNINIFRD 149

Query: 175 PRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVD 234
           PRWGR  ET GEDPF+ G+ AV ++RGLQ         DL+  P  +++  KH A   V 
Sbjct: 150 PRWGRGMETYGEDPFLTGQLAVGFIRGLQ-------GDDLD-HPRTIATP-KHIA---VH 197

Query: 235 NWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQ 294
           +     R+ FD  V+ +D+E T+   F   + +G A SVMC+YN ++G P+CA   LLN 
Sbjct: 198 SGPEPGRHSFDVDVSPRDVEATYTPAFRAALIDGQAGSVMCAYNSLHGTPACAADWLLNG 257

Query: 295 TVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFTGNA 354
            VRG+W   G++V+DCD++  M   H F  D+   + A +LKAG DL+CG  Y    G A
Sbjct: 258 RVRGDWGFKGFVVSDCDAVDDMTQFHYFRPDNAGSSAA-SLKAGHDLNCGYAYRAL-GTA 315

Query: 355 VQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ--YVSLGKQDICSDENIELAAEAAREG 412
           +++G+V E  +D+SL  L+    RLG  +   +  Y +LG +DI +  N  LA +AA + 
Sbjct: 316 IERGEVDEALLDQSLVRLFAARYRLGELEAPHKDPYATLGAKDIDNTANRALALKAAAQS 375

Query: 413 IVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFS---GYAN 469
           IVLLKND NTLPL +     +AV+GP+A+A  A+  NY G     ++P+ G     G   
Sbjct: 376 IVLLKNDANTLPLKAG--ARLAVIGPNADALAALEANYQGTSSTPVTPLLGLRQRFGVHQ 433

Query: 470 VTYKTGC 476
           V+Y  G 
Sbjct: 434 VSYAQGA 440



 Score =  134 bits (338), Expect = 1e-28,   Method: Compositional matrix adjust.
 Identities = 89/286 (31%), Positives = 140/286 (48%), Gaps = 55/286 (19%)

Query: 505 GLDLSVEAESL----------DREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIA 554
           GL   VE E L          DR D+ LP  Q  L+ + A+ +  P+++V+MS   V + 
Sbjct: 626 GLSPDVEGEELRIDVPGFDGGDRNDIALPAPQQALLER-AKASGKPLVVVLMSGSAVALN 684

Query: 555 FAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLR 614
           +A+T+ +  AI+ A YPG+ GG AIA ++ G  NPGGRLP+T+Y          ++  L 
Sbjct: 685 WAKTHAD--AIVAAWYPGQSGGTAIARMLAGDDNPGGRLPVTFYR---------STKDLP 733

Query: 615 PVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTS 674
           P  S    GRTY+++ G  L+PFGYGLSYT F Y     + T                  
Sbjct: 734 PYVSYDMKGRTYRYFKGEPLFPFGYGLSYTSFAYGAPQLSSTT----------------- 776

Query: 675 DASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGF 734
                         L+     +     +N G+  G +V  VY + P +   + ++ ++GF
Sbjct: 777 --------------LQAGSTLQVTTTVRNTGTRAGDEVAQVYLQYP-DRPQSPLRSLVGF 821

Query: 735 QRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGNG 780
           QRV ++ G  + + F  +A ++L+ VD      + AG++T+FVG G
Sbjct: 822 QRVHLKPGEQRTLTFTLDA-RALSDVDRTGQRAVEAGDYTLFVGGG 866


>gi|284174578|ref|ZP_06388547.1| Beta-xylosidase [Sulfolobus solfataricus 98/2]
 gi|356934752|gb|AET42953.1| beta-xylosidase-like protein [Sulfolobus solfataricus 98/2]
          Length = 754

 Score =  297 bits (760), Expect = 2e-77,   Method: Compositional matrix adjust.
 Identities = 218/699 (31%), Positives = 342/699 (48%), Gaps = 118/699 (16%)

Query: 122 ATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTY-WSPNINVARDPRWGRI 180
           +T+FP  I   +++N  L   +   + ++ R +      G+    SP ++V RDPRWGR 
Sbjct: 101 STAFPQAIGLASTWNPELLTNVASTIRSQGRLI------GVNQCLSPVLDVCRDPRWGRC 154

Query: 181 TETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDV-DNWKGV 239
            ET GEDP++V    + Y+ GLQ                ++ +  KH+AA+   +  + +
Sbjct: 155 EETYGEDPYLVASMGLAYITGLQG-------------ETQLVATAKHFAAHGFPEGGRNI 201

Query: 240 DRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGE 299
            + H    V  +++ ETFL PFE+ VK G   S+M +Y+ ++G+P   +P+LL   +R E
Sbjct: 202 AQVH----VGNRELRETFLFPFEVAVKIGKVMSIMPAYHEIDGVPCHGNPQLLTNILRQE 257

Query: 300 WDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLD-----LDCGQYYTNFTGNA 354
           W   G +V+D D I+ +   HK +A +K +A    L++G+D     +DC   Y      A
Sbjct: 258 WGFDGIVVSDYDGIRQLEAIHK-VASNKMEAAILALESGVDIEFPTIDC---YGEPLVTA 313

Query: 355 VQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQDICSDENIELAAEAAREGIV 414
           +++G V E  ID++++ +  +  RLG  D      S   + +   ++ ELA +AARE IV
Sbjct: 314 IKEGLVSEAIIDRAVERVLRIKERLGLLDNPFVDESAVPERLDDRKSRELALKAARESIV 373

Query: 415 LLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNY---------AGIP-CRYMSPIAGF 464
           LLKN+ N LPL S  +  +AV+GP+AN    M+G+Y         +GI     +  IA  
Sbjct: 374 LLKNENNMLPL-SKNINKIAVIGPNANDPRNMLGDYTYTGHLNIDSGIEIVTVLQGIAKK 432

Query: 465 SGYANVTYKTGCDDVACKSNNSIFAASEAAKTADATIIL----AGLDLS----------- 509
            G   V Y  GC D+A +S      A E AK AD  I +    +GL LS           
Sbjct: 433 VGEGKVLYAKGC-DIAGESKEGFSEAIEIAKQADVIIAVMGEKSGLPLSWTDIPSEEEFK 491

Query: 510 ----VEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAI 565
               V  E  DR  L L G Q +L+ ++ +  K P+ILV+++  G  +  +     +KAI
Sbjct: 492 KYQAVTGEGNDRASLRLLGVQEELLKELYKTGK-PIILVLIN--GRPLVLSPIINYVKAI 548

Query: 566 LWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTS--MPL---RPVDSLG 620
           + A +PGEEGG AIAD++FG +NP GRLPIT+         P+ +  +PL   R   S  
Sbjct: 549 IEAWFPGEEGGNAIADIIFGDYNPSGRLPITF---------PMDTGQIPLYYSRKPSSF- 598

Query: 621 YPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTR 680
              R Y   +   L+ FGYGLSYTQF+Y+ L  T                          
Sbjct: 599 ---RPYVMLHSSPLFTFGYGLSYTQFEYSNLEVTP------------------------- 630

Query: 681 CPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVR 740
                  ++    Y    +D +NVG+ +G +VV +Y        A  +K++ GF +V ++
Sbjct: 631 ------KEVGPLSYITILLDVKNVGNMEGDEVVQLYISKSFSSVARPVKELKGFAKVHLK 684

Query: 741 AGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGN 779
            G  +R+KF     ++L   D     ++  GE+ I +GN
Sbjct: 685 PGEKRRVKFAL-PMEALAFYDNFMRLVVEKGEYQILIGN 722


>gi|15899739|ref|NP_344344.1| Beta-xylosidase [Sulfolobus solfataricus P2]
 gi|13816430|gb|AAK43134.1| Beta-xylosidase [Sulfolobus solfataricus P2]
          Length = 754

 Score =  297 bits (760), Expect = 2e-77,   Method: Compositional matrix adjust.
 Identities = 218/699 (31%), Positives = 341/699 (48%), Gaps = 118/699 (16%)

Query: 122 ATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTY-WSPNINVARDPRWGRI 180
           +T+FP  I   +++N  L   +   + ++ R +      G+    SP ++V RDPRWGR 
Sbjct: 101 STAFPQAIGLASTWNPELLTNVASTIRSQGRLI------GVNQCLSPVLDVCRDPRWGRC 154

Query: 181 TETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDV-DNWKGV 239
            ET GEDP++V    + Y+ GLQ                ++ +  KH+AA+   +  + +
Sbjct: 155 EETYGEDPYLVASMGLAYITGLQG-------------ETQLVATAKHFAAHGFPEGGRNI 201

Query: 240 DRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGE 299
            + H   R    ++ ETFL PFE+ VK G   S+M +Y+ ++G+P   +P+LL   +R E
Sbjct: 202 AQVHVGNR----ELRETFLFPFEVAVKIGKVMSIMPAYHEIDGVPCHGNPQLLTNILRQE 257

Query: 300 WDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLD-----LDCGQYYTNFTGNA 354
           W   G +V+D D I+ +   HK +A +K +A    L++G+D     +DC   Y      A
Sbjct: 258 WGFDGIVVSDYDGIRQLEAIHK-VASNKMEAAILALESGVDIEFPTIDC---YGEPLVTA 313

Query: 355 VQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQDICSDENIELAAEAAREGIV 414
           +++G V E  ID++++ +  +  RLG  D      S   + +   ++ ELA +AARE IV
Sbjct: 314 IKEGLVSEAIIDRAVERVLRIKERLGLLDNPFVDESAVPERLDDRKSRELALKAARESIV 373

Query: 415 LLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNY---------AGIP-CRYMSPIAGF 464
           LLKN+ N LPL S  +  +AV+GP+AN    M+G+Y         +GI     +  IA  
Sbjct: 374 LLKNENNMLPL-SKNINKIAVIGPNANDPRNMLGDYTYTGHLNIDSGIEIVTVLQGIAKK 432

Query: 465 SGYANVTYKTGCDDVACKSNNSIFAASEAAKTADATIIL----AGLDLS----------- 509
            G   V Y  GC D+A +S      A E AK AD  I +    +GL LS           
Sbjct: 433 VGEGKVLYAKGC-DIAGESKEGFSEAIEIAKQADVIIAVMGEKSGLPLSWTDIPSEEEFK 491

Query: 510 ----VEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAI 565
               V  E  DR  L L G Q +L+ ++ +  K P+ILV+++  G  +  +     +KAI
Sbjct: 492 KYQAVTGEGNDRASLRLLGVQEELLKELYKTGK-PIILVLIN--GRPLVLSPIINYVKAI 548

Query: 566 LWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTS--MPL---RPVDSLG 620
           + A +PGEEGG AIAD++FG +NP GRLPIT+         P+ +  +PL   R   S  
Sbjct: 549 IEAWFPGEEGGNAIADIIFGDYNPSGRLPITF---------PMDTGQIPLYYSRKPSSF- 598

Query: 621 YPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTR 680
              R Y   +   L+ FGYGLSYTQF+Y+ L  T                          
Sbjct: 599 ---RPYVMLHSSPLFTFGYGLSYTQFEYSNLEVTP------------------------- 630

Query: 681 CPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVR 740
                  ++    Y    +D +NVG+ +G +VV +Y        A  +K++ GF +V ++
Sbjct: 631 ------KEVGPLSYITILLDVKNVGNMEGDEVVQLYISKSFSSVARPVKELKGFAKVHLK 684

Query: 741 AGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGN 779
            G  +R+KF     ++L   D     ++  GE+ I +GN
Sbjct: 685 PGEKRRVKFAL-PMEALAFYDNFMRLVVEKGEYQILIGN 722


>gi|254445290|ref|ZP_05058766.1| Glycosyl hydrolase family 3 C terminal domain protein
           [Verrucomicrobiae bacterium DG1235]
 gi|198259598|gb|EDY83906.1| Glycosyl hydrolase family 3 C terminal domain protein
           [Verrucomicrobiae bacterium DG1235]
          Length = 730

 Score =  297 bits (760), Expect = 2e-77,   Method: Compositional matrix adjust.
 Identities = 233/720 (32%), Positives = 341/720 (47%), Gaps = 95/720 (13%)

Query: 52  FLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGP 111
           + F D  LP   R+ DL++ MTL+EKV  +G F  G+PRL + +Y   SE  HGV+  GP
Sbjct: 27  YPFQDPDLPNEERIDDLITCMTLEEKVDLMG-FVPGIPRLDV-KYTRISEGYHGVAQGGP 84

Query: 112 GTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYN---LGRAGLTYWSPN 168
                      T FP      A+++ +L  ++    +TE R +Y      R+GL   +PN
Sbjct: 85  SNWGKRNPTPTTQFPQAYGLAATWDPALISRVSANQATEVRYLYQSPKYQRSGLVVMAPN 144

Query: 169 INVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHY 228
            ++ARDPRWGR  E  GEDPF+ G  A  +  GL        A D + R LK +S  KH+
Sbjct: 145 ADLARDPRWGRTEEVYGEDPFLTGTLAAAFASGL--------AGD-HPRYLKATSLLKHF 195

Query: 229 AAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCAD 288
            A    N    DR+   +   E+   E + +PFEM +++G A S+M +YN +NG P+   
Sbjct: 196 LA----NSNEDDRFFSSSDFDERLWREYYAKPFEMAIRDGGARSMMAAYNAINGTPAHVH 251

Query: 289 PKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYT 348
           P +L   V GEW L G I  D   +  +V+ HK   D    A A  +KAG++L    + T
Sbjct: 252 P-MLRDIVMGEWGLDGTICTDGGGLAHLVNQHKTYPDLPT-ATAACIKAGINLFLDNH-T 308

Query: 349 NFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ---YVSLGKQDICSD----EN 401
               +AV+Q  V E +ID  ++    + + LG  D  P+   Y ++G +         E 
Sbjct: 309 QAALDAVEQSLVTEAEIDDVIRGRIRLFLDLGLLD-PPELVPYSNIGHEPGLEPWELPET 367

Query: 402 IELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPI 461
                E  R+ IVLLKN+ N LPL+ +K+ +VA+VGP AN T  ++  Y+G P   + P 
Sbjct: 368 HAFVREVTRKSIVLLKNENNILPLDPSKINSVAIVGPLANTT--LLDWYSGTPPYAIPPR 425

Query: 462 AGFSGYANV-----TYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEA---- 512
            G  GYAN        K G + VA  S+ ++    E A + D  I++ G      A    
Sbjct: 426 DGIEGYANSGPFPSPAKFGSNWVADMSDTAL----EVAASRDVAIVVVGNHPESNAGWGV 481

Query: 513 --------ESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKA 564
                   E++DR+++ L   Q + I +V   A  P  +V++       A      N  A
Sbjct: 482 VTSPSEGKEAVDRQEIILQPDQEEFIQKV--YAANPNTIVVL-VSNFPYAMPWAAENAPA 538

Query: 565 ILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGR 624
           I+   +  +E G A+ADV+FG +NPGG+   TW      Q+ P+    +R        GR
Sbjct: 539 IVHITHASQEQGNALADVLFGDYNPGGKTVQTWPKS-LDQLPPMMDYDIRR-------GR 590

Query: 625 TYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGV 684
           TY +      YPFGYGLSYT F+ + L   K +                +DA+ T     
Sbjct: 591 TYMYSQHEPQYPFGYGLSYTTFELSKLKAPKKL---------------KADATAT----- 630

Query: 685 LVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRN 744
                        KV   N G  DG +VV +Y + P        KQ+ GFQRV V AG++
Sbjct: 631 ------------IKVRVANTGERDGDEVVQLYVRYPNSKVERPSKQLKGFQRVTVPAGKS 678


>gi|121308314|dbj|BAF43576.1| arabinofuranosidase/xylosidase homolog [Prunus persica]
          Length = 349

 Score =  297 bits (760), Expect = 2e-77,   Method: Compositional matrix adjust.
 Identities = 149/341 (43%), Positives = 212/341 (62%), Gaps = 10/341 (2%)

Query: 441 NATVAMIGNYAGIPCRYMSPIAGFSGYANVTYKTGCDDVACKSNNSIFAASEAAKTADAT 500
           + TV MIGNYAG+ C Y +P+ G   Y    ++ GC DV C  N    AA  AA+ ADAT
Sbjct: 1   DVTVTMIGNYAGVACGYTTPLQGIGRYTRTIHQAGCTDVHCNGNQLFGAAEAAARQADAT 60

Query: 501 IILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNT 560
           +++ GLD S+EAE +DR  L LPG+Q +L+++VA  ++GP ILV+MS G +D+ FA+ + 
Sbjct: 61  VLVMGLDQSIEAEFVDRAGLLLPGHQQELVSRVARASRGPTILVLMSGGPIDVTFAKNDP 120

Query: 561 NIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLG 620
            I AI+W GYPG+ GG AIADV+FG  NPGG+LP+TWY  +YV  LP+T M +R   + G
Sbjct: 121 RISAIIWVGYPGQAGGTAIADVLFGTTNPGGKLPMTWYPQNYVTHLPMTDMAMRADPARG 180

Query: 621 YPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTR 680
           YPGRTY+FY GP ++PFG GLSYT F +NL      + V L  L+   N    S A    
Sbjct: 181 YPGRTYRFYRGPVVFPFGLGLSYTTFAHNLAHGPTLVSVPLTSLKATANSTMLSKA---- 236

Query: 681 CPGVLVNDLRCDDY--FEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVF 738
              V V+   C+     +  VD +N GS DG+  ++V++ PP    A+  KQ++GF ++ 
Sbjct: 237 ---VRVSHADCNALSPLDVHVDVKNTGSMDGTHTLLVFTSPPDGKWASS-KQLMGFHKIH 292

Query: 739 VRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGN 779
           + AG  KR++   + CK L++VD      +P GEH + +G+
Sbjct: 293 IAAGSEKRVRIAVHVCKHLSVVDRFGIRRIPLGEHKLQIGD 333


>gi|384420163|ref|YP_005629523.1| glucan 1,4-beta-glucosidase [Xanthomonas oryzae pv. oryzicola
           BLS256]
 gi|353463076|gb|AEQ97355.1| glucan 1,4-beta-glucosidase [Xanthomonas oryzae pv. oryzicola
           BLS256]
          Length = 889

 Score =  297 bits (760), Expect = 2e-77,   Method: Compositional matrix adjust.
 Identities = 175/427 (40%), Positives = 244/427 (57%), Gaps = 40/427 (9%)

Query: 64  RVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGAT 123
           R  DLV+ M+ +EKV Q  + A  +PRLG+P YEWWSE LHG++  G           AT
Sbjct: 40  RAADLVAHMSREEKVAQAMNDAPAIPRLGIPAYEWWSEGLHGIARNG----------YAT 89

Query: 124 SFPTVILTTASFNESLWKKIGQAVSTEARAMYN-LGR--------AGLTYWSPNINVARD 174
            FP  I   AS+N  L +++G  VSTEARA +N  GR        AGLT WSPNIN+ RD
Sbjct: 90  VFPQAIGLAASWNTHLMQQVGTVVSTEARAKFNQAGRPGKDHKRYAGLTIWSPNINIFRD 149

Query: 175 PRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVD 234
           PRWGR  ET GEDPF+ G+ AV ++ GLQ         DL+  P  +++  KH A   V 
Sbjct: 150 PRWGRGMETYGEDPFLTGQMAVGFIHGLQ-------GDDLD-HPRTIATP-KHLA---VH 197

Query: 235 NWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQ 294
           +     R+ FD  V+ +D+E T+   F   + EG A +VMC+YN ++G P+CA   L+N 
Sbjct: 198 SGPEPGRHGFDVDVSPRDVEATYTPAFRAAIVEGQAGAVMCAYNSLHGTPACAADWLING 257

Query: 295 TVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFTGNA 354
            VRG+W   G++V+DCD++  M   H F  D+   + A  LKAG DL+CG  Y    G A
Sbjct: 258 RVRGDWGFKGFVVSDCDAVDDMTQFHYFRPDNAGSSAA-ALKAGHDLNCGHAYREL-GTA 315

Query: 355 VQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ--YVSLGKQDICSDENIELAAEAAREG 412
           + +G+V E  +D+SL  L+    RLG  +   +  Y  LG +D+ + ++  LA +AA E 
Sbjct: 316 IARGEVDEALLDQSLVRLFAARYRLGELEAPRKDPYARLGAKDVDNAQHRALALQAAAES 375

Query: 413 IVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFS---GYAN 469
           IVLLKN+ NTLPL +     +AV+GP+A+A  A+  NY G     ++P+ G     G   
Sbjct: 376 IVLLKNNANTLPLKAG--TRLAVIGPNADALAALEANYQGTSSAPVTPLLGLRQRFGAQQ 433

Query: 470 VTYKTGC 476
           V+Y  G 
Sbjct: 434 VSYAQGA 440



 Score =  128 bits (321), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 88/286 (30%), Positives = 140/286 (48%), Gaps = 55/286 (19%)

Query: 505 GLDLSVEAESL----------DREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIA 554
           GL   VE E L          DR D+ LP  Q  L+ + A+ +  P+++V+MS   V + 
Sbjct: 626 GLSPDVEGEELRIDVPGFDGGDRNDIALPAPQQALLER-AKASGKPLVVVLMSGSAVALN 684

Query: 555 FAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLR 614
           +A+T+ +  AI+ A YPG+ GG AIA ++ G  NPGGRLP+T+Y          ++  L 
Sbjct: 685 WAKTHAD--AIVAAWYPGQSGGTAIARMLAGDDNPGGRLPVTFYR---------STKDLP 733

Query: 615 PVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTS 674
              S    GRTY+++ G  L+PFGYGLSYT F Y+    + T                  
Sbjct: 734 AYVSYDMKGRTYRYFKGEPLFPFGYGLSYTCFAYDAPQLSSTA----------------- 776

Query: 675 DASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGF 734
                         ++     +     +N G+  G +V  VY + P +   + ++ ++GF
Sbjct: 777 --------------VQAGSTLQVTTTVRNTGARAGDEVAQVYLQYP-DRPQSPLRSLVGF 821

Query: 735 QRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGNG 780
           QRV + AG  + + F  +A ++L+ VD +    + AG +T+FVG G
Sbjct: 822 QRVHLAAGEQRTLTFNLDA-RALSDVDPSGQRAVEAGNYTLFVGGG 866


>gi|392537607|ref|ZP_10284744.1| Beta-glucosidase [Pseudoalteromonas marina mano4]
          Length = 870

 Score =  296 bits (757), Expect = 4e-77,   Method: Compositional matrix adjust.
 Identities = 172/443 (38%), Positives = 254/443 (57%), Gaps = 46/443 (10%)

Query: 53  LFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPG 112
           L+ + S     RV DLV+R+TL+EKV QL D +  + RL +P+Y WW+EALHGV+  G  
Sbjct: 33  LYLNESASIDERVNDLVTRLTLEEKVAQLFDKSPAIERLNIPEYNWWNEALHGVARAGK- 91

Query: 113 THFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRA--------GLTY 164
                    AT FP  I   A+F+E L  ++G A+S E RA ++   A        GLTY
Sbjct: 92  ---------ATVFPQAIGLAATFDEDLMLRVGTAISDEGRAKHHAFLAENNRSMYTGLTY 142

Query: 165 WSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSC 224
           WSPNIN+ RDPRWGR  ET GEDP++  R AVN++ GLQ           N+  LK  + 
Sbjct: 143 WSPNINIFRDPRWGRGQETYGEDPYLTTRIAVNFINGLQGD---------NTEYLKSVAT 193

Query: 225 CKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIP 284
            KHYA   V +   V R+  D   +++D+ ET+L  F+  + +   +SVMC+YN VNG P
Sbjct: 194 LKHYA---VHSGPEVSRHSDDYTASKKDLAETYLPAFKDVIAQTKVASVMCAYNSVNGTP 250

Query: 285 SCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVD--NHKFLADSKEDAVAQTLKAGLDLD 342
           +C + +L+   +R E++  GYIV+DC +I    D  +H  + +++  A A  LK G DL+
Sbjct: 251 ACGNDELIQNKLRDEFNFDGYIVSDCGAIADFYDVKSHN-IVNTEAKAAAMALKTGTDLN 309

Query: 343 CGQYYTN---FTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQDI--- 396
           CG ++ N   +   AV++G V+E D+DK+LK L     +LG FD +P+ V      I   
Sbjct: 310 CGDHHGNTYSYLSQAVKEGLVEEKDVDKALKRLMYARFKLGMFD-NPENVPYSDTSIDIV 368

Query: 397 CSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCR 456
            S++++ L  EAA++ +VLLKN+Q  LPL     + VA++GP+A+    ++GNY G+P  
Sbjct: 369 GSNKHLALTQEAAKKSLVLLKNEQ-VLPLKGN--EKVALIGPNADNEAILLGNYNGMPIV 425

Query: 457 YMSPIAGFS---GYANVTYKTGC 476
            ++P        G  N+TY  G 
Sbjct: 426 PITPKLALEQRLGKNNLTYTAGS 448



 Score =  113 bits (283), Expect = 4e-22,   Method: Compositional matrix adjust.
 Identities = 90/303 (29%), Positives = 132/303 (43%), Gaps = 57/303 (18%)

Query: 494 AKTADATIILAGLDLSVEAESL----------DREDLWLPGYQTQLINQVAEVAKGPVIL 543
           A  AD  + + G+  ++E E +          DR ++ LP  Q  L+ ++ +  K P++L
Sbjct: 603 ANEADVIVFVGGISANLEGEEMPLQIDGFSHGDRTNINLPKSQLNLLKKLKQTGK-PIVL 661

Query: 544 VIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYV 603
           V MS  G  +A    N NI AI+   YPGE  G A+  +++G+++P G+LPIT+Y    V
Sbjct: 662 VNMS--GSAMALNWENENIDAIIQGFYPGEAAGSALVSLLYGEYSPSGKLPITFYKS--V 717

Query: 604 QMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNK 663
             LP                RTYK+Y G  LYPFG+GLSY  FKY               
Sbjct: 718 SDLP-------DFKDYSMKNRTYKYYEGEVLYPFGFGLSYADFKY--------------- 755

Query: 664 LQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEI 723
               +N  ++ DA           DL             N  S    DVV VY   P   
Sbjct: 756 ----KNTRHSIDAGS--------GDLN------LTTTITNQSSFSADDVVQVYVSMPDAP 797

Query: 724 AATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGNG-GV 782
             T  KQ++GF+ + ++      IKF     K L+ ++     +   G   I VG+G G+
Sbjct: 798 IKTPNKQLVGFKHITLKNESKNDIKFTIPKNK-LSYINEQGIAVAYKGRLIITVGSGQGI 856

Query: 783 SFP 785
             P
Sbjct: 857 KIP 859


>gi|90021134|ref|YP_526961.1| Beta-glucosidase [Saccharophagus degradans 2-40]
 gi|89950734|gb|ABD80749.1| b-xylosidase-like protein [Saccharophagus degradans 2-40]
          Length = 893

 Score =  295 bits (756), Expect = 5e-77,   Method: Compositional matrix adjust.
 Identities = 173/449 (38%), Positives = 259/449 (57%), Gaps = 42/449 (9%)

Query: 50  SSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNV 109
           +++ F D+SL    RV DLVSR+T  EK+ Q+ +    + RLG+P Y WW+E+LHGV+  
Sbjct: 41  ATYPFRDASLSVDARVDDLVSRLTTTEKIAQMFNDTPAIERLGIPAYNWWNESLHGVARA 100

Query: 110 GPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYN--LGR------AG 161
           G           AT +P  I   ++F+E L  ++  ++S E RA Y+  L +       G
Sbjct: 101 GK----------ATVYPQAIGLASTFDEDLMLRVATSISDEGRAKYHDFLSKDVRTIYGG 150

Query: 162 LTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKV 221
           LT+WSPNIN+ RDPRWGR  ET GEDPF+ GR A+N+V+G+Q         + NS  LK 
Sbjct: 151 LTFWSPNINIFRDPRWGRGQETYGEDPFLTGRMAINFVKGIQ-------GENDNSDYLKA 203

Query: 222 SSCCKHYAAYD-VDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRV 280
            +  KHYA +   +  +  D YH     T +D+ ET+L  F M + E +  S+MC+YNRV
Sbjct: 204 VATIKHYAVHSGPEKTRHSDDYH----PTRKDLFETYLPAFRMAIAETNVQSLMCAYNRV 259

Query: 281 NGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNH-KFLADSKEDAVAQTLKAGL 339
           +G P+C + +L+ + +RG+   +GY+V+DC +I    ++    + DS  +A A  +K+G 
Sbjct: 260 DGAPACGNNELMQEILRGDMGFNGYVVSDCGAIADFYESRSHHVVDSPAEAAAWAVKSGT 319

Query: 340 DLDCGQYYTNFTGN---AVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ--YVSLGKQ 394
           DL+CG  + N   N   A+QQG + E  ID ++K L+   ++LG FD   +  Y  +G  
Sbjct: 320 DLNCGDSHGNTYTNLHYALQQGLITEDYIDIAVKRLFKARIKLGMFDEQDRVPYSEIGMD 379

Query: 395 DICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIP 454
            + S +++ L  EAA + IVLLKN+   LPL  A VK VAV+GP+A     ++GNY G+P
Sbjct: 380 VVGSPKHLALTQEAAEKSIVLLKNN-GVLPL-KAGVK-VAVIGPNAVDEDVLVGNYHGVP 436

Query: 455 CRYMSPIAGF---SGYANVTYKTGCDDVA 480
            + + P+ G     G ANV Y  G   +A
Sbjct: 437 VKPVLPLEGIVNRVGEANVFYAPGSAQIA 465



 Score =  112 bits (280), Expect = 9e-22,   Method: Compositional matrix adjust.
 Identities = 91/301 (30%), Positives = 126/301 (41%), Gaps = 55/301 (18%)

Query: 490 ASEAAKTADATIILAGLDLSVEAESL----------DREDLWLPGYQTQLINQVAEVAKG 539
           A  AA+ AD  I + G+D  +E E +          DR  + LP  QT L+ Q+    K 
Sbjct: 620 ALAAARKADVIIFMGGIDAHLEGEEMPLELDGFTHGDRTHINLPKVQTNLLKQLKATGK- 678

Query: 540 PVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYN 599
           PV++V  S  G  +A    +  + AIL A YPGE  G A+A++++G  +P GRLP+T+Y 
Sbjct: 679 PVVMVNFS--GSAMALNWESEKLDAILQAFYPGEATGTALANILWGDVSPSGRLPVTFYK 736

Query: 600 GDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQV 659
           G  V  LP         +      RTYKFY G  LY FG+GL Y  F YN L        
Sbjct: 737 G--VDDLP-------AFNDYHMENRTYKFYRGEPLYAFGHGLGYVDFAYNNL-------- 779

Query: 660 NLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKP 719
                                   V+ N           V   N G     DV  VY   
Sbjct: 780 ------------------------VVANTAEAGKALPIAVSVTNTGKMQAEDVAQVYISL 815

Query: 720 PAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGN 779
               A T I+ +  F+R  + AG +  ++F   A + L  +D    T    G   + VG+
Sbjct: 816 LDAPANTPIRDLKAFKRTKLAAGESTELEFNLPA-RVLTYIDDNGKTQTYTGRVEVTVGS 874

Query: 780 G 780
           G
Sbjct: 875 G 875


>gi|182415162|ref|YP_001820228.1| glycoside hydrolase family 3 [Opitutus terrae PB90-1]
 gi|177842376|gb|ACB76628.1| glycoside hydrolase family 3 domain protein [Opitutus terrae
           PB90-1]
          Length = 747

 Score =  295 bits (756), Expect = 5e-77,   Method: Compositional matrix adjust.
 Identities = 223/726 (30%), Positives = 347/726 (47%), Gaps = 76/726 (10%)

Query: 54  FCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGT 113
           F D  LP   R+ DL+ RMTL+EK+  +   A  VPRLG+ +     E  HGV+  GP  
Sbjct: 34  FQDPELPAEQRIDDLIGRMTLEEKIDCMAMRA-AVPRLGV-KGSRHIEGYHGVAQGGPSN 91

Query: 114 HFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYN---LGRAGLTYWSPNIN 170
                    T FP      A+++  L +++    + EAR ++      RAGL   +PN +
Sbjct: 92  WGRRNPTATTQFPQAYGLGATWDPELIRQVAAQEAEEARYLFQSPRYDRAGLIVRAPNAD 151

Query: 171 VARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAA 230
           +ARDPRWGR  E  GEDPF  G  A  +VRGLQ           + R  K  S  KH+ A
Sbjct: 152 LARDPRWGRTEEVYGEDPFHAGTLATAFVRGLQGD---------DPRYFKAVSLVKHFLA 202

Query: 231 YDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPK 290
              ++ +     +F    +E+   E + +PFEM + +G A ++M +YN VNG P+   P 
Sbjct: 203 NSNEDGRESSSSNF----SERQWREYYAKPFEMAIVDGGAPALMAAYNAVNGTPAHVHP- 257

Query: 291 LLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNF 350
           +L   V  EW L+G +  D   ++++V+ H    D    A A  +KAG++    ++    
Sbjct: 258 MLRDIVMAEWKLNGILCTDGGGLRLLVEKHHAFPDLP-SAAAACVKAGINHFLDRHKDAV 316

Query: 351 TGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ--YVSLGK----QDICSDENIEL 404
           T  AV +G + E D+D +L+ L+ V ++LG  D   +  Y ++G+    +     +   L
Sbjct: 317 T-EAVARGSITERDLDAALRGLFRVSLKLGLLDPDERVPYAAIGRNGEAEPWLRPDTQAL 375

Query: 405 AAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGF 464
             +  +  IVLLKN    LPL+  KVKTVA+VGP  N    +   Y G P   + P  G 
Sbjct: 376 VRKVTQRSIVLLKNSGALLPLDRTKVKTVALVGPLVNTV--LPDWYGGTPPYTVPPSIGV 433

Query: 465 SGYANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLD------------LSVEA 512
              A    K G   +A   +    AA E A+T++  I+  G D             S   
Sbjct: 434 EKVAGEGVKVGW--LADMGD----AAVELARTSEIAIVCVGNDPISAGGWELVRTPSEGK 487

Query: 513 ESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPG 572
           E++DR+DL LP  Q + I +V  +A  P  +V++ +     A      ++ AI+   +  
Sbjct: 488 EAVDRKDLALPRDQEKFIRRV--LAANPRTIVVLIS-NFPYAMPWVVKHVPAIVHLTHAS 544

Query: 573 EEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGP 632
           +E G A+ DV++G+ NP G+L  TW      Q+ P+    L         GRTY+++ G 
Sbjct: 545 QELGHALGDVLWGEVNPDGKLAQTWPK-SLKQLPPMMDYDL-------THGRTYQYFKGE 596

Query: 633 TLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYT---SDASKTRCPGVLVNDL 689
             +PFG+GLSYT F  +       ++V L+  +H      T   S A +T  P  +++  
Sbjct: 597 PQFPFGFGLSYTTFNLS------NLRVGLDVARHVGAGAETPAESPAPRTFAPNAILS-- 648

Query: 690 RCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKF 749
                    V+  N G+  G +VV VY++ P    +  +KQ+ GFQR+ V AG    ++ 
Sbjct: 649 -------IAVEVTNTGTRAGDEVVQVYARYPHSKVSRPLKQLCGFQRISVAAGETAHVRL 701

Query: 750 VFNACK 755
              A +
Sbjct: 702 QLPASR 707


>gi|295135996|ref|YP_003586672.1| beta-glucosidase [Zunongwangia profunda SM-A87]
 gi|294984011|gb|ADF54476.1| putative beta-glucosidase [Zunongwangia profunda SM-A87]
          Length = 796

 Score =  295 bits (755), Expect = 6e-77,   Method: Compositional matrix adjust.
 Identities = 224/738 (30%), Positives = 352/738 (47%), Gaps = 114/738 (15%)

Query: 90  RLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVST 149
           RLG+P      EA+HG   VG            T FPT I   +++N  L KK+   ++ 
Sbjct: 126 RLGIPLL-LEEEAMHGHMAVG-----------TTVFPTAIGQASTWNPDLIKKMAHVIAK 173

Query: 150 EARAMYNLGRAGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHE 209
           E RA     +   T + P I++AR+PRW R+ ET GEDP+++     + V G Q    HE
Sbjct: 174 EIRA-----QGSNTAYGPIIDIAREPRWSRVEETFGEDPYLIAEMGKSMVTGFQG--SHE 226

Query: 210 NATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDA--RVTEQDMEETFLRPFEMCVKE 267
             +DL S    V++  KH+AAY V      +  H  A   + ++D+ + ++ P +  V  
Sbjct: 227 --SDLKSNE-HVAATLKHFAAYGVS-----EGGHNGAAVHIGQRDLFQNYMYPVKEAVDN 278

Query: 268 GDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSK 327
           G   SVM +Y+ ++G+PS A   LL   ++ +W   G++++D  SI+ ++ +H  + D++
Sbjct: 279 G-VMSVMTAYSSIDGVPSTAHKNLLTNILKEKWGFKGFVISDLASIEGLLGDH-HIVDTE 336

Query: 328 EDAVAQTLKAGLDLDCG-QYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSP 386
           EDA A  + AG+D+D G   Y +   +AV  GKV E  ID++++ + TV  +LG F+   
Sbjct: 337 EDAAAMAMNAGVDVDLGGNGYDDALIDAVNAGKVAEERIDEAVRRILTVKFKLGLFENPY 396

Query: 387 QYVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAM 446
                 ++ + + E+IELA E AR+ I +LKN+ N LPLN  +++ +AV+G +A+     
Sbjct: 397 ANEKQAEKIVRNSEHIELAREVARQSITMLKNEDNILPLNK-ELQNIAVIGSNADMQYNQ 455

Query: 447 IGNYAGIPCRYMSPIAGFSGY------ANVTYKTGCDDVACKSNNSIFAASEAAKTADAT 500
           +G+Y   P    + I    G       AN+ Y  G   V   +  +I AA EAAK A+  
Sbjct: 456 LGDYTA-PQSEENIITVLEGIQHKMPNANIEYVKGT-AVRDTTQTNIPAAVEAAKNAEVA 513

Query: 501 IILAG----LDLSVE----------------------AESLDREDLWLPGYQTQLINQVA 534
           I++ G     D   E                       E  DR  L L G Q +L+  V 
Sbjct: 514 IVVLGGSSARDFKTEYLETGAATISSKEDQVLSDMESGEGYDRSTLNLMGKQLELLQAV- 572

Query: 535 EVAKG-PVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRL 593
            VA G P +LV++   G  +       N+  IL A YPG+EGG AIADV+FG FNP GRL
Sbjct: 573 -VATGTPTVLVLIK--GRPLLLNWPAENVPVILDAWYPGQEGGSAIADVIFGDFNPAGRL 629

Query: 594 PITWYNGDYVQMLPLTSMPLRPVDSLGYPGRT-YKFYNGPTLYPFGYGLSYTQFKYNLLS 652
           P++         +P +   +    +  +P R  Y   +   LYPFGYGLSY++FKY+ L 
Sbjct: 630 PVS---------VPKSLGQIPVYYNYWFPNRRDYVETDAKPLYPFGYGLSYSEFKYSDLK 680

Query: 653 FTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDV 712
                                + + K R           +   E  +   N    DG +V
Sbjct: 681 --------------------VATSGKGR-----------NTKIEISLKISNTSKVDGDEV 709

Query: 713 VIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGE 772
           + +Y +       + +KQ+  F+RV ++AG  K ++F     K L++ D      + AGE
Sbjct: 710 IQLYIRDMVSTVLSPVKQLRAFERVSIKAGETKTVQFEL-LPKELSLFDTEMKQKVQAGE 768

Query: 773 HTIFVGNGGVSFPIHLNF 790
             + +G       +   F
Sbjct: 769 FKLMIGASSEDIRLETTF 786


>gi|423290405|ref|ZP_17269254.1| hypothetical protein HMPREF1069_04297 [Bacteroides ovatus
           CL02T12C04]
 gi|392665792|gb|EIY59315.1| hypothetical protein HMPREF1069_04297 [Bacteroides ovatus
           CL02T12C04]
          Length = 861

 Score =  294 bits (753), Expect = 1e-76,   Method: Compositional matrix adjust.
 Identities = 174/449 (38%), Positives = 247/449 (55%), Gaps = 39/449 (8%)

Query: 54  FCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGT 113
           + D+SL    R +DL+ R+TL+EKV  + + +  +PRLG+ +YEWW+EALHGV   G   
Sbjct: 26  YQDTSLTAEQRAEDLLPRLTLEEKVSLMQNASPAIPRLGIKEYEWWNEALHGVGRAGL-- 83

Query: 114 HFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNL-GRA-------GLTYW 165
                   AT FP  I   ASFN+SL  ++  A S EAR    + G +       GLT+W
Sbjct: 84  --------ATVFPQSIGMGASFNDSLLYEVFNATSDEARVKSRIFGESGVLKRYQGLTFW 135

Query: 166 SPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCC 225
           +PN+N+ RDPRWGR  ET GEDP++ G+  +  VRGLQ  E        ++R  K+ +C 
Sbjct: 136 TPNVNIFRDPRWGRGQETYGEDPYLTGQMGMAVVRGLQGPE--------DARYDKLHACA 187

Query: 226 KHYAAYDVDNWKGVDRYHFDAR-VTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIP 284
           KH+A +    W   +R+ FDA  +  +D+ ET+L  F+  V++     VMC+YNR  G P
Sbjct: 188 KHFAVHSGPEW---NRHSFDAENIDPRDLWETYLPAFKDLVQKAHVKEVMCAYNRFEGEP 244

Query: 285 SCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVD--NHKFLADSKEDAVAQTLKAGLDLD 342
            C   +LL Q +R EW   G +V+DC +I        H+   D KE A A  ++AG DL+
Sbjct: 245 CCGSNRLLMQILRDEWGYKGIVVSDCGAISDFYRPGTHETHPD-KEHASADAVRAGTDLE 303

Query: 343 CGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQDICSDENI 402
           CG  Y +   +AV+ G + E +ID SLK L T    LG  D  P +  +    + S E+ 
Sbjct: 304 CGSEYASL-ADAVKAGLIDEKEIDISLKRLLTARFELGEMDEQPAWAEIPTSVLNSKEHQ 362

Query: 403 ELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIA 462
            LA   ARE +VLL+N  N LPLN+     VAV+GP+AN +V   GNY GIP   ++ + 
Sbjct: 363 ALALRMARESLVLLQNKNNILPLNTH--LKVAVMGPNANDSVMQWGNYNGIPAHTVTLLE 420

Query: 463 GFSGY---ANVTYKTGCDDVACKSNNSIF 488
                     + Y+ GCD V  K+  S+F
Sbjct: 421 AVRAKLPEGQIIYEPGCDRVDGKTLQSLF 449



 Score =  102 bits (255), Expect = 7e-19,   Method: Compositional matrix adjust.
 Identities = 85/297 (28%), Positives = 127/297 (42%), Gaps = 56/297 (18%)

Query: 497 ADATIILAGLDLSVEAESL----------DREDLWLPGYQTQLINQVAEVAKGPVILVIM 546
           AD  +   G+  S+E E +          DR D+ LP  Q    + +  + K    +V +
Sbjct: 598 ADVILFAGGISPSLEGEEMPVEVPGFKGGDRTDIELPDVQR---DLLKALKKAGKKVVFI 654

Query: 547 SAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQML 606
           +  G  I      T  +AIL A YPG+ GG AI D ++G++NPGGRLP+T+Y    V  L
Sbjct: 655 NYSGSAIGLVPETTTCEAILQAWYPGQAGGTAIVDALWGEYNPGGRLPVTFYKD--VNQL 712

Query: 607 PLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQH 666
           P         +     GRTY++     L+PFG+GLSYT F Y     +K      N +  
Sbjct: 713 P-------DFEDYSMKGRTYRYMQQQPLFPFGHGLSYTTFTYGEAKLSK------NTIAK 759

Query: 667 CRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAAT 726
             N+  T                         +   NVG  DG +VV VY + P +    
Sbjct: 760 GENVVLT-------------------------IPVSNVGQRDGEEVVQVYLRRPGDKEGP 794

Query: 727 YIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGNGGVS 783
               +  F+RV + AG+ + +       ++    D  +NT+ P  E T  +  GG S
Sbjct: 795 RYT-LRAFKRVHIPAGKTESVAIPLTG-ENFEWFDAESNTMRPL-EGTYELLYGGTS 848


>gi|319788503|ref|YP_004147978.1| glycoside hydrolase [Pseudoxanthomonas suwonensis 11-1]
 gi|317467015|gb|ADV28747.1| glycoside hydrolase family 3 domain protein [Pseudoxanthomonas
           suwonensis 11-1]
          Length = 916

 Score =  294 bits (753), Expect = 1e-76,   Method: Compositional matrix adjust.
 Identities = 175/445 (39%), Positives = 258/445 (57%), Gaps = 32/445 (7%)

Query: 54  FCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGT 113
           + D+SL +  R   LVSRMTL+EK  Q+ + +  + RLGLP Y+WW+EALHGV+  G   
Sbjct: 50  WLDTSLSFEERAAALVSRMTLEEKAAQMQNDSPAIERLGLPAYDWWNEALHGVARAG--- 106

Query: 114 HFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYN-------LGR-AGLTYW 165
                  GAT FP  I   ASF+  L  ++  A+S EARA ++        GR  GLT+W
Sbjct: 107 -------GATVFPQAIGMAASFDVPLMDQVSAAISDEARAKHHDFLRKGEHGRYQGLTFW 159

Query: 166 SPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCC 225
           SPNIN+ RDPRWGR  ET GEDPF+  R  V++VRGLQ ++  +    L+ +  K+ +  
Sbjct: 160 SPNINIFRDPRWGRGQETYGEDPFLTTRMGVSFVRGLQGMD-PQTGQPLDPKYRKLDATA 218

Query: 226 KHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPS 285
           KH+A   V +    DR+ FD   ++QD+ +T+L  FE  VKE D  +VM +YNRV G  +
Sbjct: 219 KHFA---VHSGPEADRHTFDVHPSKQDLYDTYLPAFESLVKEADVYAVMGAYNRVYGESA 275

Query: 286 CADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQ 345
                LL  T+R +W   GY+++DC +I  +  NHK + ++ E+A A  +K G +L+CG 
Sbjct: 276 SGSKFLLLDTLRRDWGFDGYVMSDCWAIVDIWKNHKIV-ETPEEAAALAVKNGTELNCGS 334

Query: 346 YYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQDICSDENIE-- 403
            Y +    AV++G + E ++D +L  L+   M LG FD  P+ V   +     +++ E  
Sbjct: 335 TYADHLPVAVKKGLISEAELDDALTRLFVARMELGMFD-PPEQVRWAQVPYSVNQSAEHD 393

Query: 404 -LAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIA 462
            LA + A+E +VLLKND   LPL S  ++ +AVVGP A+ T+A++GNY G P   ++ + 
Sbjct: 394 ALARKMAQESLVLLKND-GVLPL-SKDIRRLAVVGPTADDTMALLGNYYGTPADPVTILR 451

Query: 463 GFSGYA---NVTYKTGCDDVACKSN 484
           G    A   +V Y  G D V  + +
Sbjct: 452 GIREAAPGVDVVYARGVDLVEGRDD 476



 Score =  137 bits (345), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 97/301 (32%), Positives = 142/301 (47%), Gaps = 56/301 (18%)

Query: 490 ASEAAKTADATIILAGLDLSVEAESL----------DREDLWLPGYQTQLINQVAEVAKG 539
           A EAA +ADA + + GL   VE E +          DR D+ LP  Q +L+  V    K 
Sbjct: 643 ALEAANSADAVVFVGGLTGDVEGEEMKVDYPGFAGGDRTDIRLPATQQKLLEAVHATGK- 701

Query: 540 PVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYN 599
           PV++V+ +   + I +A    N+  IL A YPG+ GG A+ + +FG +NPGGRLP+T+Y+
Sbjct: 702 PVVMVLTTGSALGIDWA--RRNVPGILVAWYPGQRGGTAVGEALFGDYNPGGRLPVTFYS 759

Query: 600 GDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQV 659
            D  + LP       P D      RTY+++ G  L+PFG+GLSYT F Y+ L        
Sbjct: 760 AD--EKLP-------PFDDYAMKERTYRYFTGQPLFPFGHGLSYTSFGYSGL-------- 802

Query: 660 NLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKP 719
              KL   R                        D     V  +N G   G +VV +Y  P
Sbjct: 803 ---KLDRKR--------------------AGAGDEVTVSVTVKNQGKRAGDEVVQLYLAP 839

Query: 720 PAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAAN--TLLPAGEHTIFV 777
                   +K++ GFQRV ++ G ++ + F     + L + D AA   T+ P G + + V
Sbjct: 840 VKPQRERALKELRGFQRVHLQPGESRTVTFSIVPERDLRVYDEAAGRYTVDP-GRYEVQV 898

Query: 778 G 778
           G
Sbjct: 899 G 899


>gi|21232323|ref|NP_638240.1| glucan 1,4-beta-glucosidase [Xanthomonas campestris pv. campestris
           str. ATCC 33913]
 gi|21114093|gb|AAM42164.1| glucan 1,4-beta-glucosidase [Xanthomonas campestris pv. campestris
           str. ATCC 33913]
          Length = 888

 Score =  294 bits (752), Expect = 1e-76,   Method: Compositional matrix adjust.
 Identities = 176/427 (41%), Positives = 240/427 (56%), Gaps = 40/427 (9%)

Query: 64  RVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGAT 123
           R   LV++M+ +EKV Q  + A  +PRLG+P YEWWSE LHG++  G           AT
Sbjct: 39  RAAALVAQMSREEKVAQAMNAAPAIPRLGIPAYEWWSEGLHGIARNG----------YAT 88

Query: 124 SFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGR---------AGLTYWSPNINVARD 174
            FP  I   AS+N  L +++G  VSTEARA +N            AGLT WSPNIN+ RD
Sbjct: 89  VFPQAIGLAASWNTQLMQQVGTVVSTEARAKFNQAGGPGKDHKRYAGLTIWSPNINIFRD 148

Query: 175 PRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVD 234
           PRWGR  ET GEDPF+ G+ AV ++RGLQ         DL   P  +++  KH A   V 
Sbjct: 149 PRWGRGMETYGEDPFLTGQLAVGFIRGLQ-------GDDLE-HPRTIATP-KHIA---VH 196

Query: 235 NWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQ 294
           +     R+ FD  V+ +D+E T+   F   + EG A SVMC+YN ++G P+CA   LLN 
Sbjct: 197 SGPEPGRHGFDVDVSPRDVEATYTPAFRAALVEGQAGSVMCAYNSLHGTPACAADWLLNG 256

Query: 295 TVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFTGNA 354
            VRG+W   G++V+DCD++  M   H F  D+   + A +LKAG DL+CG  Y    G A
Sbjct: 257 RVRGDWGFKGFVVSDCDAVDDMTQFHYFRPDNAGSSAA-SLKAGHDLNCGTAYRAL-GTA 314

Query: 355 VQQGKVKETDIDKSLKYLYTVLMRLGFFDG--SPQYVSLGKQDICSDENIELAAEAAREG 412
           +++G+V E  +D+SL  L+    RLG        +Y  LG +DI +  N  LA +AA E 
Sbjct: 315 IERGEVDEALLDQSLVRLFAARYRLGELQAPRKDRYARLGAKDIDNAGNRALALQAAAES 374

Query: 413 IVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFS---GYAN 469
           IVLLKN   TLPL +     +AV+GP+A+A  A+  NY G   + ++P+ G     G   
Sbjct: 375 IVLLKNANATLPLKAG--TRLAVIGPNADALAALEANYQGTSSQPVTPLLGLRQRFGAQQ 432

Query: 470 VTYKTGC 476
           V Y  G 
Sbjct: 433 VRYAQGA 439



 Score =  132 bits (333), Expect = 6e-28,   Method: Compositional matrix adjust.
 Identities = 88/286 (30%), Positives = 139/286 (48%), Gaps = 55/286 (19%)

Query: 505 GLDLSVEAESL----------DREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIA 554
           GL   VE E L          DR D+ LP  Q  L+ + A+ +  P+++V+MS   V + 
Sbjct: 625 GLSPDVEGEELRIDVPGFDGGDRNDIALPAAQQALLER-AKASGKPLVVVLMSGSAVALN 683

Query: 555 FAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLR 614
           +A+T+ +  AI+ A YPG+ GG AIA  + G  NPGGRLP+T+Y          ++  L 
Sbjct: 684 WAKTHAD--AIVAAWYPGQSGGTAIARALAGDDNPGGRLPVTFYR---------STKDLP 732

Query: 615 PVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTS 674
           P  S    GRTY+++ G  L+PFGYGLSYT F Y+    + T                  
Sbjct: 733 PYVSYDMKGRTYRYFKGEALFPFGYGLSYTSFAYDAPQLSSTT----------------- 775

Query: 675 DASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGF 734
                         L+     +     +N G+  G +V  VY + P +   + ++ ++GF
Sbjct: 776 --------------LQAGSPLQVTTTVRNTGTRAGDEVAQVYLQYP-DRPQSPLRSLVGF 820

Query: 735 QRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGNG 780
           QRV ++ G  + + F  +A ++L+ VD      + AG++ +FVG G
Sbjct: 821 QRVHLQPGEQRTLTFTLDA-RALSDVDRTGTRAVEAGDYRLFVGGG 865


>gi|325914134|ref|ZP_08176487.1| beta-glucosidase-like glycosyl hydrolase [Xanthomonas vesicatoria
           ATCC 35937]
 gi|325539637|gb|EGD11280.1| beta-glucosidase-like glycosyl hydrolase [Xanthomonas vesicatoria
           ATCC 35937]
          Length = 874

 Score =  294 bits (752), Expect = 2e-76,   Method: Compositional matrix adjust.
 Identities = 181/449 (40%), Positives = 247/449 (55%), Gaps = 43/449 (9%)

Query: 45  LGLQMSSFLFC---DSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSE 101
           LGL +    F    D S     R   LV++M+ DEKV Q  + A  +PRL +P YEWWSE
Sbjct: 3   LGLCLPCIAFAAPADRSGTPEQRAAALVAQMSRDEKVAQAMNDAPAIPRLDIPAYEWWSE 62

Query: 102 ALHGVSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGR-- 159
            LHG++  G           AT FP  I   AS+N +L +++G  VSTEARA +N     
Sbjct: 63  GLHGIARNG----------YATVFPQAIGLAASWNTALMQQVGTVVSTEARAKFNQAGGP 112

Query: 160 -------AGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENAT 212
                  AGLT WSPNIN+ RDPRWGR  ET GEDPF+ G+ AV ++RGLQ         
Sbjct: 113 GKDHKRYAGLTIWSPNINIFRDPRWGRGMETYGEDPFLTGQLAVGFIRGLQ-------GD 165

Query: 213 DLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASS 272
           DLN  P  +++  KH A   V +     R+ FD  V+ +DME T+   F   + +G A S
Sbjct: 166 DLN-HPRTIATP-KHIA---VHSGPEPGRHGFDVDVSPRDMEATYTPAFRAALVDGQAWS 220

Query: 273 VMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVA 332
           VMC+YN ++G P+CA   LLN  VRG+W   G++V+DCD++  M   H F  D+   + A
Sbjct: 221 VMCAYNSLHGTPACAADWLLNGRVRGDWGFKGFVVSDCDAVDDMTQFHYFRPDNAGSSAA 280

Query: 333 QTLKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ--YVS 390
             LKAG DL+CG  Y    G A+++G+V E  +D+SL  L+    RLG  +   +  Y  
Sbjct: 281 -ALKAGHDLNCGHAYREL-GTAIERGEVDEALLDQSLVRLFAARYRLGELEAPRKDPYAR 338

Query: 391 LGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNY 450
           LG +D+ +  +  LA +AA E IVLLKN   TLPL +     +AV+GP+A+A  A+  NY
Sbjct: 339 LGAKDVDNAAHRALALQAAAESIVLLKNTATTLPLKAG--TRLAVIGPNADALAALEANY 396

Query: 451 AGIPCRYMSPIAGFS---GYANVTYKTGC 476
            G     ++P+ G     G   V Y  G 
Sbjct: 397 QGTSATPITPLLGLRQHFGAQQVRYAQGA 425



 Score =  137 bits (346), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 91/294 (30%), Positives = 143/294 (48%), Gaps = 55/294 (18%)

Query: 497 ADATIILAGLDLSVEAESL----------DREDLWLPGYQTQLINQVAEVAKGPVILVIM 546
           +DA +   GL   VE E L          DR D+ LP  Q  L+ + A+ +  P+++V+M
Sbjct: 603 SDAVVAFVGLSPDVEGEELRIDVPGFDGGDRNDIALPAPQQALLER-AKASGKPLVVVLM 661

Query: 547 SAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQML 606
           S   V + +A+ N +  AI+ A YPG+ GG AIA  + G  NPGGRLP+T+Y        
Sbjct: 662 SGSAVALNWAKANAD--AIVAAWYPGQSGGTAIARALAGDDNPGGRLPVTFYR------- 712

Query: 607 PLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQH 666
             ++  L    S    GRTY+++ G  L+PFGYGLSYT F Y+                 
Sbjct: 713 --STKDLPAYVSYDMKGRTYRYFKGEPLFPFGYGLSYTSFAYD----------------- 753

Query: 667 CRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAAT 726
                          P +    L+  +  +     +N GS  G +V  VY + P +   +
Sbjct: 754 --------------APRLSTRTLQAGNPLQVTTTVRNTGSRAGDEVAQVYLQYP-DRPQS 798

Query: 727 YIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGNG 780
            ++ ++GFQRV ++ G  + + F  +A ++L+ VD +    + AGE+ +FVG G
Sbjct: 799 PLRSLVGFQRVHLKPGEQRELTFTLDA-RALSDVDRSGQRAVEAGEYRVFVGGG 851


>gi|66767544|ref|YP_242306.1| glucan 1,4-beta-glucosidase [Xanthomonas campestris pv. campestris
           str. 8004]
 gi|66572876|gb|AAY48286.1| glucan 1,4-beta-glucosidase [Xanthomonas campestris pv. campestris
           str. 8004]
          Length = 888

 Score =  293 bits (751), Expect = 2e-76,   Method: Compositional matrix adjust.
 Identities = 176/427 (41%), Positives = 240/427 (56%), Gaps = 40/427 (9%)

Query: 64  RVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGAT 123
           R   LV++M+ +EKV Q  + A  +PRLG+P YEWWSE LHG++  G           AT
Sbjct: 39  RAAALVAQMSREEKVAQSMNAAPAIPRLGIPAYEWWSEGLHGIARNG----------YAT 88

Query: 124 SFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGR---------AGLTYWSPNINVARD 174
            FP  I   AS+N  L +++G  VSTEARA +N            AGLT WSPNIN+ RD
Sbjct: 89  VFPQAIGLAASWNTQLMQQVGTVVSTEARAKFNQAGGPGKDHKRYAGLTIWSPNINIFRD 148

Query: 175 PRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVD 234
           PRWGR  ET GEDPF+ G+ AV ++RGLQ         DL   P  +++  KH A   V 
Sbjct: 149 PRWGRGMETYGEDPFLTGQLAVGFIRGLQ-------GDDLE-HPRTIATP-KHIA---VH 196

Query: 235 NWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQ 294
           +     R+ FD  V+ +D+E T+   F   + EG A SVMC+YN ++G P+CA   LLN 
Sbjct: 197 SGPEPGRHGFDVDVSPRDVEATYTPAFRAALVEGQAGSVMCAYNSLHGTPACAADWLLNG 256

Query: 295 TVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFTGNA 354
            VRG+W   G++V+DCD++  M   H F  D+   + A +LKAG DL+CG  Y    G A
Sbjct: 257 RVRGDWGFKGFVVSDCDAVDDMTQFHYFRPDNAGSSAA-SLKAGHDLNCGTAYRAL-GTA 314

Query: 355 VQQGKVKETDIDKSLKYLYTVLMRLGFFDG--SPQYVSLGKQDICSDENIELAAEAAREG 412
           +++G+V E  +D+SL  L+    RLG        +Y  LG +DI +  N  LA +AA E 
Sbjct: 315 IERGEVDEALLDQSLVRLFAARYRLGELQAPRKDRYARLGAKDIDNAGNRALALQAAAES 374

Query: 413 IVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFS---GYAN 469
           IVLLKN   TLPL +     +AV+GP+A+A  A+  NY G   + ++P+ G     G   
Sbjct: 375 IVLLKNANATLPLKAG--TRLAVIGPNADALAALEANYQGTSSQPVTPLLGLRQRFGAQQ 432

Query: 470 VTYKTGC 476
           V Y  G 
Sbjct: 433 VRYAQGA 439



 Score =  132 bits (333), Expect = 6e-28,   Method: Compositional matrix adjust.
 Identities = 88/286 (30%), Positives = 139/286 (48%), Gaps = 55/286 (19%)

Query: 505 GLDLSVEAESL----------DREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIA 554
           GL   VE E L          DR D+ LP  Q  L+ + A+ +  P+++V+MS   V + 
Sbjct: 625 GLSPDVEGEELRIDVPGFDGGDRNDIALPAAQQALLER-AKASGKPLVVVLMSGSAVALN 683

Query: 555 FAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLR 614
           +A+T+ +  AI+ A YPG+ GG AIA  + G  NPGGRLP+T+Y          ++  L 
Sbjct: 684 WAKTHAD--AIVAAWYPGQSGGTAIARALAGDDNPGGRLPVTFYR---------STKDLP 732

Query: 615 PVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTS 674
           P  S    GRTY+++ G  L+PFGYGLSYT F Y+    + T                  
Sbjct: 733 PYVSYDMKGRTYRYFKGEALFPFGYGLSYTSFAYDAPQLSSTT----------------- 775

Query: 675 DASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGF 734
                         L+     +     +N G+  G +V  VY + P +   + ++ ++GF
Sbjct: 776 --------------LQAGSPLQVTTTVRNTGTRAGDEVAQVYLQYP-DRPQSPLRSLVGF 820

Query: 735 QRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGNG 780
           QRV ++ G  + + F  +A ++L+ VD      + AG++ +FVG G
Sbjct: 821 QRVHLQPGEQRTLTFTLDA-RALSDVDRTGTRAVEAGDYRLFVGGG 865


>gi|397691073|ref|YP_006528327.1| beta-glucosidase [Melioribacter roseus P3M]
 gi|395812565|gb|AFN75314.1| beta-glucosidase [Melioribacter roseus P3M]
          Length = 923

 Score =  293 bits (750), Expect = 2e-76,   Method: Compositional matrix adjust.
 Identities = 168/425 (39%), Positives = 249/425 (58%), Gaps = 37/425 (8%)

Query: 61  YSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTHFDDVIP 120
           Y  R+ DL+S MT +EK++QL + A  +PRLGL  Y +W+E+LHGV           +  
Sbjct: 113 YKERLNDLISLMTTEEKIKQLTNQADSIPRLGLRAYNYWNESLHGV-----------LAE 161

Query: 121 GATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNINVARDPRWGRI 180
           GATSFP  I   A+++  L  ++  AVS EARA+  L   GLTYWSP IN+ARDPRWGR 
Sbjct: 162 GATSFPQAIALGATWDPRLVNRVATAVSDEARALNRLYGKGLTYWSPTINIARDPRWGRN 221

Query: 181 TETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVD 240
            E+  EDP+++ R  V +++G+Q    +          LK  +  KH+ A    N +   
Sbjct: 222 EESYSEDPYLLSRMGVAFIKGMQGDHPYY---------LKTVATPKHFIA----NNEEER 268

Query: 241 RYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEW 300
           R+   + V  +++ E +L  F+  + E  A S+M +YN +N +PS A+  L+   +R +W
Sbjct: 269 RHTGSSDVDMRNLYEYYLPAFKSAIVEARAYSIMGAYNELNHVPSNANMFLMTDLLRRQW 328

Query: 301 DLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFTGNAVQQGKV 360
              GY+V+DC +I  M+  HKF     E AVA+++ AG DL+CGQ Y  F  +A+ +G +
Sbjct: 329 GFEGYVVSDCGAIHDMLYGHKFFKTGAE-AVARSILAGCDLNCGQAYREFIKDALDEGLL 387

Query: 361 KETDIDKSLKYLYTVLMRLGFFDGSPQ---YVSLGKQDICSDENIELAAEAAREGIVLLK 417
           +E DID +L  + +   RLG FD  P+   Y S+GK  + S EN  LA +AAR+ IVLLK
Sbjct: 388 REKDIDSALFRVLSARFRLGEFD-PPELVPYSSIGKDKLDSKENRRLALDAARKSIVLLK 446

Query: 418 NDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFSGYAN-----VTY 472
           N+ + LP++ +K+K++AV+GP  NA  A +G Y+G P   +SP+ G    A+     V Y
Sbjct: 447 NN-DILPIDKSKIKSIAVIGP--NAREAQLGIYSGFPNVLISPLEGIKNKADSLDIRVGY 503

Query: 473 KTGCD 477
             GCD
Sbjct: 504 VKGCD 508



 Score =  125 bits (314), Expect = 9e-26,   Method: Compositional matrix adjust.
 Identities = 88/290 (30%), Positives = 139/290 (47%), Gaps = 41/290 (14%)

Query: 490 ASEAAKTADATIILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAG 549
           A + A   D  I++ G+   +  E LDR+++ LP  Q +L+ Q AEV   P I++++  G
Sbjct: 661 AKKIAAENDLVILVLGITPGISQEELDRKEIELPSVQRELVKQTAEV--NPNIVIVLVNG 718

Query: 550 GVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLT 609
           G  +A A      KAI+   Y GE GG+A+ADV+FG +NPGG+LP T+Y     + LP  
Sbjct: 719 G-PVALAGAEKYAKAIVENWYNGEFGGQALADVLFGDYNPGGKLPQTFYAS--TEQLP-- 773

Query: 610 SMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRN 669
             P+   D +  P RTY + N   L+PFG+GLSYT FKY+ L                  
Sbjct: 774 --PMSDYDIINNP-RTYMYLNEQALFPFGHGLSYTTFKYDSLK----------------- 813

Query: 670 LNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIK 729
                         ++ N L   D    +    NVG+ +G +VV +Y+           K
Sbjct: 814 --------------IVSNTLNETDTLSLQFRLTNVGNRNGDEVVQIYASCKDAKFKVPRK 859

Query: 730 QVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGN 779
           Q+  F+R+ ++ G +K ++F     +      Y  + ++  G   I +G+
Sbjct: 860 QLKRFRRLTLQTGESKVLEFKIPVDELAFYSTYENDFVVEKGAWEILIGS 909


>gi|188990656|ref|YP_001902666.1| beta-glucosidase [Xanthomonas campestris pv. campestris str. B100]
 gi|167732416|emb|CAP50610.1| exported beta-glucosidase [Xanthomonas campestris pv. campestris]
          Length = 888

 Score =  293 bits (750), Expect = 2e-76,   Method: Compositional matrix adjust.
 Identities = 176/427 (41%), Positives = 240/427 (56%), Gaps = 40/427 (9%)

Query: 64  RVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGAT 123
           R   LV++M+ +EKV Q  + A  +PRLG+P YEWWSE LHG++  G           AT
Sbjct: 39  RAAALVAQMSREEKVAQAMNAAPAIPRLGIPAYEWWSEGLHGIARNG----------YAT 88

Query: 124 SFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGR---------AGLTYWSPNINVARD 174
            FP  I   AS+N  L +++G  VSTEARA +N            AGLT WSPNIN+ RD
Sbjct: 89  VFPQAIGLAASWNTQLMQQVGTVVSTEARAKFNQAGGPGKDHKRYAGLTIWSPNINIFRD 148

Query: 175 PRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVD 234
           PRWGR  ET GEDPF+ G+ AV ++RGLQ         DL   P  +++  KH A   V 
Sbjct: 149 PRWGRGMETYGEDPFLTGQLAVGFIRGLQ-------GDDLE-HPRTIATP-KHIA---VH 196

Query: 235 NWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQ 294
           +     R+ FD  V+ +D+E T+   F   + EG A SVMC+YN ++G P+CA   LLN 
Sbjct: 197 SGPEPGRHGFDVDVSPRDVEATYTPAFRAALVEGQAGSVMCAYNSLHGTPACAADWLLNG 256

Query: 295 TVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFTGNA 354
            VRG+W   G++V+DCD++  M   H F  D+   + A +LKAG DL+CG  Y    G A
Sbjct: 257 RVRGDWGFKGFVVSDCDAVDDMTQFHYFRPDNAGSSAA-SLKAGHDLNCGTAYRAL-GTA 314

Query: 355 VQQGKVKETDIDKSLKYLYTVLMRLGFFDG--SPQYVSLGKQDICSDENIELAAEAAREG 412
           +++G+V E  +D+SL  L+    RLG        +Y  LG +DI +  N  LA +AA E 
Sbjct: 315 IERGEVDEALLDQSLVRLFAARYRLGELQAPRKDRYARLGAKDIDNAGNRALALQAAAES 374

Query: 413 IVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFS---GYAN 469
           IVLLKN   TLPL +     +AV+GP+A+A  A+  NY G   + ++P+ G     G   
Sbjct: 375 IVLLKNANATLPLKAG--TRLAVIGPNADALAALEANYQGTSSQPVTPLLGLRQRFGAQQ 432

Query: 470 VTYKTGC 476
           V Y  G 
Sbjct: 433 VRYAQGA 439



 Score =  134 bits (336), Expect = 2e-28,   Method: Compositional matrix adjust.
 Identities = 89/286 (31%), Positives = 140/286 (48%), Gaps = 55/286 (19%)

Query: 505 GLDLSVEAESL----------DREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIA 554
           GL   VE E L          DR D+ LP  Q  L+ + A+ +  P+++V+MS   V + 
Sbjct: 625 GLSPDVEGEELRIDVPGFDGGDRNDIALPAAQQALLER-AKASGKPLVVVLMSGSAVALN 683

Query: 555 FAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLR 614
           +A+T+ +  AI+ A YPG+ GG AIA  + G  NPGGRLP+T+Y          ++  L 
Sbjct: 684 WAKTHAD--AIVAAWYPGQSGGTAIARALAGDDNPGGRLPVTFYR---------STKDLP 732

Query: 615 PVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTS 674
           P  S    GRTY+++ G  L+PFGYGLSYT+F Y                          
Sbjct: 733 PYVSYDMKGRTYRYFKGEALFPFGYGLSYTRFAYE------------------------- 767

Query: 675 DASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGF 734
                  P + V  L+     +     +N G   G +V  VY + P +   + ++ ++GF
Sbjct: 768 ------TPRLSVTTLQAGSPLQVTTTVRNTGERAGDEVAQVYLQYP-DRPQSPLRSLVGF 820

Query: 735 QRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGNG 780
           QRV ++ G  + + F  +A ++L+ VD     ++ AG++ +FVG G
Sbjct: 821 QRVHLQPGEQRTLTFTLDA-RALSDVDRTGTRVVEAGDYRLFVGGG 865


>gi|359450637|ref|ZP_09240068.1| beta-glucosidase [Pseudoalteromonas sp. BSi20480]
 gi|358043611|dbj|GAA76317.1| beta-glucosidase [Pseudoalteromonas sp. BSi20480]
          Length = 468

 Score =  293 bits (749), Expect = 3e-76,   Method: Compositional matrix adjust.
 Identities = 171/441 (38%), Positives = 252/441 (57%), Gaps = 44/441 (9%)

Query: 53  LFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPG 112
           L+ + S     RV DLV+R+TL+EKV QL D +  + RL +P+Y WW+EALHGV+  G  
Sbjct: 33  LYLNKSASIDERVNDLVTRLTLEEKVAQLFDKSPAIERLNMPEYNWWNEALHGVARAGK- 91

Query: 113 THFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNL-----GRA---GLTY 164
                    AT FP  I   A+F+E L  ++G A+S E RA ++       R+   GLTY
Sbjct: 92  ---------ATVFPQAIGLAATFDEDLMLRVGTAISDEGRAKHHAFLEENNRSMYTGLTY 142

Query: 165 WSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSC 224
           WSPNIN+ RDPRWGR  ET GEDP++  R AVN++ GLQ           N+  LK  + 
Sbjct: 143 WSPNINIFRDPRWGRGQETYGEDPYLTTRIAVNFINGLQGD---------NAEYLKSVAT 193

Query: 225 CKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIP 284
            KHYA   V +   V R+  D   +E+D+ ET+L  F+  + +   +SVMC+YN VNG P
Sbjct: 194 LKHYA---VHSGPEVSRHSDDYTASEKDLAETYLPAFKDVIAQTKVASVMCAYNSVNGTP 250

Query: 285 SCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVD-NHKFLADSKEDAVAQTLKAGLDLDC 343
           +C + +L+   +R E++  GYIV+DC +I    D     + ++   A A  LK G DL+C
Sbjct: 251 ACGNDELIQNKLRDEFNFDGYIVSDCGAIADFYDVKSHNIVNTGAKAAAMALKTGTDLNC 310

Query: 344 GQYYTN---FTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQDI---C 397
           G ++ N   +   AV++G V+E D+DK+LK L     +LG FD +P+ V      I    
Sbjct: 311 GDHHGNTYSYLTQAVKEGLVEEKDVDKALKRLMYARFKLGMFD-NPENVPYSDTSIDVVG 369

Query: 398 SDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRY 457
           S++++ L  EAA++ +VLLKN+Q  LPL     + +A++GP+A+    ++GNY G+P   
Sbjct: 370 SNKHLALTQEAAQKSLVLLKNEQ-VLPLKGN--EKIALIGPNADNEAILLGNYNGMPIVP 426

Query: 458 MSPIAGFS---GYANVTYKTG 475
           ++P        G  N+TY  G
Sbjct: 427 ITPKLALEQRLGKNNLTYTAG 447


>gi|299147288|ref|ZP_07040353.1| beta-glucosidase [Bacteroides sp. 3_1_23]
 gi|298514566|gb|EFI38450.1| beta-glucosidase [Bacteroides sp. 3_1_23]
          Length = 861

 Score =  292 bits (748), Expect = 4e-76,   Method: Compositional matrix adjust.
 Identities = 174/449 (38%), Positives = 246/449 (54%), Gaps = 39/449 (8%)

Query: 54  FCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGT 113
           + D+SL    R +DL+ R+TL+EKV  + + +  +PRLG+ +YEWW+EALHGV   G   
Sbjct: 26  YQDTSLTAEQRAEDLLPRLTLEEKVSLMQNASPAIPRLGIKEYEWWNEALHGVGRAGL-- 83

Query: 114 HFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNL-GRA-------GLTYW 165
                   AT FP  I   ASFN+SL  ++  A S EAR    + G +       GLT+W
Sbjct: 84  --------ATVFPQSIGMGASFNDSLLYEVFNATSDEARVKSRIFGESGVLKRYQGLTFW 135

Query: 166 SPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCC 225
           +PN+N+ RDPRWGR  ET GEDP++ G+  +  VRGLQ  E        ++R  K+ +C 
Sbjct: 136 TPNVNIFRDPRWGRGQETYGEDPYLTGQMGMAVVRGLQGPE--------DARYDKLHACA 187

Query: 226 KHYAAYDVDNWKGVDRYHFDAR-VTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIP 284
           KH+A +    W   +R+ FDA  +  +D+ ET+L  F+  V++     VMC+YNR  G P
Sbjct: 188 KHFAVHSGPEW---NRHSFDAENIDPRDLWETYLPAFKDLVQKAHVKEVMCAYNRFEGEP 244

Query: 285 SCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVD--NHKFLADSKEDAVAQTLKAGLDLD 342
            C   +LL Q +R EW   G +V+DC +I        H    D KE A A  ++AG DL+
Sbjct: 245 CCGSNRLLMQILRDEWGYKGIVVSDCGAISDFYRPGTHGTHPD-KEHASADAVRAGTDLE 303

Query: 343 CGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQDICSDENI 402
           CG  Y +   +AV+ G + E +ID SLK L T    LG  D  P +  +    + S E+ 
Sbjct: 304 CGSEYASL-ADAVKAGLIDEKEIDISLKRLLTARFELGEMDEQPAWSEIPTSVLNSKEHQ 362

Query: 403 ELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIA 462
            LA   ARE +VLL+N  N LPLN+     VAV+GP+AN +V   GNY GIP   ++ + 
Sbjct: 363 ALALRMARESLVLLQNKNNILPLNTH--LKVAVMGPNANDSVMQWGNYNGIPAHTVTLLE 420

Query: 463 GFSGY---ANVTYKTGCDDVACKSNNSIF 488
                     + Y+ GCD V  K+  S+F
Sbjct: 421 AVRAKLPEGQIIYEPGCDRVDGKTLQSLF 449



 Score =  103 bits (258), Expect = 3e-19,   Method: Compositional matrix adjust.
 Identities = 86/297 (28%), Positives = 126/297 (42%), Gaps = 56/297 (18%)

Query: 497 ADATIILAGLDLSVEAESL----------DREDLWLPGYQTQLINQVAEVAKGPVILVIM 546
           AD  +   G+  S+E E +          DR D+ LP  Q    N +  + K    +V +
Sbjct: 598 ADVILFAGGISPSLEGEEMPVEVPGFKGGDRTDIELPDVQR---NLLKALKKAGKKVVFI 654

Query: 547 SAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQML 606
           +  G  I      T  +AIL A YPG+ GG AI D ++G++NPGGRLP+T+Y    V  L
Sbjct: 655 NYSGSAIGLVPETTTCEAILQAWYPGQAGGTAIVDALWGEYNPGGRLPVTFYKN--VNQL 712

Query: 607 PLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQH 666
           P         +     GRTY++     L+PFG+GLSYT F Y     +K      N +  
Sbjct: 713 P-------DFEDYSMKGRTYRYMQQQPLFPFGHGLSYTTFTYGEAKLSK------NTIAK 759

Query: 667 CRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAAT 726
             N+  T                         +   NVG  DG +VV VY + P +    
Sbjct: 760 GENVVLT-------------------------IPVSNVGQRDGEEVVQVYLRRPGDKEGP 794

Query: 727 YIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGNGGVS 783
               +  F+RV + AG+ + +        +    D  +NT+ P  E T  +  GG S
Sbjct: 795 RYT-LRAFKRVHIPAGKTESVAIPLTGV-NFEWFDAESNTMRPL-EGTYELLYGGTS 848


>gi|423215029|ref|ZP_17201557.1| hypothetical protein HMPREF1074_03089 [Bacteroides xylanisolvens
           CL03T12C04]
 gi|392692292|gb|EIY85530.1| hypothetical protein HMPREF1074_03089 [Bacteroides xylanisolvens
           CL03T12C04]
          Length = 861

 Score =  292 bits (748), Expect = 5e-76,   Method: Compositional matrix adjust.
 Identities = 174/449 (38%), Positives = 246/449 (54%), Gaps = 39/449 (8%)

Query: 54  FCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGT 113
           + D+SL    R +DL+ R+TL+EKV  + + +  +PRLG+ +YEWW+EALHGV   G   
Sbjct: 26  YQDTSLTAEQRAEDLLPRLTLEEKVSLMQNASPAIPRLGIKEYEWWNEALHGVGRAGL-- 83

Query: 114 HFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNL-GRA-------GLTYW 165
                   AT FP  I   ASFN+SL  ++  A S EAR    + G +       GLT+W
Sbjct: 84  --------ATVFPQSIGMGASFNDSLLYEVFNATSDEARVKSRIFGESGVLKRYQGLTFW 135

Query: 166 SPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCC 225
           +PN+N+ RDPRWGR  ET GEDP++ G+  +  VRGLQ  E        ++R  K+ +C 
Sbjct: 136 TPNVNIFRDPRWGRGQETYGEDPYLTGQMGMAVVRGLQGPE--------DARYDKLHACA 187

Query: 226 KHYAAYDVDNWKGVDRYHFDAR-VTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIP 284
           KH+A +    W   +R+ FDA  +  +D+ ET+L  F+  V++     VMC+YNR  G P
Sbjct: 188 KHFAVHSGPEW---NRHSFDAENIDPRDLWETYLPAFKDLVQKAHVKEVMCAYNRFEGEP 244

Query: 285 SCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVD--NHKFLADSKEDAVAQTLKAGLDLD 342
            C   +LL Q +R EW   G +V+DC +I        H    D KE A A  ++AG DL+
Sbjct: 245 CCGSNRLLMQILRDEWGYKGIVVSDCGAISDFYRPGTHGTHPD-KEHASADAVRAGTDLE 303

Query: 343 CGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQDICSDENI 402
           CG  Y +   +AV+ G + E +ID SLK L T    LG  D  P +  +    + S E+ 
Sbjct: 304 CGSEYASL-ADAVKAGLIDEKEIDISLKRLLTARFELGEMDEQPAWSEIPTSVLNSKEHQ 362

Query: 403 ELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIA 462
            LA   ARE +VLL+N  N LPLN+     VAV+GP+AN +V   GNY GIP   ++ + 
Sbjct: 363 ALALRMARESLVLLQNKNNILPLNTH--LKVAVMGPNANDSVMQWGNYNGIPAHTVTLLE 420

Query: 463 GFSGY---ANVTYKTGCDDVACKSNNSIF 488
                     + Y+ GCD V  K+  S+F
Sbjct: 421 AVRAKLPEGQIIYEPGCDRVDGKTLQSLF 449



 Score =  102 bits (254), Expect = 8e-19,   Method: Compositional matrix adjust.
 Identities = 85/297 (28%), Positives = 126/297 (42%), Gaps = 56/297 (18%)

Query: 497 ADATIILAGLDLSVEAESL----------DREDLWLPGYQTQLINQVAEVAKGPVILVIM 546
           AD  +   G+  S+E E +          DR D+ LP  Q    + +  + K    +V +
Sbjct: 598 ADVILFAGGISPSLEGEEMPVEVPGFKGGDRTDIELPDVQR---DLLKALKKAGKKVVFI 654

Query: 547 SAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQML 606
           +  G  I      T  +AIL A YPG+ GG AI D ++G++NPGGRLP+T+Y    V  L
Sbjct: 655 NYSGSAIGLVPETTTCEAILQAWYPGQAGGTAIVDALWGEYNPGGRLPVTFYKD--VNQL 712

Query: 607 PLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQH 666
           P         +     GRTY++     L+PFG+GLSYT F Y     +K      N +  
Sbjct: 713 P-------DFEDYSMKGRTYRYMQQQPLFPFGHGLSYTDFTYGEAKLSK------NTIAK 759

Query: 667 CRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAAT 726
             N+  T                         +   NVG  DG +VV VY + P +    
Sbjct: 760 GENVVLT-------------------------IPVSNVGQRDGEEVVQVYLRRPGDKEGP 794

Query: 727 YIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGNGGVS 783
               +  F+RV + AG+ + +        +    D  +NT+ P  E T  +  GG S
Sbjct: 795 RYT-LRAFKRVHIPAGKTESVAIPLTGV-NFEWFDVESNTMRPL-EGTYELLYGGTS 848


>gi|384428895|ref|YP_005638255.1| beta-glucosidase [Xanthomonas campestris pv. raphani 756C]
 gi|341937998|gb|AEL08137.1| beta-glucosidase [Xanthomonas campestris pv. raphani 756C]
          Length = 888

 Score =  292 bits (748), Expect = 5e-76,   Method: Compositional matrix adjust.
 Identities = 175/427 (40%), Positives = 240/427 (56%), Gaps = 40/427 (9%)

Query: 64  RVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGAT 123
           R   LV++M+ +EKV Q  + A  +PRLG+P YEWW+E LHG++  G           AT
Sbjct: 39  RAAALVAQMSREEKVAQAMNAAPAIPRLGIPAYEWWNEGLHGIARNG----------YAT 88

Query: 124 SFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGR---------AGLTYWSPNINVARD 174
            FP  I   AS+N  L +++G  VSTEARA +N            AGLT WSPNIN+ RD
Sbjct: 89  VFPQAIGLAASWNTQLMQQVGTVVSTEARAKFNQAGGPGKDHKRYAGLTIWSPNINIFRD 148

Query: 175 PRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVD 234
           PRWGR  ET GEDPF+ G+ AV ++RGLQ         DL   P  +++  KH A   V 
Sbjct: 149 PRWGRGMETYGEDPFLTGQLAVGFIRGLQ-------GDDLE-HPRTIATP-KHIA---VH 196

Query: 235 NWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQ 294
           +     R+ FD  V+ +D+E T+   F   + EG A SVMC+YN ++G P+CA   LLN 
Sbjct: 197 SGPEPGRHGFDVDVSPRDVEATYTPAFRAALVEGQAGSVMCAYNSLHGTPACAADWLLNG 256

Query: 295 TVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFTGNA 354
            VRG+W   G++V+DCD++  M   H F  D+   + A  LKAG DL+CG  Y    G A
Sbjct: 257 RVRGDWGFKGFVVSDCDAVDDMTQFHYFRPDNAGSSAA-ALKAGHDLNCGTAYRAL-GTA 314

Query: 355 VQQGKVKETDIDKSLKYLYTVLMRLGFFDG--SPQYVSLGKQDICSDENIELAAEAAREG 412
           +++G+V E  +D+SL  L+    RLG        +Y  LG +DI +  N  LA +AA E 
Sbjct: 315 IERGEVDEALLDQSLVRLFAARYRLGELQAPRKDRYARLGAKDIDNAGNRALALQAAAES 374

Query: 413 IVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFS---GYAN 469
           IVLLKN   TLPL ++    +AV+GP+A+A  A+  NY G   + ++P+ G     G   
Sbjct: 375 IVLLKNANATLPLKAS--TRLAVIGPNADALAALEANYQGTSSQPVTPLLGLRQRFGAQQ 432

Query: 470 VTYKTGC 476
           V Y  G 
Sbjct: 433 VRYAQGA 439



 Score =  138 bits (348), Expect = 9e-30,   Method: Compositional matrix adjust.
 Identities = 94/294 (31%), Positives = 147/294 (50%), Gaps = 55/294 (18%)

Query: 497 ADATIILAGLDLSVEAESL----------DREDLWLPGYQTQLINQVAEVAKGPVILVIM 546
           +DA +   GL   VE E L          DR D+ LP  Q  L+ + A+ +  P+++V+M
Sbjct: 617 SDAVVAFVGLSPDVEGEELRIDVPGFDGGDRNDIALPAAQQALLER-AKASGKPLVVVLM 675

Query: 547 SAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQML 606
           S   V + +A+T+ +  AI+ A YPG+ GG AIA  + G  NPGGRLP+T+Y        
Sbjct: 676 SGSAVALNWAKTHAD--AIVAAWYPGQSGGTAIARALAGDDNPGGRLPVTFYR------- 726

Query: 607 PLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQH 666
             ++  L P  S    GRTY+++ G  L+PFGYGLSYT+F Y      +T +++   LQ 
Sbjct: 727 --STKDLPPYVSYDMKGRTYRYFKGEALFPFGYGLSYTRFAY------ETPRLSATTLQA 778

Query: 667 CRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAAT 726
              L  T+                           +N G   G +V  VY + P E   +
Sbjct: 779 GSPLQVTT-------------------------TVRNTGERAGDEVAQVYLQYP-ERPQS 812

Query: 727 YIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGNG 780
            ++ ++GFQRV ++ G  + + F  +A ++L+ VD      + AG++ +FVG G
Sbjct: 813 PLRSLVGFQRVHLQPGEQRTLTFTLDA-RALSDVDRTGTRAVEAGDYRLFVGGG 865


>gi|298481648|ref|ZP_06999839.1| beta-glucosidase [Bacteroides sp. D22]
 gi|298272189|gb|EFI13759.1| beta-glucosidase [Bacteroides sp. D22]
          Length = 861

 Score =  292 bits (747), Expect = 5e-76,   Method: Compositional matrix adjust.
 Identities = 174/449 (38%), Positives = 246/449 (54%), Gaps = 39/449 (8%)

Query: 54  FCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGT 113
           + D+SL    R +DL+ R+TL+EKV  + + +  +PRLG+ +YEWW+EALHGV   G   
Sbjct: 26  YQDTSLTAEQRAEDLLPRLTLEEKVSLMQNASPAIPRLGIKEYEWWNEALHGVGRAGL-- 83

Query: 114 HFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNL-GRA-------GLTYW 165
                   AT FP  I   ASFN+SL  ++  A S EAR    + G +       GLT+W
Sbjct: 84  --------ATVFPQSIGMGASFNDSLLYEVFNATSDEARVKSRIFGESGVLKRYQGLTFW 135

Query: 166 SPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCC 225
           +PN+N+ RDPRWGR  ET GEDP++ G+  +  VRGLQ  E        ++R  K+ +C 
Sbjct: 136 TPNVNIFRDPRWGRGQETYGEDPYLTGQMGMAVVRGLQGPE--------DARYDKLHACA 187

Query: 226 KHYAAYDVDNWKGVDRYHFDAR-VTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIP 284
           KH+A +    W   +R+ FDA  +  +D+ ET+L  F+  V++     VMC+YNR  G P
Sbjct: 188 KHFAVHSGPEW---NRHSFDAENIDPRDLWETYLPAFKDLVQKAHVKEVMCAYNRFEGEP 244

Query: 285 SCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVD--NHKFLADSKEDAVAQTLKAGLDLD 342
            C   +LL Q +R EW   G +V+DC +I        H    D KE A A  ++AG DL+
Sbjct: 245 CCGSNRLLMQILRDEWGYKGIVVSDCGAISDFYRPGTHGTHPD-KEHASADAVRAGTDLE 303

Query: 343 CGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQDICSDENI 402
           CG  Y +   +AV+ G + E +ID SLK L T    LG  D  P +  +    + S E+ 
Sbjct: 304 CGSEYASL-ADAVKAGLIDEKEIDISLKRLLTARFELGEMDEQPAWSEIPTSVLNSKEHQ 362

Query: 403 ELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIA 462
            LA   ARE +VLL+N  N LPLN+     VAV+GP+AN +V   GNY GIP   ++ + 
Sbjct: 363 ALALRMARESLVLLQNKNNILPLNTH--LKVAVMGPNANDSVMQWGNYNGIPAHTVTLLE 420

Query: 463 GFSGY---ANVTYKTGCDDVACKSNNSIF 488
                     + Y+ GCD V  K+  S+F
Sbjct: 421 AVRAKLPEGQIIYEPGCDRVDGKTLQSLF 449



 Score =  104 bits (259), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 86/297 (28%), Positives = 127/297 (42%), Gaps = 56/297 (18%)

Query: 497 ADATIILAGLDLSVEAESL----------DREDLWLPGYQTQLINQVAEVAKGPVILVIM 546
           AD  +   G+  S+E E +          DR D+ LP  Q    N +  + K    +V +
Sbjct: 598 ADVILFAGGISPSLEGEEMPVEVPGFKGGDRTDIELPDVQR---NLLKALKKAGKKVVFI 654

Query: 547 SAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQML 606
           +  G  I      T  +AIL A YPG+ GG AI D ++G++NPGGRLP+T+Y    V  L
Sbjct: 655 NYSGSAIGLVPETTTCEAILQAWYPGQAGGTAIVDALWGEYNPGGRLPVTFYKD--VNQL 712

Query: 607 PLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQH 666
           P         +     GRTY++     L+PFG+GLSYT F Y     +K      N +  
Sbjct: 713 P-------DFEDYSMKGRTYRYMQQQPLFPFGHGLSYTDFTYGEAKLSK------NTIAK 759

Query: 667 CRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAAT 726
             N+  T                         +   NVG  DG +VV VY + P +    
Sbjct: 760 GENVVLT-------------------------IPVSNVGQCDGEEVVQVYLRRPGDKEGP 794

Query: 727 YIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGNGGVS 783
               +  F+RV + AG+ + +       ++    D  +NT+ P  E T  +  GG S
Sbjct: 795 RYT-LRAFKRVHIPAGKTESVAIPLTG-ENFEWFDVESNTMRPL-EGTYELLYGGTS 848


>gi|304406707|ref|ZP_07388362.1| glycoside hydrolase family 3 domain protein [Paenibacillus
           curdlanolyticus YK9]
 gi|304344240|gb|EFM10079.1| glycoside hydrolase family 3 domain protein [Paenibacillus
           curdlanolyticus YK9]
          Length = 733

 Score =  292 bits (747), Expect = 5e-76,   Method: Compositional matrix adjust.
 Identities = 230/760 (30%), Positives = 369/760 (48%), Gaps = 110/760 (14%)

Query: 64  RVKDLVSRMTLDEKVQQLGDFAHGV----PRLGLPQYEWWSEA-----------LHGVSN 108
           + + L+S+MTL++KV Q+  F  G     P  G  +++   E            L G + 
Sbjct: 23  QAEQLLSKMTLEDKVGQMTQFDWGYNPINPETGESEHDLIIELIRQGKVGSIFNLSGAAE 82

Query: 109 VG--------------PGTHFDDVIPG-ATSFPTVILTTASFNESLWKKIGQAVSTEARA 153
                           P     DVI G  T FP  +   A++N  + ++   A STEA  
Sbjct: 83  ANELQGLIEQHTELKIPMVIGRDVIHGYRTVFPIPLAMAAAWNPEVARQTSAAASTEALT 142

Query: 154 MYNLGRAGLTY-WSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENAT 212
                  G+T+ ++P I+V+RDPRWGRI E+ GEDP++   Y   +V G Q   G   AT
Sbjct: 143 ------DGVTWVFAPMIDVSRDPRWGRIAESIGEDPYLTAAYGRAWVEGSQIDNGPGRAT 196

Query: 213 DLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASS 272
                    +SC KH+A Y +    G D    D  ++++++ +  L PF+  V+ G A S
Sbjct: 197 ---------ASCPKHFAGYGMAE-AGRDYNTVD--LSDRELRDIILPPFQDAVEAG-ALS 243

Query: 273 VMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVA 332
           +M S+N +NGIP+CA+  LL   +R EW   G + +D +++  ++ +   +A ++E+A  
Sbjct: 244 IMASFNEINGIPACANEYLLKTILRDEWGFEGVVASDYNALVELIVHG--VAANEEEACE 301

Query: 333 QTLKAGLDLDCGQ-YYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYV-- 389
            T+ AG D+D     +T      V+ G+V E+ +D S++ +  + ++LG  + S   V  
Sbjct: 302 MTVLAGCDMDMHSGIFTRQLPKLVRAGRVPESVVDDSVRRILAMKIKLGLLEQSKSDVSQ 361

Query: 390 SLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGN 449
           S   Q + S E +ELA EAAR+ IVLL+N +  LPL+ A   ++AV+GP A+     +G 
Sbjct: 362 SAATQPLKS-EYVELAREAARQSIVLLQNKEQVLPLSKAGA-SIAVIGPLADNATDPLGC 419

Query: 450 YA--GIPCRYMSPIAGFSGYA----NVTYKTGCDDVACKSNNSIFAASEAAKTADATIIL 503
           +A  G     ++ + G    A    ++ Y  GC D+   S     AA EAA+++D  ++L
Sbjct: 420 WALDGRSDEVVTALEGIRQAAAEGTSIRYAQGC-DIDSDSEEGFEAALEAARSSDVVVML 478

Query: 504 AGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIK 563
            G   ++  ES  R  L LPG Q  L+  VA++ K P++ VI+S  G  + FA       
Sbjct: 479 LGESATMSGESRSRAALDLPGKQRALVEAVAKLGK-PIVAVILS--GRPLTFAWLPEQAS 535

Query: 564 AILWAGYPGEEGGRAIADVVFGKFNPGGRLPITW-YNGDYVQMLPLTSMPLRPVDSLGYP 622
           AI+ A + G + G AIADV+FG FNP GRLP+T+  N   + +        RP      P
Sbjct: 536 AIVQAWHLGVQSGNAIADVLFGDFNPSGRLPVTFPQNVGQIPIYHYRKKTGRP------P 589

Query: 623 GRTYKFY----NGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASK 678
              Y  Y        LYPFGYGL+YT+F+Y  +  +K+                      
Sbjct: 590 AGAYSSYYIDSTTEPLYPFGYGLTYTEFEYGAIQTSKS---------------------- 627

Query: 679 TRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVF 738
                     +  D+  +  V  +NVG+  G +VV  Y +         +K+++ F++V 
Sbjct: 628 ---------SIGADEQLDVTVSIRNVGNLAGEEVVQCYVRDEVASVTQPLKRLVAFRKVK 678

Query: 739 VRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVG 778
           V AG +  + F   A + L I+D      +  G+ T+++G
Sbjct: 679 VAAGESVDVTFTIGAAE-LAILDKHMKRTVEPGDFTLWIG 717


>gi|397690575|ref|YP_006527829.1| glucan 1,4-beta-glucosidase [Melioribacter roseus P3M]
 gi|395812067|gb|AFN74816.1| glucan 1,4-beta-glucosidase [Melioribacter roseus P3M]
          Length = 860

 Score =  291 bits (744), Expect = 1e-75,   Method: Compositional matrix adjust.
 Identities = 161/438 (36%), Positives = 249/438 (56%), Gaps = 40/438 (9%)

Query: 54  FCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGT 113
           + + +LP+  R +DL+ R++LDEK+  +   +  + RLG+P+Y WW+EALHGV+  G   
Sbjct: 23  YLNVNLPFEERAEDLLQRLSLDEKISLMVHQSPAIERLGIPEYNWWNEALHGVARNG--- 79

Query: 114 HFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRA--------GLTYW 165
                   AT FP  I   A+++  L  +I   +S EARA YN            G++ W
Sbjct: 80  -------RATVFPMPIGLAATWDRDLIYRIADVISNEARAKYNSALKKNQRGIYQGISLW 132

Query: 166 SPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCC 225
           +PNIN+ RDPRWGR  ET GEDP++ G  AV++++GLQ  +          + LK  +  
Sbjct: 133 APNINIFRDPRWGRGMETYGEDPYLTGELAVSFIKGLQGQD---------KKYLKTIATP 183

Query: 226 KHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPS 285
           KH A   V +    +R+HF+A V+  D+ ET+L  F+  + +G A SVMC+YNR+ G   
Sbjct: 184 KHLA---VHSGPEPERHHFNALVSNYDLNETYLPHFKKSIMKGKAYSVMCAYNRLRGKAC 240

Query: 286 CADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQ 345
           C    LL   +R +W   G +V+DC ++  + ++HK + DS E A A  + +G DL+CG 
Sbjct: 241 CGHDTLLTDILRNKWGFEGIVVSDCWAVYDIFNSHK-IVDSPEKAAALAVSSGTDLECGN 299

Query: 346 YYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQD---ICSDENI 402
            + +   NA + G + E +ID +L+ +     +LG FD  P+ VS  + D   + +  N 
Sbjct: 300 TFLSLK-NAYRDGLITEKEIDSALRRVLLARFKLGMFD-PPEIVSYSQIDESYLDNSYNR 357

Query: 403 ELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIA 462
           E+A EAAR+ IVLLKND   LPL+S+ +  +AV+GP+A+   +++GNY G P  Y++P+ 
Sbjct: 358 EIALEAARKSIVLLKNDNKLLPLDSS-INKIAVIGPNADNLESLLGNYHGFPSEYITPLQ 416

Query: 463 GFSGY---ANVTYKTGCD 477
                     V Y+ GCD
Sbjct: 417 AIRRVLKNGEVFYEKGCD 434



 Score =  121 bits (303), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 89/299 (29%), Positives = 140/299 (46%), Gaps = 56/299 (18%)

Query: 490 ASEAAKTADATIILAGLDLSVEAESL----------DREDLWLPGYQTQLINQVAEVAKG 539
           A + A  +DA I+  GL   +E E+L          DR  L LP  Q +LI ++    K 
Sbjct: 591 AYKTALKSDAVIMFMGLCPRMEGEALKIKLDGFKGGDRLKLSLPANQLKLIKKIHSTGK- 649

Query: 540 PVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYN 599
           PVILV+++ G +   +   + NI AIL A YPG+ GGRAI DV++GK+NP G+LP+T Y 
Sbjct: 650 PVILVLLNGGPISTVWE--SENIPAILEAWYPGQAGGRAITDVIWGKYNPSGKLPVTIYK 707

Query: 600 GDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQV 659
            +         +P  P ++    GRTY+++ G  LYPFG+GL+YT    + +  +     
Sbjct: 708 SE-------NDLP--PFENYDMEGRTYRYFKGEVLYPFGWGLNYTDITISNIELS----- 753

Query: 660 NLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKP 719
                                      N+++ +D     V  +N G+  G + V +Y+K 
Sbjct: 754 --------------------------ANEIKDNDTIRVVVKLKNNGNLAGEETVQLYTK- 786

Query: 720 PAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVG 778
            A      IK + GF+++ +  G    ++F  +       VD      +P G + I VG
Sbjct: 787 -ALKDNRTIKTLRGFEKIKLEPGTEGMVEFYLSKSDLAVWVDGLGFETMP-GVYEIIVG 843


>gi|423313768|ref|ZP_17291703.1| hypothetical protein HMPREF1058_02315 [Bacteroides vulgatus
           CL09T03C04]
 gi|392684303|gb|EIY77631.1| hypothetical protein HMPREF1058_02315 [Bacteroides vulgatus
           CL09T03C04]
          Length = 788

 Score =  290 bits (743), Expect = 2e-75,   Method: Compositional matrix adjust.
 Identities = 238/817 (29%), Positives = 369/817 (45%), Gaps = 151/817 (18%)

Query: 53  LFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRL---GLPQYEWWSEALH-GVSN 108
           L+ +   P   RV+DL+S+MTL+EK  Q+    +G  R+    LPQ  W +E    G+ N
Sbjct: 42  LYENPKAPLEDRVQDLLSQMTLEEKTCQMATL-YGSGRVLKDALPQNNWKTEVWKDGIGN 100

Query: 109 VG---------------------------------------PGTHFDDVIPG-----ATS 124
           +                                        P    ++ I G     AT 
Sbjct: 101 IDEEHNGLGAFKSEYSFPYAKHVNAKHTIQRWFVEKTRLGIPVDFTNEGIRGLCHDRATY 160

Query: 125 FPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNINVARDPRWGRITETP 184
           FP      A++N+ L  +IG+  + EA A   LG   +  +SP +++A+DPRWGR  ET 
Sbjct: 161 FPAQCGQGATWNKKLIARIGEVEAKEAVA---LGYTNI--YSPILDIAQDPRWGRCVETY 215

Query: 185 GEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHF 244
           GEDP++VG      +  LQ         +L + P       KH+A Y +       +   
Sbjct: 216 GEDPYLVGELGKQMITSLQKY-------NLVATP-------KHFAVYSIPIGGRDGKTRT 261

Query: 245 DARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHG 304
           D  V  ++M   ++ PF M  +E  A  VM SYN  +G P       L + +R EW   G
Sbjct: 262 DPHVAPREMRTLYIEPFRMAFQEAGALGVMSSYNDYDGEPITGSYHFLTEILRQEWGFKG 321

Query: 305 YIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFT---------GNAV 355
           Y+V+D ++++ + + HK +AD+ ED +AQ + AGL++      T+FT           AV
Sbjct: 322 YVVSDSEAVEFISNKHK-VADTYEDGIAQAVNAGLNIR-----THFTPPADFILPLRKAV 375

Query: 356 QQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQD---ICSDENIELAAEAAREG 412
             GK+ +  +DK +  +  +  RLG FD    Y   GKQ    + S E+  ++ EAAR+ 
Sbjct: 376 DNGKISQETLDKRVAEILRIKFRLGLFDNP--YRGNGKQAEQIVHSKEHQAVSLEAARQS 433

Query: 413 IVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNY--AGIPCRYM-SPIAGFSGYAN 469
           +VLLKN+ N LPL S  ++++AV+GP+AN    +I  Y  A  P + +   I     +A 
Sbjct: 434 LVLLKNETNLLPL-SKSIRSIAVIGPNANEQTQLICRYGPANAPIKTVYQGIKELLPHAE 492

Query: 470 VTYKTGCDDV--------------ACKSNNSIFAASEAAKTADATI-ILAGLDLSVEAES 514
           V YK GCD +                +    +  A  AAK A+  + +L G +L+V  E 
Sbjct: 493 VIYKKGCDIIDPHFPESEILDFPKTAEEVQLMEEAIRAAKQAEVVVMVLGGNELTVR-ED 551

Query: 515 LDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEE 574
             R  L LPG Q +L+  V    K PVILV++      I +A    ++ AIL A +PGE 
Sbjct: 552 RSRTSLNLPGRQEELLKAVCATGK-PVILVMLDGRASSINYAA--AHVPAILHAWFPGEF 608

Query: 575 GGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTL 634
            G+A+A+ +FG +NPGGRL +T+     V  +P  + P +P          Y       L
Sbjct: 609 CGQAVAEALFGDYNPGGRLAVTFPKS--VGQIPF-AFPFKPGSDESSSTSVYG-----AL 660

Query: 635 YPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDY 694
           YPFG+GLSYT F Y+ L  + + Q                              ++ D +
Sbjct: 661 YPFGHGLSYTTFTYSDLHISPSHQ-----------------------------GVQGDIH 691

Query: 695 FEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNAC 754
              K+  +N G   G +VV +Y +       TY K + GF+R+ ++AG  + + F     
Sbjct: 692 VSCKI--KNTGKIKGDEVVQLYLRDEISSVTTYTKVLRGFERISLKAGEEQTVHFRLRP- 748

Query: 755 KSLNIVDYAANTLLPAGEHTIFVGNGGVSFPIHLNFN 791
           + L + D   N  +  G   + +G       +H  F 
Sbjct: 749 QDLGLWDKNMNFRVEPGSFKVMLGASSTDIRLHGQFE 785


>gi|383110854|ref|ZP_09931672.1| hypothetical protein BSGG_1962 [Bacteroides sp. D2]
 gi|313694427|gb|EFS31262.1| hypothetical protein BSGG_1962 [Bacteroides sp. D2]
          Length = 861

 Score =  290 bits (742), Expect = 2e-75,   Method: Compositional matrix adjust.
 Identities = 180/467 (38%), Positives = 252/467 (53%), Gaps = 47/467 (10%)

Query: 45  LGLQMSSFLF-CDSSLPY-------SIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQY 96
           LG+   S LF C   LPY         R +DL+ R+TL+EKV  + + +  +PRLG+ +Y
Sbjct: 9   LGVCSLSLLFSCAQKLPYQDTSLTAEQRAEDLLPRLTLEEKVALMQNASPAIPRLGIKEY 68

Query: 97  EWWSEALHGVSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYN 156
           +WW+EALHGV   G           AT FP  I   ASFN+SL  ++  AVS EAR    
Sbjct: 69  DWWNEALHGVGRAGL----------ATVFPQSIGMGASFNDSLLYEVFDAVSDEARVKSR 118

Query: 157 -------LGR-AGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGH 208
                  L R  GLT+W+PN+N+ RDPRWGR  ET GEDP++ G+  +  VRGLQ  E  
Sbjct: 119 IFSENGVLKRYQGLTFWTPNVNIFRDPRWGRGQETYGEDPYLTGQLGMAVVRGLQGPE-- 176

Query: 209 ENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDAR-VTEQDMEETFLRPFEMCVKE 267
                 N +  K+ +C KH+A +    W   +R+ FDA  +T +D+ ET+L  F+  V++
Sbjct: 177 ------NGKYDKLHACAKHFAVHSGPEW---NRHSFDAENITPRDLWETYLPAFKDLVQK 227

Query: 268 GDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMV--DNHKFLAD 325
            D   VMC+YNR  G P C   +LL Q +R EW   G +V+DC +I        H    D
Sbjct: 228 ADVKEVMCAYNRFEGEPCCGSNRLLMQILRDEWGYKGIVVSDCGAISDFYRPGTHGTHPD 287

Query: 326 SKEDAVAQTLKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGS 385
            KE A A  + +G DL+CG  Y +   +AV+ G + E  ID SLK L T    LG  D  
Sbjct: 288 -KEHASAGAVLSGTDLECGGEYGSL-ADAVKAGLIDEKQIDVSLKRLLTARFELGEMDEQ 345

Query: 386 PQYVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVA 445
           P +  +    + S E+ +LA   ARE +VLL+N  + LPLN+     VAV+GP+AN +V 
Sbjct: 346 PAWAEIPASTLNSKEHQDLALRMARESLVLLQNKNDILPLNTD--LKVAVMGPNANDSVM 403

Query: 446 MIGNYAGIPCRYMSPIAGFSGY---ANVTYKTGCDDVACKSNNSIFA 489
             GNY GIP   ++ +           V Y+ GCD  + ++  S+F+
Sbjct: 404 QWGNYNGIPGHTVTLLEAVRSKLPEGQVMYEPGCDRTSREALQSLFS 450



 Score =  101 bits (251), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 82/290 (28%), Positives = 120/290 (41%), Gaps = 55/290 (18%)

Query: 490 ASEAAKTADATIILAGLDLSVEAESL----------DREDLWLPGYQTQLINQVAEVAKG 539
           A E  K AD  +   G+  S+E E +          DR D+ LP  Q    + +  + K 
Sbjct: 591 AVEKVKDADVVLFAGGISPSLEGEEMPVEVPGFKGGDRTDIELPAVQR---DLLKALKKA 647

Query: 540 PVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYN 599
              +V ++  G  I     +   +AIL   YPG+ GG AI DV+FG +NP GRLP+T+Y 
Sbjct: 648 GKKVVFINYSGSAIGLVPESNTCEAILQGWYPGQAGGTAIVDVLFGDYNPAGRLPVTFYK 707

Query: 600 GDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQV 659
            D  Q+       ++        GRTY++     L+PFG+GLSYT F Y     +K    
Sbjct: 708 -DAGQLPDFEDYSMK--------GRTYRYMQQQPLFPFGHGLSYTTFTYGEADLSK---- 754

Query: 660 NLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKP 719
                      N   D                       +   N G  DG +VV VY + 
Sbjct: 755 -----------NTIGDGGTVT----------------LTIPVSNAGQRDGDEVVQVYLRC 787

Query: 720 PAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLP 769
            A+    +   +  F+RV + AG  K++       +S    D A NT+ P
Sbjct: 788 MADKEGPHYT-LRAFKRVHIPAGETKQVTIPLT-YESFEWFDTATNTVHP 835


>gi|389736853|ref|ZP_10190363.1| glucan 1,4-beta-glucosidase [Rhodanobacter sp. 115]
 gi|388438821|gb|EIL95541.1| glucan 1,4-beta-glucosidase [Rhodanobacter sp. 115]
          Length = 868

 Score =  290 bits (742), Expect = 2e-75,   Method: Compositional matrix adjust.
 Identities = 171/426 (40%), Positives = 242/426 (56%), Gaps = 39/426 (9%)

Query: 64  RVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGAT 123
           R   LV++MTL EKV Q+ + A  +PRLG+P Y+WWSE LHG++  G           AT
Sbjct: 32  RAVALVAKMTLPEKVAQMQNDAPAIPRLGVPAYDWWSEGLHGIARNG----------YAT 81

Query: 124 SFPTVILTTASFNESLWKKIGQAVSTEARAMYNL---GRA-----GLTYWSPNINVARDP 175
            FP  I   AS++ SL   +G  +STEARA +N    GRA     GLT WSPNIN+ RDP
Sbjct: 82  VFPQAIGLAASWDTSLLHAVGTVISTEARAKFNASGSGRAHGLFQGLTLWSPNINIFRDP 141

Query: 176 RWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDN 235
           RWGR  ET GEDP++ G+ AV +VRG+Q         D    P  +++  KH+ A+   +
Sbjct: 142 RWGRGQETYGEDPYLTGQLAVAFVRGIQG--------DDPQHPRAIATP-KHFVAH---S 189

Query: 236 WKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQT 295
                R  FD  V+  D+E+T+L  F   V +G A SVMC+YN ++G P+CA+  LL+  
Sbjct: 190 GPEAGRDSFDVDVSPHDLEDTYLPAFRTAVVDGHAGSVMCAYNALHGTPACANAGLLDTR 249

Query: 296 VRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFTGNAV 355
           +R +W   GY+V+DCD++  +   H F  D  + +VA  ++AG DLDCG  Y +    AV
Sbjct: 250 LRKDWGFAGYVVSDCDAVGDIASYHYFKPDDVQASVA-AVQAGTDLDCGHTYASLA-QAV 307

Query: 356 QQGKVKETDIDKSLKYLYTVLMRLGFFD--GSPQYVSLGKQDICSDENIELAAEAAREGI 413
           +QG + E+ +D SL  L+T   RLG     G+  Y  +G   I S  + +LA +AA E +
Sbjct: 308 RQGDIAESALDASLVRLFTARYRLGELGSRGNDPYARIGADQIDSPAHRKLALQAALESL 367

Query: 414 VLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFS---GYANV 470
           VLLKN  +TLPL++     +AV+GP A+A   +  NY G     ++P+ G     G  +V
Sbjct: 368 VLLKNAHSTLPLHAG--MRLAVIGPDADALETLEANYHGTARHPVTPLQGLRARFGADHV 425

Query: 471 TYKTGC 476
            Y  G 
Sbjct: 426 AYAQGA 431



 Score =  142 bits (358), Expect = 6e-31,   Method: Compositional matrix adjust.
 Identities = 94/295 (31%), Positives = 144/295 (48%), Gaps = 57/295 (19%)

Query: 497 ADATIILAGLDLSVEAESL----------DREDLWLPGYQTQLINQVAEVAKGPVILVIM 546
           ADA +   GL   VE E L          DR D+ LP  Q  L+ + A  +  P+I+V++
Sbjct: 598 ADAVVAFIGLSPDVEGEQLRIDVPGFDGGDRTDIGLPAPQRALLER-ARASGKPLIVVLL 656

Query: 547 SAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQML 606
           S   V + +A+ + +  AIL A YPG+ GG AIA V+ G +NPGGRLP+T+Y        
Sbjct: 657 SGSAVALDWAQQHAD--AILAAWYPGQAGGTAIAQVLAGDYNPGGRLPVTFYR------- 707

Query: 607 PLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQH 666
             ++  L P  S    GRTY++++G  LYPFGYGLSYT+F Y                  
Sbjct: 708 --STRDLPPYVSYAMQGRTYRYFDGRPLYPFGYGLSYTRFTYA----------------- 748

Query: 667 CRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVY-SKPPAEIAA 725
                          P +    L+     +   + +N G   G +VV VY   PP+ +A 
Sbjct: 749 --------------APTLSAATLKAGGTLQVSAEVRNAGQRAGDEVVQVYLDTPPSPLAP 794

Query: 726 TYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGNG 780
            +   ++GF+R+ + AG  + ++F   A + L+ VD A    +  G++ +F+G G
Sbjct: 795 RH--ALVGFRRIHLAAGEQRLVRFTL-APRQLSSVDAAGARAVEPGQYRVFIGAG 846


>gi|150002739|ref|YP_001297483.1| glycoside hydrolase family protein [Bacteroides vulgatus ATCC 8482]
 gi|294776994|ref|ZP_06742455.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
           vulgatus PC510]
 gi|149931163|gb|ABR37861.1| glycoside hydrolase family 3, candidate beta-glycosidase
           [Bacteroides vulgatus ATCC 8482]
 gi|294449242|gb|EFG17781.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
           vulgatus PC510]
          Length = 788

 Score =  290 bits (741), Expect = 2e-75,   Method: Compositional matrix adjust.
 Identities = 237/817 (29%), Positives = 368/817 (45%), Gaps = 151/817 (18%)

Query: 53  LFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRL---GLPQYEWWSEALH-GVSN 108
           L+ +   P   RV+DL+S+MTL+EK  Q+    +G  R+    LPQ  W +E    G+ N
Sbjct: 42  LYENPKAPLEDRVQDLLSQMTLEEKTCQMATL-YGSGRVLKDALPQNNWKTEVWKDGIGN 100

Query: 109 VG---------------------------------------PGTHFDDVIPG-----ATS 124
           +                                        P    ++ I G     AT 
Sbjct: 101 IDEEHNGLGAFKSEYSFPYAKHVNAKHTIQRWFVEKTRLGIPVDFTNEGIRGLCHDRATY 160

Query: 125 FPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNINVARDPRWGRITETP 184
           FP      A++N+ L  +IG+  + EA A   LG   +  +SP +++A+DPRWGR  ET 
Sbjct: 161 FPAQCGQGATWNKKLIARIGEVEAKEAVA---LGYTNI--YSPILDIAQDPRWGRCVETY 215

Query: 185 GEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHF 244
           GEDP++VG      +  LQ         +L + P       KH+A Y +       +   
Sbjct: 216 GEDPYLVGELGKQMITSLQKY-------NLVATP-------KHFAVYSIPIGGRDGKTRT 261

Query: 245 DARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHG 304
           D  V  ++M   ++ PF M  +E  A  VM SYN  +G P       L + +R EW   G
Sbjct: 262 DPHVAPREMRTLYIEPFRMAFQEAGALGVMSSYNDYDGEPITGSYHFLTEILRQEWGFKG 321

Query: 305 YIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFT---------GNAV 355
           Y+V+D ++++ + + HK +AD+ ED +AQ + AGL++      T+FT           AV
Sbjct: 322 YVVSDSEAVEFISNKHK-VADTYEDGIAQAVNAGLNIR-----THFTPPADFILPLRKAV 375

Query: 356 QQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQD---ICSDENIELAAEAAREG 412
             GK+ +  +DK +  +  +  RLG FD    Y   GKQ    + S E+  ++ EAAR+ 
Sbjct: 376 DNGKISQETLDKRVAEILRIKFRLGLFDNP--YRGNGKQAEQIVHSKEHQAVSLEAARQS 433

Query: 413 IVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNY--AGIPCRYM-SPIAGFSGYAN 469
           +VLLKN+ N LPL S  ++++AV+GP+AN    +I  Y  A  P + +   I     +  
Sbjct: 434 LVLLKNETNLLPL-SKSIRSIAVIGPNANEQTQLICRYGPANAPIKTVYQGIKELLPHTE 492

Query: 470 VTYKTGCDDV--------------ACKSNNSIFAASEAAKTADATI-ILAGLDLSVEAES 514
           V YK GCD +                +    +  A  AAK A+  + +L G +L+V  E 
Sbjct: 493 VIYKKGCDIIDPHFPESEILDFPKTAEEVQLMEEAIRAAKQAEVVVMVLGGNELTVR-ED 551

Query: 515 LDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEE 574
             R  L LPG Q +L+  V    K P+ILV++      I +A    +I AIL A +PGE 
Sbjct: 552 RSRTSLNLPGRQEELLKAVCATGK-PIILVMLDGRASSINYAA--AHIPAILHAWFPGEF 608

Query: 575 GGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTL 634
            G+A+A+ +FG +NPGGRL +T+     V  +P  + P +P          Y       L
Sbjct: 609 CGQAVAEALFGDYNPGGRLAVTFPKS--VGQIPF-AFPFKPGSDESSSTSVYG-----AL 660

Query: 635 YPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDY 694
           YPFG+GLSYT F Y+ L  + + Q                              ++ D +
Sbjct: 661 YPFGHGLSYTTFTYSDLHISPSHQ-----------------------------GVQGDIH 691

Query: 695 FEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNAC 754
              K+  +N G   G +VV +Y +       TY K + GF+R+ ++AG  + + F     
Sbjct: 692 VSCKI--KNTGKIKGDEVVQLYLRDEISSVTTYTKVLRGFERISLKAGEEQTVHFRLRP- 748

Query: 755 KSLNIVDYAANTLLPAGEHTIFVGNGGVSFPIHLNFN 791
           + L + D   N  +  G   + +G       +H  F 
Sbjct: 749 QDLGLWDKNMNFRVELGSFKVMLGASSTDIRLHGQFE 785


>gi|237719778|ref|ZP_04550259.1| glycoside hydrolase family 3 protein [Bacteroides sp. 2_2_4]
 gi|229451047|gb|EEO56838.1| glycoside hydrolase family 3 protein [Bacteroides sp. 2_2_4]
          Length = 861

 Score =  290 bits (741), Expect = 2e-75,   Method: Compositional matrix adjust.
 Identities = 173/449 (38%), Positives = 246/449 (54%), Gaps = 39/449 (8%)

Query: 54  FCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGT 113
           + D+SL    R +DL+ R+TL+EKV  + + +  +PRLG+ +YEWW+EALHGV   G   
Sbjct: 26  YQDTSLTAEQRAEDLLPRLTLEEKVSLMQNASPAIPRLGIKEYEWWNEALHGVGRAGL-- 83

Query: 114 HFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNL-GRA-------GLTYW 165
                   AT FP  I   ASFN+SL  ++  A S EAR    + G +       GLT+W
Sbjct: 84  --------ATVFPQSIGMGASFNDSLLYEVFNATSDEARVKSRIFGESGVLKRYQGLTFW 135

Query: 166 SPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCC 225
           +PN+N+ RDPRWGR  ET GEDP++ G+  +  VRGLQ  E        ++R  K+ +C 
Sbjct: 136 TPNVNIFRDPRWGRGQETYGEDPYLTGQMGMAVVRGLQGPE--------DARYDKLHACA 187

Query: 226 KHYAAYDVDNWKGVDRYHFDAR-VTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIP 284
           KH+A +    W   +R+ FDA  +  +D+ ET+L  F+  V++     VMC+YNR  G P
Sbjct: 188 KHFAVHSGPEW---NRHSFDAENIDPRDLWETYLPAFKDLVQKAHVKEVMCAYNRFEGEP 244

Query: 285 SCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVD--NHKFLADSKEDAVAQTLKAGLDLD 342
            C   +LL Q +R EW   G +V+DC +I        H+   D KE A A  ++AG DL+
Sbjct: 245 CCGSNRLLMQILRDEWGYKGIVVSDCGAISDFYRPGTHETYPD-KEHASAGAVRAGTDLE 303

Query: 343 CGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQDICSDENI 402
           CG  Y +   +AV+ G + E +ID SLK L T    LG  D    +  +    + S E+ 
Sbjct: 304 CGSEYASL-ADAVKAGLIDEKEIDISLKRLLTARFELGEMDEQSAWSEIPTSVLNSKEHQ 362

Query: 403 ELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIA 462
            LA   ARE +VLL+N  N LPLN+     VAV+GP+AN +V   GNY GIP   ++ + 
Sbjct: 363 ALALRMARESLVLLQNKNNILPLNTH--LKVAVMGPNANDSVMQWGNYNGIPAHTVTLLE 420

Query: 463 GFSGY---ANVTYKTGCDDVACKSNNSIF 488
                     + Y+ GCD V  K+  S+F
Sbjct: 421 AVRAKLPEGQIIYEPGCDRVDGKTLQSLF 449



 Score =  103 bits (258), Expect = 3e-19,   Method: Compositional matrix adjust.
 Identities = 88/301 (29%), Positives = 130/301 (43%), Gaps = 58/301 (19%)

Query: 495 KTADATIIL--AGLDLSVEAESL----------DREDLWLPGYQTQLINQVAEVAKGPVI 542
           K  DA +IL   G+  S+E E +          DR D+ LP  Q    + +  + K    
Sbjct: 594 KVMDADVILFAGGISPSLEGEEMPVEVPGFKGGDRTDIELPDVQR---DLLKALKKAGKK 650

Query: 543 LVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDY 602
           +V ++  G  I      T  +AIL A YPG+ GG AI D ++G++NPGGRLP+T+Y    
Sbjct: 651 VVFINYSGSAIGLVPETTTCEAILQAWYPGQAGGTAIVDALWGEYNPGGRLPVTFYKD-- 708

Query: 603 VQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLN 662
           V  LP         +     GRTY++     L+PFG+GLSYT F Y     +K      N
Sbjct: 709 VNQLP-------DFEDYSMKGRTYRYMQQQPLFPFGHGLSYTDFTYGEAKLSK------N 755

Query: 663 KLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAE 722
            +    N+  T                         +   NVG  DG +VV VY + P +
Sbjct: 756 TIAKGENVVLT-------------------------IPVSNVGQRDGEEVVQVYLRRPGD 790

Query: 723 IAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGNGGV 782
                   +  F+RV + AG+ + +       ++    D  +NT+ P  E T  +  GG 
Sbjct: 791 KEGPRYT-LRAFKRVHIPAGKTESVAIPLTG-ENFEWFDVESNTMCPL-EGTYELLYGGT 847

Query: 783 S 783
           S
Sbjct: 848 S 848


>gi|423294294|ref|ZP_17272421.1| hypothetical protein HMPREF1070_01086 [Bacteroides ovatus
           CL03T12C18]
 gi|392675485|gb|EIY68926.1| hypothetical protein HMPREF1070_01086 [Bacteroides ovatus
           CL03T12C18]
          Length = 861

 Score =  290 bits (741), Expect = 3e-75,   Method: Compositional matrix adjust.
 Identities = 172/449 (38%), Positives = 245/449 (54%), Gaps = 39/449 (8%)

Query: 54  FCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGT 113
           + D+SL    R +DL+ R+TL+EKV  + + +  +PRLG+ +YEWW+EALHGV   G   
Sbjct: 26  YQDTSLTAEQRAEDLLPRLTLEEKVSLMQNASPAIPRLGIKEYEWWNEALHGVGRAGL-- 83

Query: 114 HFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNL-GRA-------GLTYW 165
                   AT FP  I   ASFN+SL  ++  A S EAR    + G +       GLT+W
Sbjct: 84  --------ATVFPQSIGMGASFNDSLLYEVFNATSDEARVKSRIFGESGALKRYQGLTFW 135

Query: 166 SPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCC 225
           +PN+N+ RDPRWGR  ET GEDP++ G+  +  VRGLQ  E        +++  K+ +C 
Sbjct: 136 TPNVNIFRDPRWGRGQETYGEDPYLTGQMGMAVVRGLQGPE--------DTKYDKLHACA 187

Query: 226 KHYAAYDVDNWKGVDRYHFDAR-VTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIP 284
           KH+A +    W   +R+ FDA  +  +D+ ET+L  F+  V++     VMC+YNR  G P
Sbjct: 188 KHFAVHSGPEW---NRHSFDAENIDPRDLWETYLPAFKDLVQKAHVKEVMCAYNRFEGEP 244

Query: 285 SCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVD--NHKFLADSKEDAVAQTLKAGLDLD 342
            C   +LL Q +R EW   G +V+DC +I        H    D KE A A  ++ G DL+
Sbjct: 245 CCGSNRLLMQILRDEWGYKGIVVSDCGAISDFYRPGTHGTHPD-KEHASAAAVRTGTDLE 303

Query: 343 CGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQDICSDENI 402
           CG  Y +   +AV+ G + E +ID SLK L T    LG  D  P +  +    + S E+ 
Sbjct: 304 CGSEYASL-ADAVKAGLIDEKEIDISLKRLLTARFELGEMDEQPAWSEIPASVLNSKEHQ 362

Query: 403 ELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIA 462
            LA   ARE +VLL+N  N LPLN+     VAV+GP+AN +V   GNY GIP   ++ + 
Sbjct: 363 ALALRMARESLVLLQNKNNILPLNTH--LKVAVMGPNANDSVMQWGNYNGIPAHTVTLLE 420

Query: 463 GFSGY---ANVTYKTGCDDVACKSNNSIF 488
                     + Y+ GCD V  K+  S+F
Sbjct: 421 AVRAKLPEGQIIYEPGCDRVDGKTLQSLF 449



 Score =  104 bits (260), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 86/297 (28%), Positives = 127/297 (42%), Gaps = 56/297 (18%)

Query: 497 ADATIILAGLDLSVEAESL----------DREDLWLPGYQTQLINQVAEVAKGPVILVIM 546
           AD  +   G+  S+E E +          DR D+ LP  Q    N +  + K    +V +
Sbjct: 598 ADVILFAGGISPSLEGEEMPVEVPGFKGGDRTDIELPDVQR---NLLKALKKAGKKVVFI 654

Query: 547 SAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQML 606
           +  G  I      T  +AIL A YPG+ GG AI D ++G++NPGGRLP+T+Y    V  L
Sbjct: 655 NYSGSAIGLVPETTTCEAILQAWYPGQAGGTAIVDALWGEYNPGGRLPVTFYKD--VNQL 712

Query: 607 PLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQH 666
           P         +     GRTY++     L+PFG+GLSYT F Y     +K      N +  
Sbjct: 713 P-------DFEDYSMKGRTYRYMQQQPLFPFGHGLSYTTFTYGEAKLSK------NTIAK 759

Query: 667 CRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAAT 726
             N+  T                         +   NVG  DG +VV VY + P +    
Sbjct: 760 GENVVLT-------------------------IPVSNVGQRDGEEVVQVYLRRPGDKEGP 794

Query: 727 YIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGNGGVS 783
               +  F+RV + AG+ + +       ++    D  +NT+ P  E T  +  GG S
Sbjct: 795 RYT-LRAFKRVHIPAGKTESVAIPLTG-ENFEWFDVESNTMRPL-EGTYELLYGGTS 848


>gi|380509734|ref|ZP_09853141.1| beta-glucosidase-related glycosidase [Xanthomonas sacchari NCPPB
           4393]
          Length = 883

 Score =  290 bits (741), Expect = 3e-75,   Method: Compositional matrix adjust.
 Identities = 178/445 (40%), Positives = 253/445 (56%), Gaps = 41/445 (9%)

Query: 54  FCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGT 113
           + D+S  +  R   LV++MTL+EK  Q+ + A  + RLG+P Y+WW+EALHGV+  G   
Sbjct: 24  WQDTSASFEARAAALVAQMTLEEKAAQMQNAAPAIERLGVPAYDWWNEALHGVARAGQ-- 81

Query: 114 HFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNL-------GR-AGLTYW 165
                   AT FP  I   A+F+  L  ++   +S EARA ++        GR  GLT+W
Sbjct: 82  --------ATVFPQAIGLAATFDVPLMGQVATTISDEARAKHHQFLREGAHGRYQGLTFW 133

Query: 166 SPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCC 225
           SPNIN+ RDPRWGR  ET GEDP++  R  V +V+GLQ         D   R  K+ +  
Sbjct: 134 SPNINIFRDPRWGRGQETYGEDPYLTARMGVAFVQGLQ-------GDDPVYR--KLDATA 184

Query: 226 KHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPS 285
           KH+A   V +    DR+HFDAR +++D+ +T+L  FE  VKEG   +VM +YNRV G  +
Sbjct: 185 KHFA---VHSGPEADRHHFDARPSKRDLYDTYLPAFEALVKEGKVDAVMGAYNRVYGESA 241

Query: 286 CADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQ 345
            A   LL   +R +W   GY+V+DC +I V +  H  LA S+E A A  +K G +L+CGQ
Sbjct: 242 SASQFLLRDVLRRDWGFTGYVVSDCWAI-VDIWKHHHLAPSREAAAALAVKNGTELECGQ 300

Query: 346 YYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQDICSDE---NI 402
            Y      AV+QG + E +ID ++  L+T  MRLG FD  P+ V   +     ++   + 
Sbjct: 301 EYATLPA-AVRQGLIGEAEIDDAVTRLFTARMRLGMFD-PPERVRWARIPASVNQVPAHD 358

Query: 403 ELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIA 462
            LA +AA+E +VLLKND   LPL S  +K +AVVGP A+ T+A++GNY G P   ++ + 
Sbjct: 359 ALALQAAQESLVLLKND-GVLPL-SRTLKRIAVVGPTADDTMALLGNYFGTPAAPVTILQ 416

Query: 463 GFSGYAN---VTYKTGCDDVACKSN 484
           G    A    V Y  G D V  + +
Sbjct: 417 GIRDAAKGIEVRYARGVDLVEGRDD 441



 Score =  135 bits (340), Expect = 8e-29,   Method: Compositional matrix adjust.
 Identities = 94/313 (30%), Positives = 142/313 (45%), Gaps = 54/313 (17%)

Query: 490 ASEAAKTADATIILAGLDLSVEAESL----------DREDLWLPGYQTQLINQVAEVAKG 539
           A +AA+ AD  + + GL   VE E +          DR DL LP  Q  L+  +    K 
Sbjct: 608 ALDAARNADVVVFVGGLTGDVEGEEMKVDYPGFAGGDRTDLRLPAPQRALLEALHATGK- 666

Query: 540 PVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYN 599
           PV++V+   GG  +A      ++ AIL + YPG+ GG A+   +FG+ NP GRLP+T+Y 
Sbjct: 667 PVVMVLT--GGSALAVDWAQAHLPAILMSWYPGQRGGTAVGQALFGEVNPAGRLPVTFYR 724

Query: 600 GDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQV 659
            D  Q LP         D     GRTY+++ G  LYPFG+GLSYT+F Y  L        
Sbjct: 725 AD--QALPA-------FDDYAMEGRTYRYFRGTPLYPFGHGLSYTRFDYGKLHL------ 769

Query: 660 NLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKP 719
                          DA +      + +D R     + +V+  N G   G +V  +Y + 
Sbjct: 770 ---------------DAPR------IADDGR----LKLQVEVANTGKRAGDEVAQLYVRR 804

Query: 720 PAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANT-LLPAGEHTIFVG 778
            A       + + GFQRV +  G  + + F  +A ++L   D A    ++PAG + + +G
Sbjct: 805 LAAAPGDAQQTLRGFQRVHLAPGERRTLTFELDAQQALRQYDDARGAYVVPAGRYEVRIG 864

Query: 779 NGGVSFPIHLNFN 791
                  +   F 
Sbjct: 865 GSSADARVRAGFT 877


>gi|94969405|ref|YP_591453.1| Beta-glucosidase [Candidatus Koribacter versatilis Ellin345]
 gi|94551455|gb|ABF41379.1| Beta-glucosidase [Candidatus Koribacter versatilis Ellin345]
          Length = 902

 Score =  289 bits (740), Expect = 3e-75,   Method: Compositional matrix adjust.
 Identities = 171/439 (38%), Positives = 244/439 (55%), Gaps = 41/439 (9%)

Query: 53  LFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPG 112
           ++ D++ P + R  DLV RMTLDEK  QL D+A  +PRLG+P Y+ WSEALHGV+  G  
Sbjct: 37  VYRDATRPANERAHDLVQRMTLDEKAAQLEDWATAIPRLGVPDYQTWSEALHGVARAG-- 94

Query: 113 THFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRA--------GLTY 164
                    AT FP  I   A+++  + K++G  +STEAR  YN  +         GLT+
Sbjct: 95  --------HATVFPQAIGMAATWDTEMVKQMGDVISTEARGKYNEAQREGNHRIFWGLTF 146

Query: 165 WSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSC 224
           WSPNIN+ RDPRWGR  ET GEDPF+ G+  + ++ G+Q  +         + P K  + 
Sbjct: 147 WSPNINIFRDPRWGRGQETYGEDPFLTGKMGIAFIDGVQGPDA--------AHP-KAVAT 197

Query: 225 CKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIP 284
            KH+A   V +     R+ FD +V+ +D+EET+L  F   V +G   SVMC+YN V+G+ 
Sbjct: 198 SKHFA---VHSGPESLRHGFDVKVSPRDLEETYLAAFRATVTDGHVKSVMCAYNAVDGMG 254

Query: 285 SCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCG 344
           +CA+  LL + ++  W   G++V+DC +I  +   HK   D    A A +L AG DL C 
Sbjct: 255 ACANKMLLEEHLKQAWGFKGFVVSDCGAIMDVTQGHKNAPDIVH-AAAISLAAGTDLSCS 313

Query: 345 QYYTNFT--GNAVQQGKVKETDIDKSLKYLYTVLMRLGFFD--GSPQYVSLGKQDICSDE 400
            +   F    +AV++G V E  + ++ + LY     LG FD  GS     +    + S+E
Sbjct: 314 IWEPGFNTLADAVRKGLVTEDMVTRAAERLYAARFELGMFDEPGSNPNDKIDMSQVASEE 373

Query: 401 NIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSP 460
           +   A +AA E IVLLKND   LPL +A  KT+AV+GP A    ++ GNY G P R ++P
Sbjct: 374 HRAEALKAAEESIVLLKND-GLLPLKNA--KTIAVIGPTAELLASLEGNYNGQPVRPVTP 430

Query: 461 IAGFS---GYANVTYKTGC 476
           + G     G  NV Y  G 
Sbjct: 431 LDGIVKQFGAENVRYAQGS 449



 Score =  108 bits (270), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 86/281 (30%), Positives = 133/281 (47%), Gaps = 51/281 (18%)

Query: 503 LAGLDLSVEAESL---DREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETN 559
           L G ++ ++ E     DR  + LP  Q +L+  +    K PV++V +S   V + +A  N
Sbjct: 645 LEGEEMPIKIEGFSGGDRTSIDLPATQEKLLEALGAAGK-PVVVVNLSGSAVALNWA--N 701

Query: 560 TNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLP-LTSMPLRPVDS 618
            +  AIL A YPG EGG AIA  + G+ NP GRLP+T+Y    VQ LP  T   ++    
Sbjct: 702 QHAGAILQAWYPGVEGGTAIAKTLAGESNPAGRLPVTFYAS--VQDLPAFTEYAMK---- 755

Query: 619 LGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASK 678
                RTY++Y G  L+ FG+GLSY+ FKY  +    T                + DA K
Sbjct: 756 ----NRTYRYYAGKPLWGFGFGLSYSTFKYGEVKLAST----------------SVDAGK 795

Query: 679 TRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVF 738
           +    V V                N     G +VV  Y K P +   ++   ++GFQRV 
Sbjct: 796 SLTATVTVT---------------NTSQVAGDEVVEAYLKTPQKGGPSH--SLVGFQRVP 838

Query: 739 VRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGN 779
           +  G ++ +    +  +SL+ VD +    + AGE+ + +G+
Sbjct: 839 LNPGESREVAIEVSP-RSLSAVDDSGKRSILAGEYRLSIGS 878


>gi|295086418|emb|CBK67941.1| Beta-glucosidase-related glycosidases [Bacteroides xylanisolvens
           XB1A]
          Length = 861

 Score =  289 bits (740), Expect = 3e-75,   Method: Compositional matrix adjust.
 Identities = 173/449 (38%), Positives = 246/449 (54%), Gaps = 39/449 (8%)

Query: 54  FCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGT 113
           + D+SL    R +DL+ R+TL+EKV  + + +  +PRLG+ +YEWW+EALHGV   G   
Sbjct: 26  YQDTSLTAEQRAEDLLPRLTLEEKVSLMQNASPAIPRLGIKEYEWWNEALHGVGRAGL-- 83

Query: 114 HFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNL-GRA-------GLTYW 165
                   AT FP  I   ASFN+SL  ++  A S EAR    + G +       GLT+W
Sbjct: 84  --------ATVFPQSIGMGASFNDSLLYEVFNATSDEARVKSRIFGESGVLKRYQGLTFW 135

Query: 166 SPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCC 225
           +PN+N+ RDPRWGR  ET GEDP++ G+  +  VRGLQ  E        ++R  K+ +C 
Sbjct: 136 TPNVNIFRDPRWGRGQETYGEDPYLTGQMGMAVVRGLQGPE--------DARYDKLHACA 187

Query: 226 KHYAAYDVDNWKGVDRYHFDAR-VTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIP 284
           KH+A +    W   +R+ FDA  +  +D+ ET+L  F+  V++     VMC+YNR  G P
Sbjct: 188 KHFAVHSGPEW---NRHSFDAENIDPRDLWETYLPAFKDLVQKAHVKEVMCAYNRFEGEP 244

Query: 285 SCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVD--NHKFLADSKEDAVAQTLKAGLDLD 342
            C   +LL Q +R EW   G +V+DC +I        H+   D KE A A  ++AG DL+
Sbjct: 245 CCGSNRLLMQILRDEWGYKGIVVSDCGAISDFYRPGTHETHPD-KEHASAGAVRAGTDLE 303

Query: 343 CGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQDICSDENI 402
           CG  Y +   +AV+ G + E +ID SLK L T    LG  D    +  +    + S E+ 
Sbjct: 304 CGSEYASL-ADAVKAGLIDEKEIDISLKRLLTARFELGEMDEQSAWSEIPTSVLNSKEHQ 362

Query: 403 ELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIA 462
            LA   ARE +VLL+N  N LPLN+     VAV+GP+AN +V   GNY GIP   ++ + 
Sbjct: 363 ALALRMARESLVLLQNKNNILPLNTH--LKVAVMGPNANDSVMQWGNYNGIPAHTVTLLE 420

Query: 463 GFSGY---ANVTYKTGCDDVACKSNNSIF 488
                     + Y+ GCD V  K+  S+F
Sbjct: 421 AVRAKLPEGQIIYEPGCDRVDGKTLQSLF 449



 Score =  103 bits (258), Expect = 3e-19,   Method: Compositional matrix adjust.
 Identities = 88/301 (29%), Positives = 130/301 (43%), Gaps = 58/301 (19%)

Query: 495 KTADATIIL--AGLDLSVEAESL----------DREDLWLPGYQTQLINQVAEVAKGPVI 542
           K  DA +IL   G+  S+E E +          DR D+ LP  Q    + +  + K    
Sbjct: 594 KVMDADVILFAGGISPSLEGEEMPVEVPGFKGGDRTDIELPDVQR---DLLKALKKAGKK 650

Query: 543 LVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDY 602
           +V ++  G  I      T  +AIL A YPG+ GG AI D ++G++NPGGRLP+T+Y    
Sbjct: 651 VVFINYSGSAIGLVPETTTCEAILQAWYPGQAGGTAIVDALWGEYNPGGRLPVTFYKD-- 708

Query: 603 VQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLN 662
           V  LP         +     GRTY++     L+PFG+GLSYT F Y     +K      N
Sbjct: 709 VNQLP-------DFEDYSMKGRTYRYMQQQPLFPFGHGLSYTDFTYGEAKLSK------N 755

Query: 663 KLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAE 722
            +    N+  T                         +   NVG  DG +VV VY + P +
Sbjct: 756 TIAKGENVVLT-------------------------IPVSNVGQRDGEEVVQVYLRRPGD 790

Query: 723 IAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGNGGV 782
                   +  F+RV + AG+ + +       ++    D  +NT+ P  E T  +  GG 
Sbjct: 791 KEGPRYT-LRAFKRVHIPAGKTESVAIPLTG-ENFEWFDVESNTMCPL-EGTYELLYGGT 847

Query: 783 S 783
           S
Sbjct: 848 S 848


>gi|336404627|ref|ZP_08585320.1| hypothetical protein HMPREF0127_02633 [Bacteroides sp. 1_1_30]
 gi|335941531|gb|EGN03384.1| hypothetical protein HMPREF0127_02633 [Bacteroides sp. 1_1_30]
          Length = 861

 Score =  289 bits (740), Expect = 4e-75,   Method: Compositional matrix adjust.
 Identities = 172/451 (38%), Positives = 246/451 (54%), Gaps = 43/451 (9%)

Query: 54  FCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGT 113
           + D+SL    R +DL+ R+TL+EKV  + + +  +PRLG+ +YEWW+EALHGV   G   
Sbjct: 26  YQDTSLTAEQRAEDLLPRLTLEEKVSLMQNASPAIPRLGIKEYEWWNEALHGVGRAGL-- 83

Query: 114 HFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNL-GRA-------GLTYW 165
                   AT FP  I   ASFN+SL  ++  A S EAR    + G +       GLT+W
Sbjct: 84  --------ATVFPQSIGMGASFNDSLLYEVFNATSDEARVKSRIFGESGVLKRYQGLTFW 135

Query: 166 SPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVE--GHENATDLNSRPLKVSS 223
           +PN+N+ RDPRWGR  ET GEDP++ G+  +  VRGLQ  E  G++          K+ +
Sbjct: 136 TPNVNIFRDPRWGRGQETYGEDPYLTGQMGMAVVRGLQGPEDAGYD----------KLHA 185

Query: 224 CCKHYAAYDVDNWKGVDRYHFDAR-VTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNG 282
           C KH+A +    W   +R+ FDA  +  +D+ ET+L  F+  V++     VMC+YNR  G
Sbjct: 186 CAKHFAVHSGPEW---NRHSFDAENIAPRDLWETYLPAFKDLVQKAHVKEVMCAYNRFEG 242

Query: 283 IPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVD--NHKFLADSKEDAVAQTLKAGLD 340
            P C   +LL Q +R EW   G +V+DC +I        H+   D KE A A  ++ G D
Sbjct: 243 EPCCGSNRLLMQILRDEWGYKGIVVSDCGAISDFYRPGTHETHPD-KEHASAAAVRTGTD 301

Query: 341 LDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQDICSDE 400
           L+CG  Y +   +AV+ G + E +ID SLK L T    LG  D  P +  +    + S E
Sbjct: 302 LECGSEYASL-ADAVKAGLIDEKEIDISLKRLLTARFELGEMDEQPAWAEIPTSVLNSKE 360

Query: 401 NIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSP 460
           +  LA   ARE +VLL+N  N LPLN+     +AV+GP+AN +V   GNY GIP   ++ 
Sbjct: 361 HQALALRMARESLVLLQNKNNILPLNTN--LKIAVMGPNANDSVMQWGNYNGIPAHTVTL 418

Query: 461 IAGFSGY---ANVTYKTGCDDVACKSNNSIF 488
           +           + Y+ GCD V  K+  S+F
Sbjct: 419 LEAVRAKLPEGQIIYEPGCDRVDRKTLQSLF 449



 Score =  103 bits (257), Expect = 4e-19,   Method: Compositional matrix adjust.
 Identities = 88/301 (29%), Positives = 130/301 (43%), Gaps = 58/301 (19%)

Query: 495 KTADATIIL--AGLDLSVEAESL----------DREDLWLPGYQTQLINQVAEVAKGPVI 542
           K  DA +IL   G+  S+E E +          DR D+ LP  Q    + +  + K    
Sbjct: 594 KVMDADVILFAGGISPSLEGEEMPVEVPGFKGGDRTDIELPDVQR---DLLKALKKAGKK 650

Query: 543 LVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDY 602
           +V ++  G  I      T  +AIL A YPG+ GG AI D ++G++NPGGRLP+T+Y    
Sbjct: 651 VVFINYSGSAIGLVPETTTCEAILQAWYPGQAGGTAIVDALWGEYNPGGRLPVTFYKD-- 708

Query: 603 VQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLN 662
           V  LP         +     GRTY++     L+PFG+GLSYT F Y     +K      N
Sbjct: 709 VNQLP-------DFEDYSMKGRTYRYMQQQPLFPFGHGLSYTDFTYGEAKLSK------N 755

Query: 663 KLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAE 722
            +    N+  T                         +   NVG  DG +VV VY + P +
Sbjct: 756 TIAKGENVVLT-------------------------IPVSNVGQRDGEEVVQVYLRRPGD 790

Query: 723 IAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGNGGV 782
                   +  F+RV + AG+ + +       ++    D  +NT+ P  E T  +  GG 
Sbjct: 791 KEGPRYT-LRAFKRVHIPAGKTESVAIPLTG-ENFEWFDAESNTMRPL-EGTYELLYGGT 847

Query: 783 S 783
           S
Sbjct: 848 S 848


>gi|326427096|gb|EGD72666.1| hypothetical protein PTSG_04397 [Salpingoeca sp. ATCC 50818]
          Length = 614

 Score =  289 bits (740), Expect = 4e-75,   Method: Compositional matrix adjust.
 Identities = 201/630 (31%), Positives = 302/630 (47%), Gaps = 65/630 (10%)

Query: 166 SPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCC 225
           SPNIN+ RDPRWGR  E P EDP + G +   Y  GLQ  E        +SR  KV    
Sbjct: 11  SPNININRDPRWGRNQEVPSEDPLLNGEFGKLYTMGLQQGE--------DSRYTKVVVTL 62

Query: 226 KHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPS 285
           KH+ AY +++  G  R++FDA+V+   + +T+   F   V EG+A  VMCSYN +NG P+
Sbjct: 63  KHWDAYSLEDSDGFTRHNFDAKVSNFALMDTYWPAFRKAVMEGNAKGVMCSYNALNGRPT 122

Query: 286 CADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQ 345
           C  P LL + +R  W   GY+ +D  +I+ +   H + A++     A       D+D G 
Sbjct: 123 CTHP-LLTKVLRDIWKFDGYVTSDTGAIEDIYAKHHYTANASAAVAAALRDGRCDMDSGA 181

Query: 346 YYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFD--GSPQYVSLGKQDICSDENIE 403
            Y +   +AV  G+    D+D++L     +   LG FD      Y  +    I +    +
Sbjct: 182 VYHDALLDAVNSGECSMDDVDRALYNTLKLRFELGLFDPIEDQPYWRINASSINTTYAQD 241

Query: 404 LAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCR------Y 457
           L  +   E ++LL+N  N LP    K + VAV+GPH NA  A++GNY G  C        
Sbjct: 242 LNMKITLESMILLQNHNNALPFK--KGRKVAVIGPHINAQEALVGNYLGQLCPDDSFDCI 299

Query: 458 MSPIA---GFSGYANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAES 514
            SP+A     +G +N     G   +AC ++ SI  A   AK AD  ++L G++ ++EAES
Sbjct: 300 TSPLAAIEAINGMSNTVSAMGSGVLAC-TDASIQEAVNVAKDADYVVLLIGINDTIEAES 358

Query: 515 LDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEE 574
            DR  + LP  Q +L   +A + K    ++I   GG+ +A  +    + AI+ AGYPG  
Sbjct: 359 NDRTSIDLPQCQHKLTAAIAHLNKTTAAVLI--NGGM-LAIEQEKKQLPAIIEAGYPGFY 415

Query: 575 GGRAIADVVFGKFNP-GGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPT 633
           GG AIA  +FG  N  GG+LP T Y  DY+  + ++ M +        PGR+Y++Y G  
Sbjct: 416 GGAAIAKTIFGDNNHLGGKLPYTVYPADYIHKINMSDMEMT-----NSPGRSYRYYTGQP 470

Query: 634 LYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDD 693
           L+PFG+GL+YT F                             + ++  P         + 
Sbjct: 471 LWPFGFGLAYTTF-----------------------------SVQSPGPSASTFATGSNT 501

Query: 694 YFEFKVDFQNVGSTDGSDVVIVYSKPPA--EIAATYIKQVIGFQRVFVRAGRNKRIKFVF 751
            F   V   N G   G  VV VY  P +    + +  KQ+I F+RV +   +   +    
Sbjct: 502 SFSLPVHVVNTGKRTGDTVVQVYMAPVSLPHRSFSLKKQLIAFERVHLTPNQRLGVTIPL 561

Query: 752 NACKSLNIVD-YAANTLLPAGEHTIFVGNG 780
           +A    N+VD    N +   G + + V +G
Sbjct: 562 SA-DVFNMVDPVTGNVVSTPGSYRLVVSDG 590


>gi|336415363|ref|ZP_08595703.1| hypothetical protein HMPREF1017_02811 [Bacteroides ovatus
           3_8_47FAA]
 gi|335940959|gb|EGN02821.1| hypothetical protein HMPREF1017_02811 [Bacteroides ovatus
           3_8_47FAA]
          Length = 861

 Score =  289 bits (739), Expect = 4e-75,   Method: Compositional matrix adjust.
 Identities = 173/449 (38%), Positives = 245/449 (54%), Gaps = 39/449 (8%)

Query: 54  FCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGT 113
           + D+SL    R +DL+ R+TL+EKV  + + +  +PRLG+ +YEWW+EALHGV   G   
Sbjct: 26  YQDTSLAAEQRAEDLLPRLTLEEKVSLMQNASPAIPRLGIKEYEWWNEALHGVGRAGL-- 83

Query: 114 HFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNL-GRA-------GLTYW 165
                   AT FP  I   ASFN+SL  ++  A S EAR    + G +       GLT+W
Sbjct: 84  --------ATVFPQSIGMGASFNDSLLYEVFNATSDEARVKSRIFGESGVLKRYQGLTFW 135

Query: 166 SPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCC 225
           +PN+N+ RDPRWGR  ET GEDP++ G+  +  VRGLQ  E        ++R  K+ +C 
Sbjct: 136 TPNVNIFRDPRWGRGQETYGEDPYLTGQMGMAVVRGLQGPE--------DARYDKLHACA 187

Query: 226 KHYAAYDVDNWKGVDRYHFDAR-VTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIP 284
           KH+A +    W   +R+ FDA  +  +D+ ET+L  F+  V++     VMC+YNR  G P
Sbjct: 188 KHFAVHSGPEW---NRHSFDAENIDPRDLWETYLPAFKDLVQKAHVKEVMCAYNRFEGEP 244

Query: 285 SCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVD--NHKFLADSKEDAVAQTLKAGLDLD 342
            C   +LL Q +R EW   G +V+DC +I        H    D KE A A  ++AG DL+
Sbjct: 245 CCGSNRLLMQILRDEWGYKGIVVSDCGAISDFYRPGTHGTHPD-KEHASAGAVRAGTDLE 303

Query: 343 CGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQDICSDENI 402
           CG  Y +   +AV+ G + E +ID SLK L T    LG  D    +  +    + S E+ 
Sbjct: 304 CGSEYASL-ADAVKAGLIDEKEIDISLKRLLTARFELGEMDEQSAWSEIPTSVLNSKEHQ 362

Query: 403 ELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIA 462
            LA   ARE +VLL+N  N LPLN+     VAV+GP+AN +V   GNY GIP   ++ + 
Sbjct: 363 ALALRMARESLVLLQNKNNILPLNTH--LKVAVMGPNANDSVMQWGNYNGIPAHTVTLLE 420

Query: 463 GFSGY---ANVTYKTGCDDVACKSNNSIF 488
                     + Y+ GCD V  K+  S+F
Sbjct: 421 AVRAKLPEGQIIYEPGCDRVDGKTLQSLF 449



 Score =  112 bits (279), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 87/297 (29%), Positives = 129/297 (43%), Gaps = 56/297 (18%)

Query: 497 ADATIILAGLDLSVEAESL----------DREDLWLPGYQTQLINQVAEVAKGPVILVIM 546
           AD  +   G+  S+E E +          DR D+ LP  Q  L+  + +V K    +V +
Sbjct: 598 ADVILFAGGISPSLEGEEMPVEVPGFKGGDRTDIELPDVQRDLLKALKKVGKK---VVFI 654

Query: 547 SAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQML 606
           +  G  I      T  +AIL A YPG+ GG AI D ++G++NPGGRLP+T+Y    V  L
Sbjct: 655 NYSGSAIGLVPETTTCEAILQAWYPGQAGGTAIVDALWGEYNPGGRLPVTFYKD--VNQL 712

Query: 607 PLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQH 666
           P         +     GRTY++     L+PFG+GLSYT F Y     +K      N +  
Sbjct: 713 P-------DFEDYSMKGRTYRYMQQQPLFPFGHGLSYTDFTYGEAKLSK------NTIAK 759

Query: 667 CRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAAT 726
             N+  T                         +   NVG  DG +VV VY + P +    
Sbjct: 760 GENVVLT-------------------------IPVSNVGQRDGEEVVQVYLRRPGDKEGP 794

Query: 727 YIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGNGGVS 783
               +  F+RV + AG+ + +       ++    D  +NT+ P  E T  +  GG S
Sbjct: 795 RYT-LRAFKRVHIPAGKTESVAISLTG-ENFEWFDVESNTMRPL-EGTYELLYGGTS 848


>gi|262405256|ref|ZP_06081806.1| conserved hypothetical protein [Bacteroides sp. 2_1_22]
 gi|294644754|ref|ZP_06722499.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
           ovatus SD CC 2a]
 gi|294810589|ref|ZP_06769241.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
           xylanisolvens SD CC 1b]
 gi|345508031|ref|ZP_08787672.1| glycoside hydrolase family beta-glycosidase [Bacteroides sp. D1]
 gi|229444722|gb|EEO50513.1| glycoside hydrolase family beta-glycosidase [Bacteroides sp. D1]
 gi|262356131|gb|EEZ05221.1| conserved hypothetical protein [Bacteroides sp. 2_1_22]
 gi|292639876|gb|EFF58149.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
           ovatus SD CC 2a]
 gi|294442250|gb|EFG11065.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
           xylanisolvens SD CC 1b]
          Length = 861

 Score =  289 bits (739), Expect = 5e-75,   Method: Compositional matrix adjust.
 Identities = 173/449 (38%), Positives = 245/449 (54%), Gaps = 39/449 (8%)

Query: 54  FCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGT 113
           + D+SL    R +DL+ R+TL+EKV  + + +  +PRLG+ +YEWW+EALHGV   G   
Sbjct: 26  YQDTSLTAEQRAEDLLPRLTLEEKVSLMQNASPAIPRLGIKEYEWWNEALHGVGRAGL-- 83

Query: 114 HFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNL-GRA-------GLTYW 165
                   AT FP  I   ASFN+SL  ++  A S EAR    + G +       GLT+W
Sbjct: 84  --------ATVFPQSIGMGASFNDSLLYEVFNATSDEARVKSRIFGESGVLKRYQGLTFW 135

Query: 166 SPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCC 225
           +PN+N+ RDPRWGR  ET GEDP++ G+  +  VRGLQ  E        ++R  K+ +C 
Sbjct: 136 TPNVNIFRDPRWGRGQETYGEDPYLTGQMGMAVVRGLQGPE--------DARYDKLHACA 187

Query: 226 KHYAAYDVDNWKGVDRYHFDAR-VTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIP 284
           KH+A +    W   +R+ FDA  +  +D+ ET+L  F+  V++     VMC+YNR  G P
Sbjct: 188 KHFAVHSGPEW---NRHSFDAENIDPRDLWETYLPAFKDLVQKAHVKEVMCAYNRFEGEP 244

Query: 285 SCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVD--NHKFLADSKEDAVAQTLKAGLDLD 342
            C   +LL Q +R EW   G +V+DC +I        H    D KE A A  ++AG DL+
Sbjct: 245 CCGSNRLLMQILRDEWGYKGIVVSDCGAISDFYRPGTHGTHPD-KEHASAGAVRAGTDLE 303

Query: 343 CGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQDICSDENI 402
           CG  Y +   +AV+ G + E +ID SLK L T    LG  D    +  +    + S E+ 
Sbjct: 304 CGSEYASL-ADAVKAGLIDEKEIDISLKRLLTARFELGEMDEQSAWSEIPTSVLNSKEHQ 362

Query: 403 ELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIA 462
            LA   ARE +VLL+N  N LPLN+     VAV+GP+AN +V   GNY GIP   ++ + 
Sbjct: 363 ALALRMARESLVLLQNKNNILPLNTH--LKVAVMGPNANDSVMQWGNYNGIPAHTVTLLE 420

Query: 463 GFSGY---ANVTYKTGCDDVACKSNNSIF 488
                     + Y+ GCD V  K+  S+F
Sbjct: 421 AVRAKLPEGQIIYEPGCDRVDGKTLQSLF 449



 Score =  106 bits (264), Expect = 6e-20,   Method: Compositional matrix adjust.
 Identities = 85/290 (29%), Positives = 125/290 (43%), Gaps = 57/290 (19%)

Query: 495 KTADATIIL--AGLDLSVEAESL----------DREDLWLPGYQTQLINQVAEVAKGPVI 542
           K  DA +IL   G+  S+E E +          DR D+ LP  Q    + +  + K    
Sbjct: 594 KVMDADVILFAGGISPSLEGEEMPVEVPGFKGGDRTDIELPDVQR---DLLKALKKAGKK 650

Query: 543 LVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDY 602
           +V ++  G  I      T  +AIL A YPG+ GG AI D ++G++NPGGRLP+T+Y    
Sbjct: 651 VVFINYSGSAIGLVPETTTCEAILQAWYPGQAGGTAIVDALWGEYNPGGRLPVTFYKD-- 708

Query: 603 VQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLN 662
           V  LP         +     GRTY++     L+PFG+GLSYT F Y     +K      N
Sbjct: 709 VNQLP-------DFEDYSMKGRTYRYMQQQPLFPFGHGLSYTTFTYGEAKLSK------N 755

Query: 663 KLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAE 722
            +    N+  T                         +   NVG  DG +VV VY + P +
Sbjct: 756 TIAKGENVVLT-------------------------IPVSNVGQRDGEEVVQVYLRRPGD 790

Query: 723 IAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGE 772
                   +  F+RV + AG+ + +       +S    D A NT+ P  +
Sbjct: 791 KEGPRYT-LRAFKRVHIPAGKTESVAISLTH-ESFEWFDEATNTMHPVAD 838


>gi|160885419|ref|ZP_02066422.1| hypothetical protein BACOVA_03419 [Bacteroides ovatus ATCC 8483]
 gi|156109041|gb|EDO10786.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
           ovatus ATCC 8483]
          Length = 861

 Score =  288 bits (738), Expect = 6e-75,   Method: Compositional matrix adjust.
 Identities = 174/449 (38%), Positives = 244/449 (54%), Gaps = 39/449 (8%)

Query: 54  FCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGT 113
           + D+SL    R +DL+ R+TL+EKV  + + +  +PRLG+ +YEWW+EALHGV   G   
Sbjct: 26  YQDTSLAAEQRTEDLLPRLTLEEKVSLMQNASPAIPRLGIKEYEWWNEALHGVGRAGL-- 83

Query: 114 HFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYN-------LGR-AGLTYW 165
                   AT FP  I   ASFN+SL  ++  A S EAR           L R  GLT+W
Sbjct: 84  --------ATVFPQSIGMGASFNDSLLYEVFNATSDEARVKSRIFGDSGVLKRYQGLTFW 135

Query: 166 SPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCC 225
           +PN+N+ RDPRWGR  ET GEDP++ G+  +  VRGLQ  E        ++R  K+ +C 
Sbjct: 136 TPNVNIFRDPRWGRGQETYGEDPYLTGQMGMAVVRGLQGPE--------DARYDKLHACA 187

Query: 226 KHYAAYDVDNWKGVDRYHFDAR-VTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIP 284
           KH+A +    W   +R+ FDA  +  +D+ ET+L  F+  V++     VMC+YNR  G P
Sbjct: 188 KHFAVHSGPEW---NRHSFDAENIDPRDLWETYLPAFKDLVQKAHVKEVMCAYNRFEGEP 244

Query: 285 SCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVD--NHKFLADSKEDAVAQTLKAGLDLD 342
            C   +LL Q +R EW   G +V+DC +I        H    D KE A A  ++AG DL+
Sbjct: 245 CCGSNRLLMQILRDEWGYEGIVVSDCGAISDFYRPGTHGTHPD-KEHASAGAVRAGTDLE 303

Query: 343 CGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQDICSDENI 402
           CG  Y +   +AV+ G + E +ID SLK L T    LG  D    +  +    + S E+ 
Sbjct: 304 CGSEYASL-ADAVKAGLIDEKEIDISLKRLLTARFELGEMDEQSAWSEIPTSVLNSKEHQ 362

Query: 403 ELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIA 462
            LA   ARE +VLL+N  N LPLN+     VAV+GP+AN +V   GNY GIP   ++ + 
Sbjct: 363 ALALRMARESLVLLQNKNNILPLNTH--LKVAVMGPNANDSVMQWGNYNGIPAHTVTLLE 420

Query: 463 GFSGY---ANVTYKTGCDDVACKSNNSIF 488
                     + Y+ GCD V  K+  S+F
Sbjct: 421 AVRAKLPEGQIIYEPGCDRVDGKTLQSLF 449



 Score =  103 bits (256), Expect = 5e-19,   Method: Compositional matrix adjust.
 Identities = 85/297 (28%), Positives = 127/297 (42%), Gaps = 56/297 (18%)

Query: 497 ADATIILAGLDLSVEAESL----------DREDLWLPGYQTQLINQVAEVAKGPVILVIM 546
           AD  +   G+  S+E E +          DR D+ LP  Q    + +  + K    +V +
Sbjct: 598 ADVILFAGGISPSLEGEEMPVEVPGFKGGDRTDIELPDVQR---DLLKALKKAGKKVVFI 654

Query: 547 SAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQML 606
           +  G  I      T  +AIL A YPG+ GG AI D ++G++NPGGRLP+T+Y    V  L
Sbjct: 655 NYSGSAIGLVPETTTCEAILQAWYPGQAGGTAIVDALWGEYNPGGRLPVTFYKD--VNQL 712

Query: 607 PLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQH 666
           P         +     GRTY++     L+PFG+GLSYT F Y     +K      N +  
Sbjct: 713 P-------DFEDYSMKGRTYRYMQQQPLFPFGHGLSYTDFTYGEAKLSK------NTIAK 759

Query: 667 CRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAAT 726
             N+  T                         +   NVG  DG +VV VY + P +    
Sbjct: 760 GENVVLT-------------------------IPVSNVGQRDGEEVVQVYLRRPGDKEGP 794

Query: 727 YIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGNGGVS 783
               +  F+RV + AG+ + +       ++    D  +NT+ P  E T  +  GG S
Sbjct: 795 RYT-LRAFKRVHIPAGKTESVAIPLTG-ENFEWFDVESNTMCPL-EGTYELLYGGTS 848


>gi|380696428|ref|ZP_09861287.1| beta-glucosidase [Bacteroides faecis MAJ27]
          Length = 851

 Score =  287 bits (735), Expect = 1e-74,   Method: Compositional matrix adjust.
 Identities = 169/430 (39%), Positives = 252/430 (58%), Gaps = 41/430 (9%)

Query: 53  LFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPG 112
           L+ + + P   RV DL+SR+T++EK+  L   + G+PRLG+ +Y   +EALHGV  V PG
Sbjct: 27  LYKNENAPVHERVMDLISRLTVEEKISLLRATSPGIPRLGIDKYYHGNEALHGV--VRPG 84

Query: 113 THFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYN---LGRAG-------L 162
                     T FP  I   A++N  L +++   +S EARA +N    GRA        L
Sbjct: 85  RF--------TVFPQAIGLAATWNPELQRRVATVISDEARARWNELDQGRAQKEQFSDVL 136

Query: 163 TYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVS 222
           T+WSP +N+ARDPRWGR  ET GEDPF+ G     +V+GLQ  + H          LK+ 
Sbjct: 137 TFWSPTVNMARDPRWGRTPETYGEDPFLSGVMGTAFVKGLQGDDPHY---------LKIV 187

Query: 223 SCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNG 282
           S  KH+AA + ++    +R+  + +++E+ + E +   FEMCVKEG A+S+M +YN +N 
Sbjct: 188 STPKHFAANNEEH----NRFVCNPQISEKQLREYYFPAFEMCVKEGKAASIMSAYNALND 243

Query: 283 IPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLD 342
           +P   +  LL + +R +W   GY+V+DC    ++V+ HK+L  +KE A   +LKAGLDL+
Sbjct: 244 VPCTLNAWLLQKVLRKDWGFQGYVVSDCGGPALLVNAHKYLK-TKEAAATLSLKAGLDLE 302

Query: 343 CG-QYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ--YVSLGKQDICSD 399
           CG   Y     NA +Q  V + DID +  ++ T  M+LG FDG  +  Y  +    I S 
Sbjct: 303 CGDDVYDGPLLNAYKQYMVSDADIDSAAYHVLTARMKLGLFDGVERNPYTKISPSVIGSK 362

Query: 400 ENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMS 459
           E+ ++A +AAR+ IVLLKN +N LPLN++K+K++AVVG   NA     G+Y+G P   + 
Sbjct: 363 EHQQIALDAARQCIVLLKNQKNMLPLNASKLKSIAVVG--INAGKCEFGDYSGAPV--VE 418

Query: 460 PIAGFSGYAN 469
           P++   G  N
Sbjct: 419 PVSILQGIRN 428



 Score =  136 bits (342), Expect = 5e-29,   Method: Compositional matrix adjust.
 Identities = 90/294 (30%), Positives = 144/294 (48%), Gaps = 56/294 (19%)

Query: 490 ASEAAKTADATIILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAG 549
           A +A +  +  I + G++ S+E E  DR D+ LP  Q + + ++ +V    +++++    
Sbjct: 595 AGKAVRECETVIAVMGINKSIEREGQDRYDIQLPADQREFLQEIYKVNSNMIVILV---A 651

Query: 550 GVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLT 609
           G  +A    + ++ AI+ A YPGE+GG A+A+V+FG +NP GRLP+T+Y         L 
Sbjct: 652 GSSLAINWMDEHVPAIVNAWYPGEQGGTAVAEVLFGDYNPAGRLPLTYYKS-------LD 704

Query: 610 SMPLRPVDSLGY-PGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCR 668
            +P  P D      GRTYK++ G  LYPFGYGLSY+ FKY                    
Sbjct: 705 ELP--PFDDYDITKGRTYKYFKGEVLYPFGYGLSYSSFKY-------------------- 742

Query: 669 NLNYTSDASKTRCPGVLVNDLRC-DDYFEFKVDF--QNVGSTDGSDVVIVYSKPPAEIAA 725
                             +DLR  D+  E  V F  +N G  +G +V  VY + P     
Sbjct: 743 ------------------SDLRVKDEADEVAVSFRLKNTGKRNGDEVTQVYVRIPETGGI 784

Query: 726 TYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANT-LLPAGEHTIFVG 778
             +K++ GF+RV +++G ++R++   N  + L   D      ++P G   I VG
Sbjct: 785 VPVKELKGFRRVPLKSGESRRVEIRLNK-EQLRYWDVGKGQFVVPKGTFDIMVG 837


>gi|431798021|ref|YP_007224925.1| beta-glucosidase-like glycosyl hydrolase [Echinicola vietnamensis
           DSM 17526]
 gi|430788786|gb|AGA78915.1| beta-glucosidase-like glycosyl hydrolase [Echinicola vietnamensis
           DSM 17526]
          Length = 906

 Score =  287 bits (735), Expect = 1e-74,   Method: Compositional matrix adjust.
 Identities = 167/423 (39%), Positives = 243/423 (57%), Gaps = 34/423 (8%)

Query: 52  FLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGP 111
           F F D    +  RV  LV +M+L+EKV Q+ + +  +PRL +P+Y WW+E LHGV+  G 
Sbjct: 50  FSFLDMEKNFEERVDILVDQMSLEEKVSQMMNASPAIPRLKVPEYNWWNECLHGVARAGY 109

Query: 112 GTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMY-----NLGRA---GLT 163
                     AT FP  I   ASF+++L K IG  +S EARA +     N  R    GL 
Sbjct: 110 ----------ATVFPQSISVAASFDKNLMKDIGSVISDEARAKHHEFIRNGKRGIYTGLD 159

Query: 164 YWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSS 223
           +WSPNIN+ RDPRWGR  ET GEDP++ G  A  ++ GLQD +G         + LK  +
Sbjct: 160 FWSPNINIFRDPRWGRGHETYGEDPYLTGELASQFIEGLQDSDG---------KYLKTIA 210

Query: 224 CCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGI 283
             KH+A   V +     R+ FD  V+++D+ ET+L  F   VKE    S+M +YNR  G 
Sbjct: 211 TSKHFA---VHSGPEPLRHTFDVDVSDRDLYETYLPAFRKTVKEAKVYSIMGAYNRFRGE 267

Query: 284 PSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDC 343
                  LLNQ +R +W   GY+V+DC +IQ +   HK +A +  +A A  +  G DL+C
Sbjct: 268 SCSGHDFLLNQLLREQWGFEGYVVSDCGAIQDIHTGHK-IASTAAEAAAIGVSGGCDLNC 326

Query: 344 GQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSP--QYVSLGKQDICSDEN 401
           G YYT+ T  AV +G + E +ID ++K L+    RLG FD      Y  +    +CS+ +
Sbjct: 327 GNYYTHLT-EAVAEGLISEEEIDIAVKRLFLARFRLGMFDPEEAVSYAQIPFGIVCSEAH 385

Query: 402 IELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPI 461
             LA +AA++ +VLLKN +N LPL+  K+K +AV+GP+A+   +++GNY GIP + ++ +
Sbjct: 386 NTLARQAAQKSMVLLKNQKNLLPLSVDKIKRIAVIGPNADNVESLLGNYHGIPKKPVTFL 445

Query: 462 AGF 464
            G 
Sbjct: 446 DGI 448



 Score =  155 bits (391), Expect = 1e-34,   Method: Compositional matrix adjust.
 Identities = 102/304 (33%), Positives = 149/304 (49%), Gaps = 55/304 (18%)

Query: 485 NSIFAASEAAKTADATIILAGLDLSVEAESLD----------REDLWLPGYQTQLINQVA 534
           + I  A   AK+AD  +++ GL   +E ES+D          R  + LP  Q  L+  V 
Sbjct: 615 SKIDEAVAMAKSADLAVVVLGLSQRLEGESMDVVTPGFDRGDRTAITLPAQQEALLKAVK 674

Query: 535 EVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLP 594
           E  K PVILV+ +   + I +A+ N  + AI+ AGYPGEEGG A+ADVVFG +NP GRLP
Sbjct: 675 ETGK-PVILVLNAGSAMAINWAKEN--VDAIISAGYPGEEGGNALADVVFGDYNPAGRLP 731

Query: 595 ITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFT 654
           IT+Y    V+ LP       P +     GRTY+++ G  LYPFGYGLSYT+F Y  L   
Sbjct: 732 ITYYQS--VEDLP-------PFEDYDMKGRTYRYFEGKPLYPFGYGLSYTRFSYKDLEVP 782

Query: 655 KTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVI 714
             +                                   D  +  V   N+GS  G +VV 
Sbjct: 783 AKVNAG--------------------------------DPVQISVTVTNIGSRAGDEVVQ 810

Query: 715 VYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHT 774
           +Y           I+Q+ GFQR+ ++ G +K + F  +A + L++++  +  ++  G  +
Sbjct: 811 LYLNDKEASTMRPIRQLEGFQRIHLKPGESKVVNFTLSA-RQLSMINGESKRVIEEGVFS 869

Query: 775 IFVG 778
           I VG
Sbjct: 870 IHVG 873


>gi|423303655|ref|ZP_17281654.1| hypothetical protein HMPREF1072_00594 [Bacteroides uniformis
           CL03T00C23]
 gi|423307623|ref|ZP_17285613.1| hypothetical protein HMPREF1073_00363 [Bacteroides uniformis
           CL03T12C37]
 gi|392688019|gb|EIY81310.1| hypothetical protein HMPREF1072_00594 [Bacteroides uniformis
           CL03T00C23]
 gi|392689492|gb|EIY82769.1| hypothetical protein HMPREF1073_00363 [Bacteroides uniformis
           CL03T12C37]
          Length = 801

 Score =  287 bits (734), Expect = 2e-74,   Method: Compositional matrix adjust.
 Identities = 234/803 (29%), Positives = 369/803 (45%), Gaps = 147/803 (18%)

Query: 53  LFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRL---GLPQYEWWSEALH-GVSN 108
           ++ DS  P  +RV++L+S+MTL+EK  Q+    +G  R+    LP   W +E    G+ N
Sbjct: 55  IYEDSCAPLEVRVQNLLSQMTLEEKSCQMATL-YGSGRVLNDALPSDNWKNEVWKDGIGN 113

Query: 109 VG---------------PGTHF------------------------DDVIPG-----ATS 124
           +                P  H                         ++ I G     AT 
Sbjct: 114 IDEEHNGLGSFKSAYSFPYAHHVKTKHAIQRWFVENTRLGIPVDFTNEGIRGLCHDRATY 173

Query: 125 FPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNINVARDPRWGRITETP 184
           FP      A++N+ L  +IG+A   EAR    LG   +  +SP +++A+DPRWGR  ET 
Sbjct: 174 FPAQCGQGATWNKELIAQIGEA---EAREASVLGYTNI--YSPILDIAQDPRWGRCVETY 228

Query: 185 GEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHF 244
           GEDP+  G+     +  LQ                K+ S  KH+A Y +       +   
Sbjct: 229 GEDPYHAGQMGKQMILSLQKN--------------KLVSTPKHFAVYSIPVGGRDGKTRT 274

Query: 245 DARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHG 304
           D  V  ++M   +L PF +   E  A  VM SYN  +G P       L + +R EW   G
Sbjct: 275 DPHVAPREMRTLYLDPFRVAFHEAGALGVMSSYNDYDGEPITGSYHFLTEILRQEWGFKG 334

Query: 305 YIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFT---------GNAV 355
           Y+V+D ++++ +   H+ +A+  EDAVAQ + AGL++      T+FT          +AV
Sbjct: 335 YVVSDSEAVEFISTKHQ-VANGYEDAVAQAVNAGLNIR-----THFTPPADFILPLRSAV 388

Query: 356 QQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQY-VSLGKQDICSDENIELAAEAAREGIV 414
           ++GK+ +  +++ +  +  V   LG FD   +       Q + S E+ +LA EAAR+ +V
Sbjct: 389 KKGKISQETLNQRVAEILRVKFWLGLFDNPYRGDEKRAGQIVHSPEHQQLALEAARQSLV 448

Query: 415 LLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFSGY---ANVT 471
           LLKN+  TLPL S  +++VAV+GP+A+    +I  Y        +   G       A+V 
Sbjct: 449 LLKNEHQTLPL-SKSIRSVAVIGPNADERQQLICRYGPANAHITTIYEGIKKMLPQADVV 507

Query: 472 YKTGCDDV--------------ACKSNNSIFAASEAAKTADATI-ILAGLDLSVEAESLD 516
           YK GCD +              A +    +  A EAAK A+ T+ +L G +L+V  E   
Sbjct: 508 YKKGCDIIDPHFPESEVLEFPKAAQEAQMMEEAIEAAKGAEVTVMVLGGNELTVR-EDRS 566

Query: 517 REDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGG 576
           R  L LPG Q +L+ ++ ++ K PV+LV++      I FA   T++ AI+ A +PGE GG
Sbjct: 567 RTSLDLPGRQEELLKKICQLGK-PVVLVMIDGRASSINFAA--THVPAIIHAWFPGEFGG 623

Query: 577 RAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYP 636
           +AIA+ +FG +NPGGRL +T+     V  +P  + P +P          Y       LYP
Sbjct: 624 QAIAEALFGDYNPGGRLAVTFPKS--VGQIPF-AFPFKPGSDESSETSVYG-----ALYP 675

Query: 637 FGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFE 696
           FG+GLSYT F+Y+ L  + + Q                        GV  N         
Sbjct: 676 FGHGLSYTTFQYSDLVISPSKQ------------------------GVQGN-------IS 704

Query: 697 FKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKS 756
                +N+G  +G +VV +Y +       TY + + GF+R+ ++   +  + F     + 
Sbjct: 705 ISCTIKNIGQREGDEVVQLYLRDEVSSVTTYTQVLRGFERITLKPEASHTVHFELTP-QE 763

Query: 757 LNIVDYAANTLLPAGEHTIFVGN 779
           L I D   N  +  G   + +G+
Sbjct: 764 LGIWDKQMNFTVEPGMFKVMIGS 786


>gi|391417909|gb|AFM44649.1| Xyl3A [Caldanaerobius polysaccharolyticus]
          Length = 789

 Score =  287 bits (734), Expect = 2e-74,   Method: Compositional matrix adjust.
 Identities = 232/825 (28%), Positives = 368/825 (44%), Gaps = 162/825 (19%)

Query: 46  GLQMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLG---------------DFAHGVPR 90
           G    S L+ D++ P   RV+DL+SRMTLDEK+ QL                D A  + +
Sbjct: 3   GNSKESALYLDATQPVEKRVEDLLSRMTLDEKIAQLSSVWVYELLDNMEFSVDKAKDLLK 62

Query: 91  LGLPQYEWWSEALHGVSNVGPG------------------------THFDD----VIPGA 122
            G+ Q       + G SN+GP                          H +     +  GA
Sbjct: 63  DGIGQIT----RIGGASNLGPKESAQLANEIQRYLIENTRLGIPALVHEESCSGYMAKGA 118

Query: 123 TSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNINVARDPRWGRITE 182
           T FP  I   +++N  L K++G  +  + +A+           +P ++VARD RWGR+ E
Sbjct: 119 TCFPQTIGVASTWNTELVKQMGSVIREQMKAV-----GAHQALAPLMDVARDARWGRVEE 173

Query: 183 TPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVD----NWKG 238
           T GEDP+++    V+Y+ GLQ      N  D       + +  KH+  Y       NW  
Sbjct: 174 TFGEDPYLISEMGVSYIEGLQG----GNIKD------GIMATVKHFVGYGFSEGGMNWA- 222

Query: 239 VDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRG 298
                  A + E+++ E FL PFE  VK+   +SVM +Y+ ++GIP     KLL Q +R 
Sbjct: 223 ------PAHIPERELREVFLLPFEAAVKKAKTASVMAAYHELDGIPCHGSKKLLTQILRN 276

Query: 299 EWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDL-----DCGQYYTNFTGN 353
           EW   G +V+D   + ++ + H  +A  K +A    L+AG+D+     DC   Y      
Sbjct: 277 EWGFDGLVVSDYFGVNMLYEYH-HVARDKGEAAKIALQAGVDIELPSRDC---YGQPLKE 332

Query: 354 AVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQDICSDENIELAAEAAREGI 413
           AVQ+G V+E  ID+ ++ +  +    G F+     V    +   + +  +LA + A++ I
Sbjct: 333 AVQKGLVEEALIDEVVRRILRMKFLSGVFENPYVDVEKAAEVFDTPDQRKLAYKLAQQSI 392

Query: 414 VLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMS-------------- 459
           VLLKN  + LPL    +K++AV+GP+A++   +IG+YA  PC   S              
Sbjct: 393 VLLKNQGDLLPLKK-DIKSIAVIGPNADSVRNIIGDYA-YPCHIESLVETKEQSNVFNTP 450

Query: 460 ------------PIAGF--------SGYANVTYKTGCDDVACKSNNSIFAASEAAKTADA 499
                       PI           S    + Y  GC+ V          A EAAK +D 
Sbjct: 451 VPDKVSLVDNFVPIKSILEGIKGKISPETELHYAKGCE-VTGDDKGGFAEAIEAAKKSDV 509

Query: 500 TIILAG-----LDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIA 554
            I++ G      D     ES DR DL LPG Q +L+  +      P ++V+++   + I 
Sbjct: 510 AIVVVGDKAGLTDDCTSGESRDRADLNLPGVQQELVEAIYNTGT-PTVVVLVNGRPLSIN 568

Query: 555 FAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLR 614
           +   + +I AI+ A  PGEEG  A+ADV+FG +NPGG+LP+++     V  +P+     +
Sbjct: 569 W--ISRHIPAIIEAWLPGEEGAAAVADVLFGDYNPGGKLPVSFPRS--VGQVPVY-YNHK 623

Query: 615 PVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTS 674
           P     +    Y   +   LYPFGYGLSYT+F+++ L    +                  
Sbjct: 624 PSGGRSHWKGDYVEMSTKPLYPFGYGLSYTKFEFSNLEIAPS------------------ 665

Query: 675 DASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGF 734
                        ++  D      VD QN G  +G +VV +Y +         +K++ GF
Sbjct: 666 -------------EVYDDGKVRISVDVQNAGKLEGDEVVQLYVRNEVSNVTRPVKELKGF 712

Query: 735 QRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGN 779
           +RV +R G  K++ F  +  + L   D     ++  G   + +G+
Sbjct: 713 KRVSLRPGEKKKVVFELSVSQ-LGFYDEDMRYVVQPGTVKVMIGS 756


>gi|389737578|ref|ZP_10190998.1| beta-glucosidase [Rhodanobacter sp. 115]
 gi|388434298|gb|EIL91245.1| beta-glucosidase [Rhodanobacter sp. 115]
          Length = 898

 Score =  286 bits (733), Expect = 2e-74,   Method: Compositional matrix adjust.
 Identities = 173/440 (39%), Positives = 243/440 (55%), Gaps = 39/440 (8%)

Query: 53  LFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPG 112
           L+ D++  +  R  DLVSRMTL EKV Q+ + A  +PRLG+P Y+WW+EALHGV+  G  
Sbjct: 42  LYLDTAHSFQERAADLVSRMTLAEKVAQMQNSAPAIPRLGVPAYDWWNEALHGVARAGE- 100

Query: 113 THFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYN-------LGR-AGLTY 164
                    AT FP  I   A+F+ +L      A+S EARA YN        GR  GLT+
Sbjct: 101 ---------ATVFPQAIGLAATFDPALLHHEATAISDEARAKYNDFQRRGMRGRYEGLTF 151

Query: 165 WSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSC 224
           WSPN N+ RDPRWGR  ET GEDP++  R  V +VRGL   EG +          K+ + 
Sbjct: 152 WSPNTNIFRDPRWGRGQETYGEDPYLTSRMGVAFVRGL---EGDDPTYQ------KLDAT 202

Query: 225 CKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIP 284
            KH+A   V +    +R+ FD   +E+D+ ET+L  F+  V++G   +VM +YNRV+G+P
Sbjct: 203 AKHFA---VHSGPESERHRFDVHPSERDLHETYLPAFQALVQQGGVDAVMGAYNRVDGVP 259

Query: 285 SCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCG 344
           + A  +LL   +R +W   GY+V+DCD++  +   HK +  + E A A  +  G DL+CG
Sbjct: 260 ATASHRLLQDILRRDWGFKGYVVSDCDAVADIYQFHKVVP-TAEQAAALAVNNGDDLNCG 318

Query: 345 QYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFD--GSPQYVSLGKQDICSDENI 402
             Y      AV  G V E  ID ++  L     RLG FD  G   + +L    + S ++ 
Sbjct: 319 TTYATLV-KAVHDGLVNEHTIDTAVTRLMLARFRLGMFDPPGRVPWSTLPMSVVQSPQHD 377

Query: 403 ELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIA 462
            LA   A+E +VLLKND   LPL S  V+ +AV+GP A+   A++GNY G P   ++ + 
Sbjct: 378 ALALRTAQESMVLLKND-GLLPL-SHNVRRIAVIGPTADNVTALLGNYHGTPKAPVTILQ 435

Query: 463 GFSGY---ANVTYKTGCDDV 479
           G       A VTY  G + V
Sbjct: 436 GIREAVPNAQVTYVQGTELV 455



 Score =  137 bits (346), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 93/314 (29%), Positives = 143/314 (45%), Gaps = 54/314 (17%)

Query: 489 AASEAAKTADATIILAGLDLSVEAESL----------DREDLWLPGYQTQLINQVAEVAK 538
           AA +AA+ AD  I   GL   +E E +          DR  L LP  Q +L+ Q  +V  
Sbjct: 625 AALDAARHADVVIFAGGLSSDLEGEEMPVDYPGFAGGDRTTLALPATQRKLL-QALQVTG 683

Query: 539 GPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWY 598
            PV+LV+ +   + I +A+ +  + AIL A YPG++GG A+AD +FG  +P GRLP+T+Y
Sbjct: 684 KPVVLVLTTGSALAIDWAKQH--LPAILLAWYPGQDGGHAVADALFGNVDPAGRLPVTFY 741

Query: 599 NGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQ 658
                     ++  L P D     GRTY+++ G  L+PFG+GLSYT+F Y+ L   +   
Sbjct: 742 K---------SARQLPPFDDYAMKGRTYRYFTGQPLFPFGFGLSYTRFAYSDLQLDR--- 789

Query: 659 VNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSK 718
                                       + L   D     +  +N G   G +VV +Y +
Sbjct: 790 ----------------------------DTLGPSDRMRISLRVKNTGQRAGDEVVQLYLR 821

Query: 719 PPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPA-GEHTIFV 777
           P     A  IK + GFQR+ ++ G  + + F  +    L   D A +    A G + + V
Sbjct: 822 PLRAPHARAIKSLRGFQRISLKPGEERSVSFDISPQTDLKYYDVAHHAYAVAPGRYQVQV 881

Query: 778 GNGGVSFPIHLNFN 791
           G       +  +F 
Sbjct: 882 GASSADIRLTRDFT 895


>gi|255690202|ref|ZP_05413877.1| beta-glucosidase [Bacteroides finegoldii DSM 17565]
 gi|260624221|gb|EEX47092.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
           finegoldii DSM 17565]
          Length = 853

 Score =  286 bits (732), Expect = 3e-74,   Method: Compositional matrix adjust.
 Identities = 168/430 (39%), Positives = 247/430 (57%), Gaps = 41/430 (9%)

Query: 53  LFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPG 112
           L+ +++ P   RV DL+SR+T++EK+  L   + G+PRLG+ +Y   +EALHGV  V PG
Sbjct: 28  LYKNANAPVHERVMDLISRLTVEEKISLLRATSPGIPRLGIDKYYHGNEALHGV--VRPG 85

Query: 113 THFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAG----------L 162
                     T FP  I   A++N  L K+I   +S EARA +N    G          L
Sbjct: 86  RF--------TVFPQAIGLAATWNPELQKRIATVISDEARARWNELDQGRNQKEQFSDVL 137

Query: 163 TYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVS 222
           T+WSP +N+ARDPRWGR  ET GEDPF+ G     +V+GLQ  + H          LK+ 
Sbjct: 138 TFWSPTVNMARDPRWGRTPETYGEDPFLSGVMGTAFVKGLQGDDPHY---------LKIV 188

Query: 223 SCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNG 282
           S  KH+AA + ++    +R+  + +++E+ + E +   FEMCVKEG A+S+M +YN +N 
Sbjct: 189 STPKHFAANNEEH----NRFVCNPQISEKQLREYYFPAFEMCVKEGKAASIMTAYNALNN 244

Query: 283 IPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLD 342
           +P   +  LL + +R +W   GY+V+DC    ++V+ HK++  +KE A   ++KAGLDL+
Sbjct: 245 VPCTLNSWLLQKVLRRDWGFQGYVVSDCGGPSLLVNAHKYV-KTKEAAATLSIKAGLDLE 303

Query: 343 CG-QYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ--YVSLGKQDICSD 399
           CG   Y  +  NA +Q    E DID +  ++ T  M+LG FDG  +  Y  +    I S 
Sbjct: 304 CGDDVYDEYLLNAYKQYMASEADIDSAAYHVLTARMKLGLFDGVERNPYAKISPSVIGSK 363

Query: 400 ENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMS 459
           E+  +A  AARE IVLLKN +N LPLN  K+K++AVVG   NA     G+Y+G P   + 
Sbjct: 364 EHQTVALNAARECIVLLKNQKNMLPLNVKKLKSIAVVG--INAGKCEFGDYSGAPV--VE 419

Query: 460 PIAGFSGYAN 469
           P++   G  N
Sbjct: 420 PVSILQGIKN 429



 Score =  134 bits (337), Expect = 2e-28,   Method: Compositional matrix adjust.
 Identities = 93/291 (31%), Positives = 147/291 (50%), Gaps = 49/291 (16%)

Query: 490 ASEAAKTADATIILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAG 549
           A +A +  +  + + G++ S+E E  DR D+ LP  Q + + ++ +V   P I++++ AG
Sbjct: 596 AGKAVRECETVVAVMGINKSIEREGQDRYDIQLPADQREFLQEIYKV--NPNIILVLVAG 653

Query: 550 GVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLT 609
              +A    N ++ AI+ A YPGE+GG A+A+V+FG +NP GRLP+T+Y    ++ LP  
Sbjct: 654 S-SLAVNWENEHLPAIVNAWYPGEQGGTAVAEVLFGDYNPAGRLPLTYYKS--LEQLPAF 710

Query: 610 SMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRN 669
                  D     GRTY+++    LYPFGYGLSYT FKY+ L                  
Sbjct: 711 D------DYDITKGRTYQYFKKDVLYPFGYGLSYTTFKYSNLK----------------- 747

Query: 670 LNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATY-- 727
                DA KT      VN              +N G   G +V  VY + P EIA +   
Sbjct: 748 ---VDDAGKT------VN---------VSFTLKNTGKRAGDEVAQVYVRLP-EIAGSTQA 788

Query: 728 IKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVG 778
           I+Q+ GF+RV ++AG +++++   +  +     +  A  ++P G  T  VG
Sbjct: 789 IRQLKGFRRVALKAGESRKVEITLDKEQLRYWDEKQACFVVPQGSFTFMVG 839


>gi|270296098|ref|ZP_06202298.1| glycoside hydrolase family beta-glycosidase [Bacteroides sp. D20]
 gi|270273502|gb|EFA19364.1| glycoside hydrolase family beta-glycosidase [Bacteroides sp. D20]
          Length = 798

 Score =  286 bits (731), Expect = 4e-74,   Method: Compositional matrix adjust.
 Identities = 234/803 (29%), Positives = 369/803 (45%), Gaps = 147/803 (18%)

Query: 53  LFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRL---GLPQYEWWSEALH-GVSN 108
           ++ DS  P   RV++L+S+MTL+EK  Q+    +G  R+    LP   W +E    G+ N
Sbjct: 52  IYEDSYAPLEARVQNLLSQMTLEEKSCQMATL-YGSGRVLNDALPSDNWKNEVWKDGIGN 110

Query: 109 VG---------------PGTHF------------------------DDVIPG-----ATS 124
           +                P  H                         ++ I G     AT 
Sbjct: 111 IDEEHNGLGSFKSAYSFPYAHHVKTKHAIQRWFVENTRLGIPVDFTNEGIRGLCHDRATY 170

Query: 125 FPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNINVARDPRWGRITETP 184
           FP      A++N+ L  +IG+A   EAR    LG   +  +SP +++A+DPRWGR  ET 
Sbjct: 171 FPAQCGQGATWNKELIAQIGEA---EAREASVLGYTNI--YSPILDIAQDPRWGRCVETY 225

Query: 185 GEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHF 244
           GEDP+  G+     +  LQ                K+ S  KH+A Y +       +   
Sbjct: 226 GEDPYHAGQMGKQMILSLQKN--------------KLVSTPKHFAVYSIPVGGRDGKTRT 271

Query: 245 DARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHG 304
           D  V  ++M   +L PF +   E  A  VM SYN  +G P       L + +R EW   G
Sbjct: 272 DPHVAPREMRTLYLDPFRVAFHEAGALGVMSSYNDYDGEPITGSYHFLTEILRQEWGFKG 331

Query: 305 YIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFT---------GNAV 355
           Y+V+D ++++ +   H+ +A+  EDAVAQ + AGL++      T+FT          +AV
Sbjct: 332 YVVSDSEAVEFISTKHQ-VANGYEDAVAQAVNAGLNIR-----THFTPPADFILPLRSAV 385

Query: 356 QQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQY-VSLGKQDICSDENIELAAEAAREGIV 414
           ++GK+ +  +++ +  +  V   LG FD   +       Q + S E+ +LA EAAR+ +V
Sbjct: 386 KKGKISQETLNQRVAEILRVKFWLGLFDNPYRGDEKRAGQIVHSPEHQQLALEAARQSLV 445

Query: 415 LLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFSGY---ANVT 471
           LLKN+  TLPL S  +++VAV+GP+A+    +I  Y        +   G       A+V 
Sbjct: 446 LLKNEHQTLPL-SKSIRSVAVIGPNADERQQLICRYGPANAHITTIYEGIKKMLPQADVV 504

Query: 472 YKTGCDDV--------------ACKSNNSIFAASEAAKTADATI-ILAGLDLSVEAESLD 516
           YK GCD +              A +    +  A EAAK A+ T+ +L G +L+V  E   
Sbjct: 505 YKKGCDIIDPHFPESEVLEFPKAAQEAQMMEEAIEAAKGAEVTVMVLGGNELTVR-EDRS 563

Query: 517 REDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGG 576
           R  L LPG Q +L+ ++ ++ K PV+LV++      I FA   T++ AI+ A +PGE GG
Sbjct: 564 RTSLDLPGRQKELLKKICQLGK-PVVLVMIDGRASSINFAA--THVPAIIHAWFPGEFGG 620

Query: 577 RAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYP 636
           +AIA+ +FG +NPGGRL +T+     V  +P  + P +P          Y       LYP
Sbjct: 621 QAIAEALFGDYNPGGRLAVTFPKS--VGQIPF-AFPFKPGSDESSETSVYG-----ALYP 672

Query: 637 FGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFE 696
           FG+GLSYT F+Y+ L+ + + Q                        GV  N         
Sbjct: 673 FGHGLSYTTFQYSDLAISPSKQ------------------------GVQGN-------IS 701

Query: 697 FKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKS 756
                +N+G  +G +VV +Y +       TY + + GF+R+ ++   +  + F     + 
Sbjct: 702 ISCTIKNIGQREGDEVVQLYLRDEVSSVTTYTQVLRGFERITLKPEASHTVHFELTP-QE 760

Query: 757 LNIVDYAANTLLPAGEHTIFVGN 779
           L I D   N  +  G   + +G+
Sbjct: 761 LGIWDKQMNFTVEPGMFKVMIGS 783


>gi|386718620|ref|YP_006184946.1| glucan 1,4-beta-glucosidase [Stenotrophomonas maltophilia D457]
 gi|384078182|emb|CCH12773.1| Glucan 1,4-beta-glucosidase [Stenotrophomonas maltophilia D457]
          Length = 897

 Score =  285 bits (729), Expect = 7e-74,   Method: Compositional matrix adjust.
 Identities = 174/445 (39%), Positives = 249/445 (55%), Gaps = 41/445 (9%)

Query: 54  FCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGT 113
           + D S  +  R   LV++MTLDEK  Q+ + A  + RLG+P Y+WW+E LHGV+  G   
Sbjct: 38  WLDVSASFEQRAASLVAQMTLDEKAAQMQNAAPAIERLGVPAYDWWNEGLHGVARAGQ-- 95

Query: 114 HFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNL-------GR-AGLTYW 165
                   AT FP  I   A+F+  L  ++   +S EARA ++        GR  GLT+W
Sbjct: 96  --------ATVFPQAIGLAATFDVPLMGQVATTISDEARAKHHQFLRQGAHGRYQGLTFW 147

Query: 166 SPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCC 225
           SPN+N+ RDPRWGR  ET GEDP++  R  V +VRGLQ         D   R  K+ +  
Sbjct: 148 SPNVNIFRDPRWGRGQETYGEDPYLTARMGVAFVRGLQ-------GDDPVYR--KLDATA 198

Query: 226 KHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPS 285
           KH A   V +    DR+HFDAR + +D+ +T+L  FE  VKEGD  +VM +YNRV G  +
Sbjct: 199 KHLA---VHSGPEADRHHFDARPSRRDLYDTYLPAFEALVKEGDVDAVMGAYNRVYGESA 255

Query: 286 CADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQ 345
            A   LL   +R +W   GY+V+DC +I V +  H  +  ++E A A  ++ G +L+CGQ
Sbjct: 256 SASRFLLRDVLRRDWGFKGYVVSDCWAI-VDIWKHHHIVTTREAAAALAVRNGTELECGQ 314

Query: 346 YYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQDICSDE---NI 402
            Y     +AV+QG + E +ID ++  L+T  MRLG FD  P+ V   +     ++   + 
Sbjct: 315 EYATLP-SAVRQGLISEAEIDDAVTRLFTARMRLGMFD-PPERVRWARIPASVNQAPSHD 372

Query: 403 ELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIA 462
            LA +AA+  +VLLKND   LPL S  +K +AVVGP A+ T+A++GNY G P   ++ + 
Sbjct: 373 ALALKAAQASLVLLKND-GILPL-SRDIKRIAVVGPTADDTMALLGNYFGTPAAPVTILQ 430

Query: 463 GFSGYAN---VTYKTGCDDVACKSN 484
           G    A    V Y  G D V  + +
Sbjct: 431 GIREAAKGVEVRYARGVDLVEGRDD 455



 Score =  135 bits (341), Expect = 7e-29,   Method: Compositional matrix adjust.
 Identities = 87/284 (30%), Positives = 127/284 (44%), Gaps = 53/284 (18%)

Query: 490 ASEAAKTADATIILAGLDLSVEAESL----------DREDLWLPGYQTQLINQVAEVAKG 539
           A +AA+ AD  + + GL   VE E +          DR DL LP  Q  L+  +    K 
Sbjct: 622 ALDAAREADVVVFVGGLTGDVEGEEMTVNYPGFAGGDRTDLRLPAPQRTLLEALHATGK- 680

Query: 540 PVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYN 599
           PV++V+   GG  IA     +++ AIL + YPG+ GG A+   +FG  NP GRLP+T+Y 
Sbjct: 681 PVVMVLT--GGSAIAVDWAQSHLPAILMSWYPGQRGGTAVGQALFGDVNPAGRLPVTFYK 738

Query: 600 GDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQV 659
                     S  L   D     GRTY+++ G  LYPFG+GLSYT+F Y  L        
Sbjct: 739 ---------ASEALPAFDDYAMEGRTYRYFRGTPLYPFGHGLSYTRFDYGTLRLD----- 784

Query: 660 NLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKP 719
                                        LR D      VD  N G+  G +VV +Y + 
Sbjct: 785 --------------------------AGSLRADGRLGVAVDVTNAGTRSGDEVVQLYVRR 818

Query: 720 PAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYA 763
               +   ++++ GFQR+ +  G ++ + F   A ++L   D A
Sbjct: 819 EHAGSGDAVQELRGFQRIHLAPGEHRTVTFTLEAAQALRHYDEA 862


>gi|167521708|ref|XP_001745192.1| hypothetical protein [Monosiga brevicollis MX1]
 gi|163776150|gb|EDQ89770.1| predicted protein [Monosiga brevicollis MX1]
          Length = 614

 Score =  285 bits (729), Expect = 7e-74,   Method: Compositional matrix adjust.
 Identities = 190/574 (33%), Positives = 284/574 (49%), Gaps = 53/574 (9%)

Query: 88  VPRLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAV 147
           V R+GLP+Y+W   A+HGV +       D  +   TSFP  +    ++N S + ++G+ +
Sbjct: 72  VSRIGLPEYDWGMNAIHGVQSSCIKDD-DGTVYCPTSFPNPVNYGFTWNYSAYLELGRII 130

Query: 148 STEARAMYNLG-----------RAGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAV 196
             E RA++  G             GL  WSPNIN+AR P WGR  E PGEDPF+ G++  
Sbjct: 131 GVETRALWLAGAVEASTWSGRPHIGLDTWSPNINIARSPLWGRNQEVPGEDPFMNGQFGK 190

Query: 197 NYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEET 256
            Y  GLQ   G ++        L+     KH+ AY +++  G  R++F+A V+   + +T
Sbjct: 191 AYTLGLQ---GDDDTY------LQAIVTLKHWDAYSLEDSDGATRHNFNAIVSNFSLMDT 241

Query: 257 FLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVM 316
           +   F + V EG A  VMCSYN VNGIP+CA P LL   +R  W   GY+ +D  +++ +
Sbjct: 242 YWPAFRVAVTEGKAKGVMCSYNAVNGIPTCAHP-LLRTVLRDLWKFDGYVSSDTGAVEDI 300

Query: 317 VDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVL 376
            DNHK+       A A       D+D G  Y       V +G  +  D+D +L+    + 
Sbjct: 301 SDNHKYTPSWATAACAAIRDGQTDIDSGAVYMKSLLQGVSEGHCRMEDVDNALRNTLRLR 360

Query: 377 MRLGFFD--GSPQYVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVA 434
             LG FD   +  Y  +    + ++ +         E +VLL+N  N LPL  A    VA
Sbjct: 361 FELGLFDPVENQSYWHVPLAAVNTNASRATNMLHTLESMVLLQNKNNVLPL--ASNTKVA 418

Query: 435 VVGPHANATVAMIGNYAGIPCR------YMSP---IAGFSGYANVTYKTGCDDVACKSNN 485
           ++GPHA A   M+GNY G  C        +SP   +    G   VTY  G +   C S +
Sbjct: 419 LIGPHAKAQEDMVGNYLGQLCPDNNFDCVVSPHDALVSILGTDAVTYAPGTNVTTC-SQS 477

Query: 486 SIFAASEAAKTADATIILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVI 545
            I  A   A  AD  +++ G+D S+EAES DR+ + LP  Q QL + +  V K P ++V+
Sbjct: 478 HIDEAVSVATAADVAVLMLGIDESIEAESNDRKSIDLPECQHQLASAIFAVGK-PTVIVL 536

Query: 546 MSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQM 605
           ++ G   +A         AI+ AGYPG  GG AIA  + G+           + GDY+  
Sbjct: 537 LNGGM--LAIENEKQQADAIIEAGYPGFYGGTAIAQTLTGQNE---------HLGDYINW 585

Query: 606 LPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGY 639
           + ++ M +        PGRTY++Y   TL+ F +
Sbjct: 586 INMSDMEMT-----SGPGRTYRYYKNETLWAFHF 614


>gi|410097219|ref|ZP_11292201.1| hypothetical protein HMPREF1076_01379 [Parabacteroides goldsteinii
           CL02T12C30]
 gi|409224537|gb|EKN17469.1| hypothetical protein HMPREF1076_01379 [Parabacteroides goldsteinii
           CL02T12C30]
          Length = 805

 Score =  285 bits (728), Expect = 8e-74,   Method: Compositional matrix adjust.
 Identities = 249/863 (28%), Positives = 400/863 (46%), Gaps = 152/863 (17%)

Query: 8   LLCFSLSI-ALLVFSTNAVDANGSSSP-----VFVCDPGRFSKLGLQMSSFLFCDSSLPY 61
           L C S  +  LL  +T+++ A+   +P     ++      F+K G +    ++ D S P 
Sbjct: 12  LFCLSFGLLPLLNANTSSLQASKPDAPKNQKKIYQKGWIDFNKNGKKD---IYEDLSQPI 68

Query: 62  SIRVKDLVSRMTLDEKVQQLGD-FAHG-VPRLGLPQYEW-------------------W- 99
             RV+DL+ +MT++EK  QLG  + +G V +  LP  EW                   W 
Sbjct: 69  DKRVEDLLKQMTVEEKTCQLGTIYGYGAVLKDTLPTDEWKTRIWKDGIGNIDEHLNGEWK 128

Query: 100 -----------SEALHGV-------SNVG-PGTHFDDVIPG-----ATSFPTVILTTASF 135
                      +EA++ V       + +G P    ++ I G     +T FP  I    ++
Sbjct: 129 RTSLDFPYSNHAEAMNKVQAFFVEETRLGIPADLTNEGIRGLKHEKSTFFPAQIGQGCTW 188

Query: 136 NESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYA 195
           ++ L  +IG+    EA+A   LG   +  +SP ++++RDPRWGR  E+ GED ++ G   
Sbjct: 189 DKELIYEIGRITGEEAKA---LGYTNI--YSPILDLSRDPRWGRTVESYGEDSYLAGELG 243

Query: 196 VNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRY-HFDARVTEQDME 254
              V G+Q                +V S  KH+A Y +    G D Y   D   + Q++ 
Sbjct: 244 RQQVLGIQSN--------------RVVSTPKHFAIYGIPG-GGRDCYSRTDPHASPQEVH 288

Query: 255 ETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQ 314
           E  L PF +  +E  A   MCS+N  NG P  A   L+ + +R +W   GY+V+D  +I 
Sbjct: 289 ELHLEPFRIAFQEAGALGTMCSHNDYNGTPVSASHYLMTELLRNQWGFKGYVVSDSWAID 348

Query: 315 VMVDNHKF--LADSKEDAVAQTLKAGLDL----DCGQYYTNFTGNAVQQGKVKETDIDKS 368
               N KF  + D++E+AVA  L AGL++    +  + +      A+Q+G V+E+ +D+ 
Sbjct: 349 ---KNVKFYHIVDTEEEAVASELNAGLNVRTFFEQSEVFIEALRRALQKGLVEESTLDQR 405

Query: 369 LKYLYTVLMRLGFFDGSPQYV---SLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPL 425
           ++ +  V   LG FD    YV    L  + + SD+N E++  AARE IVLLKN+ NTLPL
Sbjct: 406 VREVLYVKFWLGLFDDP--YVKDTKLADKIVNSDKNREVSLRAARESIVLLKNENNTLPL 463

Query: 426 NSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFSGY----ANVTYKTGCD---- 477
            S  +K +AV+GP A+   ++   Y       ++ + G         N+ Y  GC+    
Sbjct: 464 -SKTLKNIAVIGPQADEVKSLTSRYGSHNPNVITGLQGLKNLLGENVNLMYAKGCNVRDK 522

Query: 478 ----------DVACKSNNSIFAASEAAKTADATIILAGLDLSVEAESLDREDLWLPGYQT 527
                     +++ K    I  A E AK A+  II  G D     ES  R +L L G Q 
Sbjct: 523 NFPQSDVMYFELSDKEKEEIDEAVEIAKKAEVAIIYVGDDFRTIGESRSRVNLDLSGRQK 582

Query: 528 QLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKF 587
           +L+  V +    PV+LV+ +  G  +     + N+ AI+ A YPGE  G+A+A+V+FG +
Sbjct: 583 ELVRAV-QATGTPVVLVLFN--GRPVTLNWEDANLPAIVEAWYPGEFSGQAVAEVLFGDY 639

Query: 588 NPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFK 647
           NPGG+L  T+     V  +P  + P +P  +    G+ +   +G  LYPFGYGLSYT F+
Sbjct: 640 NPGGKLSTTFPKS--VGQIPW-AFPFKPNAT----GKGFARVDG-ELYPFGYGLSYTTFE 691

Query: 648 YNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGST 707
            +       +Q +  K+     L  T                            +N GS 
Sbjct: 692 IS------NLQPSATKIADGDTLTVTCKV-------------------------KNTGSV 720

Query: 708 DGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTL 767
            G +VV +Y        + + K++ GF+RV +  G  K + F  N  ++  + +     +
Sbjct: 721 KGDEVVQLYLNDETSSISRFEKELCGFERVALEPGEEKTVTFKVNR-RAYGMYNDKNEFV 779

Query: 768 LPAGEHTIFVGNGGVSFPIHLNF 790
           +  G+  +F GN   S P++  F
Sbjct: 780 VEPGKFFLFAGNSSKSTPLNAEF 802


>gi|293370605|ref|ZP_06617157.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
           ovatus SD CMC 3f]
 gi|292634339|gb|EFF52876.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
           ovatus SD CMC 3f]
          Length = 861

 Score =  285 bits (728), Expect = 1e-73,   Method: Compositional matrix adjust.
 Identities = 172/451 (38%), Positives = 244/451 (54%), Gaps = 43/451 (9%)

Query: 54  FCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGT 113
           + D+SL    R +DL+ R+TL+EKV  + + +  +PRLG+ +YEWW+EALHGV   G   
Sbjct: 26  YQDTSLTAEQRAEDLLPRLTLEEKVSLMQNASPAIPRLGIKEYEWWNEALHGVGRAGL-- 83

Query: 114 HFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNL-GRA-------GLTYW 165
                   AT FP  I   ASFN+SL  ++  A S EAR    + G +       GLT+W
Sbjct: 84  --------ATVFPQSIGMGASFNDSLLYEVFNATSDEARVKSRIFGESGVLKRYQGLTFW 135

Query: 166 SPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVE--GHENATDLNSRPLKVSS 223
           +PN+N+ RDPRWGR  ET GEDP++ G+  +  VRGLQ  E  G++          K+ +
Sbjct: 136 TPNVNIFRDPRWGRGQETYGEDPYLTGQMGMAVVRGLQGPEDAGYD----------KLHA 185

Query: 224 CCKHYAAYDVDNWKGVDRYHFDAR-VTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNG 282
           C KH+A +    W   +R+ FDA  +  +D+ ET+L  F+  V++     VMC+YNR  G
Sbjct: 186 CAKHFAVHSGPEW---NRHSFDAENIDPRDLWETYLPAFKDLVQKAHVKEVMCAYNRFEG 242

Query: 283 IPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVD--NHKFLADSKEDAVAQTLKAGLD 340
            P C   +LL Q +R EW   G +V+DC +I        H    D KE A A  ++ G D
Sbjct: 243 EPCCGSNRLLMQILRDEWGYKGIVVSDCGAISDFYRPGTHGTHPD-KEHASAAAVRTGTD 301

Query: 341 LDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQDICSDE 400
           L+CG  Y +   +AV+ G + E +ID SLK L T    LG  D    +  +    + S E
Sbjct: 302 LECGSEYASL-ADAVKAGLIDEKEIDISLKRLLTARFELGEMDEQSAWSEIPTSVLNSKE 360

Query: 401 NIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSP 460
           +  LA   ARE +VLL+N  N LPLN+     VAV+GP+AN +V   GNY GIP   ++ 
Sbjct: 361 HQALALRMARESLVLLQNKNNILPLNTH--LKVAVMGPNANDSVMQWGNYNGIPAHTVTL 418

Query: 461 IAGFSGY---ANVTYKTGCDDVACKSNNSIF 488
           +           + Y+ GCD V  K+  S+F
Sbjct: 419 LEAVRAKLPEGQIIYEPGCDRVDGKTLQSLF 449



 Score =  103 bits (256), Expect = 5e-19,   Method: Compositional matrix adjust.
 Identities = 85/297 (28%), Positives = 127/297 (42%), Gaps = 56/297 (18%)

Query: 497 ADATIILAGLDLSVEAESL----------DREDLWLPGYQTQLINQVAEVAKGPVILVIM 546
           AD  +   G+  S+E E +          DR D+ LP  Q    + +  + K    +V +
Sbjct: 598 ADVILFAGGISPSLEGEEMPVEVPGFKGGDRTDIELPDVQR---DLLKALKKAGKKVVFI 654

Query: 547 SAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQML 606
           +  G  I      T  +AIL A YPG+ GG AI D ++G++NPGGRLP+T+Y    V  L
Sbjct: 655 NYSGSAIGLVPETTTCEAILQAWYPGQAGGTAIVDALWGEYNPGGRLPVTFYKD--VNQL 712

Query: 607 PLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQH 666
           P         +     GRTY++     L+PFG+GLSYT F Y     +K      N +  
Sbjct: 713 P-------DFEDYSMKGRTYRYMQQQPLFPFGHGLSYTDFTYGEAKLSK------NTIAK 759

Query: 667 CRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAAT 726
             N+  T                         +   NVG  DG +VV VY + P +    
Sbjct: 760 GENVVLT-------------------------IPVSNVGQRDGEEVVQVYLRRPGDKEGP 794

Query: 727 YIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGNGGVS 783
               +  F+RV + AG+ + +       ++    D  +NT+ P  E T  +  GG S
Sbjct: 795 RYT-LRAFKRVHIPAGKTESVAIPLTG-ENFEWFDVESNTMRPL-EGTYELLYGGTS 848


>gi|383643328|ref|ZP_09955734.1| glycoside hydrolase family 3 [Sphingomonas elodea ATCC 31461]
          Length = 799

 Score =  284 bits (727), Expect = 1e-73,   Method: Compositional matrix adjust.
 Identities = 219/687 (31%), Positives = 321/687 (46%), Gaps = 101/687 (14%)

Query: 90  RLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVST 149
           RLG+P      EALHG+           V PGATSFP  I   +SF+  L + I    + 
Sbjct: 145 RLGIPML-MHEEALHGL-----------VAPGATSFPQSIALASSFDPKLVENIFSMAAK 192

Query: 150 EARAMYNLGRAGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHE 209
           EARA     R      +P ++VARDPRWGRI ET GEDP++V +  +  +RG Q      
Sbjct: 193 EARA-----RGANLVLAPVVDVARDPRWGRIEETYGEDPYLVTQMGLAAIRGFQ------ 241

Query: 210 NATDLNSRPLKVSSCCKHYAAY-DVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEG 268
             T +  +  KV    KH   +   +N   V      A + E+ + E F  PFE  VK  
Sbjct: 242 -GTTMPLKSDKVFITLKHMTGHGQPENGTNVG----PASLGERTLREDFFPPFEAAVKTL 296

Query: 269 DASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKE 328
              SVM SYN ++GIPS A+  LL   +RGEW   G +V+D  +I+ ++  H    D K 
Sbjct: 297 PVMSVMASYNEIDGIPSHANKWLLTDVLRGEWGFQGAVVSDYFAIRELITRHHLFKDPK- 355

Query: 329 DAVAQTLKAGLDLDC--GQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSP 386
           DA  + L AG+D++   G+ YT+     V+QG+V + +ID +++ +  +    G F+   
Sbjct: 356 DAAQRALDAGVDVETPDGEAYTHLV-QLVKQGRVSQGEIDNAVRRVLRMKFEGGLFENPY 414

Query: 387 QYVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAM 446
             V L      + E I L+ +AARE IVLLKN Q  LPL++  +K +AV+G HA  T   
Sbjct: 415 PEVKLAAARTNTPEAIALSRQAARESIVLLKNAQGLLPLDARGIKRMAVIGTHAKDTP-- 472

Query: 447 IGNYAGIPCRYMSPIAGFS----GYANVTYKTGCD-------------DVACKSNNSIFA 489
           IG Y+ +P   +S + G      G   V Y  G                V    N+ + A
Sbjct: 473 IGGYSDLPNHVVSVLEGMQAEGKGKFAVDYAEGIRITNHREWSKDAVAQVPASVNDQLRA 532

Query: 490 -ASEAAKTADATIILAGLDLSVEAESL------DREDLWLPGYQTQLINQVAEVAKGPVI 542
            A E AK AD  +++ G + +V  E+       D E L LPG Q QL  ++  + K PV+
Sbjct: 533 QALETAKNADVVVLVLGGNEAVSREAWADNHLGDSETLDLPGPQDQLAKELIALGK-PVV 591

Query: 543 LVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDY 602
           +++++  G   A         A++   Y GE+ G AIADVVFG++NPGG+LP++      
Sbjct: 592 VILLN--GRPYAVNYLAEKAPALIEGWYLGEQTGNAIADVVFGRYNPGGKLPVSVARS-- 647

Query: 603 VQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLN 662
           V  LP+        +      R Y F +   LYPFGYGLSYT F  +             
Sbjct: 648 VGQLPIY------YNKKPSARRGYLFGDTSPLYPFGYGLSYTTFDIS------------- 688

Query: 663 KLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAE 722
                              P +    +   D    +VD  N G   G +VV ++      
Sbjct: 689 ------------------APRLGTPTIGIADKASVEVDVTNTGKVAGDEVVQLFVHDDEA 730

Query: 723 IAATYIKQVIGFQRVFVRAGRNKRIKF 749
                + ++  F+RV ++ G  K ++F
Sbjct: 731 SVTRPVIELKRFERVTLKPGEKKTVRF 757


>gi|393786911|ref|ZP_10375043.1| hypothetical protein HMPREF1068_01323 [Bacteroides nordii
           CL02T12C05]
 gi|392658146|gb|EIY51776.1| hypothetical protein HMPREF1068_01323 [Bacteroides nordii
           CL02T12C05]
          Length = 863

 Score =  284 bits (727), Expect = 1e-73,   Method: Compositional matrix adjust.
 Identities = 166/439 (37%), Positives = 233/439 (53%), Gaps = 38/439 (8%)

Query: 54  FCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGT 113
           F +  LP   RV+DLV R+TL EKV  + D++  VPRLG+ QY WW+EALHGV   G   
Sbjct: 24  FNNPDLPVEERVEDLVRRLTLHEKVLLMCDYSSSVPRLGIKQYNWWNEALHGVGRAGL-- 81

Query: 114 HFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGR--------AGLTYW 165
                   AT FP  I   A+F++   K++ + VS EARA Y+            GLT+W
Sbjct: 82  --------ATVFPQAIGMAATFDDCAVKQVFECVSDEARAKYHHSENKDGSERYRGLTFW 133

Query: 166 SPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCC 225
           +PN+N+ RDPRWGR  ET GEDP++  R  +  VRGLQ            S+  K+ +C 
Sbjct: 134 TPNVNIFRDPRWGRGQETYGEDPYLTSRMGLAVVRGLQGPS--------ESKYDKLHACA 185

Query: 226 KHYAAYDVDNWKGVDRYHFDAR-VTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIP 284
           KHYA +    W   +R+ FD   ++ +D+ ET+L  F+  V++G    VMC+YNR  G P
Sbjct: 186 KHYALHSGPEW---NRHRFDVENISPRDLWETYLPAFKALVQQGGVKEVMCAYNRFEGEP 242

Query: 285 SCADPKLLNQTVRGEWDLHGYIVADCDSIQ-VMVDNHKFLADSKEDAVAQTLKAGLDLDC 343
            C   +LL   +R EW   G +V+DC +I    +  H     +KE AVA  +KAG DLDC
Sbjct: 243 CCGSNRLLYNILREEWGFDGLVVSDCGAISDFYLKGHHETHSTKESAVAAAVKAGTDLDC 302

Query: 344 GQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSP--QYVSLGKQDICSDEN 401
           G  Y +    AV++G + E  ID SL  L      LG  D      +  +    + S+++
Sbjct: 303 GVDYQSLE-KAVEKGIITEKQIDVSLSRLLKARFELGLMDEEHLVSWSDIPYTVVDSEKH 361

Query: 402 IELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPI 461
              A E AR+ + LLKN   TLPL S     + V+GP+AN ++ M GNY G P   ++ +
Sbjct: 362 RAKALEVARKSMTLLKNKNGTLPL-SKHCGKIVVIGPNANDSIMMWGNYNGFPSHTVTIL 420

Query: 462 AGFSGY---ANVTYKTGCD 477
            G +       V Y  GC+
Sbjct: 421 EGITHKLDAGQVIYDKGCE 439



 Score =  116 bits (291), Expect = 5e-23,   Method: Compositional matrix adjust.
 Identities = 91/305 (29%), Positives = 143/305 (46%), Gaps = 59/305 (19%)

Query: 488 FAASEAAKT---ADATIILAGLDLSVEAESL----------DREDLWLPGYQTQLINQVA 534
           F  +E A T   A+A + + G+   VE E L          DR  + LP  Q  L+ ++ 
Sbjct: 588 FNPNEIAATVSDAEAIVFVGGISPKVEGEELPVSFPGFKGGDRTVIELPQVQRDLLQELY 647

Query: 535 EVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLP 594
           +  K P+IL++ S   + ++ AE +    AI+ A YPG+ GG A+ADV+FG +NP GRLP
Sbjct: 648 KTGK-PIILILCSGSAIGLS-AEVDL-ADAIIQAWYPGQAGGTAVADVLFGDYNPAGRLP 704

Query: 595 ITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFT 654
           +T+Y          T+  L   +     GRTY+++ G  L+PFGYGLSYT F+       
Sbjct: 705 VTFYK---------TTEQLPDFEDYNMQGRTYRYFKGEALFPFGYGLSYTSFE------- 748

Query: 655 KTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVI 714
                 + K Q           SK R        +  ++     +  +N G  DG +V+ 
Sbjct: 749 ------IGKAQ----------LSKKR--------IHANESVNLDLWIKNTGERDGEEVIQ 784

Query: 715 VYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTL-LPAGEH 773
           VY +   +     +K +  F+RV V++G  K+I  +     S    D   N + + AGE+
Sbjct: 785 VYIRKLKDKEGP-LKTLRAFKRVHVKSGEKKQIS-IHLPNDSFEFFDPEFNVMRVMAGEY 842

Query: 774 TIFVG 778
            +  G
Sbjct: 843 EVLYG 847


>gi|217967241|ref|YP_002352747.1| glycoside hydrolase family 3 [Dictyoglomus turgidum DSM 6724]
 gi|217336340|gb|ACK42133.1| glycoside hydrolase family 3 domain protein [Dictyoglomus turgidum
           DSM 6724]
          Length = 762

 Score =  284 bits (727), Expect = 1e-73,   Method: Compositional matrix adjust.
 Identities = 235/792 (29%), Positives = 365/792 (46%), Gaps = 146/792 (18%)

Query: 64  RVKDLVSRMTLDEKVQQL------------GDFAH---------GV-------------P 89
           +V+DL+S+MTL+EK+ QL            G+F+          G+             P
Sbjct: 9   KVRDLISKMTLEEKIAQLQSVFGKELVDESGNFSEEKAEKLLKNGIGQISRVAGEKGMDP 68

Query: 90  RLGLPQYEWWSEALHGVSNVG-PGTHFDDVI-----PGATSFPTVILTTASFNESLWKKI 143
              +       + L   + +G P    ++ +      GAT FP  I   ++F   L +++
Sbjct: 69  ERAVELANKIQKFLKEKTRLGIPAIIHEECLSGFMAKGATVFPQAIGMASTFEPELIRRV 128

Query: 144 GQAVSTEARAMYNLGRAGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQ 203
              +    RA  N+ +      SP +++ RDPRWGR  ET GEDP++V R A  YV+GLQ
Sbjct: 129 SDVIRQHMRAA-NVHQG----LSPVLDIPRDPRWGRTEETFGEDPYLVSRMAAEYVKGLQ 183

Query: 204 DVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEM 263
             +  E           + +  KH+ AY +       R    A+V E+++ E FL PFE+
Sbjct: 184 GEDWREG----------IIATVKHFTAYGISEGA---RNLGPAKVGERELREVFLFPFEV 230

Query: 264 CVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFL 323
            +KEG A S+M +Y+ ++G+P  +   LL + +R EW   GY+V+D  +I+++ + H+  
Sbjct: 231 AIKEGQAGSLMNAYHEIDGVPCASSKFLLTKILRWEWGFKGYVVSDYIAIRMLENFHRVA 290

Query: 324 ADSKEDAVAQTLKAGLDL-----DCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMR 378
            D+KE AV   L+AG+D+     DC   Y      AV++G + E  I+ S++ +      
Sbjct: 291 KDAKEAAVL-ALEAGIDIELPSVDC---YGEPLIQAVKEGLISEEVINASVERVLRAKFM 346

Query: 379 LGFFDGSPQYVSLGKQDICSD-ENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVG 437
           LG FDG  +       DI    E  EL+ E AR  IVLLKND   LPL S  ++TVAV+G
Sbjct: 347 LGLFDGDLEKDPKKVYDIFDKPEFRELSREVARRSIVLLKND-GILPL-SKNIRTVAVIG 404

Query: 438 PHANATVAMIGNY---AGIP----------------CRYMSPIAGF----SGYANVTYKT 474
           P+A+    + G+Y   A IP                 R +S + G     S    V Y  
Sbjct: 405 PNADNPRNLHGDYSYTAHIPSVSETLEGVKIPEECAVRTVSILEGIKNKVSAETQVLYAK 464

Query: 475 GCDDVACKSNNSIFAASEAAKTADATIILAG-----LDLSVEAESLDREDLWLPGYQTQL 529
           GC+ +   S      A E AK AD  I + G         +  E  DR  L L G Q  L
Sbjct: 465 GCE-ILSDSKEGFDEAIEIAKRADVIIAVMGEESGLFHRGISGEGNDRTTLELFGIQRDL 523

Query: 530 INQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNP 589
           + ++ ++ K P++LV+++  G   A    + N+ AIL A YPGEEGG A+ADV+FG +NP
Sbjct: 524 LRELHKLGK-PIVLVLVN--GRPQALKWEHENLNAILEAWYPGEEGGDAVADVIFGDYNP 580

Query: 590 GGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPG--RTYKFYNGPTLYPFGYGLSYTQFK 647
            G+LPI++         P  +  + PV     P     Y   +   LYPFG+GLSYT F+
Sbjct: 581 SGKLPISF---------PAVTGQV-PVYYNRKPSAFTDYVEESAKPLYPFGHGLSYTTFE 630

Query: 648 YNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGST 707
           Y+ L              H   +N                     +  E     +N G  
Sbjct: 631 YSNLKI------------HPEKVNAL-------------------EKVEISFTIKNTGVR 659

Query: 708 DGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTL 767
           +G +VV +Y           +K++ GF+++ ++ G +KR+ F+    + L   D     +
Sbjct: 660 EGEEVVQLYVHDQVASLERPVKELKGFKKIHLKPGESKRVTFILYP-EQLAFYDEFMRFV 718

Query: 768 LPAGEHTIFVGN 779
           +  G   I +G+
Sbjct: 719 VEKGIFEIMIGS 730


>gi|182413194|ref|YP_001818260.1| glycoside hydrolase family 3 [Opitutus terrae PB90-1]
 gi|177840408|gb|ACB74660.1| glycoside hydrolase family 3 domain protein [Opitutus terrae
           PB90-1]
          Length = 859

 Score =  284 bits (726), Expect = 2e-73,   Method: Compositional matrix adjust.
 Identities = 246/884 (27%), Positives = 386/884 (43%), Gaps = 140/884 (15%)

Query: 1   MAKVVSSLLCFSLSIALLVFSTNAVDANGSSSPVFVCDPGRFSKLGLQMSSFL------- 53
           M + V   +C   ++A LVFS + + A   S+ +FV          +    ++       
Sbjct: 1   MLRFVHPTVC---TLAALVFSASPLLAAAPSADLFVPSATPPLAAAVYHDGWIDLNKNGA 57

Query: 54  ---FCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRL---GLPQYEW----WSEAL 103
              + DSS P   R++DL++RM+L+EK  QL    +G PR+     P   W    W + +
Sbjct: 58  RDPYEDSSRPIDARIEDLLARMSLEEKTAQLTTL-YGFPRVLKDERPTSAWREAMWKDGI 116

Query: 104 -----HGVSNVGPGTHFDDVI--------------------------------------- 119
                H   N G   +  D +                                       
Sbjct: 117 GNIDEHLNGNTGWTNNLADPVHDLPWSLHARALNEVQRWFIEQTRLGIPVDFTNEGIRGL 176

Query: 120 --PGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLT-YWSPNINVARDPR 176
               ATSFP  +   ++++ +L ++IG+    EARA+      G T  +SP +++ARDPR
Sbjct: 177 LHSKATSFPAELAVASTWDPALVREIGRITGREARAL------GYTNIYSPVLDLARDPR 230

Query: 177 WGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDNW 236
           WGR  ET GEDPF+VG   V  VRGLQ                 V S  KH+A Y +   
Sbjct: 231 WGRTIETYGEDPFLVGTLGVEQVRGLQAEH--------------VVSTLKHFAVYSIPKG 276

Query: 237 KGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTV 296
                   D + T ++++  FL PF   ++E  A  VM SYN  +G+P       L++ +
Sbjct: 277 GRDGEARTDPQATWREVQTIFLEPFRRAIREAGALGVMASYNDYDGVPVEGSALFLSEIL 336

Query: 297 RGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFTGNA-- 354
           RG+W   GY+V+D  +++ +   H+ +A +  DA+ Q ++AGL++      TNFT  A  
Sbjct: 337 RGQWGFRGYVVSDSAAVEFIHSKHR-VAPTPADAIRQAVEAGLNI-----RTNFTPPAAY 390

Query: 355 -------VQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQD--ICSDENIELA 405
                  V+ GK+    ID  ++ +  V  +LG FD  P        D  + + E++ +A
Sbjct: 391 AEPLRQLVRDGKLAMATIDARVRDVLRVKFQLGLFD-RPYVADPAAADRVVRAPEHLVVA 449

Query: 406 AEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFS 465
             A RE IVLLKN+   LPL+ AK++ V V GP A+   A    Y      +++P+ G  
Sbjct: 450 QRAGREAIVLLKNEPALLPLDRAKLQRVLVAGPLADDAHAWWSRYGAQRLDFVTPLPGLR 509

Query: 466 GY----ANVTYKTGC--------------DDVACKSNNSIFAASEAAKTADATIILAGLD 507
                   V Y  G               D  + +    I AA  AA+  D  I + G  
Sbjct: 510 AKLGAAVEVRYAKGVEAKDAAWPASDVLKDPPSAEVRAGIEAAVAAAQNVDVIIAVLGET 569

Query: 508 LSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILW 567
             +  ES  R  L LPGYQ +L+  +    K P++LV+ +   + + +A  +      LW
Sbjct: 570 DELCRESSSRISLALPGYQQELLEALHATGK-PLVLVLSNGRPLSVVWAARHVPAIVELW 628

Query: 568 AGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYK 627
             +PGE+GG A+A V+ G  NP GRLPIT+     V  LP  + P  P    G   R + 
Sbjct: 629 --FPGEDGGAALAAVLLGDANPSGRLPITFPQS--VGQLPY-NFPAHP----GSQARDFG 679

Query: 628 FYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVN 687
              G +L+PFG+GLSYT F+Y+ L  T   ++ ++        +     S +R     V+
Sbjct: 680 QVEG-SLFPFGHGLSYTTFRYSDLRITPE-RIPVDGFGAAGGGDPGLRGSASRATPYSVS 737

Query: 688 DLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRI 747
            +     F    D  N G+  G +VV +Y +       TY   + GF RV +  G  K +
Sbjct: 738 TV---PEFTITCDVTNTGTRAGDEVVQLYLRDDYSSVTTYDIALRGFARVTLAPGETKPV 794

Query: 748 KFVFNACKSLNIVDYAANTLLPAGEHTIFVGNGGVSFPIHLNFN 791
            F  +    L + +   + ++  G  T+ +G       +   F 
Sbjct: 795 TFTLHRAH-LELYNRDGDWVVEPGRFTVMLGASSADIRLRGTFT 837


>gi|393782428|ref|ZP_10370612.1| hypothetical protein HMPREF1071_01480 [Bacteroides salyersiae
           CL02T12C01]
 gi|392673256|gb|EIY66719.1| hypothetical protein HMPREF1071_01480 [Bacteroides salyersiae
           CL02T12C01]
          Length = 596

 Score =  284 bits (726), Expect = 2e-73,   Method: Compositional matrix adjust.
 Identities = 198/633 (31%), Positives = 313/633 (49%), Gaps = 78/633 (12%)

Query: 162 LTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRP--L 219
           +TYWSPN+N+ RDPRWGR  ET GEDP++       YVRGLQ            + P  L
Sbjct: 1   MTYWSPNVNIFRDPRWGRGQETYGEDPYLTAEIGKAYVRGLQ-----------GNDPFFL 49

Query: 220 KVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNR 279
           K ++C KHYA   V +     R+ F+A  +++D+ ET+L  FE  VKE    +VM +YNR
Sbjct: 50  KAAACAKHYA---VHSGPEALRHEFNASPSKRDLFETYLPAFEALVKEAKVEAVMGAYNR 106

Query: 280 VNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGL 339
           V G  +     LL   +R +W   G++V+DC ++  +   HK   D  E A A  LK+GL
Sbjct: 107 VYGESASGSFFLLTDILRKKWGFKGHVVSDCGAVDDIYGGHKIAKDVAE-ASAIALKSGL 165

Query: 340 DLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFF---DGSPQYVSLGKQDI 396
           +L+CG  +      A+++  + E D+D +L  L    ++LG     D SP Y ++    I
Sbjct: 166 NLNCGGSFHALK-EALERKLITEVDLDNALMPLMMTRLKLGNLTDDDESP-YKNISDSVI 223

Query: 397 CSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCR 456
            S  +  +A E A++ +VLLKN+ +TLPL    VKT+ V GP+A  T  M+GNY G+  R
Sbjct: 224 ASYTHAMVAREVAQKSMVLLKNNNHTLPLKK-DVKTIFVTGPYAADTYVMMGNYYGVSPR 282

Query: 457 YMSPIAGF----SGYANVTYKTGCDDVACKSNNSIFAASE--AAKTADATIILAGLDLSV 510
             + + G     SG  ++ YK G        N + +   E  AA+ A   I L+G+D   
Sbjct: 283 SNTFLQGIAAKVSGGTSINYKIGILPTTPNMNPADWTVGEVRAAEVAIVVIGLSGIDEGE 342

Query: 511 EAESL------DREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKA 564
           E +++      D+++L LP +Q + +  ++      ++ VI   GG  I   E +    A
Sbjct: 343 EGDAIASSHRGDKQNLKLPEHQLKFLRDISRNRWNKLVTVI--TGGSPIDLEEVSELSDA 400

Query: 565 ILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGR 624
           ++ A YPG+EGG A+ D++FG  +  GR+P+T+         P+ S  L   +     GR
Sbjct: 401 VIMAWYPGQEGGMALGDLLFGDVSFSGRMPVTF---------PINSDWLPAFEDYNMQGR 451

Query: 625 TYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGV 684
           TYK+     +YPFGYGL+Y    Y+                  + LN   D  +      
Sbjct: 452 TYKYMTDNIMYPFGYGLTYGDVSYS----------------DVKILNPKYDGKQE----- 490

Query: 685 LVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRN 744
                        +   +N G+ +  +VV +Y   P     T I  +IGF+RV + +  +
Sbjct: 491 ----------IHVQATLRNNGNNEVEEVVQLYLSAPGAGVITPISSLIGFKRVTLESHLS 540

Query: 745 KRIKFVFNACKSLNIVDYAANTLLPAGEHTIFV 777
           + ++F+    +   +++  +  LL  G++TI V
Sbjct: 541 QTVEFIIKPDQLKMVMEDGSKNLL-KGKYTIIV 572


>gi|409197445|ref|ZP_11226108.1| glycoside hydrolase family protein [Marinilabilia salmonicolor JCM
           21150]
          Length = 737

 Score =  283 bits (724), Expect = 2e-73,   Method: Compositional matrix adjust.
 Identities = 232/751 (30%), Positives = 357/751 (47%), Gaps = 112/751 (14%)

Query: 33  PVFVCDPGRFSKLGLQ-MSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRL 91
           P  +   G FS L     +S+ F ++ L    RV DL+SRMTL+EKV  L      VPRL
Sbjct: 20  PYLLILIGIFSLLNASAQTSYPFQNADLDMETRVDDLLSRMTLEEKVSALST-DPSVPRL 78

Query: 92  GL---PQYEWWSEALHGVSNVGPGT---HFDDVIPGATSFPTVILTTASFNESLWKKIGQ 145
           G+   P  E      HGV+  GP       D+ +P  T FP      A++N  L +K G+
Sbjct: 79  GIKGAPHIE----GYHGVAMGGPANWAPKGDERVP-TTQFPQAYGMGATWNPELIRKAGE 133

Query: 146 AVSTEARAMYN---LGRAGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGL 202
             S EAR ++    + + GL   +PN ++ RDPRWGR  E  GEDPF+VG  +  + +GL
Sbjct: 134 IESIEARYIFQNPEISKGGLVVRAPNADLGRDPRWGRTEEVLGEDPFLVGTLSTAFTKGL 193

Query: 203 QDVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFE 262
           Q           + +  + +S  KH+ A   +N +     +FD ++  +    TF R   
Sbjct: 194 QGD---------DEKYWRTASLLKHFLANSNENTRDSSSSNFDTQLFYEYYGATFRR--- 241

Query: 263 MCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKF 322
             + EG +++ M +YN VNG+P+   P +  +     W ++G I  D     ++V  HK 
Sbjct: 242 -AILEGGSNAYMTAYNAVNGVPAHIHP-MHKEISMARWGVNGIICTDGGGYTLLVRAHKA 299

Query: 323 LADSKEDAVAQTLKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFF 382
             D    A    +KAGL+     Y     G A+  G + E D+D+ LK +Y V+++LG  
Sbjct: 300 YDDYYR-AAEGVIKAGLNQFLDNYREGVWG-ALAHGYLAEEDLDEVLKGVYRVMIKLGQL 357

Query: 383 DGSPQ----YVSLGKQD----ICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVA 434
           D  PQ    Y S+G+        S E+ E A + ARE +VLLKN++ TLPL   ++  VA
Sbjct: 358 D--PQDKVPYASIGRDGKPAPWTSPEHQEAALQMARESVVLLKNEKQTLPLAGDELGKVA 415

Query: 435 VVGPHANATVAMIGNYAGIPCRYMSPIAGFSGYANVTYKTGCDDVACKSNNSIFAASEAA 494
           V+G H   T+ ++  Y+G+P    +P+ G      +  K G D V    +N   AA EAA
Sbjct: 416 VIG-HLADTI-LLDWYSGMPPFMSTPLDG------IKEKMGADKVLFAPDNDYNAAVEAA 467

Query: 495 KTADATIILAG-------------LDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPV 541
             AD  I++ G              D  +  E++DR+ L L     + + Q    A    
Sbjct: 468 SQADVAIVVLGNHPYCDSERWGDCPDPGMGREAVDRKTLRL---TDEWLAQRVFEANPNT 524

Query: 542 ILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGD 601
           ILV+ S+    I +++ N  + AI+   + G+  G A+ADV+FG +NPGG+L  TW   +
Sbjct: 525 ILVLQSSFPYGINWSQEN--LPAIVHITHNGQSTGTALADVLFGDYNPGGKLTQTWPKSE 582

Query: 602 YVQMLPLTSMPLRPVDSLGY---PGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQ 658
             + LP         D + Y    G TY ++NG  LYPFG+GLSYT F++          
Sbjct: 583 --EQLP---------DMMEYDIRKGHTYMYFNGEPLYPFGFGLSYTSFEW---------- 621

Query: 659 VNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSK 718
                     ++  T  + K+    V+V            V  +NVG   G +V+ +Y+ 
Sbjct: 622 ---------VDMEITGSSVKSNEEEVIVT-----------VKLKNVGQVKGDEVIQLYAS 661

Query: 719 PPAEIAATYIKQVIGFQRVFVRAGRNKRIKF 749
            P   +    K + GF+RV +  G +K ++ 
Sbjct: 662 FPETSSRRPDKALKGFKRVTLEPGESKNVQI 692


>gi|383125190|ref|ZP_09945844.1| hypothetical protein BSIG_4346 [Bacteroides sp. 1_1_6]
 gi|251838523|gb|EES66609.1| hypothetical protein BSIG_4346 [Bacteroides sp. 1_1_6]
          Length = 853

 Score =  283 bits (724), Expect = 3e-73,   Method: Compositional matrix adjust.
 Identities = 166/430 (38%), Positives = 248/430 (57%), Gaps = 41/430 (9%)

Query: 53  LFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPG 112
           L+ + + P   RV DL+SR+T++EK+  L   + G+PRLG+ +Y   +EALHGV  V PG
Sbjct: 29  LYKNENAPVHERVMDLISRLTVEEKISLLRATSPGIPRLGIDKYYHGNEALHGV--VRPG 86

Query: 113 THFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAG----------L 162
                     T FP  I   A++N  L K++   +S EARA +N    G          L
Sbjct: 87  RF--------TVFPQAIGLAATWNPELQKRVATVISDEARARWNELDQGREQKEQFSDVL 138

Query: 163 TYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVS 222
           T+WSP +N+ARDPRWGR  ET GEDPF+ G     +V GLQ  + H          LK+ 
Sbjct: 139 TFWSPTVNMARDPRWGRTPETYGEDPFLSGIMGTAFVNGLQGDDPHY---------LKIV 189

Query: 223 SCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNG 282
           S  KH+AA + ++    +R+  + +++E+ + E +   FEMCVKEG A+S+M +YN +N 
Sbjct: 190 STPKHFAANNEEH----NRFVCNPQISEKQLREYYFPAFEMCVKEGKAASIMSAYNALND 245

Query: 283 IPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLD 342
           +P   +P LL + +R +W   GY+V+DC    ++V+ HK++  +KE A   ++KAGLDL+
Sbjct: 246 VPCTLNPWLLQKVLRQDWGFQGYVVSDCGGPALLVNAHKYVK-TKEAAATLSIKAGLDLE 304

Query: 343 CG-QYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ--YVSLGKQDICSD 399
           CG   Y     NA +Q  V + DID +  ++ T  M+LG FD   +  Y  +    I S 
Sbjct: 305 CGDDVYDGPLLNAYKQYMVSDADIDSAAYHVLTARMKLGLFDSGERNPYTKISPSVIGSK 364

Query: 400 ENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMS 459
           E+ ++A +AAR+ IVLLKN +N LPLN+ K+K++AVVG   NA     G+Y+G P   + 
Sbjct: 365 EHQQIALDAARQCIVLLKNQKNRLPLNADKLKSIAVVG--INAGKCEFGDYSGAPV--VE 420

Query: 460 PIAGFSGYAN 469
           P++   G  N
Sbjct: 421 PVSILQGIRN 430



 Score =  132 bits (332), Expect = 8e-28,   Method: Compositional matrix adjust.
 Identities = 90/293 (30%), Positives = 142/293 (48%), Gaps = 54/293 (18%)

Query: 490 ASEAAKTADATIILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAG 549
           A +A +  +  + + G++ S+E E  DR D+ LP  Q + + ++ +V   P I+V++ AG
Sbjct: 597 AGKAVRECETVVAVMGINKSIEREGQDRYDIQLPADQREFLQEIYKV--NPNIIVVLVAG 654

Query: 550 GVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLT 609
              +A    + +I AI+ A YPGE+GG A+A+V+FG +NP GRLP+T+Y         L 
Sbjct: 655 S-SLAINWMDEHIPAIVNAWYPGEQGGTAVAEVLFGDYNPAGRLPLTYYKS-------LD 706

Query: 610 SMPLRPVDSLGY-PGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCR 668
            +P  P D      GRTYK++ G  LYPFGYGLSY+ F Y+ L                 
Sbjct: 707 ELP--PFDDYDITKGRTYKYFKGDVLYPFGYGLSYSSFTYSDLQVK-------------- 750

Query: 669 NLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDF--QNVGSTDGSDVVIVYSKPPAEIAAT 726
                                  D   E  V F  +N G  +G +V  VY + P      
Sbjct: 751 -----------------------DGVGEVTVSFRLKNTGKRNGDEVAQVYVRIPETGGIV 787

Query: 727 YIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANT-LLPAGEHTIFVG 778
            +K++ GF+RV +++G ++R++   N  + L   D      ++P G   + VG
Sbjct: 788 PLKELKGFRRVPLKSGESRRVEIKLNK-EQLRYWDVEKGQFVVPKGAFDVMVG 839


>gi|336417083|ref|ZP_08597412.1| hypothetical protein HMPREF1017_04520 [Bacteroides ovatus
           3_8_47FAA]
 gi|335936708|gb|EGM98626.1| hypothetical protein HMPREF1017_04520 [Bacteroides ovatus
           3_8_47FAA]
          Length = 850

 Score =  283 bits (724), Expect = 3e-73,   Method: Compositional matrix adjust.
 Identities = 166/427 (38%), Positives = 248/427 (58%), Gaps = 41/427 (9%)

Query: 53  LFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPG 112
           L+ + + P   RV DL+SR+T++EK+  L   + G+PRLG+ +Y   +EALHGV  V PG
Sbjct: 26  LYKNENAPVHERVADLLSRLTVEEKISLLRATSPGIPRLGIDKYYHGNEALHGV--VRPG 83

Query: 113 THFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAG----------L 162
                     T FP  I   A++N  L K++   +S EARA +N    G          L
Sbjct: 84  RF--------TVFPQAIGLAATWNPELQKRVATVISDEARARWNELDQGREQKEQFSDVL 135

Query: 163 TYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVS 222
           T+WSP +N+ARDPRWGR  ET GEDPF+ G     +V+GLQ           + R LK+ 
Sbjct: 136 TFWSPTVNMARDPRWGRTPETYGEDPFLSGVMGTAFVKGLQGD---------DPRYLKIV 186

Query: 223 SCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNG 282
           S  KH+AA + ++    +R+  + +++E+ + E +   FEMCVKEG A+S+M +YN +N 
Sbjct: 187 STPKHFAANNEEH----NRFVCNPQISEKQLREYYFPAFEMCVKEGKAASIMTAYNALND 242

Query: 283 IPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLD 342
           +P   +  LL + +R +W   GY+V+DC    ++V+ HK++  +KE A   +++AGLDL+
Sbjct: 243 VPCTLNAWLLKKVLRQDWGFQGYVVSDCGGPSLLVNAHKYVK-TKEAAATLSIQAGLDLE 301

Query: 343 CG-QYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ--YVSLGKQDICSD 399
           CG   Y  +  NA +Q  V + DID +  ++ T  M+LG FDG+ +  Y  +    I S 
Sbjct: 302 CGDDVYDEYLLNAYKQYMVSDADIDSAACHVLTARMKLGLFDGTERNPYTRISPSVIGSK 361

Query: 400 ENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMS 459
           E+ ++A +AARE IVLLKN  N LPLN  KVK++AVVG   NA     G+Y+G P   + 
Sbjct: 362 EHQQIALDAARECIVLLKNKNNMLPLNVNKVKSIAVVG--INAGKCEFGDYSGAPV--VD 417

Query: 460 PIAGFSG 466
           P++   G
Sbjct: 418 PVSILQG 424



 Score =  128 bits (322), Expect = 1e-26,   Method: Compositional matrix adjust.
 Identities = 88/290 (30%), Positives = 139/290 (47%), Gaps = 48/290 (16%)

Query: 490 ASEAAKTADATIILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAG 549
           A +A +  +  + + G++ S+E E  DR D+ LP  Q + + ++ +V   P I+V++ AG
Sbjct: 594 AGKAVRECETVVAVMGINKSIEREGQDRYDIQLPADQREFLQEIYKV--NPNIIVVLVAG 651

Query: 550 GVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLT 609
              +A    + +I AI+ A YPGE+GG A+ADV+FG +NP GRLP+T+Y    +  LP  
Sbjct: 652 S-SLAVNWMDEHIPAIVNAWYPGEQGGTAVADVLFGDYNPAGRLPLTYYKS--LDELPAF 708

Query: 610 SMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRN 669
                  D     GRTYK++ G  LYPFGYGLSY+ FKY+ L                  
Sbjct: 709 D------DYDITKGRTYKYFKGDVLYPFGYGLSYSSFKYSDLK----------------- 745

Query: 670 LNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIK 729
                D + T                      +N G   G +V  VY + P       IK
Sbjct: 746 ---VKDGANT---------------VSVSFRLKNTGKRKGDEVAQVYVRIPETGGVVPIK 787

Query: 730 QVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYA-ANTLLPAGEHTIFVG 778
           ++ GF+R+ +++G ++ ++   +  + L   D      ++P G   I VG
Sbjct: 788 ELKGFRRIPLKSGESRVVEIELDK-EQLRYWDAGLGRFIVPQGAFDIMVG 836


>gi|206901280|ref|YP_002250567.1| xylosidase/arabinosidase [Dictyoglomus thermophilum H-6-12]
 gi|206740383|gb|ACI19441.1| xylosidase/arabinosidase [Dictyoglomus thermophilum H-6-12]
          Length = 762

 Score =  283 bits (724), Expect = 3e-73,   Method: Compositional matrix adjust.
 Identities = 230/766 (30%), Positives = 363/766 (47%), Gaps = 145/766 (18%)

Query: 62  SIRVKDLVSRMTLDEKVQQL------------GDFAH---------------------GV 88
           S +VKDL+++MTL+EK+ QL            G+F+                      GV
Sbjct: 7   SKKVKDLIAKMTLEEKIAQLQAVYGKDLVDENGNFSEEKAEKLLKNGIGQISRVAGERGV 66

Query: 89  -PRLGLPQYEWWSEALHGVSNVG-PGTHFDDVIPG-----ATSFPTVILTTASFNESLWK 141
            P   +       + L   + +G P    ++ + G     AT FP  I   ++F   L +
Sbjct: 67  SPEKAVELANKIQKFLKEKTRLGIPAIIHEECLSGFMAQGATVFPQAIGMASTFEPELIR 126

Query: 142 KIGQAVSTEARAMYNLGRAGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRG 201
           ++   +    +A  N+ +      SP +++ RDPRWGR  ET GEDP++V R A  YV+G
Sbjct: 127 RVSDVIRQHMKAA-NVHQG----LSPVLDIPRDPRWGRTEETFGEDPYLVSRMATEYVKG 181

Query: 202 LQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPF 261
           LQ  +  E           + +  KH+ AY +       R    A+V E+++ E FL PF
Sbjct: 182 LQGEDWREG----------IVATVKHFTAYGISEGA---RNLGPAKVGERELREVFLFPF 228

Query: 262 EMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHK 321
           E+ +KEG A S+M +Y+ ++G+P  +   LL + +R EW   GY+V+D  +++++ + HK
Sbjct: 229 EVAIKEGQAGSLMNAYHEIDGVPCASSKFLLTKILRWEWGFKGYVVSDYIAVRMLENFHK 288

Query: 322 FLADSKEDAVAQTLKAGLDL-----DCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVL 376
              D+KE AV   L+AG+D+     DC   Y      AV++G + E  I+ S++ +    
Sbjct: 289 VARDAKEAAVL-ALEAGIDIELPSVDC---YGEPLIQAVKEGLISEEVINASVERVLRAK 344

Query: 377 MRLGFFDGSPQYVSLGKQDICSD-ENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAV 435
             LG FD + +       ++    E  +L+ E AR  IVLLKND  TLPL S  +K VAV
Sbjct: 345 FMLGLFDDNLEKDPKKVYEVFDKPEFRDLSREVARRSIVLLKND-GTLPL-SKNLKKVAV 402

Query: 436 VGPHANATVAMIGNY---AGIP----------------CRYMSPIAGF----SGYANVTY 472
           +GP+A+    + G+Y   A IP                 R +S + G     S    V Y
Sbjct: 403 IGPNADNPRNLHGDYSYTAHIPSIAEGLEGVKVEEKCVVRTVSILEGIRNKVSPETEVLY 462

Query: 473 KTGCDDVACKSNNSIFAASEAAKTADATIILAG-----LDLSVEAESLDREDLWLPGYQT 527
             GCD +   S +    A E AK AD  I + G         +  E  DR  L L G Q 
Sbjct: 463 AKGCD-IISDSKDGFAEAIEMAKEADVIIAVMGEESGLFHRGISGEGNDRTTLELFGVQR 521

Query: 528 QLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKF 587
            L+ ++ ++ K P++LV+++  G   A    + N+ AIL A YPGEEGG A+ADV+FG +
Sbjct: 522 DLLKELHKLGK-PIVLVLIN--GRPQALKWEHENLNAILEAWYPGEEGGNAVADVIFGDY 578

Query: 588 NPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPG--RTYKFYNGPTLYPFGYGLSYTQ 645
           NP G+LPI++         P  +  + PV     P     Y   +   LYPFG+GLSYT 
Sbjct: 579 NPSGKLPISF---------PAVTGQI-PVYYNRKPSAFSDYIDESAKPLYPFGHGLSYTT 628

Query: 646 FKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVG 705
           F+Y+ L  +     +L K++    +++T                            +N G
Sbjct: 629 FEYSDLKISPEKVNSLEKVE----ISFT---------------------------IKNTG 657

Query: 706 STDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVF 751
           + DG +VV +Y           +K++ GF++++++ G +KR+ F  
Sbjct: 658 NRDGEEVVQLYIHDQVASLERPVKELKGFKKIYLKPGESKRVTFTL 703


>gi|261408260|ref|YP_003244501.1| glycoside hydrolase family protein [Paenibacillus sp. Y412MC10]
 gi|261284723|gb|ACX66694.1| glycoside hydrolase family 3 domain protein [Paenibacillus sp.
           Y412MC10]
          Length = 763

 Score =  283 bits (724), Expect = 3e-73,   Method: Compositional matrix adjust.
 Identities = 213/690 (30%), Positives = 337/690 (48%), Gaps = 96/690 (13%)

Query: 121 GATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNINVARDPRWGRI 180
           GAT FP  +   +++N  L++ I +AV+ E RA     + G   +SP ++V RDPRWGR 
Sbjct: 123 GATVFPVPLTIGSTWNTELFRSISRAVAAETRA-----QGGSATYSPVLDVVRDPRWGRT 177

Query: 181 TETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDV-DNWKGV 239
            ET GEDP +V  +AV  V+GLQ          L+S    + +  KH+A Y   +  +  
Sbjct: 178 EETFGEDPHLVTEFAVAAVQGLQ-------GERLDSH-TSLLATLKHFAGYGASEGGRNG 229

Query: 240 DRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGE 299
              H   R    ++ E  L PF   V+ G A SVM +YN ++G+P  +   LL   +R  
Sbjct: 230 APVHMGLR----ELHEVDLLPFRKAVEAG-ALSVMTAYNEIDGVPCTSSGYLLQDVLREA 284

Query: 300 WDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLD-CGQYYTNFTGNAVQQG 358
           W   G+++ DC +I ++   H   A S  +A AQ+LKAG+D++  G  +      A++QG
Sbjct: 285 WGFDGFVITDCGAIHMLACGHN-TAGSGVEAAAQSLKAGVDMEMSGTMFRAHLHQALEQG 343

Query: 359 KVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQDICSDENIELAAEAAREGIVLLKN 418
            + E D++++   +  +  RLG FD      +  +Q I   E+I LA +AA EGIVLLKN
Sbjct: 344 LITEEDLNRAAGRVLELKFRLGLFDRPYVDPAWAEQVIGCKEHIALAYQAAAEGIVLLKN 403

Query: 419 DQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAG--IPCRYMSPIAGFS---GYANVTYK 473
           + N LPL+S+   T+AV+GP+A+A    +G+Y     P + ++ + G     G + V Y 
Sbjct: 404 EGNLLPLDSSS-GTIAVIGPNAHAPYHQLGDYTSPQPPGQIVTVLDGIRRRLGDSRVLYA 462

Query: 474 TGCDDVACKSNNSIFAASEAAKTADATIILAG-----------LDLSVEA---------- 512
            GC  +   S      A   A+ AD  +++ G           +DL   A          
Sbjct: 463 PGC-RIQGDSREGFPRALACAEQADVIVMVLGGSSARDFGEGTIDLRTGASVVTGHAESD 521

Query: 513 ----ESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWA 568
               E +DR  L L G Q +L+ ++ ++ K PVI+V ++  G  I     + +I +I+ A
Sbjct: 522 MECGEGIDRSTLTLMGVQLELLQELHKLGK-PVIVVYIN--GRPITEPWIDEHIPSIVEA 578

Query: 569 GYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKF 628
            YPG+EGG AIAD++FG  NP GRLP++      V  LP +    R        G+ Y  
Sbjct: 579 WYPGQEGGSAIADMLFGDINPSGRLPLSIPK--EVGQLPNSYNARR------TRGKRYLE 630

Query: 629 YNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVND 688
            +    YPFG+GLSYT+F+Y  L+    +            +    +A+           
Sbjct: 631 TDLAPRYPFGFGLSYTEFRYGRLTVEPAV------------VPIGGEAT----------- 667

Query: 689 LRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIK 748
                    ++D  N G+ DG++VV +Y    A       K + GF++VF++AG  + + 
Sbjct: 668 --------VRIDVTNAGARDGAEVVQLYVSDLAASVTRPEKALKGFRKVFLKAGETQEVT 719

Query: 749 FVFNACKSLNIVDYAANTLLPAGEHTIFVG 778
           F   + + L ++      ++  GE  I VG
Sbjct: 720 FTIGS-EQLELIGLDLKPVVEPGEFRIQVG 748


>gi|329962030|ref|ZP_08300041.1| glycosyl hydrolase family 3 protein [Bacteroides fluxus YIT 12057]
 gi|328530678|gb|EGF57536.1| glycosyl hydrolase family 3 protein [Bacteroides fluxus YIT 12057]
          Length = 941

 Score =  283 bits (724), Expect = 3e-73,   Method: Compositional matrix adjust.
 Identities = 240/862 (27%), Positives = 379/862 (43%), Gaps = 149/862 (17%)

Query: 1   MAKVVSSLLCFSLSIALLVFSTNAVDANGSSSPVFVCDPG--RFSKLGLQMSSFLFCDSS 58
           M K+++++L  S S  L    T  V A    +   +   G   F+K G+     ++ D +
Sbjct: 1   MRKLIAAVLLLSNSALLTAQKTMKVPATYKPTKSEMYHKGWIDFNKNGVMD---VYEDPA 57

Query: 59  LPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRL---GLPQYEW----WS-------EALH 104
                RV+DL+ +MTLDEK  Q+    +G  R+    LP  EW    W        E L+
Sbjct: 58  ATVDARVEDLLKQMTLDEKTCQMVTL-YGYKRVLKDALPTPEWKQMLWKDGIGAIDEHLN 116

Query: 105 GVSNVG-PGTHFDDVIPG--------------------------------------ATSF 125
           G    G P +  ++V P                                       AT+F
Sbjct: 117 GFQQWGLPPSDNENVWPASRHAWALNEVQRFFVEETRLGIPVDFTNEGIRGVESYKATNF 176

Query: 126 PTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLT-YWSPNINVARDPRWGRITETP 184
           PT +    ++N +L  K+G     EAR +      G T  ++P ++V RD RWGR  E  
Sbjct: 177 PTQLGLGHTWNRALIHKVGLITGREARML------GYTNVYAPILDVGRDQRWGRYEEVY 230

Query: 185 GEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHF 244
           GE P++V    +  VRGLQ                 V++  KH+AAY  +          
Sbjct: 231 GESPYLVAELGIEMVRGLQQ---------------HVAATGKHFAAYSNNKGAREGMARV 275

Query: 245 DARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHG 304
           D + +  ++E   + PF   +KE     VM SYN  +GIP       L   +R E    G
Sbjct: 276 DPQTSPHEVENIHIYPFRRVIKEAGLLGVMSSYNDYDGIPIQGSYYWLTTRLRDEMGFRG 335

Query: 305 YIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCG----QYYTNFTGNAVQQGKV 360
           Y+V+D D+++ +   H    D KE AV Q+++AGL++ C       +       V++G +
Sbjct: 336 YVVSDSDAVEYLYTKHGTAKDMKE-AVRQSVEAGLNVRCTFRSPDSFVLPLRELVKEGGL 394

Query: 361 KETDIDKSLKYLYTVLMRLGFFDGSPQYVSLG-KQDICSDENIELAAEAAREGIVLLKND 419
            E  ++  ++ +  V   +G FD   Q    G  +++  +EN  +A +A+RE +VLLKN+
Sbjct: 395 DEETVNDRVRDILRVKFLIGLFDAPYQTDLAGADKEVEKEENEAVALQASRESVVLLKNE 454

Query: 420 QNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGF----SGYANVTYKTG 475
            +TLPLN   VK +AV GP+A+     + +Y  +     + + G     +G A V Y  G
Sbjct: 455 NSTLPLNINTVKKIAVCGPNADEDGYALTHYGPLAVEVTTVLKGIQDKVNGKAEVLYTKG 514

Query: 476 CDDVACKSNNS--------------IFAASEAAKTADATIILAGLDLSVEAESLDREDLW 521
           CD V      S              I  A E A+ AD  +++ G       E+  R  L 
Sbjct: 515 CDLVDANWPESEIIDYPLTPDEQAEINKAVENARRADVAVVVLGGGQRTCGENKSRSSLD 574

Query: 522 LPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIAD 581
           LPG Q QL+  V    K PV+L++++   + + +A+    + AIL A YPG +GG A+AD
Sbjct: 575 LPGRQLQLLQAVQATGK-PVVLILINGRPLSVNWAD--KYVPAILEAWYPGSKGGVALAD 631

Query: 582 VVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSL-----GYPGRTYKFYNGPTLYP 636
           ++FG +NPGG+L +T+     V  +P  + P +P   +       P       NG  LYP
Sbjct: 632 ILFGDYNPGGKLTVTFPK--TVGQIPF-NFPCKPASQIDGGKNAGPDGNMSRING-ALYP 687

Query: 637 FGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFE 696
           FGYGLSYT F+Y+ L  T                           P V+  + +      
Sbjct: 688 FGYGLSYTTFEYSNLEIT---------------------------PKVITPNEKA----T 716

Query: 697 FKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKS 756
            ++   N G   G +VV +Y++       TY K + GF+R+ +  G  K + F+ +  K 
Sbjct: 717 VRLKVTNTGKYAGDEVVQLYTRDVLSSVTTYEKNLAGFERIHLEPGETKEVTFILDR-KH 775

Query: 757 LNIVDYAANTLLPAGEHTIFVG 778
           L ++D     ++  G+  I  G
Sbjct: 776 LELLDADMKRVVEPGDFAIMAG 797


>gi|383113364|ref|ZP_09934136.1| hypothetical protein BSGG_3068 [Bacteroides sp. D2]
 gi|382948729|gb|EFS32368.2| hypothetical protein BSGG_3068 [Bacteroides sp. D2]
          Length = 850

 Score =  283 bits (723), Expect = 3e-73,   Method: Compositional matrix adjust.
 Identities = 166/427 (38%), Positives = 248/427 (58%), Gaps = 41/427 (9%)

Query: 53  LFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPG 112
           L+ + + P   RV DL+SR+T++EK+  L   + G+PRLG+ +Y   +EALHGV  V PG
Sbjct: 26  LYKNENAPVHERVADLLSRLTVEEKISLLRATSPGIPRLGIDKYYHGNEALHGV--VRPG 83

Query: 113 THFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAG----------L 162
                     T FP  I   A++N  L K++   +S EARA +N    G          L
Sbjct: 84  RF--------TVFPQAIGLAATWNPELQKRVATVISDEARARWNELDQGREQKEQFSDVL 135

Query: 163 TYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVS 222
           T+WSP +N+ARDPRWGR  ET GEDPF+ G     +V+GLQ           + R LK+ 
Sbjct: 136 TFWSPTVNMARDPRWGRTPETYGEDPFLSGVMGTAFVKGLQGD---------DPRYLKIV 186

Query: 223 SCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNG 282
           S  KH+AA + ++    +R+  + +++E+ + E +   FEMCVKEG A+S+M +YN +N 
Sbjct: 187 STPKHFAANNEEH----NRFVCNPQISEKQLREYYFPAFEMCVKEGKAASIMTAYNALND 242

Query: 283 IPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLD 342
           +P   +  LL + +R +W   GY+V+DC    ++V+ HK++  +KE A   +++AGLDL+
Sbjct: 243 VPCTLNAWLLKKVLRQDWGFQGYVVSDCGGPSLLVNAHKYVK-TKEAAATLSIQAGLDLE 301

Query: 343 CG-QYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ--YVSLGKQDICSD 399
           CG   Y  +  NA +Q  V + DID +  ++ T  M+LG FDG+ +  Y  +    I S 
Sbjct: 302 CGDDVYDEYLLNAYKQYMVSDADIDSAACHVLTARMKLGLFDGTERNPYTRISPSVIGSK 361

Query: 400 ENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMS 459
           E+ ++A +AARE IVLLKN  N LPLN  KVK++AVVG   NA     G+Y+G P   + 
Sbjct: 362 EHQQIALDAARECIVLLKNKNNMLPLNVNKVKSIAVVG--INAGKCEFGDYSGAPV--VD 417

Query: 460 PIAGFSG 466
           P++   G
Sbjct: 418 PVSILQG 424



 Score =  128 bits (321), Expect = 1e-26,   Method: Compositional matrix adjust.
 Identities = 87/290 (30%), Positives = 139/290 (47%), Gaps = 48/290 (16%)

Query: 490 ASEAAKTADATIILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAG 549
           A +A +  +  + + G++ S+E E  DR D+ LP  Q + + ++ +V   P I+V++ AG
Sbjct: 594 AGKAVRECETVVAVMGINKSIEREGQDRYDIQLPADQREFLQEIYKV--NPNIIVVLVAG 651

Query: 550 GVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLT 609
              +A    + +I AI+ A YPGE+GG A+ADV+FG +NP GRLP+T+Y    +  LP  
Sbjct: 652 S-SLAVNWMDEHIPAIVNAWYPGEQGGTAVADVLFGDYNPAGRLPLTYYKS--LDELPAF 708

Query: 610 SMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRN 669
                  D     GRTYK++ G  LYPFGYGLSY+ FKY+ L                  
Sbjct: 709 D------DYDITQGRTYKYFKGDVLYPFGYGLSYSSFKYSDLK----------------- 745

Query: 670 LNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIK 729
                D + T                      +N G   G +V  VY + P       IK
Sbjct: 746 ---VKDGANT---------------VSVSFRLKNTGKRKGDEVAQVYVRIPETGGVVPIK 787

Query: 730 QVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYA-ANTLLPAGEHTIFVG 778
           ++ GF+R+ +++G ++ ++   +  + L   D      ++P G   I +G
Sbjct: 788 ELKGFRRIPLKSGESRVVEIELDK-EQLRYWDAGLGQFIVPQGAFDIMIG 836


>gi|46127231|ref|XP_388169.1| hypothetical protein FG07993.1 [Gibberella zeae PH-1]
          Length = 712

 Score =  283 bits (723), Expect = 3e-73,   Method: Compositional matrix adjust.
 Identities = 204/621 (32%), Positives = 301/621 (48%), Gaps = 85/621 (13%)

Query: 54  FCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGT 113
            CD++   + R   LVS +T  EKV  L   A G PR+GLP+Y WW+EALHGV+   PG 
Sbjct: 41  ICDTTASPAERAAALVSALTPREKVNNLVSNATGAPRIGLPRYNWWNEALHGVAGA-PGN 99

Query: 114 HFDDVIP--GATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAG-LTYWSPNIN 170
            ++D  P   ATSFP  +L  ++F++ L   IG+ + TEARA  N G  G + YW     
Sbjct: 100 DYNDKPPYDSATSFPMPLLMGSTFDDDLIHDIGEVIGTEARAWNNGGWGGGVDYW----- 154

Query: 171 VARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAA 230
                       TP  +PF   R+            G E                     
Sbjct: 155 ------------TPNVNPFKDPRWG----------RGSETP------------------- 173

Query: 231 YDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPK 290
                  G D  H             + R  E C ++    S+MCSYN VNGIP+CA+  
Sbjct: 174 -------GEDALHV----------SRYARAME-CTRDAKVGSIMCSYNAVNGIPACANSY 215

Query: 291 LLNQTVRGEWDL---HGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYY 347
           L    +R  W+    + +I +DC ++Q +  +H +     E A A   + G D  C    
Sbjct: 216 LQETLLRKHWNWTHTNNWITSDCGAMQDIWQHHNYTKTGAEAAKA-AFENGQDSSCEYTT 274

Query: 348 TNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDG-SPQYVSLGKQDICSDENIELAA 406
           T    ++ +QG + E  +D++LK L+  L+  GFFDG   ++ SL   D+ +    +LA 
Sbjct: 275 TKDISDSYEQGLLTEKVMDRALKRLFEGLVHTGFFDGDKSEWSSLDFDDVNTRHAQDLAL 334

Query: 407 EAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPI--AGF 464
           ++A  G VLLKND NTLPLN  K ++VA++G  A+    + G Y+G      +P   A  
Sbjct: 335 QSAVRGAVLLKND-NTLPLNIKKKESVALIGFWADDKTKLQGGYSGPAPHVRTPAYAAKM 393

Query: 465 SGY-ANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAESLDREDLWLP 523
            G   NV +     + +   N +  A  EAAK +D  + L GLD +   E  DR DL  P
Sbjct: 394 LGLNTNVAWGPTLQNSSVPDNWTTNAL-EAAKKSDYIVYLGGLDATAAGEERDRTDLDWP 452

Query: 524 GYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVV 583
             Q  L+ +++ + K   ++V+     VD      N  + +ILW  YPG+EGG A+ +++
Sbjct: 453 STQLTLLKKLSNLGK--PLVVVQLGDQVDDTPLLKNKGVNSILWVNYPGQEGGTAVMELI 510

Query: 584 FGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSY 643
            G+  P GRLP+T Y   Y + + +  M LRP  S   PGRTY++Y+   L PFG+G  Y
Sbjct: 511 TGRKGPAGRLPLTQYPSKYTEQVGMLEMELRPTKS--SPGRTYRWYSDSVL-PFGFGKHY 567

Query: 644 TQFKYNLLSFTKTIQVNLNKL 664
           T FK    S  + I++N+ K+
Sbjct: 568 TTFKAMFKS--QKIEMNIQKI 586


>gi|423302093|ref|ZP_17280116.1| hypothetical protein HMPREF1057_03257 [Bacteroides finegoldii
           CL09T03C10]
 gi|408471184|gb|EKJ89716.1| hypothetical protein HMPREF1057_03257 [Bacteroides finegoldii
           CL09T03C10]
          Length = 1039

 Score =  283 bits (723), Expect = 3e-73,   Method: Compositional matrix adjust.
 Identities = 247/868 (28%), Positives = 392/868 (45%), Gaps = 155/868 (17%)

Query: 13  LSIALLVFSTNAVDANGSSSPVFVCDPGR----------FSKLGLQMSSFLFCDSSLPYS 62
           L I+L + S   + A  +S    V  P R          F+K G++    ++ D S P  
Sbjct: 97  LLISLFLGSCATLPAQKTSKIPTVYKPVRTEMYQKGWIDFNKNGIKD---VYEDPSAPID 153

Query: 63  IRVKDLVSRMTLDEKVQQLGDFAHGVPRL---GLPQYEW----WS-------EALHGVSN 108
            R++DL+S+MTL+EK  Q+    +G  R+    LP  EW    W        E L+G   
Sbjct: 154 ARIEDLLSQMTLEEKTCQMVTL-YGYKRVLKDDLPTPEWKNQLWKDGIGAIDEHLNGFQQ 212

Query: 109 VG-PGTHFDDVIPG--------------------------------------ATSFPTVI 129
            G P +  + V P                                       AT+FPT +
Sbjct: 213 WGLPPSDNEYVWPASKHAWALNEVQRFFIEETRLGIPTDFTNEGIRGVESYKATNFPTQL 272

Query: 130 LTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNINVARDPRWGRITETPGEDPF 189
               ++N  L ++IG     EAR    LG   +  ++P ++V RD RWGR  E  GE P+
Sbjct: 273 GLGHTWNRQLLRQIGLITGREARM---LGYTNV--YAPILDVGRDQRWGRYEEVYGESPY 327

Query: 190 VVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVT 249
           +V    +  V+G+Q    H +         +V++  KH+ AY  +          D +++
Sbjct: 328 LVAELGIEMVKGMQ----HNH---------QVAATGKHFIAYSNNKGAREGMARVDPQMS 374

Query: 250 EQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVAD 309
            +++E   + PF+  ++E     VM SYN  +G P  +    L   +RG+    GY+V+D
Sbjct: 375 PREVEMIHVYPFKRVIREAGLLGVMSSYNDYDGFPIQSSYYWLTTRLRGDMGFRGYVVSD 434

Query: 310 CDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCG----QYYTNFTGNAVQQGKVKETDI 365
            D+++ +   H    D KE AV Q+++AGL++ C       Y       V++G++ E  I
Sbjct: 435 SDAVEYLYTKHGTAKDMKE-AVRQSVEAGLNIRCTFRSPDSYVLPLRELVKEGELSEEII 493

Query: 366 DKSLKYLYTVLMRLGFFDGSPQYVSLG-KQDICSDENIELAAEAAREGIVLLKNDQNTLP 424
           +  ++ +  V   +G FD   Q    G  +++    N E+A +A+RE IVLLKND+N LP
Sbjct: 494 NDRVRDILRVKFLVGLFDHPYQTDLKGADEEVEKASNEEIALQASRESIVLLKNDKNVLP 553

Query: 425 LNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAG----FSGYANVTYKTGCDDV- 479
           LN++ +K +AV GP+A+     + +Y  +     S + G      G A V Y  GC+ V 
Sbjct: 554 LNASTIKKIAVCGPNADEHSYALTHYGPLAVEVTSVLKGIQEKLGGKAEVLYTKGCELVD 613

Query: 480 -------------ACKSNNSIFAASEAAKTADATIILAGLDLSVEAESLDREDLWLPGYQ 526
                        +      I  A    K AD  +++ G       E+  R  L LPG Q
Sbjct: 614 ANWPESELMEYPLSENEQEEIEKAVSQTKQADVAVVVLGGGQRTCGENKSRSSLALPGRQ 673

Query: 527 TQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGK 586
             L+  V    K PV+LV+++   + I +A+    + AIL A YPG +GG+A+ADV+FG 
Sbjct: 674 LDLLKAVVATGK-PVVLVLINGRPLSINWAD--KFVPAILEAWYPGSKGGKAVADVLFGD 730

Query: 587 FNPGGRLPITWYNGDYVQMLPLTSMPLRP---VDSLGYPGRTYKF--YNGPTLYPFGYGL 641
           +NPGG+L +T+     V  +P  + P +P   +D    PG        NG  LYPFG+GL
Sbjct: 731 YNPGGKLTVTF--PKTVGQIPF-NFPCKPSSQIDGGKNPGLNGNMSRVNG-ALYPFGFGL 786

Query: 642 SYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDF 701
           SYT F+Y+ L  +                           P ++  + +   Y   KV  
Sbjct: 787 SYTTFEYSDLKIS---------------------------PAIITPNQKT--YVTCKV-- 815

Query: 702 QNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVD 761
            N G   G +VV +Y +       TY K + GF+RV ++ G  K I F  +  K+L +++
Sbjct: 816 TNTGKRAGDEVVQLYVRDVLSSVTTYEKNLAGFERVHLKPGETKEITFPIDR-KALELLN 874

Query: 762 YAANTLLPAGEHTIFVGNGGVSFPIHLN 789
              + ++  GE T+ +  G  S  I LN
Sbjct: 875 ADMHWVVEPGEFTLMI--GASSTDIRLN 900


>gi|299149391|ref|ZP_07042448.1| beta-glucosidase [Bacteroides sp. 3_1_23]
 gi|298512578|gb|EFI36470.1| beta-glucosidase [Bacteroides sp. 3_1_23]
          Length = 853

 Score =  283 bits (723), Expect = 3e-73,   Method: Compositional matrix adjust.
 Identities = 166/427 (38%), Positives = 248/427 (58%), Gaps = 41/427 (9%)

Query: 53  LFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPG 112
           L+ + + P   RV DL+SR+T++EK+  L   + G+PRLG+ +Y   +EALHGV  V PG
Sbjct: 29  LYKNENAPVHERVADLLSRLTVEEKISLLRATSPGIPRLGIDKYYHGNEALHGV--VRPG 86

Query: 113 THFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAG----------L 162
                     T FP  I   A++N  L K++   +S EARA +N    G          L
Sbjct: 87  RF--------TVFPQAIGLAATWNPELQKRVATVISDEARARWNELDQGREQKEQFSDVL 138

Query: 163 TYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVS 222
           T+WSP +N+ARDPRWGR  ET GEDPF+ G     +V+GLQ           + R LK+ 
Sbjct: 139 TFWSPTVNMARDPRWGRTPETYGEDPFLSGVMGTAFVKGLQGD---------DPRYLKIV 189

Query: 223 SCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNG 282
           S  KH+AA + ++    +R+  + +++E+ + E +   FEMCVKEG A+S+M +YN +N 
Sbjct: 190 STPKHFAANNEEH----NRFVCNPQISEKQLREYYFPAFEMCVKEGKAASIMTAYNALND 245

Query: 283 IPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLD 342
           +P   +  LL + +R +W   GY+V+DC    ++V+ HK++  +KE A   +++AGLDL+
Sbjct: 246 VPCTLNAWLLKKVLRQDWGFQGYVVSDCGGPSLLVNAHKYVK-TKEAAATLSIQAGLDLE 304

Query: 343 CG-QYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ--YVSLGKQDICSD 399
           CG   Y  +  NA +Q  V + DID +  ++ T  M+LG FDG+ +  Y  +    I S 
Sbjct: 305 CGDDVYDEYLLNAYKQYMVSDADIDSAACHVLTARMKLGLFDGTERNPYTRISPSVIGSK 364

Query: 400 ENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMS 459
           E+ ++A +AARE IVLLKN  N LPLN  KVK++AVVG   NA     G+Y+G P   + 
Sbjct: 365 EHQQIALDAARECIVLLKNKNNMLPLNVNKVKSIAVVG--INAGKCEFGDYSGAPV--VD 420

Query: 460 PIAGFSG 466
           P++   G
Sbjct: 421 PVSILQG 427



 Score =  128 bits (321), Expect = 1e-26,   Method: Compositional matrix adjust.
 Identities = 88/290 (30%), Positives = 138/290 (47%), Gaps = 48/290 (16%)

Query: 490 ASEAAKTADATIILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAG 549
           A +A +  +  + + G++ S+E E  DR D+ LP  Q + + ++ +V   P I+V++ AG
Sbjct: 597 AGKAVRECETVVAVMGINKSIEREGQDRYDIQLPADQREFLQEIYKV--NPNIIVVLVAG 654

Query: 550 GVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLT 609
              +A    + +I AI+ A YPGE+GG A+ADV+FG +NP GRLP+T+Y    +  LP  
Sbjct: 655 S-SLAVNWMDEHIPAIVNAWYPGEQGGTAVADVLFGDYNPAGRLPLTYYKS--LDELPAF 711

Query: 610 SMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRN 669
                  D     GRTYK++ G  LYPFGYGLSY+ FKY+ L                  
Sbjct: 712 D------DYDITKGRTYKYFKGDVLYPFGYGLSYSSFKYSDLK----------------- 748

Query: 670 LNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIK 729
                D + T                      +N G   G +V  VY + P       IK
Sbjct: 749 ---VKDGANT---------------ISVSFRLKNTGKRKGDEVAQVYVRIPETGGVVPIK 790

Query: 730 QVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYA-ANTLLPAGEHTIFVG 778
           ++ GF+R+ +++G ++ +    +  + L   D      ++P G   I VG
Sbjct: 791 ELKGFRRIPLKSGESRVVDIELDK-EQLRYWDAGLGQFIVPQGAFDIMVG 839


>gi|380694609|ref|ZP_09859468.1| periplasmic beta-glucosidase , xylosidase/arabinosidase
           [Bacteroides faecis MAJ27]
          Length = 804

 Score =  283 bits (723), Expect = 4e-73,   Method: Compositional matrix adjust.
 Identities = 222/719 (30%), Positives = 340/719 (47%), Gaps = 110/719 (15%)

Query: 90  RLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVST 149
           RLG+P +    EA HG   +G            T FPT I  +A+++ +L +++G+A++ 
Sbjct: 142 RLGIPVF-LAEEAPHGHMAIG-----------TTVFPTGIGMSATWSPTLIEEVGKAIAK 189

Query: 150 EARAMYNLGRAGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHE 209
           E R+     +     + P ++++RDPRW R+ ET GEDP + GR     V GL       
Sbjct: 190 EIRS-----QGAHISYGPVLDLSRDPRWSRVEETFGEDPVLSGRLGAAMVTGL------- 237

Query: 210 NATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGD 269
            + DL SR     +  KH+ AY V        Y   A V  +D+ E FL PF   ++ G 
Sbjct: 238 GSGDL-SREHATIATLKHFLAYAVPEGGQNGNY---ASVGARDLHENFLPPFREAIEAG- 292

Query: 270 ASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKED 329
           A SVM SYN ++GIP  A+  LL Q +R EW   G++V+D  SI+ + ++H F+A + E+
Sbjct: 293 ALSVMTSYNSIDGIPCTANHYLLTQLLRNEWKFRGFVVSDLYSIEGIHESH-FVASTMEE 351

Query: 330 AVAQTLKAGLDLDCG-QYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQY 388
           A  Q L AG+D+D G   + N    AV+ GK+ ET I+ ++  +  +   +G F+     
Sbjct: 352 AAVQALSAGVDIDLGGDAFMNLL-QAVRSGKLDETQINAAVDRILRMKFEMGLFEHPYVN 410

Query: 389 VSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIG 448
                + + + E+++LA + A+  +VLL+N  + LPL S K+K VAVVGP+A+    M+G
Sbjct: 411 PKTTTKMVRNKEHVKLARKVAQSSVVLLENKNSILPL-SKKIKRVAVVGPNADNRYNMLG 469

Query: 449 NYAG------IPCRYMSPIAGFSGYANVTYKTGCDDVACKSNNSIFAASEAAKTADATII 502
           +Y        I       I+  S  + V Y  GC  +   + N I  A EAA  ++  I 
Sbjct: 470 DYTAPQEDKDIRTVLDGVISKLSP-SRVEYVRGCA-IRDTTVNEIAEAVEAAHRSEVIIA 527

Query: 503 LAGLDLSVE-----------------------AESLDREDLWLPGYQTQLINQVAEVAKG 539
           + G   + +                        E  DR  L L G Q  L+N +    K 
Sbjct: 528 VVGGSSARDFKTSYQETGAAIADEKSISDMECGEGFDRATLTLLGKQQDLLNALKTTGK- 586

Query: 540 PVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYN 599
           P+I+V +    +D  +A    +  A+L A YPG+ GG AIADV+FG +NP GRLP++   
Sbjct: 587 PLIVVYIEGRPLDKVWASECAD--ALLTASYPGQAGGDAIADVLFGDYNPAGRLPVSVPR 644

Query: 600 GDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQV 659
              V  +P+      P +        Y       LY FGYGLSYT F+Y+ L  T+    
Sbjct: 645 S--VGQIPVYYNKKAPRN------HDYVEMAASPLYGFGYGLSYTTFEYSDLQITQ---- 692

Query: 660 NLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKP 719
                             K+ C            +FE     +N G+ DG +V  +Y K 
Sbjct: 693 ------------------KSPC------------HFEVSFKVKNTGNYDGEEVAQLYLKD 722

Query: 720 PAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVG 778
                   +KQ+  F+R F+R G  K I F     K L+I+D +   ++  G+  I +G
Sbjct: 723 EYASVVQPLKQLKHFERFFLRKGEEKEILFTLTE-KDLSIIDRSMKRVVETGDFRIMIG 780


>gi|319643197|ref|ZP_07997825.1| glycoside hydrolase family 3 [Bacteroides sp. 3_1_40A]
 gi|345520511|ref|ZP_08799899.1| glycoside hydrolase family beta-glycosidase [Bacteroides sp.
           4_3_47FAA]
 gi|254835034|gb|EET15343.1| glycoside hydrolase family beta-glycosidase [Bacteroides sp.
           4_3_47FAA]
 gi|317385101|gb|EFV66052.1| glycoside hydrolase family 3 [Bacteroides sp. 3_1_40A]
          Length = 788

 Score =  283 bits (723), Expect = 4e-73,   Method: Compositional matrix adjust.
 Identities = 239/817 (29%), Positives = 371/817 (45%), Gaps = 151/817 (18%)

Query: 53  LFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRL---GLPQYEWWSEALH-GVSN 108
           L+ +   P   RV+DL+S+MTL+EK  Q+    +G  R+    LPQ  W +E    G+ N
Sbjct: 42  LYENPKAPLEDRVQDLLSQMTLEEKTCQMATL-YGSGRVLKDALPQDNWKTEVWKDGIGN 100

Query: 109 V-----GPGT-----------HFD-----------------------DVIPG-----ATS 124
           +     G G            H D                       + I G     AT 
Sbjct: 101 IDEEHNGLGAFKSEYSFPYAKHVDAKHTIQRWFVEKTRLGIPVDFTNEGIRGLCHDRATY 160

Query: 125 FPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNINVARDPRWGRITETP 184
           FP      A++N+ L  +IG+  + EA A   LG   +  +SP +++A+DPRWGR  ET 
Sbjct: 161 FPAQCGQGATWNKKLIARIGEVEAKEAVA---LGYTNI--YSPILDIAQDPRWGRCVETY 215

Query: 185 GEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHF 244
           GEDP++VG      +  LQ         +L + P       KH+A Y +       +   
Sbjct: 216 GEDPYLVGELGKQMITSLQKY-------NLVATP-------KHFAVYSIPIGGRDGKTRT 261

Query: 245 DARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHG 304
           D  V  ++M   ++ PF M  +E  A  VM SYN  +G P       L + +R EW   G
Sbjct: 262 DPHVAPREMRTLYIEPFRMAFQEAGALGVMSSYNDYDGEPITGSYHFLTEILRQEWGFKG 321

Query: 305 YIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFT---------GNAV 355
           Y+V+D ++++ + + HK +AD+ ED +AQ + AGL++      T+FT           AV
Sbjct: 322 YVVSDSEAVEFISNKHK-VADTYEDGIAQAVNAGLNIR-----THFTPPADFILPLRKAV 375

Query: 356 QQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQD---ICSDENIELAAEAAREG 412
             GK+ +  +DK +  +  +   LG FD    Y   GKQ    + S E+  ++ EAAR+ 
Sbjct: 376 DDGKISQETLDKRVAEILRIKFWLGLFDNP--YRGNGKQAEQIVHSKEHQAVSLEAARQS 433

Query: 413 IVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNY--AGIPCRYM-SPIAGFSGYAN 469
           +VLLKN+ + LPL S  ++++AV+GP+A+    +I  Y  A  P + +   I     +A 
Sbjct: 434 LVLLKNETHLLPL-SKSIRSIAVIGPNADEQTQLICRYGPANAPIKTVYQGIKELLPHAE 492

Query: 470 VTYKTGCDDVA-----------CKSNNSIFAASE---AAKTADATI-ILAGLDLSVEAES 514
           V YK GCD +             K+   +    E   AAK A+  + +L G +L+V  E 
Sbjct: 493 VIYKKGCDIIDPHFPESEILDFPKTAEEVRLMQEVIRAAKQAEVVVMVLGGNELTVR-ED 551

Query: 515 LDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEE 574
             R  L LPG Q +L+  V    K PVILV++      I +A    ++ AIL A +PGE 
Sbjct: 552 RSRTSLNLPGRQEELLKAVCATGK-PVILVMLDGRASSINYAA--AHVPAILHAWFPGEF 608

Query: 575 GGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTL 634
            G+A+A+ +FG +NPGGRL +T+     V  +P  + P +P          Y       L
Sbjct: 609 CGQAVAEALFGDYNPGGRLAVTFPKS--VGQIPF-AFPFKPGSDESSSTSVYG-----AL 660

Query: 635 YPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDY 694
           YPFG+GLSYT F Y+ L  + + Q                              ++ D +
Sbjct: 661 YPFGHGLSYTTFTYSDLHISPSHQ-----------------------------GVQGDIH 691

Query: 695 FEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNAC 754
              K+  +N G   G +VV +Y +       TY K + GF+R+ ++AG  + + F     
Sbjct: 692 VSCKI--KNTGKIKGDEVVQLYLRDEISSVTTYTKVLRGFERISLKAGEEQTVHFRLRP- 748

Query: 755 KSLNIVDYAANTLLPAGEHTIFVGNGGVSFPIHLNFN 791
           + L + D   N  +  G   + +G       +H  F 
Sbjct: 749 QDLGLWDKNMNFRVEPGSFKVMLGASSTDIRLHGQFE 785


>gi|410634080|ref|ZP_11344720.1| beta-glucosidase [Glaciecola arctica BSs20135]
 gi|410146740|dbj|GAC21587.1| beta-glucosidase [Glaciecola arctica BSs20135]
          Length = 772

 Score =  283 bits (723), Expect = 4e-73,   Method: Compositional matrix adjust.
 Identities = 205/632 (32%), Positives = 318/632 (50%), Gaps = 71/632 (11%)

Query: 165 WSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSC 224
           ++P ++VARDPRWGRI+E  GED ++    A   V+G Q         DL S+P  + + 
Sbjct: 176 FAPMVDVARDPRWGRISEGSGEDVYLTTAIARARVQGFQ-------GDDL-SQPHTILAT 227

Query: 225 CKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIP 284
            KH+AAY      G D +  D  ++++++ +T+L PF+  V  G  +S M S+N +NG+P
Sbjct: 228 AKHFAAYG-QGQAGRDYHTTD--MSDRELRDTYLPPFKAAVDAG-VTSFMTSFNELNGVP 283

Query: 285 SCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDC- 343
           + A+  LL   +R EW   G++V D  SI  MV  H F  D+ + A    +KAG+D+D  
Sbjct: 284 ASANKYLLTDILRDEWSFEGFVVTDYTSINEMV-KHGFARDN-DHAGELAVKAGVDMDMQ 341

Query: 344 GQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGK--QDICSDEN 401
           G  Y ++  N V QGKV    ID + + +  +  RLG F+   +Y +  +  Q+I  + N
Sbjct: 342 GSVYFDYLANQVTQGKVSPQQIDNAARRILEMKYRLGLFEDPYRYSNEEREAQEIYKEYN 401

Query: 402 IELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPI 461
           ++ A + AR+ +VLLKN+   LPL+ + + T+AV+GP A++   +IG+++    RY  PI
Sbjct: 402 LQAAQDVARKSMVLLKNENQQLPLSKSDL-TIAVIGPLADSKEDLIGSWSAAGDRYEKPI 460

Query: 462 AGFSGY-------ANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILA-GLDLSVEAE 513
              +G        + V Y  G        +NS F A+ A       I+LA G    +  E
Sbjct: 461 TLLTGIKAKVADPSKVLYAKGASYEFSHQDNSGFEAAIAIAKKADVIVLAMGEKWDMTGE 520

Query: 514 SLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGE 573
           +  R  L  PG Q  L+ Q+ ++AK P++LV+M+   + I +A  + N+ AIL A YPG 
Sbjct: 521 ATSRTSLDFPGNQLALMQQLKKLAK-PMVLVLMNGRPMTIEWA--DQNVDAILEAWYPGT 577

Query: 574 EGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPL------TSMPLRPVDS-LGYPGRTY 626
            GG AIADV+FG +NP G+LP+T+     V  +PL      T  P    ++   Y  R  
Sbjct: 578 MGGPAIADVLFGDYNPSGKLPVTFPRN--VGQIPLYYNMKNTGRPYSKDNAEQKYVSRYI 635

Query: 627 KFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLV 686
              N P LY FG+GLSYT F Y+ +S  K +     KL                      
Sbjct: 636 DSLNTP-LYHFGHGLSYTTFDYSKISLNKAVITAKEKLTAS------------------- 675

Query: 687 NDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKR 746
                       +D  N G+ DG +VV +Y +         +KQ+ GF+++F+  G  K 
Sbjct: 676 ------------IDVTNSGNYDGEEVVQLYIRDRIGSVTRPVKQLKGFKKIFLHKGETKT 723

Query: 747 IKFVFNACKSLNIVDYAANTLLPAGEHTIFVG 778
           + F   + + L       +    AGE  +F+G
Sbjct: 724 VSFSI-STEDLAFHRQDMSFGAEAGEFDLFIG 754


>gi|29347188|ref|NP_810691.1| beta-glucosidase [Bacteroides thetaiotaomicron VPI-5482]
 gi|29339087|gb|AAO76885.1| beta-glucosidase (gentiobiase) [Bacteroides thetaiotaomicron
           VPI-5482]
          Length = 853

 Score =  282 bits (722), Expect = 4e-73,   Method: Compositional matrix adjust.
 Identities = 165/430 (38%), Positives = 248/430 (57%), Gaps = 41/430 (9%)

Query: 53  LFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPG 112
           L+ + + P   RV DL+SR+T++EK+  L   + G+PRLG+ +Y   +EALHGV  V PG
Sbjct: 29  LYKNENAPVHERVMDLISRLTVEEKISLLRATSPGIPRLGIDKYYHGNEALHGV--VRPG 86

Query: 113 THFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAG----------L 162
                     T FP  I   A++N  L K++   +S EARA +N    G          L
Sbjct: 87  RF--------TVFPQAIGLAATWNPELQKRVATVISDEARARWNELDQGREQKEQFSDVL 138

Query: 163 TYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVS 222
           T+WSP +N+ARDPRWGR  ET GEDPF+ G     +V GLQ  + H          LK+ 
Sbjct: 139 TFWSPTVNMARDPRWGRTPETYGEDPFLSGIMGTAFVNGLQGDDPHY---------LKIV 189

Query: 223 SCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNG 282
           S  KH+AA + ++    +R+  + +++E+ + E +   FEMCVKEG A+S+M +YN +N 
Sbjct: 190 STPKHFAANNEEH----NRFVCNPQISEKQLREYYFPAFEMCVKEGKAASIMSAYNALND 245

Query: 283 IPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLD 342
           +P   +P LL + +R +W   GY+V+DC    ++V+ HK++  +KE A   ++KAGLDL+
Sbjct: 246 VPCTLNPWLLQKVLRQDWGFQGYVVSDCGGPALLVNAHKYVK-TKEAAATLSIKAGLDLE 304

Query: 343 CG-QYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ--YVSLGKQDICSD 399
           CG   Y     NA +Q  V + DID +  ++ T  M+LG FD   +  Y  +    I S 
Sbjct: 305 CGDDVYDGPLLNAYKQYMVSDADIDSAAYHVLTARMKLGLFDSGERNPYTKISPSVIGSK 364

Query: 400 ENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMS 459
           E+ ++A +AAR+ +VLLKN +N LPLN+ K+K++AVVG   NA     G+Y+G P   + 
Sbjct: 365 EHQQIALDAARQCVVLLKNQKNRLPLNADKLKSIAVVG--INAGKCEFGDYSGAPV--VE 420

Query: 460 PIAGFSGYAN 469
           P++   G  N
Sbjct: 421 PVSILQGIRN 430



 Score =  130 bits (326), Expect = 4e-27,   Method: Compositional matrix adjust.
 Identities = 88/291 (30%), Positives = 144/291 (49%), Gaps = 50/291 (17%)

Query: 490 ASEAAKTADATIILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAG 549
           A +A +  +  + + G++ S+E E  DR D+ LP  Q + + ++ +V   P I+V++ AG
Sbjct: 597 AGKAVRECETVVAVMGINKSIEREGQDRYDIQLPADQREFLQEIYKV--NPNIIVVLVAG 654

Query: 550 GVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLT 609
              +A    + +I AI+ A YPGE+GG A+A+V+FG +NP GRLP+T+Y         L 
Sbjct: 655 S-SLAINWMDEHIPAIVNAWYPGEQGGTAVAEVLFGDYNPAGRLPLTYYKS-------LD 706

Query: 610 SMPLRPVDSLGY-PGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCR 668
            +P  P D      GRTYK++ G  LYPFGYGLSY+ F Y+ L                 
Sbjct: 707 ELP--PFDDYDITKGRTYKYFKGDVLYPFGYGLSYSSFTYSDLQ---------------- 748

Query: 669 NLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYI 728
                            V D   +    F++  +N G  +G +V  VY + P       +
Sbjct: 749 -----------------VKDGGGEVTVSFRL--KNTGKRNGDEVAQVYVRIPETGGIVPL 789

Query: 729 KQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANT-LLPAGEHTIFVG 778
           K++ GF+RV +++G ++R++   +  + L   D      ++P G   + VG
Sbjct: 790 KELKGFRRVPLKSGESRRVEIKLDK-EQLRYWDVEKGQFVVPKGAFDVMVG 839


>gi|189463167|ref|ZP_03011952.1| hypothetical protein BACCOP_03878 [Bacteroides coprocola DSM 17136]
 gi|189430146|gb|EDU99130.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
           coprocola DSM 17136]
          Length = 865

 Score =  282 bits (722), Expect = 4e-73,   Method: Compositional matrix adjust.
 Identities = 175/452 (38%), Positives = 241/452 (53%), Gaps = 39/452 (8%)

Query: 52  FLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGP 111
           F + ++SL    R  DL+ R+TL+EKV  + + +  +PRLG+  Y+WW+EALHGV   G 
Sbjct: 25  FPYQNTSLTPEQRASDLLERLTLEEKVSLMQNASPAIPRLGIKAYDWWNEALHGVGRAGI 84

Query: 112 GTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMY-------NLGR-AGLT 163
                     AT FP  I   ASF++ L  K+  AVS EARA Y       NL R  GLT
Sbjct: 85  ----------ATVFPQTIGMAASFDDELIYKVFTAVSDEARAKYTEFSKSGNLKRYQGLT 134

Query: 164 YWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSS 223
           +W+PNIN+ RDPRWGR  ET GEDP++  R  V  VRGLQ  +        N +  K+ +
Sbjct: 135 FWTPNINIFRDPRWGRGQETYGEDPYLTSRMGVAVVRGLQGPD--------NMKYDKLHA 186

Query: 224 CCKHYAAYDVDNWKGVDRYHFDAR-VTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNG 282
           C KHYA +    W   +R+ F+A  +  +D+ ET+L  F+  V+E D   VMC+YNR  G
Sbjct: 187 CAKHYAVHSGPEW---NRHSFNAENIAPRDLWETYLPAFKALVQEADVKEVMCAYNRFEG 243

Query: 283 IPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVM--VDNHKFLADSKEDAVAQTLKAGLD 340
            P C   +LL Q +R EW   G IV+DC +I       +H+   D KE A A  + +G D
Sbjct: 244 EPCCGSNRLLMQILRDEWKYKGIIVSDCGAISDFWRKGDHETHPD-KETASAGAVLSGTD 302

Query: 341 LDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQDICSDE 400
           L+CG  Y +    AVQ+G + E  ID S+K L T    LG  D    + S+    + S  
Sbjct: 303 LECGNNYKSLP-EAVQKGLIDEKQIDISVKRLLTARFELGEMDEHVCWDSIPYSVVDSKA 361

Query: 401 NIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSP 460
           + +LA E AR+ IVLL+N  N LPL       +A++GP+AN +V   GNY G P    + 
Sbjct: 362 HKDLALEIARKSIVLLQNRNNILPLKED--MKIALIGPNANDSVMQWGNYNGFPSHTSTL 419

Query: 461 IAGFSGYA---NVTYKTGCDDVACKSNNSIFA 489
                       + Y  GCD  +  S  S+F+
Sbjct: 420 YEALKERIPANQLIYDFGCDRTSGISLESVFS 451



 Score =  103 bits (256), Expect = 5e-19,   Method: Compositional matrix adjust.
 Identities = 84/302 (27%), Positives = 132/302 (43%), Gaps = 58/302 (19%)

Query: 489 AASEAAKTADATIILAGLDLSVEAESL----------DREDLWLPGYQTQLINQVAEVAK 538
           A+ +  K AD  +   G+  S+E E +          DR  + LP  Q +LI+++ ++ K
Sbjct: 593 ASIDKVKAADVIVFAGGISPSLEGEEMPVNAEGFKGGDRTTIELPAIQRRLISELKKLGK 652

Query: 539 GPVILVIMSAGGVDIAFAETNTNI-KAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITW 597
            P+I V  S   V +   E  + I  AIL A YPG+ GG A+ADV+FG +NP G+LP+T+
Sbjct: 653 -PIIFVNYSGSAVGL---EPESKICDAILQAWYPGQAGGTAVADVLFGDYNPSGKLPVTF 708

Query: 598 YNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTI 657
           Y   +   LP               GRTY++     LY FG+GLSYT F Y   + ++  
Sbjct: 709 YK--HTDQLP-------DFQDYSMKGRTYRYMTESPLYSFGHGLSYTNFTYGPATLSQ-- 757

Query: 658 QVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYS 717
                               +T   G  V            +  QN G+ DG +VV VY 
Sbjct: 758 --------------------QTISQGKEVT---------LTIPVQNTGNYDGEEVVQVYL 788

Query: 718 KPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTL-LPAGEHTIF 776
               +        +  F+RV +  G+   + F  ++ ++    D   NT+ +  G + + 
Sbjct: 789 SCSGDKEGPS-HTLRAFKRVHIAKGQRANVSFTLDS-ETFQWFDTNTNTMRMVEGNYELL 846

Query: 777 VG 778
            G
Sbjct: 847 YG 848


>gi|424792251|ref|ZP_18218496.1| exported beta-glucosidase [Xanthomonas translucens pv. graminis
           ART-Xtg29]
 gi|422797157|gb|EKU25539.1| exported beta-glucosidase [Xanthomonas translucens pv. graminis
           ART-Xtg29]
          Length = 909

 Score =  282 bits (721), Expect = 5e-73,   Method: Compositional matrix adjust.
 Identities = 170/422 (40%), Positives = 236/422 (55%), Gaps = 44/422 (10%)

Query: 71  RMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGATSFPTVIL 130
           +MT +EKV Q  + A  +PRLG+P YEWW+E LHG++  G           AT FP  I 
Sbjct: 67  KMTREEKVAQAMNAAPAIPRLGVPAYEWWNEGLHGIARNG----------YATVFPQAIG 116

Query: 131 TTASFNESLWKKIGQAVSTEARAMYNLGR---------AGLTYWSPNINVARDPRWGRIT 181
             A++N +L +++G   STEARA +NL           AGLT WSPNIN+ RDPRWGR  
Sbjct: 117 LAATWNTALLEQVGTVTSTEARAKFNLAGGPGKDHPRYAGLTIWSPNINIFRDPRWGRGM 176

Query: 182 ETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDR 241
           ET GEDP++ G+ AV ++ GLQ         DL + P  +++  KH A   V +     R
Sbjct: 177 ETYGEDPYLTGQLAVGFIHGLQ-------GDDL-THPRTIATP-KHLA---VHSGPEPGR 224

Query: 242 YHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWD 301
           + FD  V+  D+E T+   F   + +G A SVMC+YN ++G P+CA   LLN  +RG+W 
Sbjct: 225 HGFDVDVSPHDLEATYTPAFRAAIVDGRAGSVMCAYNALHGTPACAADWLLNGRLRGDWG 284

Query: 302 LHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFTGNAVQQGKVK 361
             G++V+DCD++  M   H F AD+   + A  LKAG DL+CG  Y +  G A+ +G   
Sbjct: 285 FTGFVVSDCDAVDDMTQFHYFRADNAGSSAA-ALKAGHDLNCGYAYRDL-GKAIARGDAD 342

Query: 362 ETDIDKSLKYLYTVLMRLGFFDGSPQ----YVSLGKQDICSDENIELAAEAAREGIVLLK 417
           E  +DKSL  L+    RLG     PQ    Y  LG +D+ S  +  LA +AA++ IVLL+
Sbjct: 343 EALLDKSLVRLFAARYRLGEL--QPQRKDPYARLGAKDVDSAAHRALALQAAQQSIVLLQ 400

Query: 418 NDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFS---GYANVTYKT 474
           N   TLPL       +AV+GP+A+A  A+  NY G     ++P+ G     G ANV Y  
Sbjct: 401 NRNATLPLRPG--LRLAVIGPNADALAALEANYQGTSAAPVTPLLGLRERFGAANVRYAQ 458

Query: 475 GC 476
           G 
Sbjct: 459 GA 460



 Score =  126 bits (316), Expect = 6e-26,   Method: Compositional matrix adjust.
 Identities = 88/286 (30%), Positives = 138/286 (48%), Gaps = 55/286 (19%)

Query: 505 GLDLSVEAESL----------DREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIA 554
           GL   VE E L          DR DL LP  Q  L+ + A+ +  P+++V+MS   V + 
Sbjct: 646 GLSPDVEGEELRIDVPGFDGGDRNDLALPAAQQALLER-AKASGKPLVVVLMSGSAVALN 704

Query: 555 FAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLR 614
           +A+ + +  AI+ A YPG+ GG AIA V+ G  NPGGRLP+T+Y          ++  L 
Sbjct: 705 WAKQHAD--AIVAAWYPGQSGGTAIAQVLAGDVNPGGRLPVTFYR---------STKDLP 753

Query: 615 PVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTS 674
              S    GRTY+++ G  L+ FG GLSYT+F Y                          
Sbjct: 754 AYVSYDMKGRTYRYFKGEPLFAFGSGLSYTRFTYA------------------------- 788

Query: 675 DASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGF 734
                  P +    L+   + + +   +N G+  G +VV VY + P + A + ++ ++GF
Sbjct: 789 ------APQLSATTLQAGAHLQVRTQVRNSGTRAGDEVVQVYLEFP-QRAQSPLRTLVGF 841

Query: 735 QRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGNG 780
           QRV ++ G  + + F   A + L+ VD A    +  G++ +FVG G
Sbjct: 842 QRVTLQPGEARDVSFEL-APRQLSDVDRAGQRAVQPGDYRVFVGGG 886


>gi|399029285|ref|ZP_10730258.1| beta-glucosidase-like glycosyl hydrolase [Flavobacterium sp. CF136]
 gi|398072895|gb|EJL64089.1| beta-glucosidase-like glycosyl hydrolase [Flavobacterium sp. CF136]
          Length = 871

 Score =  281 bits (720), Expect = 7e-73,   Method: Compositional matrix adjust.
 Identities = 177/446 (39%), Positives = 243/446 (54%), Gaps = 48/446 (10%)

Query: 50  SSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNV 109
            +F F + +L    RV DLVSRM++DEK+ QL D +  + RLG+P+Y WW+E+LHGV+  
Sbjct: 22  ENFAFKNPNLTTEQRVDDLVSRMSIDEKISQLMDSSPAIERLGVPEYNWWNESLHGVARA 81

Query: 110 GPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYN--LGRA------G 161
           G           AT FP  I   +S++  L   +   +S EARA ++  L R       G
Sbjct: 82  G----------YATVFPQSISIASSWDRQLIFDVANVISDEARAKHHEYLRRGQHGMYQG 131

Query: 162 LTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKV 221
           LT+WSPN+N+ RDPRWGR  ET GEDPF+ G+  + YV GLQ           N + LKV
Sbjct: 132 LTFWSPNVNIFRDPRWGRGHETYGEDPFLTGQLGLKYVNGLQGT---------NEKYLKV 182

Query: 222 SSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVN 281
            +  KHYA   V +     R+ F+A  ++ D+ ET+L  F   VKEG   SVM +YNR  
Sbjct: 183 IATAKHYA---VHSGPEPSRHLFNAETSDIDLYETYLPAFRTLVKEGHVYSVMGAYNRFR 239

Query: 282 GIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDL 341
           G    A P L N  +R  W   GYIV+DC ++  +   HK   D+   A A  LK GLDL
Sbjct: 240 GESCSASPFLFN-ILRNVWGFDGYIVSDCGAVTDIWKYHKITGDAA-TASALALKDGLDL 297

Query: 342 DCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQDICSDEN 401
           +CG  + +    A+ +  + E DID ++K L+T   +LG FD   + VS  +     + N
Sbjct: 298 ECGSSFKSLK-EAIDRKLISEADIDIAVKRLFTARFKLGMFD-PEEIVSYAQIPYSVNNN 355

Query: 402 IE---LAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYM 458
                LA  A+++ IVLLKN  NTLPL S  +KTVAV+GP+AN   ++ GNY+G+P    
Sbjct: 356 SAHDWLARVASQKSIVLLKNQNNTLPL-SRDIKTVAVIGPNANDVQSLWGNYSGVPS--- 411

Query: 459 SPIAGFSGYAN-------VTYKTGCD 477
           +PI    G  N       V Y  G D
Sbjct: 412 NPITVLKGIQNKLEPNTKVLYAKGTD 437



 Score =  150 bits (380), Expect = 2e-33,   Method: Compositional matrix adjust.
 Identities = 99/309 (32%), Positives = 152/309 (49%), Gaps = 55/309 (17%)

Query: 480 ACKSNNSIFAASEAAKTADATIILAGLDLSVEAESL----------DREDLWLPGYQTQL 529
           A    N +  A + A  ADA +++ GL+  +E E +          DR  L LP  Q +L
Sbjct: 582 AEPQENVLQEAVQVAGQADAIVLVLGLNERLEGEEMKVEADGFEGGDRTSLDLPSNQEEL 641

Query: 530 INQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNP 589
           +  +    K PVILV+++   + I +A  N ++ AIL AGYPG++GG AIADV+FG +NP
Sbjct: 642 MKAMTATGK-PVILVLINGSALSINWA--NDHVPAILTAGYPGQQGGNAIADVLFGDYNP 698

Query: 590 GGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYN 649
            GRLP+T+Y     + LP         ++    GRTY+++    LYPFG+GLSYT+FKY+
Sbjct: 699 AGRLPVTYYKS--TEQLP-------AFENYDMKGRTYRYFQKKPLYPFGFGLSYTKFKYS 749

Query: 650 LLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDG 709
            L                                 L  ++  +  FE  VD  N+G  DG
Sbjct: 750 NLK--------------------------------LPTNVTPEKDFEILVDVTNIGERDG 777

Query: 710 SDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLP 769
            +V+ +Y K         I Q+ GF+RV ++ G  K ++F     + L++++     ++ 
Sbjct: 778 DEVIELYLKDEKASTPRPILQLEGFERVNLKKGETKTVRFTITP-RQLSLINKKGQRVIE 836

Query: 770 AGEHTIFVG 778
            G  TI VG
Sbjct: 837 PGWFTISVG 845


>gi|340616359|ref|YP_004734812.1| xylosidase/arabinosidase [Zobellia galactanivorans]
 gi|339731156|emb|CAZ94420.1| Xylosidase/arabinosidase, family GH3 [Zobellia galactanivorans]
          Length = 801

 Score =  281 bits (720), Expect = 7e-73,   Method: Compositional matrix adjust.
 Identities = 245/864 (28%), Positives = 383/864 (44%), Gaps = 154/864 (17%)

Query: 12  SLSIALLVFSTNAVDANGSSSPVFVCDPGRFSKLGLQMSSFLFCDSSLPYSIRVKDLVSR 71
           +L I +L+FS      + S   ++  +   F+K G      ++ D + P  +R++DL+S+
Sbjct: 5   ALIIGILLFSFPKELHSQSKQKIYHKNWVDFNKNG---KKDVYEDPTRPVDLRIEDLLSQ 61

Query: 72  MTLDEKVQQLGDFAHGVPRL---GLPQYEWWSEALH-GVSNVG----------------- 110
           MTL+EK  Q+    +G  R+    LP  +W ++    G+ N+                  
Sbjct: 62  MTLEEKSCQMATL-YGFGRVLKDELPTPDWKNQIWKDGIGNIDEQLNNLAYHPSAVTDKA 120

Query: 111 --PGTHFDDV--------------IP--------------GATSFPTVILTTASFNESLW 140
             P  H   +              IP               ATSFP+ +   A++N++L 
Sbjct: 121 WPPSNHIKALNTIQEFFVEDTRLGIPVDFTNEGIRGLCHEKATSFPSQLGVGATWNKNLV 180

Query: 141 KKIGQAVSTEARAMYNLGRAGLT-YWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYV 199
            KIG     EAR +      G T  +SP +++ARDPRWGR+ E  GEDP++VG      V
Sbjct: 181 GKIGHITGKEARLL------GYTNVYSPILDIARDPRWGRVVECYGEDPYLVGELGYQMV 234

Query: 200 RGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLR 259
           +G+Q                KV S  KH+A Y             DA +TE+++   +L 
Sbjct: 235 KGIQQE--------------KVVSTPKHFAIYSAPKGGRDGDARTDAHITERELFSLYLH 280

Query: 260 PFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDN 319
           PF+  +K+  A  VM SYN  NG+P  +    LN  +R +W   GY+V+D  +++ + D 
Sbjct: 281 PFKRAIKDAGAMGVMSSYNDYNGVPVSSSKYFLNDILREDWGFKGYVVSDSRAVEFIADK 340

Query: 320 HKFLADSKEDAVAQTLKAGL----DLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTV 375
           H    D K DAV Q + AGL    D    + +       V++G +    ID  ++ +  V
Sbjct: 341 HHVAKDRK-DAVRQAVLAGLNVRTDFTMPEDFILPVRELVKEGGLDMATIDDRVRDILRV 399

Query: 376 LMRLGFFDGSPQYVSLGKQDICSDENI------ELAAEAAREGIVLLKNDQNTLPLNSAK 429
               G FD        GKQ   +D+ +      E+A +A+ E IVLLKN++N LPL+ +K
Sbjct: 400 KFWQGLFDA-----PYGKQMKEADKTVGKPEYQEVAYQASLESIVLLKNEENILPLDFSK 454

Query: 430 VKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAG----FSGYANVTYKTGC---DDVACK 482
            K+V V GP+A A    +  Y       +S   G    F     + Y  GC   D+    
Sbjct: 455 YKSVLVTGPNAKAINHSVSRYGPSHIDVVSVFDGIKEKFPKDVEIKYTKGCVFFDENWPD 514

Query: 483 S-----------NNSIFAASEAAKTADATIILAGLDLSVEAESLDREDLWLPGYQTQLIN 531
           S            + I  A   AKT    I++ G D     ES  R  L LPG Q +L+ 
Sbjct: 515 SELMNTPPTEAEQSEIDKAVAMAKTVGLAIVVLGDDEETVGESRSRTSLDLPGNQQKLVE 574

Query: 532 QVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGG 591
           ++ +    PVI+V+++   + I +   +  +  I+   + G+ GG AIADV+ G +NPGG
Sbjct: 575 EIYKTGT-PVIVVLINGRPMTINWV--DKYVPGIVEGWFQGKFGGSAIADVLVGSYNPGG 631

Query: 592 RLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGR----TYKFYNGPTLYPFGYGLSYTQFK 647
           +LP+++     V  LP+ + P +P      P +    + K   G  LYPFGYGLSYT F+
Sbjct: 632 KLPVSFPK--TVGQLPM-NFPSKPGAQADQPAKGPNGSGKTRVGGFLYPFGYGLSYTTFE 688

Query: 648 YNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGST 707
           Y  L     I+  L                      V+V+           VD  N G  
Sbjct: 689 YTNLKIRSNIKNGLGD--------------------VVVS-----------VDITNSGKR 717

Query: 708 DGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTL 767
            G ++V +Y          Y KQ+ GF+R+ + AG  K + F   + + L++ +     +
Sbjct: 718 KGDEIVQLYFSDETSSVTVYEKQLRGFERISLEAGETKTVNFTL-SPEDLSLYNRQMEFV 776

Query: 768 LPAGEHTIFVGNGGVSFPIHLNFN 791
           L  G  TI +G+      IH++ N
Sbjct: 777 LEPGSFTIMIGSSAED--IHVSGN 798


>gi|427383551|ref|ZP_18880271.1| hypothetical protein HMPREF9447_01304 [Bacteroides oleiciplenus YIT
           12058]
 gi|425728735|gb|EKU91590.1| hypothetical protein HMPREF9447_01304 [Bacteroides oleiciplenus YIT
           12058]
          Length = 939

 Score =  281 bits (720), Expect = 8e-73,   Method: Compositional matrix adjust.
 Identities = 231/808 (28%), Positives = 367/808 (45%), Gaps = 142/808 (17%)

Query: 53  LFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRL---GLPQYEW----WS----- 100
           ++ D +     R++DL+S+MTLDEK  Q+    +G  R+    LP  EW    W      
Sbjct: 49  VYEDPNATLDARIEDLLSQMTLDEKTCQMVTL-YGYKRVLKDDLPTPEWKQMLWKDGIGA 107

Query: 101 --EALHGVSNVG-PGTHFDDVIPG------------------------------------ 121
             E L+G    G P +   +V P                                     
Sbjct: 108 IDEHLNGFQQWGLPPSDNPNVWPASRHAWALNEVQRFFIEETRLGIPVDFTNEGIRGVES 167

Query: 122 --ATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLT-YWSPNINVARDPRWG 178
             AT+FPT +    ++N  L  ++G     EAR +      G T  ++P ++V RD RWG
Sbjct: 168 YRATNFPTQLGLGHTWNRKLIHQVGLITGREARML------GYTNVYAPILDVGRDQRWG 221

Query: 179 RITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKG 238
           R  E  GE P++V    +  VRG+Q    H +         +V++  KH+ AY  +    
Sbjct: 222 RYEEVYGESPYLVAELGIEMVRGMQ----HNH---------QVAATGKHFVAYSNNKGAR 268

Query: 239 VDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRG 298
                 D +++ +++E   + PF+  +KE     VM SYN  +GIP       L + +RG
Sbjct: 269 EGMARVDPQMSPREVEMIHVYPFKRVIKEAGMLGVMSSYNDYDGIPIQGSYYWLTKRLRG 328

Query: 299 EWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCG----QYYTNFTGNA 354
           E    GY+V+D D+++ +   H    D KE AV Q+++AGL++ C       Y       
Sbjct: 329 EMGFRGYVVSDSDAVEYLYTKHSTAKDMKE-AVRQSVEAGLNVRCTFRSPDSYVLPLREL 387

Query: 355 VQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLG-KQDICSDENIELAAEAAREGI 413
           V++G + E  I+  ++ +  V   +G FD   Q    G  +++   EN  +A +A+RE +
Sbjct: 388 VKEGGLSEDIINDRVRDILRVKFLIGLFDAPYQTDLAGADKEVEKAENEAVALQASRESL 447

Query: 414 VLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGF----SGYAN 469
           +LLKN+ N LPL+   +KT+AV GP+AN     + +Y  +    ++ + G      G A 
Sbjct: 448 ILLKNENNVLPLDINNIKTIAVCGPNANEEGYALTHYGPLAVEVITVLEGIRQKAEGKAE 507

Query: 470 VTYKTGCDDVAC--------------KSNNSIFAASEAAKTADATIILAGLDLSVEAESL 515
           V Y  GCD V                +    I  A E A+ AD  +++ G       E+ 
Sbjct: 508 VLYAKGCDLVDANWPESELIEYPMTNEEQAEINKAVENARKADVAVVVLGGGQRTCGENK 567

Query: 516 DREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEG 575
            R  L LPG Q +L+  V    K PV+LV+++   + I +A+    + AIL   YPG +G
Sbjct: 568 SRSSLDLPGRQLKLLQAVQATGK-PVVLVLINGRPLSINWAD--KFVPAILETWYPGSKG 624

Query: 576 GRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRP---VDSLGYPGR--TYKFYN 630
           G A+ADV+FG +NPGG+L +T+     V  +P  + P +P   +D    PG        N
Sbjct: 625 GTAVADVLFGDYNPGGKLTVTFPKS--VGQIPF-NFPCKPSSQIDGGKNPGPDGNMSRVN 681

Query: 631 GPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLR 690
           G +LYPFGYGLSYT F+Y+ +  +  +               T++   T         +R
Sbjct: 682 G-SLYPFGYGLSYTTFEYSNIEISPKMM--------------TANQKAT---------VR 717

Query: 691 CDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFV 750
           C           N G   G +VV +Y +       TY K + GF+RV ++ G  K + F+
Sbjct: 718 C--------KVTNTGKRAGDEVVQLYIRDMLSSVTTYEKNLAGFERVHLQPGETKEVTFI 769

Query: 751 FNACKSLNIVDYAANTLLPAGEHTIFVG 778
            +  K L ++D     ++  G+ +I VG
Sbjct: 770 LDR-KHLELLDKHMEWVVEPGDFSIMVG 796


>gi|333379224|ref|ZP_08470948.1| hypothetical protein HMPREF9456_02543 [Dysgonomonas mossii DSM
           22836]
 gi|332885492|gb|EGK05741.1| hypothetical protein HMPREF9456_02543 [Dysgonomonas mossii DSM
           22836]
          Length = 745

 Score =  281 bits (720), Expect = 8e-73,   Method: Compositional matrix adjust.
 Identities = 221/758 (29%), Positives = 358/758 (47%), Gaps = 105/758 (13%)

Query: 65  VKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQY-----EWWSEALHG--VSNVG------- 110
           V DL+ RMTL+EK+ Q   +  G   +  P       E+  + + G   + VG       
Sbjct: 35  VDDLLRRMTLEEKIGQTVLYTSGYDVITGPTVDPNYKEYLKKGMVGGIFNAVGADYTRSL 94

Query: 111 ------------PGTHFDDVIPGA-TSFPTVILTTASFNESLWKKIGQAVSTEARAMYNL 157
                       P     DVI G  T FP  +  + S++    ++  +  ++EA A    
Sbjct: 95  QKIAVEETRLGIPLIFGYDVIHGQRTIFPIPLAESCSWDLEAMERSARIAASEATA---- 150

Query: 158 GRAGLTY-WSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNS 216
              G+ + ++P ++++RDPRWGR+ E  GED ++    A   V+G Q     +N + +N+
Sbjct: 151 --EGINWIYAPMVDISRDPRWGRVAEGAGEDVYLGSLIAAARVKGFQ----GDNLSAVNT 204

Query: 217 RPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCS 276
               V +C KHYAAY      G D    D  + E  +  T+L PF+  +  G   ++M S
Sbjct: 205 ----VVACVKHYAAYGA-TMAGRDYNTVDMSLNE--LWNTYLPPFKAALDAG-CGTIMTS 256

Query: 277 YNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLK 336
           +N +NGIP+  +  LL   +R +W+ +G++V D  SI  M+  H +  D K  A    + 
Sbjct: 257 FNDLNGIPATGNKYLLKDILRDKWNFNGFVVTDYTSINEMIP-HGYANDEKHSA-EIAMN 314

Query: 337 AGLDLDC-GQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQ- 394
           AG+D+D  G  Y N     +++GKV E D+ ++ + +  +  +LG F+   +Y    ++ 
Sbjct: 315 AGVDMDMQGGVYMNHLKTLIEEGKVSEKDVTEAARAILKIKYKLGLFEDPYRYCDANREK 374

Query: 395 -DICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGI 453
            DI +  N E A + AR+ +VLLKND+ TLPL   K   VA++GP       ++G ++ +
Sbjct: 375 TDILTPANKEAARDMARKSMVLLKNDKQTLPLKENK--RVALIGPLVKDKYEILGCWSAM 432

Query: 454 PCRYMSPIAGFSGYA------NVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLD 507
             R   P++ + G         ++Y  GCD +  +       A   A  +D  +++ G  
Sbjct: 433 GNRDTIPVSVYDGLVEAIGKDKISYAKGCD-IQSEDTKGFAEAVRVASASDVVVMVMGEF 491

Query: 508 LSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILW 567
            ++  E+  R +L LPG Q  L+  + +  K PV+LV+M+   + I + + N  + AIL 
Sbjct: 492 HNMSGENNSRTNLSLPGVQVDLLKAIKKTGK-PVVLVLMNGRPLTINWEKDN--LDAILE 548

Query: 568 AGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPL------TSMPLRP-VDSLG 620
           A +PG  GG AIADV+ GK+NP G+L +T+     V  +PL      T  P  P V    
Sbjct: 549 AWFPGTMGGAAIADVLTGKYNPSGKLTMTFPQN--VGQIPLFYNHKNTGRPYDPNVPQFA 606

Query: 621 YPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTR 680
           Y  R +   N P LYPFGYGLSYT F Y+ L+ +       N L+               
Sbjct: 607 YGSRYWDVSNEP-LYPFGYGLSYTTFTYSDLTLSSKEITKENPLK--------------- 650

Query: 681 CPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVR 740
                             V   N G  DG +VV +Y++         +K++ GF++VF++
Sbjct: 651 ----------------VSVKLTNSGEYDGEEVVQLYTRDLVGSVTRPVKELKGFKKVFLK 694

Query: 741 AGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVG 778
           AG +K I F   +   L   +     +   G+  +FVG
Sbjct: 695 AGESKVIDFTL-SVNDLRFYNSQLEYVYEPGDFHLFVG 731


>gi|238620766|ref|YP_002915592.1| glycoside hydrolase family protein [Sulfolobus islandicus M.16.4]
 gi|238381836|gb|ACR42924.1| glycoside hydrolase family 3 domain protein [Sulfolobus islandicus
           M.16.4]
          Length = 755

 Score =  281 bits (720), Expect = 8e-73,   Method: Compositional matrix adjust.
 Identities = 210/705 (29%), Positives = 349/705 (49%), Gaps = 122/705 (17%)

Query: 118 VIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNINVARDPRW 177
           ++  AT+FP  I   ++++  L +++   +  +A+ +           SP ++V RDPRW
Sbjct: 97  MVKTATAFPQAIGLASTWDPDLIREVSSTIRYQAKLI-----GTNQCLSPVLDVCRDPRW 151

Query: 178 GRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDV-DNW 236
           GR  ET GED ++V    + YV+GLQ     EN         ++ +  KH+AA+   +  
Sbjct: 152 GRCEETYGEDQYLVASIGLAYVKGLQG----EN---------ELIATVKHFAAHGFPEGG 198

Query: 237 KGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTV 296
           + +   H    V  +++ E FL PFE+ +K G A SVM +Y+ ++GIP  ++ +LL + +
Sbjct: 199 RNIAPVH----VGNRELREVFLFPFEVAIKLGKAMSVMPAYHEIDGIPCHSNAELLTKIL 254

Query: 297 RGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLD-----LDCGQYYTNFT 351
           R EW   G +V+D D+I+ +   HK   + KE A+   L+AG+D     +DC   +    
Sbjct: 255 RQEWGFEGIVVSDYDAIRQLEAIHKVSLNKKEAAIL-ALEAGVDTEFPNIDC---FGEPL 310

Query: 352 GNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGK--QDICSDENIELAAEAA 409
             AV++G + E+ ID++++ +  +  +LG F+    Y++     + + + ++ ELA + A
Sbjct: 311 LEAVKEGLISESIIDRAVERVLRIKEKLGLFND--HYINENNVPEKLDNSKSRELALDVA 368

Query: 410 REGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNY-------AGIPCRYMSPIA 462
           R+ IVLLKND N LPLN   + T+AV+GP+AN    ++G+Y       A +    ++ + 
Sbjct: 369 RKSIVLLKND-NILPLNK-NIGTIAVIGPNANEPRNLLGDYTYTGHLNADVGIEVVTVLE 426

Query: 463 GF----SGYANVTYKTGCDDVACKSNNSIFAASEAAKTADATIIL----AGLDLS----- 509
           G     S   NV Y  GCD +A +S      A E AK  D  I +    +GL LS     
Sbjct: 427 GIMRKVSNNTNVLYAKGCD-IAAESKEGFSEAIEIAKKGDIIIAVMGEKSGLPLSWTDVP 485

Query: 510 ----------VEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETN 559
                     V  E  DR  L LPG Q +L+ ++ +  K P+ILV+++  G  +A +   
Sbjct: 486 GKDEFEKYQAVTGEGNDRTSLRLPGVQEELLKELHKTGK-PIILVLVN--GRPLALSSIF 542

Query: 560 TNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTS--MPL---R 614
             + AI+ A +PGEEGG AIADV+FG +NP GRLPI++         P+ +  +P+   R
Sbjct: 543 NEVNAIIDAWFPGEEGGNAIADVIFGDYNPSGRLPISF---------PIDTGQIPIYYNR 593

Query: 615 PVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTS 674
              SL    R Y       L+PFGYGLSYT+FKY+ L  T                    
Sbjct: 594 KPSSL----RPYVMMKSKPLFPFGYGLSYTEFKYSNLEVTP------------------- 630

Query: 675 DASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGF 734
                        ++      +  ++ +NVG  +G + V +Y        +  IK++ GF
Sbjct: 631 ------------KEVNSSGKIKISLEVENVGKREGEETVQLYISKQYSGVSRPIKELKGF 678

Query: 735 QRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGN 779
            +V+++    ++I F     ++L   D     ++  G++ I +G 
Sbjct: 679 AKVYLKPNEKRKITFSL-PLEALAFYDQYMRLIIDTGDYEILIGK 722


>gi|433679952|ref|ZP_20511614.1| beta-glucosidase [Xanthomonas translucens pv. translucens DSM
           18974]
 gi|430814928|emb|CCP42243.1| beta-glucosidase [Xanthomonas translucens pv. translucens DSM
           18974]
          Length = 909

 Score =  281 bits (719), Expect = 9e-73,   Method: Compositional matrix adjust.
 Identities = 168/422 (39%), Positives = 237/422 (56%), Gaps = 44/422 (10%)

Query: 71  RMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGATSFPTVIL 130
           +MT +EKV Q  + A  +PRLG+P YEWW+E LHG++  G           AT FP  I 
Sbjct: 67  KMTREEKVAQAMNAAPAIPRLGVPAYEWWNEGLHGIARNG----------YATVFPQAIG 116

Query: 131 TTASFNESLWKKIGQAVSTEARAMYNLGR---------AGLTYWSPNINVARDPRWGRIT 181
             A++N +L +++G   STEARA +NL           AGLT WSPNIN+ RDPRWGR  
Sbjct: 117 LAATWNTALLEQVGTVTSTEARAKFNLAGGPGKDHPRYAGLTIWSPNINIFRDPRWGRGM 176

Query: 182 ETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDR 241
           ET GEDP++ G+ AV ++RGLQ         DL + P  +++  KH A   V +     R
Sbjct: 177 ETYGEDPYLTGQLAVGFIRGLQ-------GDDL-THPRTIATP-KHLA---VHSGPEPGR 224

Query: 242 YHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWD 301
           + FD  V+  D+E T+   F   + +G A +VMC+YN ++G P+CA   LLN  +RG+W 
Sbjct: 225 HGFDVDVSPHDLEATYTPAFRAAIVDGRAGAVMCAYNSLHGTPACAADWLLNGRLRGDWG 284

Query: 302 LHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFTGNAVQQGKVK 361
             G++V+DCD++  M   H F AD+   + A  LKAG DL+CG  Y +  G A+ +G   
Sbjct: 285 FTGFVVSDCDAVDDMTQFHYFRADNAGSSAA-ALKAGHDLNCGYAYRDL-GKAIARGDAD 342

Query: 362 ETDIDKSLKYLYTVLMRLGFFDGSPQ----YVSLGKQDICSDENIELAAEAAREGIVLLK 417
           E  +D+SL  L+    RLG     PQ    Y  LG +D+ S  +  LA +AA++ IVLL+
Sbjct: 343 EAVLDQSLVRLFAARYRLGEL--QPQRKDPYARLGAKDVDSAAHRALALQAAQQSIVLLQ 400

Query: 418 NDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFS---GYANVTYKT 474
           N   TLPL       +AV+GP+A+A  A+  NY G     ++P+ G     G AN+ Y  
Sbjct: 401 NRNATLPLRPG--LRLAVIGPNADALAALEANYQGTSAAPVTPLLGLRERFGAANLRYAQ 458

Query: 475 GC 476
           G 
Sbjct: 459 GA 460



 Score =  129 bits (324), Expect = 6e-27,   Method: Compositional matrix adjust.
 Identities = 91/286 (31%), Positives = 140/286 (48%), Gaps = 55/286 (19%)

Query: 505 GLDLSVEAESL----------DREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIA 554
           GL   VE E L          DR DL LP  Q  L+ + A+ +  P+++V+MS   V + 
Sbjct: 646 GLSPDVEGEELRIDVPGFDGGDRNDLALPAAQQALLER-AKASGKPLVVVLMSGSAVALN 704

Query: 555 FAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLR 614
           +A+ + +  AI+ A YPG+ GG AIA V+ G  NPGGRLP+T+Y          ++  L 
Sbjct: 705 WAKQHAD--AIVAAWYPGQSGGTAIAQVLAGDVNPGGRLPVTFYR---------STKDLP 753

Query: 615 PVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTS 674
              S    GRTY+++ G  L+ FG GLSYT+F Y         Q++   LQ   NL    
Sbjct: 754 AYVSYDMKGRTYRYFKGEPLFAFGSGLSYTRFTY------AAPQLSATTLQAGANL---- 803

Query: 675 DASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGF 734
                                + +   +N G+  G +VV VY +PP + A + ++ ++GF
Sbjct: 804 ---------------------QVRTQVRNSGTRAGDEVVQVYLQPP-QGAQSPLRTLVGF 841

Query: 735 QRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGNG 780
           QRV ++ G  + + F     + L+ VD A    +  G++ +FVG G
Sbjct: 842 QRVTLQPGEAREVGFELTP-RQLSDVDRAGQRAVQPGDYRVFVGGG 886


>gi|440733337|ref|ZP_20913088.1| beta-glucosidase [Xanthomonas translucens DAR61454]
 gi|440362904|gb|ELQ00083.1| beta-glucosidase [Xanthomonas translucens DAR61454]
          Length = 895

 Score =  281 bits (718), Expect = 1e-72,   Method: Compositional matrix adjust.
 Identities = 169/422 (40%), Positives = 236/422 (55%), Gaps = 44/422 (10%)

Query: 71  RMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGATSFPTVIL 130
           +MT +EKV Q  + A  +PRLG+P YEWW+E LHG++  G           AT FP  I 
Sbjct: 53  KMTREEKVAQAMNAAPAIPRLGVPAYEWWNEGLHGIARNG----------YATVFPQAIG 102

Query: 131 TTASFNESLWKKIGQAVSTEARAMYNLGR---------AGLTYWSPNINVARDPRWGRIT 181
             A++N +L +++G   STEARA +NL           AGLT WSPNIN+ RDPRWGR  
Sbjct: 103 LAATWNTALLEQVGTVTSTEARAKFNLAGGPGKDHPRYAGLTIWSPNINIFRDPRWGRGM 162

Query: 182 ETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDR 241
           ET GEDP++ G+ AV ++ GLQ         DL + P  +++  KH A   V +     R
Sbjct: 163 ETYGEDPYLTGQLAVGFIHGLQ-------GDDL-THPRTIATP-KHLA---VHSGPEPGR 210

Query: 242 YHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWD 301
           + FD  V+  D+E T+   F   + +G A SVMC+YN ++G P+CA   LLN  +RG+W 
Sbjct: 211 HGFDVDVSPHDLEATYTPAFRAAIVDGRAGSVMCAYNALHGTPACAADWLLNGRLRGDWG 270

Query: 302 LHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFTGNAVQQGKVK 361
             G++V+DCD++  M   H F AD+   + A  LKAG DL+CG  Y +  G A+ +G   
Sbjct: 271 FTGFVVSDCDAVDDMTQFHYFRADNAGSSAA-ALKAGHDLNCGYAYRDL-GKAIARGDAD 328

Query: 362 ETDIDKSLKYLYTVLMRLGFFDGSPQ----YVSLGKQDICSDENIELAAEAAREGIVLLK 417
           E  +D+SL  L+    RLG     PQ    Y  LG +D+ S  +  LA +AA++ IVLL+
Sbjct: 329 EALLDQSLVRLFAARYRLGEL--QPQRKDPYAQLGAKDVDSAAHRALALQAAQQSIVLLQ 386

Query: 418 NDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFS---GYANVTYKT 474
           N   TLPL       +AV+GP+A+A  A+  NY G     ++P+ G     G ANV Y  
Sbjct: 387 NRNATLPLRPG--LRLAVIGPNADALAALEANYQGTSAAPVTPLLGLRERFGAANVRYAQ 444

Query: 475 GC 476
           G 
Sbjct: 445 GA 446



 Score =  129 bits (323), Expect = 9e-27,   Method: Compositional matrix adjust.
 Identities = 91/286 (31%), Positives = 139/286 (48%), Gaps = 55/286 (19%)

Query: 505 GLDLSVEAESL----------DREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIA 554
           GL   VE E L          DR DL LP  Q  L+ + A+ +  P+++V+MS   V + 
Sbjct: 632 GLSPDVEGEELRIDVPGFDGGDRNDLALPAAQQALLER-AKASGKPLVVVLMSGSAVALN 690

Query: 555 FAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLR 614
           +A+ + +  AI+ A YPG+ GG AIA V+ G  NPGGRLP+T+Y          ++  L 
Sbjct: 691 WAKQHAD--AIVAAWYPGQSGGTAIAQVLAGDVNPGGRLPVTFYR---------STKDLP 739

Query: 615 PVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTS 674
              S    GRTY+++ G  L+ FG GLSYT+F Y         Q++   LQ   NL    
Sbjct: 740 AYVSYDMKGRTYRYFKGEPLFAFGSGLSYTRFTY------AAPQLSATTLQAGANL---- 789

Query: 675 DASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGF 734
                                + +    N G+  G +VV VY +PP + A + ++ ++GF
Sbjct: 790 ---------------------QVRTQVSNSGTRAGDEVVQVYLQPP-QGAQSPLRTLVGF 827

Query: 735 QRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGNG 780
           QRV ++ G  + + F     + L+ VD A    +  G++ +FVG G
Sbjct: 828 QRVTLQPGEAREVGFELTP-RQLSDVDRAGQRAVQPGDYRVFVGGG 872


>gi|206901921|ref|YP_002251428.1| xylosidase/arabinosidase [Dictyoglomus thermophilum H-6-12]
 gi|206741024|gb|ACI20082.1| xylosidase/arabinosidase [Dictyoglomus thermophilum H-6-12]
          Length = 756

 Score =  281 bits (718), Expect = 1e-72,   Method: Compositional matrix adjust.
 Identities = 220/697 (31%), Positives = 338/697 (48%), Gaps = 110/697 (15%)

Query: 101 EALHGVSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRA 160
           EALHG            +  G+T FP  I   +++N  L  ++  A+  E R+     R 
Sbjct: 138 EALHGC-----------MAKGSTIFPQAIGMASTWNPELIYQVATAIGKETRS-----RG 181

Query: 161 GLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLK 220
                SP IN+ARDPR GR  ET GEDP++  R AV Y++G+Q+ +G             
Sbjct: 182 IHQVLSPTINIARDPRCGRTEETYGEDPYLASRMAVAYIKGVQE-QG------------- 227

Query: 221 VSSCCKHYAAYDVDNWKGVDRY--HFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYN 278
           V +  KH+ A  V +  G D Y  HF  R+    + E +   F   ++E  A S+M +YN
Sbjct: 228 VIATPKHFVANFVGD-GGRDSYPIHFSERL----LREIYFPAFRASIEEAGALSLMAAYN 282

Query: 279 RVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAG 338
            ++GIP  ++  LL + +R EW   GY+V+D  S+  ++  HK +A+SK +A   +L+AG
Sbjct: 283 SLDGIPCSSNKWLLTRILRKEWGFKGYVVSDYFSVLHLMTKHK-VAESKAEAAKLSLEAG 341

Query: 339 LDL-----DCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDG---SPQYVS 390
           LD+     DC   +    G  +++ K+ +  +D++++ +  V   +G FD     P Y  
Sbjct: 342 LDMELPDSDC---FEEIPG-LIRESKLSQDTLDEAVRRVLRVKFWIGLFDNPFVDPDYAE 397

Query: 391 LGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNY 450
             + + CS E+ ELA   ARE IVLLKN +  LPLN   ++++AV+GP  NA V  +G Y
Sbjct: 398 --RINDCS-EHRELALRVARESIVLLKN-EGILPLNK-DIRSIAVIGP--NAAVPRLGGY 450

Query: 451 AGIPCRYMSPIAGFSGY----ANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGL 506
           +G   + ++P+ G          V +  GC  +   S +    A + A+ +D  I+  G 
Sbjct: 451 SGYGVKVVTPLEGIKNKLGDKVKVYFAEGCG-LNDTSKSGFDEAIKIAQKSDVAILFMGN 509

Query: 507 DL-SVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAI 565
            +   E E  DR +L LPG Q  LI ++      PVI+V+++  G  I        ++A+
Sbjct: 510 SVPETEGEQRDRHNLNLPGVQEDLIKEICN-TNTPVIVVLIN--GSAITMMNWIDKVQAV 566

Query: 566 LWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPL--TSMPLRPVDS-LGYP 622
           + A YPGEEGG AIADV+FG +NPGG+LPI++    Y   LPL     P   VD  +   
Sbjct: 567 IEAWYPGEEGGNAIADVLFGDYNPGGKLPISF--PKYSSQLPLYYNHKPSGRVDDYVDLR 624

Query: 623 GRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCP 682
           G  Y       L+PFGYGLSYT FKY+                   NL  T +       
Sbjct: 625 GNQY-------LFPFGYGLSYTDFKYS-------------------NLRITPE------- 651

Query: 683 GVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAG 742
                ++  D       D +N+G   G +VV +Y        A  IK++  F+RV +  G
Sbjct: 652 -----EIPRDGEVVITFDIENIGKYKGDEVVQLYLHDEFASVARPIKELKRFERVTLDVG 706

Query: 743 RNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGN 779
             K + F  N  + L  +      ++  G   + +G+
Sbjct: 707 ERKTVSFKLNR-RDLEFLSMDMELVVEPGRFEVLIGS 742


>gi|380692929|ref|ZP_09857788.1| glycoside hydrolase family protein [Bacteroides faecis MAJ27]
          Length = 777

 Score =  281 bits (718), Expect = 1e-72,   Method: Compositional matrix adjust.
 Identities = 238/805 (29%), Positives = 367/805 (45%), Gaps = 153/805 (19%)

Query: 53  LFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRL---GLPQYEWWSE-------- 101
           ++ D S P   RVKDL+S+M +DEK  Q+    +G  R+    LP  +W SE        
Sbjct: 31  IYEDPSAPIEERVKDLLSQMNMDEKTCQMATL-YGSGRVLADALPTEKWKSEIWKDGIGN 89

Query: 102 ------------------------ALHGV-------SNVGPGTHF-DDVIPG-----ATS 124
                                   A+H +       + +G    F ++ I G     AT 
Sbjct: 90  IDEEHNGLGKFGSEYAFPYAKHVKAIHDIQRWFVEETRLGIPVDFTNEGIRGVCHEKATF 149

Query: 125 FPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNINVARDPRWGRITETP 184
           FP      +++N+ L  +IG+  + EA A   LG   +  +SP +++A+DPRWGR  E  
Sbjct: 150 FPAQCGQGSTWNKELIARIGEVEAKEAVA---LGYTNI--YSPILDIAQDPRWGRAVECY 204

Query: 185 GEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHF 244
           GEDP++VG+     ++ LQ                K+ +  KH+A Y +           
Sbjct: 205 GEDPYLVGQLGKQMIQSLQK--------------HKLVATPKHFAVYSIPVGGRDGGTRT 250

Query: 245 DARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHG 304
           D  V  ++M   +L PF +  +E  A  VM SYN  +G P     + L Q +R EW   G
Sbjct: 251 DPHVAPREMRTLYLEPFRVAFQEAGALGVMSSYNDYDGEPITGSYRFLTQILRQEWGFKG 310

Query: 305 YIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFTG---------NAV 355
           Y+V+D D+++ +   HK +AD+ E+AV Q++ AGL++      TNF+          +A+
Sbjct: 311 YVVSDSDAVEFISSKHK-VADNNEEAVVQSVNAGLNV-----RTNFSSPAGFIKPLRSAI 364

Query: 356 QQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGK---QDICSDENIELAAEAAREG 412
            +GKV +  ID+ +  +  V   LG FD    Y   GK   + +   E+  +A EAAR+ 
Sbjct: 365 AKGKVSQATIDQRVSEILYVKFWLGLFDNP--YRGDGKLADKIVHCKEHQAVALEAARQS 422

Query: 413 IVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNY----AGIPCRYMSPIAGFSGYA 468
           IVLLKN  N LPL    +K+VAV+GP+A+    +I  Y    A I   Y        G A
Sbjct: 423 IVLLKNQDNLLPLQKT-LKSVAVIGPNADEQKELICRYGPSNAPIKTVYKGIKEALPG-A 480

Query: 469 NVTYKTGCD--------------DVACKSNNSIFAASEAAKTADATI-ILAGLDLSVEAE 513
            V YK GC+              D+  K    +  A EAAK+A+  I +L G +++V  E
Sbjct: 481 KVVYKKGCEIVDPHFPESEVLPFDITPKEQQIMDEAIEAAKSAEVVIMVLGGSEVTVREE 540

Query: 514 SLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGE 573
              R  L LPG Q +L+  V ++ K P ILV++      I +A+    + AIL A +PGE
Sbjct: 541 R-SRTSLDLPGRQEELLKAVCKLGK-PTILVMIDGRASSINYAK--KYVPAILHAWFPGE 596

Query: 574 EGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPT 633
             G+A+A+ +FG  NPGG+L +T+     V  +P  + P +P    G             
Sbjct: 597 FCGQAVAETIFGDNNPGGKLAVTFPKS--VGQIPF-AFPFKPGSDSGCGTSVTG-----A 648

Query: 634 LYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDD 693
           L+PFG+GLSYT F+YN L  +   Q  L ++             K  C            
Sbjct: 649 LFPFGHGLSYTTFEYNNLKISPEQQGVLGEV-------------KVSC------------ 683

Query: 694 YFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNA 753
                   +N G   G +VV +Y +       TY+K + GF+R+ ++    K++ F  + 
Sbjct: 684 ------TVKNTGKRPGDEVVQLYLRDEISSVTTYVKILRGFERITLQPNEEKKVTFTLSP 737

Query: 754 CKSLNIVDYAANTLLPAGEHTIFVG 778
            + L I D      +  G   + +G
Sbjct: 738 -QDLAIWDKNMKFQVEPGTFKVMIG 761


>gi|160884133|ref|ZP_02065136.1| hypothetical protein BACOVA_02110 [Bacteroides ovatus ATCC 8483]
 gi|423291392|ref|ZP_17270240.1| hypothetical protein HMPREF1069_05283 [Bacteroides ovatus
           CL02T12C04]
 gi|156110475|gb|EDO12220.1| glycosyl hydrolase family 3 N-terminal domain protein [Bacteroides
           ovatus ATCC 8483]
 gi|392663392|gb|EIY56942.1| hypothetical protein HMPREF1069_05283 [Bacteroides ovatus
           CL02T12C04]
          Length = 735

 Score =  281 bits (718), Expect = 1e-72,   Method: Compositional matrix adjust.
 Identities = 222/768 (28%), Positives = 358/768 (46%), Gaps = 115/768 (14%)

Query: 53  LFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPG 112
           L+ D+  P   R+ DL+SRMTL+EK+ QL  +  G         E   E     S +G  
Sbjct: 29  LYKDAKAPIEKRIDDLISRMTLEEKILQLNQYTLGRNNNVNNVGE---EVKKVPSEIGSL 85

Query: 113 THFD---------------------------DVIPG-ATSFPTVILTTASFNESLWKKIG 144
            +FD                           D I G  T +P  +    S+N  L ++  
Sbjct: 86  IYFDINPELRNSMQKKAMEESRLGIPIIFGYDAIHGFRTIYPISLGQACSWNPGLVEQAC 145

Query: 145 QAVSTEARAMYNLGRAGLTY-WSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQ 203
              + EAR       +G+ + +SP I+VARDPRWGR+ E  GEDP+  G +A   VRG Q
Sbjct: 146 AVSAQEARM------SGVDWTFSPMIDVARDPRWGRVAEGYGEDPYTNGVFAAASVRGYQ 199

Query: 204 -DVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFE 262
            D    EN         ++++C KHY  Y         R +    ++ Q + +T+L P+E
Sbjct: 200 GDDMSAEN---------RMAACLKHYVGYGASE---AGRDYVYTEISAQTLWDTYLLPYE 247

Query: 263 MCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKF 322
           M VK G A+++M S+N ++G+P  A+P ++ + ++  W   G+IV+D  +++ +   ++ 
Sbjct: 248 MGVKAG-AATLMSSFNDISGVPGSANPYIMTEILKKRWKHDGFIVSDWGAVEQL--KNQG 304

Query: 323 LADSKEDAVAQTLKAGLDLDCGQY-YTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGF 381
           LA +K+DA      AGL++D   + Y  +    V++GKV    +D+S++ +  V  RLG 
Sbjct: 305 LAATKKDAARYAFNAGLEMDMMSHAYDRYLKELVEEGKVTMAQVDESVRRVLRVKFRLGL 364

Query: 382 FDGSPQYVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHAN 441
           F+     V+  K      +++ +AA+ A E +VLLKND   LPL +   K +AVVGP A 
Sbjct: 365 FERPYTPVTNEKDRFFRPQSMAVAAQLAAESMVLLKNDNQILPLTNK--KKIAVVGPMAK 422

Query: 442 ATVAMIGNYAG------IPCRYMSPIAGFSGYANVTYKTGCDDVACKSNNSIFA-ASEAA 494
               ++G++ G      +   Y    A F G A + Y  GC      ++ S FA A + A
Sbjct: 423 NGWDLLGSWCGHGKDTDVEMLYDGLTAEFGGDAELRYAMGCKPQG--NDRSGFAGALDVA 480

Query: 495 KTADATIILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIA 554
           + +D  I+  G  L+   E+  R  + LP  Q +L+ ++ E  K P+ILV+  + G  + 
Sbjct: 481 RWSDVVIVCLGEMLTWSGENASRSTIALPQIQEELVKELKEAGK-PIILVL--SNGRPLE 537

Query: 555 FAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTS--MP 612
                    AIL    PG  G R++A ++ G+ NP G+L +T          P ++  +P
Sbjct: 538 LNRMEPLCDAILEIWQPGINGARSMAGILSGRINPSGKLAMT---------FPYSTGQIP 588

Query: 613 L---RPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRN 669
           +   R     G+ G  YK      LYPFG+GLSYT+FKY  +                  
Sbjct: 589 IYYNRRKSGRGHQG-FYKDITSDPLYPFGHGLSYTEFKYGTV------------------ 629

Query: 670 LNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIK 729
              T  A+K          ++  D    +V   N G+ DG++ V  +   P       +K
Sbjct: 630 ---TPSATK----------VKRGDKLSAEVTVTNTGARDGAETVHWFISDPYCSITRPVK 676

Query: 730 QVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFV 777
           ++  F++ F++AG  K  +F  +  +    V+      L AGE+ I V
Sbjct: 677 ELKHFEKQFIKAGETKTFRFDIDLERDFGFVNEDGKRFLEAGEYHILV 724


>gi|427385138|ref|ZP_18881643.1| hypothetical protein HMPREF9447_02676 [Bacteroides oleiciplenus YIT
           12058]
 gi|425727306|gb|EKU90166.1| hypothetical protein HMPREF9447_02676 [Bacteroides oleiciplenus YIT
           12058]
          Length = 863

 Score =  280 bits (717), Expect = 2e-72,   Method: Compositional matrix adjust.
 Identities = 183/461 (39%), Positives = 246/461 (53%), Gaps = 51/461 (11%)

Query: 52  FLFCDSSLPY-------SIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALH 104
           FL C S  PY         R  DLV R+TL+EK   + + +  +PRLG+  Y+WW+EALH
Sbjct: 18  FLSC-SQPPYKNPALTPEERAADLVGRLTLEEKASLMQNTSPAIPRLGIKAYDWWNEALH 76

Query: 105 GVSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMY-------NL 157
           GV   G           AT FP  I   ASFN  L   +  AVS EARA          L
Sbjct: 77  GVGRAGL----------ATVFPQAIGMGASFNNDLLYDVFTAVSDEARAKTAEFSKEGGL 126

Query: 158 GR-AGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNS 216
            R  GLT W+PN+N+ RDPRWGR  ET GEDP++ G+  +  VRGLQ  EG         
Sbjct: 127 KRYQGLTMWTPNVNIFRDPRWGRGQETYGEDPYLTGQMGMAVVRGLQGPEG--------G 178

Query: 217 RPLKVSSCCKHYAAYDVDNWKGVDRYHFDAR-VTEQDMEETFLRPFEMCVKEGDASSVMC 275
           +  K+ +C KH+A +    W   +R+ FDA  V  +D+ ET+L  F+  V++     VMC
Sbjct: 179 KYDKLHACAKHFAVHSGPEW---NRHSFDAENVDPRDLWETYLPAFKDLVQKAHVKEVMC 235

Query: 276 SYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDN--HKFLADSKEDAVAQ 333
           +YNR  G P C   +LL Q +R EW   G IV+DC +I    +   H+   D KE A A+
Sbjct: 236 AYNRFEGEPCCGSNRLLVQILRDEWAYDGIIVSDCWAINDFFNKGAHETEPD-KEHASAK 294

Query: 334 TLKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGK 393
            +  G D++CG+ Y +    AV+ G + E  ID SLK L      LG  D +P+ VS  +
Sbjct: 295 AVLTGTDVECGESYASLP-QAVKAGLIDEKKIDISLKRLMKARFELGEMD-NPELVSWAQ 352

Query: 394 ---QDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNY 450
                + S E+ ELA   ARE +VLL+N+QN LPLN  K   VAVVGP+AN +V   GNY
Sbjct: 353 IPYSVVDSKEHRELALRMARESLVLLQNNQNVLPLN--KSLKVAVVGPNANDSVMQWGNY 410

Query: 451 AGIPCRYMSPIAGFSGY---ANVTYKTGCDDVACKSNNSIF 488
            G P   ++ + G   Y   A + Y+ GCD  +  +  S+F
Sbjct: 411 NGFPGHTVTLLEGIRQYLPEAQLIYEPGCDLTSDVTLQSVF 451



 Score =  107 bits (266), Expect = 3e-20,   Method: Compositional matrix adjust.
 Identities = 83/298 (27%), Positives = 128/298 (42%), Gaps = 56/298 (18%)

Query: 492 EAAKTADATIILAGLDLSVEAESL----------DREDLWLPGYQTQLINQVAEVAKGPV 541
           +  K AD  +   G+  +VE E +          DRE + LP  Q++L+   AE+ K   
Sbjct: 595 QRVKDADIIVFAGGISPAVEGEEMRVTIPGFKGGDRETIELPSIQSRLL---AELKKAGK 651

Query: 542 ILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGD 601
            +V ++  G  IA         AIL A YPG+ GG AIA+V+FG +NP GRLP+T+Y   
Sbjct: 652 KVVFVNFSGSAIALTPETKTCDAILQAWYPGQAGGTAIANVLFGDYNPAGRLPVTFYK-- 709

Query: 602 YVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNL 661
                  ++  L   +     GRTY++     L+PFG+GLSYT F+Y             
Sbjct: 710 -------STSQLPDFEDYSMKGRTYRYMAEAPLFPFGHGLSYTTFRY------------- 749

Query: 662 NKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPA 721
                        DAS      +   +++  +     +   N G  DG +VV VY + P 
Sbjct: 750 ------------GDAS------LSTQEVKEGEQAILTIPVSNTGERDGEEVVQVYLRRPG 791

Query: 722 EIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLP-AGEHTIFVG 778
           +        +  F+RV +  G    +    +  +     D   NT+ P  G++ I  G
Sbjct: 792 DKEGPS-HALRAFKRVNIAKGTTGNVTISLSK-EDFEWFDTETNTMRPIEGDYEILYG 847


>gi|218262493|ref|ZP_03476939.1| hypothetical protein PRABACTJOHN_02617 [Parabacteroides johnsonii
           DSM 18315]
 gi|218223341|gb|EEC95991.1| hypothetical protein PRABACTJOHN_02617 [Parabacteroides johnsonii
           DSM 18315]
          Length = 868

 Score =  280 bits (717), Expect = 2e-72,   Method: Compositional matrix adjust.
 Identities = 165/452 (36%), Positives = 235/452 (51%), Gaps = 44/452 (9%)

Query: 48  QMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVS 107
           +   + F +  LP   R+ DL+ R+T +EKV Q+ +    + RLG+PQY+WW+EALHGV+
Sbjct: 22  RQEDYPFRNPDLPIDERIDDLLKRLTAEEKVGQMMNTTPAIERLGIPQYDWWNEALHGVA 81

Query: 108 NVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRA------- 160
             G           AT FP  I   A+F++    +    VS EARA Y+  +        
Sbjct: 82  RAGK----------ATVFPQAIAMAATFDDDALYETFTMVSDEARAKYHQYQKDKEYDRY 131

Query: 161 -GLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPL 219
            GLT+W+PNIN+ RDPRWGR  ET GEDP++  R  V  V+GLQ           + +  
Sbjct: 132 KGLTFWTPNINIFRDPRWGRGMETYGEDPYLTERMGVAVVKGLQGD---------DPKYF 182

Query: 220 KVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNR 279
           K  +C KHYA +    W   +R+ FD  VT +D+ +T+L  FE  VKEG+   VMC+YNR
Sbjct: 183 KTHACAKHYAVHSGPEW---NRHEFDVTVTPRDLWQTYLPAFEALVKEGNVQEVMCAYNR 239

Query: 280 VNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVD------NHKFLADSKEDAVAQ 333
             G P C+  KLL   +R  W     I++DC +I    +       H+   D+ E A A 
Sbjct: 240 YQGKPCCSSDKLLIDILRNSWGYENIILSDCGAINDFWERDERTPRHETHPDA-ESASAD 298

Query: 334 TLKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ--YVSL 391
            +  G DL+CG  Y      A++ GK+ E D+D SL+ L      LG FD   Q  Y  +
Sbjct: 299 AVLNGTDLECGNSYRALV-KALKDGKISENDLDVSLRRLLKGRFELGMFDPDEQVPYAQI 357

Query: 392 GKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYA 451
               + S E++  A E A + +VLLKN  NTLPL S  ++ +AVVGP+A  +  +  NY 
Sbjct: 358 PYNVVESPEHVAQALEMAHKSMVLLKNKNNTLPL-SKTIRKIAVVGPNAADSTMLWANYN 416

Query: 452 GIPCRYMSPIAGFSGY---ANVTYKTGCDDVA 480
           G P   ++ + G         V Y+ GC+  A
Sbjct: 417 GFPTHTVTILEGIRNKVPDTEVIYELGCNHAA 448



 Score =  114 bits (284), Expect = 3e-22,   Method: Compositional matrix adjust.
 Identities = 85/270 (31%), Positives = 124/270 (45%), Gaps = 56/270 (20%)

Query: 490 ASEAAKTADATIIL--AGLDLSVEAESL----------DREDLWLPGYQTQLINQVAEVA 537
           A+ AAK  DA +I+   G+   +E E +          DR ++ LP  Q +++  +    
Sbjct: 596 AATAAKVKDADVIVYVGGISPRLEGEEMPVNVEGFKKGDRTNIELPKVQQEMVKALKATG 655

Query: 538 KGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITW 597
           K PV+ V+ +   + + + E N  I AIL A Y G+E G A+AD++FG +NP GRLP+T+
Sbjct: 656 K-PVVYVLCTGSALALNWEEAN--IDAILNAWYGGQEAGTAVADILFGDYNPSGRLPVTF 712

Query: 598 YNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTI 657
           Y    +  LP         +     GRTY++     LYPFGYGLSYT F Y         
Sbjct: 713 YKS--IDQLP-------DFEDYSMKGRTYRYMTETPLYPFGYGLSYTNFAY--------- 754

Query: 658 QVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYS 717
                     RN   +S        G +  D      F    D  N G  DG +V  +Y 
Sbjct: 755 ----------RNAKLSS--------GKIAKDQSVTLTF----DIANTGKMDGDEVAQIYI 792

Query: 718 KPPAEIAATYIKQVIGFQRVFVRAGRNKRI 747
           K P +     IK +  F RV V+AG ++ +
Sbjct: 793 KNPNDPEGP-IKALKAFLRVHVKAGDSQEV 821


>gi|408824590|ref|ZP_11209480.1| Glucan 1,4-beta-glucosidase [Pseudomonas geniculata N1]
          Length = 897

 Score =  280 bits (716), Expect = 2e-72,   Method: Compositional matrix adjust.
 Identities = 172/445 (38%), Positives = 248/445 (55%), Gaps = 41/445 (9%)

Query: 54  FCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGT 113
           + D S  +  R   LV++MTL+EK  Q+ + A  + RLG+P Y+WW+E LHGV+  G   
Sbjct: 38  WLDVSASFEQRAAALVAQMTLEEKAAQMQNAAPAIERLGVPAYDWWNEGLHGVARAGQ-- 95

Query: 114 HFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNL-------GR-AGLTYW 165
                   AT FP  I   A+F+  L  ++   +S EARA ++        GR  GLT+W
Sbjct: 96  --------ATVFPQAIGLAATFDVPLMGQVAATISDEARAKHHQFLREGAHGRYQGLTFW 147

Query: 166 SPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCC 225
           SPN+N+ RDPRWGR  ET GEDP++  R  V +VRGLQ         D   R  K+ +  
Sbjct: 148 SPNVNIFRDPRWGRGQETYGEDPYLTARMGVAFVRGLQ-------GDDPVYR--KLDATA 198

Query: 226 KHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPS 285
           KH A   V +    DR+HFDAR + +D+ +T+L  FE  VKEGD  +VM +YNRV G  +
Sbjct: 199 KHLA---VHSGPEADRHHFDARPSRRDLYDTYLPAFEALVKEGDVDAVMGAYNRVYGESA 255

Query: 286 CADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQ 345
            A   LL   +R +W   GY+V+DC +I V +  H  +  ++E A A  ++ G +L+CGQ
Sbjct: 256 SASRFLLRDVLRRDWGFKGYVVSDCWAI-VDIWKHHRIVTTREAAAALAVRNGTELECGQ 314

Query: 346 YYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQDICSDE---NI 402
            Y     +AV+QG + E +ID ++  L+T  MRLG FD  P+ V   +     ++   + 
Sbjct: 315 EYATLP-SAVRQGLISEAEIDDAVTRLFTARMRLGMFD-PPERVRWARIPASVNQAPAHD 372

Query: 403 ELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIA 462
            LA +AA+  +VLLKND   LPL S   + +AVVGP A+ T+A++GNY G P   ++ + 
Sbjct: 373 ALALKAAQASLVLLKND-GILPL-SRNTRRIAVVGPTADDTMALLGNYFGTPAAPVTILQ 430

Query: 463 GFSGYAN---VTYKTGCDDVACKSN 484
           G    A    V Y  G D V  + +
Sbjct: 431 GIREAAKGVEVRYARGVDLVEGRDD 455



 Score =  126 bits (316), Expect = 6e-26,   Method: Compositional matrix adjust.
 Identities = 87/289 (30%), Positives = 127/289 (43%), Gaps = 54/289 (18%)

Query: 501 IILAGLDLSVEAESL----------DREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGG 550
           + + GL   VE E +          DR DL LP  Q  L+  +    K PV++V+   GG
Sbjct: 633 VFVGGLTGDVEGEEMTVNYPGFAGGDRTDLRLPAPQRTLLEALHGTGK-PVVMVLT--GG 689

Query: 551 VDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTS 610
             IA      ++ AIL + YPG+ GG A+   +FG  NP GRLP+T+Y           +
Sbjct: 690 SAIAVDWAQAHLPAILMSWYPGQRGGTAVGQALFGDVNPSGRLPVTFYKAG-------EA 742

Query: 611 MPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNL 670
           MP    D     GRTY+++ G  LYPFG+GLSYT+F Y  L                   
Sbjct: 743 MPA--FDDYAMEGRTYRYFRGTPLYPFGHGLSYTRFDYGTLRLD---------------- 784

Query: 671 NYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQ 730
                           + LR D      VD  N G+  G +VV +Y +     +   +++
Sbjct: 785 ---------------ADSLRADGRLGVAVDVANTGTRSGDEVVQLYVRREHAGSGDAVQE 829

Query: 731 VIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYA-ANTLLPAGEHTIFVG 778
           + GFQRV +  G  + + F   A ++L   D A A   +  G + + VG
Sbjct: 830 LRGFQRVQLAPGERRTVTFTLEAAQALRHYDEARAAYAVQPGAYEVRVG 878


>gi|237712573|ref|ZP_04543054.1| glycoside hydrolase family 3 protein [Bacteroides sp. 9_1_42FAA]
 gi|345512524|ref|ZP_08792050.1| glycoside hydrolase family beta-glycosidase [Bacteroides dorei
           5_1_36/D4]
 gi|423239901|ref|ZP_17221016.1| hypothetical protein HMPREF1065_01639 [Bacteroides dorei
           CL03T12C01]
 gi|229435409|gb|EEO45486.1| glycoside hydrolase family beta-glycosidase [Bacteroides dorei
           5_1_36/D4]
 gi|229453894|gb|EEO59615.1| glycoside hydrolase family 3 protein [Bacteroides sp. 9_1_42FAA]
 gi|392644890|gb|EIY38624.1| hypothetical protein HMPREF1065_01639 [Bacteroides dorei
           CL03T12C01]
          Length = 788

 Score =  280 bits (716), Expect = 2e-72,   Method: Compositional matrix adjust.
 Identities = 237/820 (28%), Positives = 369/820 (45%), Gaps = 157/820 (19%)

Query: 53  LFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRL---GLPQYEWWSEALH-GVSN 108
           L+ +   P   RV+DL+S+MTL+EK  Q+    +G  R+    LPQ  W +E    G+ N
Sbjct: 42  LYENPKAPLEERVQDLLSQMTLEEKSCQMATL-YGSGRVLKDALPQDNWKTEVWKDGIGN 100

Query: 109 V-----GPGT-----------HFD-----------------------DVIPG-----ATS 124
           +     G GT           H D                       + I G     AT 
Sbjct: 101 IDEEHNGLGTFKSEYSFPYTKHVDAKHAIQRWFVEETRLGIPVDFTNEGIRGLCHDRATY 160

Query: 125 FPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNINVARDPRWGRITETP 184
           FP      A++N+ L  +IG+  + EA A   LG   +  +SP +++A+DPRWGR  ET 
Sbjct: 161 FPAQCGQGATWNKELIARIGEVEAKEAVA---LGYTNI--YSPILDIAQDPRWGRCVETY 215

Query: 185 GEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHF 244
           GEDP++VG      +  LQ         +L + P       KH+A Y +       +   
Sbjct: 216 GEDPYLVGELGKQMITSLQK-------HNLVATP-------KHFAVYSIPVGGRDGKTRT 261

Query: 245 DARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHG 304
           D  V  ++M   ++ PF M  +E  A  VM SYN  +G P       L + +R EW   G
Sbjct: 262 DPHVAPREMRTLYIEPFRMAFQEAGALGVMSSYNDYDGEPITGSYHFLTEILRQEWGFKG 321

Query: 305 YIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFT---------GNAV 355
           Y+V+D ++++ +   HK +A++ ED +AQ + AGL++      T+FT           AV
Sbjct: 322 YVVSDSEAVEFISSKHK-VANTYEDGIAQAVNAGLNIR-----THFTPPADFILPLRKAV 375

Query: 356 QQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQD---ICSDENIELAAEAAREG 412
             GK+ +  +DK +  +  V   LG FD    Y   GKQ    + S E+  ++ EAAR+ 
Sbjct: 376 ADGKISQETLDKRVAEILRVKFWLGLFDNP--YRGNGKQAEQIVHSKEHQAVSLEAARQS 433

Query: 413 IVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNY--AGIPCRYM-SPIAGFSGYAN 469
           +VLLKN+ N LPL S  ++++AV+GP+A+    +I  Y  A  P + +   I     +  
Sbjct: 434 LVLLKNEMNLLPL-SKSLRSIAVIGPNADERTQLICRYGPANAPIKTVYQGIKERLPHTE 492

Query: 470 VTYKTGCDDV--------------ACKSNNSIFAASEAAKTADATI-ILAGLDLSVEAES 514
           V Y+ GCD +                +    +  A  AAK A+  + +L G +L+V  E 
Sbjct: 493 VIYRKGCDIIDPHFPESEVLDFPKTTEEARLMEEAIHAAKQAEVVVMVLGGNELTVR-ED 551

Query: 515 LDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEE 574
             R  L LPG Q +L+  V    K PV+LV++      I +A    ++ AIL A +PGE 
Sbjct: 552 RSRTSLNLPGRQEELLKAVCATGK-PVVLVLLDGRASSINYAA--AHVPAILHAWFPGEF 608

Query: 575 GGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTL 634
            G+A+A+ +FG +NPGGRL +T+     V  +P  + P +P          Y       L
Sbjct: 609 CGQAVAEALFGDYNPGGRLAVTFPKS--VGQIPF-AFPFKPGSDESSSTSVYG-----VL 660

Query: 635 YPFGYGLSYTQFKYNLLSFT---KTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRC 691
           YPFG+GLSYT F Y  L  +   + +Q ++N                          + C
Sbjct: 661 YPFGHGLSYTTFSYGDLKISPLRQGVQGDIN--------------------------ISC 694

Query: 692 DDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVF 751
                     +N G   G +VV +Y +       TY K + GF+R+ + AG  + + F  
Sbjct: 695 --------KIKNTGKIKGDEVVQLYLRDEVSSVTTYTKVLRGFERISLEAGEEQMVHFRL 746

Query: 752 NACKSLNIVDYAANTLLPAGEHTIFVGNGGVSFPIHLNFN 791
              + L + D   N  +  G+  + +G+      +H  F 
Sbjct: 747 RP-QDLGLWDKNMNFRVEPGKFKVMIGSSSTDIRLHGRFE 785


>gi|298386950|ref|ZP_06996504.1| beta-glucosidase [Bacteroides sp. 1_1_14]
 gi|298260100|gb|EFI02970.1| beta-glucosidase [Bacteroides sp. 1_1_14]
          Length = 846

 Score =  280 bits (715), Expect = 3e-72,   Method: Compositional matrix adjust.
 Identities = 165/442 (37%), Positives = 255/442 (57%), Gaps = 42/442 (9%)

Query: 42  FSKLGL-QMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWS 100
           F +LG+  M+  L+ + + P   R++DL+S++T++EK+  L   + G+ R+G+ +Y   +
Sbjct: 10  FMQLGVVSMAQDLYKNMNAPIHERIQDLLSKLTIEEKISLLRATSPGIERMGIDKYYMGN 69

Query: 101 EALHGVSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRA 160
           EALHG+  + PG          T FP  I   + +N  L   I   +S EARA +N    
Sbjct: 70  EALHGI--IRPGKF--------TVFPQAIGLASMWNPELHHIIASVISDEARARWNELER 119

Query: 161 G----------LTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHEN 210
           G          LT+WSP +N+ARDPRWGR  ET GEDP++ G     +V+GLQ       
Sbjct: 120 GKKQKDQFSDLLTFWSPTVNMARDPRWGRTPETYGEDPYLSGVLGTAFVKGLQGD----- 174

Query: 211 ATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDA 270
               + R LK  S  KH+AA + ++    +R++ DA +TE DM E +L  FE C++EG A
Sbjct: 175 ----HPRYLKSVSTPKHFAANNEEH----NRFYCDAAITETDMREYYLPAFEKCIREGKA 226

Query: 271 SSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDA 330
            S+M +YN +NG+P  A+  LLN+ ++ +W  +GYIV+DC +  +++ +H+++  + E A
Sbjct: 227 ESIMTAYNAINGVPCTANNWLLNKVLKQDWGFNGYIVSDCGAPGLLMTDHRYVK-TPEAA 285

Query: 331 VAQTLKAGLDLDCGQY-YTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ-- 387
               +KAGLDL+CG Y +     NA +Q  V   +ID +  ++    MRLG FD   +  
Sbjct: 286 AMIAIKAGLDLECGDYVFGAPLLNAYKQYMVSTAEIDSAAYHVLRARMRLGMFDDPEKNP 345

Query: 388 YVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMI 447
           Y  L  + +  +++ ELA EAAR+ IVLLKN +NTLPLN+ K+K++AVVG   NA     
Sbjct: 346 YNHLSPEIVGCEKHKELALEAARQSIVLLKNQKNTLPLNAKKIKSIAVVG--INAANCEF 403

Query: 448 GNYAGIPCRYMSPIAGFSGYAN 469
           G+Y+G P    +P++   G  N
Sbjct: 404 GDYSGTPVN--APVSVLDGIRN 423



 Score =  126 bits (316), Expect = 6e-26,   Method: Compositional matrix adjust.
 Identities = 92/289 (31%), Positives = 135/289 (46%), Gaps = 46/289 (15%)

Query: 490 ASEAAKTADATIILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAG 549
           AS+  + +D  I + G++ S+E E  DR  + LP  Q   I +  +    P  +V++ AG
Sbjct: 590 ASKVIRESDVVIAVMGINQSIEREGQDRSSIELPKDQQIFIREAYKA--NPNTIVVLVAG 647

Query: 550 GVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLT 609
              +A    + NI AI+ A YPGE+GG AIA+V+FG +NP GRLP+T+YN   ++ LP  
Sbjct: 648 S-SMAVGWMDQNIPAIIDAWYPGEQGGTAIAEVLFGDYNPAGRLPLTFYNS--IEDLPAF 704

Query: 610 SMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRN 669
           +      D      RTY ++ G  LY FGYGLSYT+F Y                   RN
Sbjct: 705 N------DYNVKNNRTYMYFEGKPLYAFGYGLSYTKFDY-------------------RN 739

Query: 670 LNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIK 729
           LN   D+                    F V  +N G  +G +V  VY + P     T +K
Sbjct: 740 LNIKQDSQNIT--------------LNFSV--KNSGKYNGDEVAQVYVQFPDLGIKTPLK 783

Query: 730 QVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVG 778
           Q+ GF+RV ++ G  ++I       +     D       P+G +   VG
Sbjct: 784 QLKGFKRVHIKKGATEQISIEIPKEELRLWDDQKKQFYTPSGTYNFMVG 832


>gi|320105647|ref|YP_004181237.1| glycoside hydrolase family protein [Terriglobus saanensis SP1PR4]
 gi|319924168|gb|ADV81243.1| glycoside hydrolase family 3 domain protein [Terriglobus saanensis
           SP1PR4]
          Length = 885

 Score =  280 bits (715), Expect = 3e-72,   Method: Compositional matrix adjust.
 Identities = 178/450 (39%), Positives = 244/450 (54%), Gaps = 42/450 (9%)

Query: 42  FSKLGLQMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSE 101
           F + GL      + D +L    R +DLV RMTL+EK  Q+ + A  + RLG+P Y++WSE
Sbjct: 19  FGQSGLAQKP-AYLDPTLSPPARARDLVHRMTLEEKTAQMINTAPAIDRLGVPAYDFWSE 77

Query: 102 ALHGVSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRA- 160
            LHGV+  G           AT FP  I   A+++E L  +IG  VSTEARA YN     
Sbjct: 78  GLHGVARSG----------YATLFPQAIGMAATWDEPLMHEIGTVVSTEARAKYNDAVQH 127

Query: 161 -------GLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATD 213
                  GLT WSPNIN+ RDPRWGR  ET GEDPF+  R    +VRG+Q         D
Sbjct: 128 GVHSIYFGLTIWSPNINIFRDPRWGRGQETYGEDPFLTARMGTAFVRGIQ-------GDD 180

Query: 214 LNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSV 273
            N    +  +  KH+A   V +     R+ F+  V++ D+ +T+L  F   + EG A S+
Sbjct: 181 PNY--FRTIATPKHFA---VHSGPESTRHTFNVDVSQHDLWDTYLPAFRSTIIEGKADSI 235

Query: 274 MCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDN--HKFLADSKEDAV 331
           MC+YNR++G P+CA   LL Q +RG+W   G++ +DC +I        H F +  KEDA 
Sbjct: 236 MCAYNRIDGQPACASDLLLKQILRGDWGFRGFVTSDCGAIDDFYTKIGHHF-SKEKEDAS 294

Query: 332 AQTLKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ--YV 389
           A  +KAG D  CG+ Y   T +AV+ G + E ++D SL+ L+   +RLG FD   +  Y 
Sbjct: 295 AAGVKAGTDTACGKTYLGLT-SAVKSGLITEHEMDISLERLFEARIRLGLFDDPARMPYA 353

Query: 390 SLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGN 449
            L   ++ S  +  LA  AARE IVLLKN  N LPL+   VK +AV+GP+A +  A+ GN
Sbjct: 354 RLTMAEVNSPAHRALALRAARESIVLLKNANNLLPLHG--VKNIAVIGPNAASLDALEGN 411

Query: 450 YAGIPCRYMSPIAGFSGY---ANVTYKTGC 476
           Y  I      P+ G +     A V Y  G 
Sbjct: 412 YNAIARDPAMPVDGIAAAFPGAKVVYAQGA 441



 Score =  117 bits (292), Expect = 4e-23,   Method: Compositional matrix adjust.
 Identities = 87/292 (29%), Positives = 132/292 (45%), Gaps = 59/292 (20%)

Query: 498 DATIILAGLDLSVEAESL----------DREDLWLPGYQTQLINQVAEVAKGPVILVIMS 547
           D  +   GL   +E E +          DR D+ LP  Q +L+  V    K P+I+V+M+
Sbjct: 620 DVVVAFVGLSPELEGEEMPIKVKGFAGGDRTDIELPQTQLELLRAVKATGK-PLIVVLMN 678

Query: 548 AGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLP 607
              + +  +ET+    A+L A YPGE G +AIA+ + GK NP GRLP+T+Y+   +  LP
Sbjct: 679 GSAIALKDSETD----ALLEAWYPGEAGAQAIAETLAGKNNPSGRLPLTFYSN--IDQLP 732

Query: 608 LTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHC 667
                    D      RTY+++ G  LY FG GLSYT F+Y  +S + T           
Sbjct: 733 A-------FDDYSMANRTYRYFKGQPLYAFGGGLSYTTFRYGKVSLSAT----------- 774

Query: 668 RNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPP-AEIAAT 726
                                L   +    + +  N G   G +V  VY  PP   IA  
Sbjct: 775 --------------------HLHAGEDLTVEAEVTNTGKVAGDEVAQVYLTPPQTSIAPR 814

Query: 727 YIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVG 778
           +   ++G+QRV +  G++K ++F  +  + L+ VD        AG + I VG
Sbjct: 815 F--ALVGYQRVHLLPGQSKPMRFTLHP-RELSQVDAQGVRAASAGHYEIKVG 863


>gi|227828570|ref|YP_002830350.1| glycoside hydrolase [Sulfolobus islandicus M.14.25]
 gi|229585800|ref|YP_002844302.1| glycoside hydrolase family protein [Sulfolobus islandicus M.16.27]
 gi|227460366|gb|ACP39052.1| glycoside hydrolase family 3 domain protein [Sulfolobus islandicus
           M.14.25]
 gi|228020850|gb|ACP56257.1| glycoside hydrolase family 3 domain protein [Sulfolobus islandicus
           M.16.27]
          Length = 755

 Score =  279 bits (714), Expect = 4e-72,   Method: Compositional matrix adjust.
 Identities = 210/705 (29%), Positives = 348/705 (49%), Gaps = 122/705 (17%)

Query: 118 VIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNINVARDPRW 177
           ++  AT+FP  I   ++++  L +++   +  +A+ +           SP ++V RDPRW
Sbjct: 97  MVKTATAFPQAIGLASTWDPDLIREVSSTIRYQAKLI-----GTNQCLSPVLDVCRDPRW 151

Query: 178 GRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDV-DNW 236
           GR  ET GED ++V    + YV+GLQ     EN         ++ +  KH+AA+   +  
Sbjct: 152 GRCEETYGEDQYLVASIGLAYVKGLQG----EN---------ELIATVKHFAAHGFPEGG 198

Query: 237 KGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTV 296
           + +   H    V  +++ E FL PFE+ +K G A SVM +Y+ ++GIP  ++ +LL + +
Sbjct: 199 RNIAPVH----VGNRELREVFLFPFEVAIKLGKAMSVMPAYHEIDGIPCHSNAELLTKIL 254

Query: 297 RGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLD-----LDCGQYYTNFT 351
           R EW   G +V+D D+I+ +   HK   + KE A+   L+AG+D     +DC   +    
Sbjct: 255 RQEWGFEGIVVSDYDAIRQLEAIHKVSLNKKEAAIL-ALEAGVDTEFPNIDC---FGEPL 310

Query: 352 GNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGK--QDICSDENIELAAEAA 409
             AV++G + E+ ID++++ +  +  +LG F+    Y++     + + + ++ ELA + A
Sbjct: 311 LEAVKEGLISESIIDRAVERVLRIKEKLGLFNN--HYINENNVPEKLDNSKSRELALDVA 368

Query: 410 REGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNY-------AGIPCRYMSPIA 462
           R+ IVLLKND N LPLN   + T+AV+GP+AN    ++G+Y       A      ++ + 
Sbjct: 369 RKSIVLLKND-NILPLNK-NIGTIAVIGPNANEPRNLLGDYTYTGHLNADGGIEVVTVLE 426

Query: 463 GF----SGYANVTYKTGCDDVACKSNNSIFAASEAAKTADATIIL----AGLDLS----- 509
           G     S   NV Y  GCD +A +S      A E AK  D  I +    +GL LS     
Sbjct: 427 GIMRKVSNNTNVLYAKGCD-IAAESKEGFSEAIEIAKKGDIIIAVMGEKSGLPLSWTDVP 485

Query: 510 ----------VEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETN 559
                     V  E  DR  L LPG Q +L+ ++ +  K P+ILV+++  G  +A +   
Sbjct: 486 GKDEFEKYQAVTGEGNDRTSLRLPGVQEELLKELHKTGK-PIILVLVN--GRPLALSSIF 542

Query: 560 TNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTS--MPL---R 614
             + AI+ A +PGEEGG AIADV+FG +NP GRLPI++         P+ +  +P+   R
Sbjct: 543 NEVNAIIDAWFPGEEGGNAIADVIFGDYNPSGRLPISF---------PIDTGQIPIYYNR 593

Query: 615 PVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTS 674
              SL    R Y       L+PFGYGLSYT+FKY+ L  T                    
Sbjct: 594 KPSSL----RPYVMMKSKPLFPFGYGLSYTEFKYSNLEVTP------------------- 630

Query: 675 DASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGF 734
                        ++      +  ++ +NVG  +G + V +Y        +  IK++ GF
Sbjct: 631 ------------KEVNSSGKIKISLEVENVGKREGEETVQLYISKQYSGVSRPIKELKGF 678

Query: 735 QRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGN 779
            +V+++    ++I F     ++L   D     ++  G++ I +G 
Sbjct: 679 AKVYLKPNEKRKITFSL-PLEALAFYDQYMRLIIDTGDYEILIGK 722


>gi|298387490|ref|ZP_06997042.1| beta-glucosidase [Bacteroides sp. 1_1_14]
 gi|298259697|gb|EFI02569.1| beta-glucosidase [Bacteroides sp. 1_1_14]
          Length = 853

 Score =  279 bits (714), Expect = 4e-72,   Method: Compositional matrix adjust.
 Identities = 165/430 (38%), Positives = 247/430 (57%), Gaps = 41/430 (9%)

Query: 53  LFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPG 112
           L+ + + P   RV DL+SR+T++EK+  L   + G+PRLG+ +Y   +EALHGV  V PG
Sbjct: 29  LYKNENAPVHERVMDLISRLTVEEKISLLRATSPGIPRLGIDKYYHGNEALHGV--VRPG 86

Query: 113 THFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAG----------L 162
                     T FP  I   A++N  L K++   +S EARA +N    G          L
Sbjct: 87  RF--------TVFPQAIGLAATWNPELQKRVATVISDEARARWNELDQGREQKEQFSDVL 138

Query: 163 TYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVS 222
           T+WSP +N+ARDPRWGR  ET GEDPF+ G     +V GLQ  + H          LK+ 
Sbjct: 139 TFWSPTVNMARDPRWGRTPETYGEDPFLSGIMGTAFVNGLQGDDPHY---------LKIV 189

Query: 223 SCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNG 282
           S  KH+AA + ++    +R+  + +++E+ + E +   FEMCVKEG A+S+M +YN +N 
Sbjct: 190 STPKHFAANNEEH----NRFVCNPQISEKQLREYYFPAFEMCVKEGKAASIMSAYNALND 245

Query: 283 IPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLD 342
           +P   +  LL + +R +W   GY+V+DC    ++V+ HK++  +KE A   ++KAGLDL+
Sbjct: 246 VPCTLNSWLLQKVLRQDWGFQGYVVSDCGGPALLVNAHKYVK-TKEAAATLSIKAGLDLE 304

Query: 343 CG-QYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ--YVSLGKQDICSD 399
           CG   Y     NA +Q  V + DID +  ++ T  M+LG FD   +  Y  +    I S 
Sbjct: 305 CGDDVYDGPLLNAYKQYMVSDADIDSAACHVLTARMKLGLFDSGERNPYTKISPSVIGSK 364

Query: 400 ENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMS 459
           E+ ++A +AAR+ IVLLKN +N LPLN+ K+K++AVVG   NA     G+Y+G P   + 
Sbjct: 365 EHQQIALDAARQCIVLLKNQKNRLPLNADKLKSIAVVG--INAGKCEFGDYSGAPV--VE 420

Query: 460 PIAGFSGYAN 469
           P++   G  N
Sbjct: 421 PVSILQGIRN 430



 Score =  129 bits (323), Expect = 7e-27,   Method: Compositional matrix adjust.
 Identities = 88/293 (30%), Positives = 142/293 (48%), Gaps = 54/293 (18%)

Query: 490 ASEAAKTADATIILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAG 549
           A +A +  +  + + G++ S+E E  DR D+ LP  Q + + ++ +V   P I+V++ AG
Sbjct: 597 AGKAVRECETVVAVMGINKSIEREGQDRYDIQLPADQREFLQEIYKV--NPNIIVVLVAG 654

Query: 550 GVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLT 609
              +A    + ++ AI+ A YPGE+GG A+A+V+FG +NP GRLP+T+Y         L 
Sbjct: 655 S-SLAVNWMDEHVPAIVNAWYPGEQGGTAVAEVLFGDYNPAGRLPLTYYKS-------LD 706

Query: 610 SMPLRPVDSLGY-PGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCR 668
            +P  P D      GRTYK++ G  LYPFGYGLSY+ F Y+ L                 
Sbjct: 707 ELP--PFDDYDITKGRTYKYFKGDVLYPFGYGLSYSSFTYSDLQVK-------------- 750

Query: 669 NLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDF--QNVGSTDGSDVVIVYSKPPAEIAAT 726
                                  D   E  V F  +N G  +G +V  VY + P      
Sbjct: 751 -----------------------DGGDEVTVSFRLKNTGKRNGDEVAQVYVRIPETGGIV 787

Query: 727 YIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANT-LLPAGEHTIFVG 778
            +K++ GF+RV +++G ++R++   +  + L   D      ++P G   + VG
Sbjct: 788 PLKELKGFRRVPLKSGESRRVEIKLDK-EQLRYWDVEKGQFVVPKGAFDVMVG 839


>gi|298376791|ref|ZP_06986746.1| beta-glucosidase [Bacteroides sp. 3_1_19]
 gi|298266669|gb|EFI08327.1| beta-glucosidase [Bacteroides sp. 3_1_19]
          Length = 868

 Score =  279 bits (714), Expect = 4e-72,   Method: Compositional matrix adjust.
 Identities = 167/452 (36%), Positives = 235/452 (51%), Gaps = 44/452 (9%)

Query: 48  QMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVS 107
           +   + F +  LP   R+ DL+SR+T +EK+ Q+ +    + RLG+P Y+WW+EALHGV+
Sbjct: 22  KQQDYPFRNPELPLEERIDDLLSRLTPEEKIGQMMNVTPAIERLGIPTYDWWNEALHGVA 81

Query: 108 NVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRA------- 160
             G           AT FP  I   A+F+++   +    VS EARA Y+  +        
Sbjct: 82  RAG----------RATVFPQAIAMAATFDDNAVHETFTMVSDEARAKYHQYQKDKEYDRY 131

Query: 161 -GLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPL 219
            GLT+W+PNIN+ RDPRWGR  ET GEDP++  +  V   RGLQ         D N    
Sbjct: 132 KGLTFWTPNINIFRDPRWGRGMETYGEDPYLTEKMGVAVTRGLQ-------GDDPNY--Y 182

Query: 220 KVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNR 279
           K  +C KHYA +    W   +R+ F+A  T +D+ ET+L  FE  VKEGD   VMC+YNR
Sbjct: 183 KTHACAKHYAVHSGPEW---NRHEFNAEATPRDLYETYLPAFEALVKEGDVQEVMCAYNR 239

Query: 280 VNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVM------VDNHKFLADSKEDAVAQ 333
             G P C+  KLL   +R  W     I++DC +I            H+   D+ E A A 
Sbjct: 240 FEGKPCCSSDKLLIDILRNSWGYDNIILSDCGAIDDFWRKDKNTPRHETHPDA-ESASAD 298

Query: 334 TLKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ--YVSL 391
            +  G DL+CG  Y      A+  GK+ E D+D SL+ L      LG FD   +  Y  +
Sbjct: 299 AVLNGTDLECGGSYRALN-KALADGKISEKDLDVSLRRLLKGRFELGMFDPDERVPYSKI 357

Query: 392 GKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYA 451
               + S E+I  A + AR+ IVLLKN  N LPL+   +K +AVVGP+A  +  +  NY 
Sbjct: 358 PYSVVESPEHIAKALDMARKSIVLLKNKNNMLPLDK-NIKKIAVVGPNAADSTMLWANYN 416

Query: 452 GIPCRYMSPIAGFSGY---ANVTYKTGCDDVA 480
           G P + ++ + G       A V Y+ GC+  A
Sbjct: 417 GFPSKTVTIVEGIRNKVPNAEVIYELGCNHTA 448



 Score =  118 bits (295), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 114/429 (26%), Positives = 174/429 (40%), Gaps = 85/429 (19%)

Query: 382 FDGSPQYVSLGKQ-------DICSDENIELAAEAAR---------EGIVLLK---NDQNT 422
           F+G+P Y  L K+       +     N+ L    AR         +G V  K   ND   
Sbjct: 477 FEGTPAYKGLAKELHYTTGGNTQFAPNVNLTNFTARFTGEFESPIDGPVEFKLSGNDAFR 536

Query: 423 LPLNSAKVKTV--AVVGPHANATV-AMIGNYAGIPCRYMSPIAGFSGYANVTYKTGCDDV 479
           L +++AKV  V     G     T+ A  G    I   YM      +G A++ +  G    
Sbjct: 537 LYIDTAKVAEVWENEYGAEKLYTLNAKKGEKYPIKIEYMQR----TGSADLNFTVGV--- 589

Query: 480 ACKSNNSIFAASEAAKTADATIILAGLDLSVEAESL----------DREDLWLPGYQTQL 529
             ++     A +   K AD  + + G+   +E E +          DR ++ +P  Q ++
Sbjct: 590 --RTPVDFQATASKVKDADVIVFVGGISPRLEGEEMPVDAEGFRKGDRTNIEIPAVQKEM 647

Query: 530 INQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNP 589
           +  +    K PV+ V+ +  G  +A    N ++ AIL A Y G+EGG A+ADV+FG +NP
Sbjct: 648 VKALVATGK-PVVYVVCT--GSALALNWENDHVNAILNAWYGGQEGGTAVADVLFGDYNP 704

Query: 590 GGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYN 649
            GRLPIT+Y    V  LP               GRTY++     LYPFGYGLSYT F Y 
Sbjct: 705 AGRLPITFYKS--VDQLP-------DFQDYSMKGRTYRYMTQTPLYPFGYGLSYTTFDYK 755

Query: 650 LLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDG 709
               +K                               + +  ++      D  N G  DG
Sbjct: 756 NAKLSK-------------------------------DKIASNESVTLSFDIANTGKMDG 784

Query: 710 SDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLP 769
            +V  +Y K P + A   +K +  F+RV V+AG  + +          +  D      + 
Sbjct: 785 DEVAQIYIKNPNDPAGP-LKAMKAFKRVNVKAGSEQPVSIQLEPKAFQSFNDNTQTMEVR 843

Query: 770 AGEHTIFVG 778
            G++ I  G
Sbjct: 844 PGKYQILYG 852


>gi|262381651|ref|ZP_06074789.1| glycoside hydrolase family beta-glycosidase [Bacteroides sp.
           2_1_33B]
 gi|262296828|gb|EEY84758.1| glycoside hydrolase family beta-glycosidase [Bacteroides sp.
           2_1_33B]
          Length = 868

 Score =  279 bits (713), Expect = 5e-72,   Method: Compositional matrix adjust.
 Identities = 167/452 (36%), Positives = 235/452 (51%), Gaps = 44/452 (9%)

Query: 48  QMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVS 107
           +   + F +  LP   R+ DL+SR+T +EK+ Q+ +    + RLG+P Y+WW+EALHGV+
Sbjct: 22  KQQDYPFRNPELPLEERIDDLLSRLTPEEKIGQMMNVTPAIERLGIPTYDWWNEALHGVA 81

Query: 108 NVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRA------- 160
             G           AT FP  I   A+F+++   +    VS EARA Y+  +        
Sbjct: 82  RAG----------RATVFPQAIAMAATFDDNAVHETFTIVSDEARAKYHQYQKDKEYDRY 131

Query: 161 -GLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPL 219
            GLT+W+PNIN+ RDPRWGR  ET GEDP++  +  V   RGLQ         D N    
Sbjct: 132 KGLTFWTPNINIFRDPRWGRGMETYGEDPYLTEKMGVAVTRGLQ-------GDDPNY--Y 182

Query: 220 KVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNR 279
           K  +C KHYA +    W   +R+ F+A  T +D+ ET+L  FE  VKEGD   VMC+YNR
Sbjct: 183 KTHACAKHYAVHSGPEW---NRHEFNAEATPRDLYETYLPAFEALVKEGDVQEVMCAYNR 239

Query: 280 VNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVM------VDNHKFLADSKEDAVAQ 333
             G P C+  KLL   +R  W     I++DC +I            H+   D+ E A A 
Sbjct: 240 FEGKPCCSSDKLLIDILRNSWGYDNIILSDCGAIDDFWRKDKNTPRHETHPDA-ESASAD 298

Query: 334 TLKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ--YVSL 391
            +  G DL+CG  Y      A+  GK+ E D+D SL+ L      LG FD   +  Y  +
Sbjct: 299 AVLNGTDLECGGSYRALN-KALADGKISEKDLDVSLRRLLKGRFELGMFDPDERVPYSKI 357

Query: 392 GKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYA 451
               + S E+I  A + AR+ IVLLKN  N LPL+   +K +AVVGP+A  +  +  NY 
Sbjct: 358 PYSVVESPEHIAKALDMARKSIVLLKNKNNMLPLDK-NIKKIAVVGPNAADSTMLWANYN 416

Query: 452 GIPCRYMSPIAGFSGY---ANVTYKTGCDDVA 480
           G P + ++ + G       A V Y+ GC+  A
Sbjct: 417 GFPSKTVTIVEGIRNKVPNAEVIYELGCNHTA 448



 Score =  117 bits (293), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 113/429 (26%), Positives = 174/429 (40%), Gaps = 85/429 (19%)

Query: 382 FDGSPQYVSLGKQ-------DICSDENIELAAEAAR---------EGIVLLK---NDQNT 422
           F+G+P Y  L K+       +     N+ L    AR         +G +  K   ND   
Sbjct: 477 FEGTPAYKGLAKELHYTTGGNTQFAPNVNLTNFTARFTGEFESPIDGPIEFKLSGNDAFR 536

Query: 423 LPLNSAKVKTV--AVVGPHANATV-AMIGNYAGIPCRYMSPIAGFSGYANVTYKTGCDDV 479
           L +++AKV  V     G     T+ A  G    I   YM      +G A++ +  G    
Sbjct: 537 LYIDTAKVAEVWENEYGAEKLYTLNAKKGEKYPIKIEYMQR----TGSADLNFTVGV--- 589

Query: 480 ACKSNNSIFAASEAAKTADATIILAGLDLSVEAESL----------DREDLWLPGYQTQL 529
             ++     A +   K AD  + + G+   +E E +          DR ++ +P  Q ++
Sbjct: 590 --RTPVDFQATASKVKDADVIVFVGGISPRLEGEEMPVDAEGFRKGDRTNIEIPAVQKEM 647

Query: 530 INQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNP 589
           +  +    K PV+ V+ +  G  +A    N ++ AIL A Y G+EGG A+ADV+FG +NP
Sbjct: 648 VKALVATGK-PVVYVVCT--GSALALNWENDHVNAILNAWYGGQEGGTAVADVLFGDYNP 704

Query: 590 GGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYN 649
            GRLPIT+Y    V  LP               GRTY++     LYPFGYGLSYT F Y 
Sbjct: 705 AGRLPITFYKS--VDQLP-------DFQDYSMKGRTYRYMTQTPLYPFGYGLSYTTFDYK 755

Query: 650 LLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDG 709
               +K                               + +  ++      D  N G  DG
Sbjct: 756 NAKLSK-------------------------------DKIASNESVTLSFDIANTGKMDG 784

Query: 710 SDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLP 769
            +V  +Y K P + A   +K +  F+RV V+AG  + +          +  D      + 
Sbjct: 785 DEVAQIYIKNPNDPAGP-LKAMKAFKRVNVKAGSEQPVSIQLEPKAFQSFNDNTQTMEVR 843

Query: 770 AGEHTIFVG 778
            G++ I  G
Sbjct: 844 PGKYQILYG 852


>gi|423229063|ref|ZP_17215468.1| hypothetical protein HMPREF1063_01288 [Bacteroides dorei
           CL02T00C15]
 gi|423244903|ref|ZP_17225977.1| hypothetical protein HMPREF1064_02183 [Bacteroides dorei
           CL02T12C06]
 gi|392634816|gb|EIY28728.1| hypothetical protein HMPREF1063_01288 [Bacteroides dorei
           CL02T00C15]
 gi|392640944|gb|EIY34735.1| hypothetical protein HMPREF1064_02183 [Bacteroides dorei
           CL02T12C06]
          Length = 788

 Score =  279 bits (713), Expect = 5e-72,   Method: Compositional matrix adjust.
 Identities = 235/820 (28%), Positives = 367/820 (44%), Gaps = 157/820 (19%)

Query: 53  LFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRL---GLPQYEWWSEALH-GVSN 108
           L+ +   P   RV+DL+S+MTL+EK  Q+    +G  R+    LPQ  W +E    G+ N
Sbjct: 42  LYENPKAPLEERVQDLLSQMTLEEKSCQMATL-YGSGRVLKDALPQDNWKTEVWKDGIGN 100

Query: 109 V-----GPGT-----------HFD-----------------------DVIPG-----ATS 124
           +     G GT           H D                       + I G     AT 
Sbjct: 101 IDEEHNGLGTFKSEYSFPYTKHVDAKHAIQRWFVEETRLGIPVDFTNEGIRGLCHDRATY 160

Query: 125 FPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNINVARDPRWGRITETP 184
           FP      A++N+ L  +IG+  + EA A+          +SP +++A+DPRWGR  ET 
Sbjct: 161 FPAQCGQGATWNKELIARIGEVEAKEAVAL-----EYTNIYSPILDIAQDPRWGRCVETY 215

Query: 185 GEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHF 244
           GEDP++VG      +  LQ         +L + P       KH+A Y +       +   
Sbjct: 216 GEDPYLVGELGKQMITSLQK-------HNLVATP-------KHFAVYSIPVGGRDGKTRT 261

Query: 245 DARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHG 304
           D  V  ++M   ++ PF M  +E  A  VM SYN  +G P       L + +R EW   G
Sbjct: 262 DPHVAPREMRTLYIEPFRMAFQEAGALGVMSSYNDYDGEPITGSYHFLTEILRQEWGFKG 321

Query: 305 YIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFT---------GNAV 355
           Y+V+D ++++ +   HK +A++ ED +AQ + AGL++      T+FT           AV
Sbjct: 322 YVVSDSEAVEFISSKHK-VANTYEDGIAQAVNAGLNIR-----THFTPPADFILPLRKAV 375

Query: 356 QQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQD---ICSDENIELAAEAAREG 412
             GK+ +  +DK +  +  V   LG FD    Y   GKQ    + S E+  ++ EAAR+ 
Sbjct: 376 ADGKISQETLDKRVAEILRVKFWLGLFDNP--YRGNGKQAEQIVHSKEHQAVSLEAARQS 433

Query: 413 IVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNY--AGIPCRYM-SPIAGFSGYAN 469
           +VLLKN+ N LPL S  ++++AV+GP+A+    +I  Y  A  P + +   I     +  
Sbjct: 434 LVLLKNEMNLLPL-SKSLRSIAVIGPNADERTQLICRYGPANAPIKTVYQGIKERLPHTE 492

Query: 470 VTYKTGCDDV--------------ACKSNNSIFAASEAAKTADATI-ILAGLDLSVEAES 514
           V Y+ GCD +                +    +  A  AAK A+  + +L G +L+V  E 
Sbjct: 493 VIYRKGCDIIDPHFPESEVLDFPKTTEEARLMEEAIHAAKQAEVVVMVLGGNELTVR-ED 551

Query: 515 LDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEE 574
             R  L LPG Q +L+  V    K PV+LV++      I +A    ++ AIL A +PGE 
Sbjct: 552 RSRTSLNLPGRQEELLKAVCATGK-PVVLVLLDGRASSINYAA--AHVPAILHAWFPGEF 608

Query: 575 GGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTL 634
            G+A+A+ +FG +NPGGRL +T+     V  +P  + P +P          Y       L
Sbjct: 609 CGQAVAEALFGDYNPGGRLAVTFPKS--VGQIPF-AFPFKPGSDESSSTSVYG-----VL 660

Query: 635 YPFGYGLSYTQFKYNLLSFT---KTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRC 691
           YPFG+GLSYT F Y  L  +   + +Q ++N                          + C
Sbjct: 661 YPFGHGLSYTTFSYGDLKISPLRQGVQGDIN--------------------------ISC 694

Query: 692 DDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVF 751
                     +N G   G +VV +Y +       TY K + GF+R+ + AG  + + F  
Sbjct: 695 --------KIKNTGKIKGDEVVQLYLRDEVSSVTTYTKVLRGFERISLEAGEEQMVHFRL 746

Query: 752 NACKSLNIVDYAANTLLPAGEHTIFVGNGGVSFPIHLNFN 791
              + L + D   N  +  G+  + +G+      +H  F 
Sbjct: 747 RP-QDLGLWDKNMNFRVEPGKFKVMIGSSSTDIRLHGRFE 785


>gi|423342048|ref|ZP_17319763.1| hypothetical protein HMPREF1077_01193 [Parabacteroides johnsonii
           CL02T12C29]
 gi|409219455|gb|EKN12417.1| hypothetical protein HMPREF1077_01193 [Parabacteroides johnsonii
           CL02T12C29]
          Length = 868

 Score =  279 bits (713), Expect = 5e-72,   Method: Compositional matrix adjust.
 Identities = 164/452 (36%), Positives = 235/452 (51%), Gaps = 44/452 (9%)

Query: 48  QMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVS 107
           +   + F +  LP   R+ DL+ R+T +EKV Q+ +    + RLG+PQY+WW+EALHGV+
Sbjct: 22  RQEDYPFRNPDLPIDERIDDLLKRLTAEEKVGQMMNTTPAIERLGIPQYDWWNEALHGVA 81

Query: 108 NVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRA------- 160
             G           AT FP  I   A+F++    +    VS EARA Y+  +        
Sbjct: 82  RAGK----------ATVFPQAIAMAATFDDDALYETFTMVSDEARAKYHQYQKDKEYDRY 131

Query: 161 -GLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPL 219
            GLT+W+PNIN+ RDPRWGR  ET GEDP++  R  V  V+GLQ           + +  
Sbjct: 132 KGLTFWTPNINIFRDPRWGRGMETYGEDPYLTERMGVAVVKGLQGD---------DPKYF 182

Query: 220 KVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNR 279
           K  +C KHYA +    W   +R+ FD  VT +D+ +T+L  FE  VKEG+   VMC+YNR
Sbjct: 183 KTHACAKHYAVHSGPEW---NRHEFDVTVTPRDLWQTYLPAFEALVKEGNVQEVMCAYNR 239

Query: 280 VNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVD------NHKFLADSKEDAVAQ 333
             G P C+  KLL   +R  W     I++DC +I    +       H+   D+ E A A 
Sbjct: 240 YQGKPCCSSDKLLIDILRNSWGYENIILSDCGAINDFWERDERTPRHETHPDA-ESASAD 298

Query: 334 TLKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ--YVSL 391
            +  G DL+CG  Y      A++ GK+ E D+D SL+ L      LG FD   +  Y  +
Sbjct: 299 AVLNGTDLECGNSYRALV-KALKDGKISENDLDVSLRRLLKGRFELGMFDPDERVPYAQI 357

Query: 392 GKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYA 451
               + S E++  A E A + +VLLKN  NTLPL S  ++ +AVVGP+A  +  +  NY 
Sbjct: 358 PYNVVESPEHVAQALEMAHKSMVLLKNKNNTLPL-SKTIRKIAVVGPNAADSTMLWANYN 416

Query: 452 GIPCRYMSPIAGFSGY---ANVTYKTGCDDVA 480
           G P   ++ + G         V Y+ GC+  A
Sbjct: 417 GFPTHTVTILEGIRNKVPDTEVIYELGCNHAA 448



 Score =  113 bits (282), Expect = 4e-22,   Method: Compositional matrix adjust.
 Identities = 84/270 (31%), Positives = 124/270 (45%), Gaps = 56/270 (20%)

Query: 490 ASEAAKTADATIIL--AGLDLSVEAESL----------DREDLWLPGYQTQLINQVAEVA 537
           A+ AAK  DA +I+   G+   +E E +          DR ++ LP  Q +++  +    
Sbjct: 596 AATAAKVKDADVIVYVGGISPRLEGEEMPVNVEGFKKGDRTNIELPKVQQEMVKALKATG 655

Query: 538 KGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITW 597
           K PV+ V+ +   + + + E N  I AIL A Y G+E G A+AD++FG +NP GRLP+T+
Sbjct: 656 K-PVVYVLCTGSALALNWEEAN--IDAILNAWYGGQEAGTAVADILFGDYNPSGRLPVTF 712

Query: 598 YNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTI 657
           Y    +  LP         +     GRTY++     LYPFGYGLSYT F Y         
Sbjct: 713 YKS--IDQLP-------DFEDYSMKGRTYRYMTETPLYPFGYGLSYTNFAY--------- 754

Query: 658 QVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYS 717
                     RN   +S        G +  D      F    D  N G  DG ++  +Y 
Sbjct: 755 ----------RNAKLSS--------GKIAKDQSVTLTF----DIANTGKMDGDEIAQIYI 792

Query: 718 KPPAEIAATYIKQVIGFQRVFVRAGRNKRI 747
           K P +     IK +  F RV V+AG ++ +
Sbjct: 793 KNPNDPEGP-IKALKAFLRVHVKAGDSQEV 821


>gi|423331656|ref|ZP_17309440.1| hypothetical protein HMPREF1075_01453 [Parabacteroides distasonis
           CL03T12C09]
 gi|409230226|gb|EKN23094.1| hypothetical protein HMPREF1075_01453 [Parabacteroides distasonis
           CL03T12C09]
          Length = 868

 Score =  279 bits (713), Expect = 5e-72,   Method: Compositional matrix adjust.
 Identities = 167/452 (36%), Positives = 234/452 (51%), Gaps = 44/452 (9%)

Query: 48  QMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVS 107
           +   + F +  LP   R+ DL+SR+T +EK+ Q+ +    + RLG+P Y+WW+EALHGV+
Sbjct: 22  KQQDYPFRNPDLPLEERIDDLLSRLTPEEKIGQMMNVTPAIERLGIPTYDWWNEALHGVA 81

Query: 108 NVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRA------- 160
             G           AT FP  I   A+F+++   +    VS EARA Y+  +        
Sbjct: 82  RAG----------RATVFPQAIAMAATFDDNAVHETFTMVSDEARAKYHQYQKDKEYDRY 131

Query: 161 -GLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPL 219
            GLT+W+PNIN+ RDPRWGR  ET GEDP++  +  V   RGLQ         D N    
Sbjct: 132 KGLTFWTPNINIFRDPRWGRGMETYGEDPYLTEKMGVAVTRGLQ-------GDDPNY--Y 182

Query: 220 KVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNR 279
           K  +C KHYA +    W   +R+ FD   T +D+ ET+L  FE  VKEGD   VMC+YNR
Sbjct: 183 KTHACAKHYAVHSGPEW---NRHEFDVEATPRDLYETYLPAFEALVKEGDVQEVMCAYNR 239

Query: 280 VNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVM------VDNHKFLADSKEDAVAQ 333
             G P C+  KLL   +R  W     I++DC +I            H+   D+ E A A 
Sbjct: 240 FEGKPCCSSDKLLIDILRNSWGYDNIILSDCGAIDDFWRKDKNTPRHETHPDA-ESASAD 298

Query: 334 TLKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ--YVSL 391
            +  G DL+CG  Y      A+  GK+ E D+D SL+ L      LG FD   +  Y  +
Sbjct: 299 AVLNGTDLECGGSYRALN-KALADGKISEKDLDVSLRRLLKGRFELGMFDPDERVPYSKI 357

Query: 392 GKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYA 451
               + S E+I  A + AR+ IVLLKN  N LPL+   +K +AVVGP+A  +  +  NY 
Sbjct: 358 PYSVVESPEHIAKALDMARKSIVLLKNKNNMLPLDK-NIKKIAVVGPNAADSTMLWANYN 416

Query: 452 GIPCRYMSPIAGFSGY---ANVTYKTGCDDVA 480
           G P + ++ + G       A V Y+ GC+  A
Sbjct: 417 GFPTKTVTIVEGIRNKVPNAEVIYELGCNHTA 448



 Score =  117 bits (293), Expect = 3e-23,   Method: Compositional matrix adjust.
 Identities = 114/429 (26%), Positives = 174/429 (40%), Gaps = 85/429 (19%)

Query: 382 FDGSPQYVSLGKQ-------DICSDENIELAAEAAR---------EGIVLLK---NDQNT 422
           F+G+P Y  L K+       +     N+ L    AR         +G V  K   ND   
Sbjct: 477 FEGTPAYKGLAKELHYTTGGNTQFAPNVNLTNFTARFTGEFESPIDGPVEFKLSGNDAFR 536

Query: 423 LPLNSAKVKTV--AVVGPHANATV-AMIGNYAGIPCRYMSPIAGFSGYANVTYKTGCDDV 479
           L +++AKV  V     G     T+ A  G    I   YM      +G A++ +  G    
Sbjct: 537 LYIDTAKVAEVWENEYGAEKLYTLNAKKGEKYPIKIEYMQR----TGSADLNFTVGV--- 589

Query: 480 ACKSNNSIFAASEAAKTADATIILAGLDLSVEAESL----------DREDLWLPGYQTQL 529
             ++     A +   K AD  + + G+   +E E +          DR ++ +P  Q ++
Sbjct: 590 --RTPVDFQATASKVKDADVIVFVGGISPRLEGEEMPVDAEGFRKGDRTNIEIPAVQKEM 647

Query: 530 INQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNP 589
           +  +    K PV+ V+ +  G  +A    N ++ AIL A Y G+EGG A+ADV+FG +NP
Sbjct: 648 VKALVATGK-PVVYVVCT--GSALALNWENDHVNAILNAWYGGQEGGTAVADVLFGDYNP 704

Query: 590 GGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYN 649
            GRLPIT+Y    V  LP               GRTY++     LYPFGYGLSYT F Y 
Sbjct: 705 AGRLPITFYKS--VDQLP-------DFQDYSMKGRTYRYMTQTPLYPFGYGLSYTTFDYK 755

Query: 650 LLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDG 709
               +K                               + +  ++      D  N G  DG
Sbjct: 756 NAKLSK-------------------------------DKIASNESVTLSFDIANTGKMDG 784

Query: 710 SDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLP 769
            +V  +Y K P + A   +K +  F+RV V+AG  + +          +  D      + 
Sbjct: 785 DEVAQIYIKNPNDPAGP-LKAMKAFKRVNVKAGSAQPVSIQLEPKAFQSFNDNTQTMEVR 843

Query: 770 AGEHTIFVG 778
            G++ I  G
Sbjct: 844 PGKYQILYG 852


>gi|150007848|ref|YP_001302591.1| glycoside hydrolase family protein [Parabacteroides distasonis ATCC
           8503]
 gi|301310124|ref|ZP_07216063.1| beta-glucosidase [Bacteroides sp. 20_3]
 gi|423336365|ref|ZP_17314112.1| hypothetical protein HMPREF1059_00064 [Parabacteroides distasonis
           CL09T03C24]
 gi|149936272|gb|ABR42969.1| glycoside hydrolase family 3, candidate beta-glycosidase
           [Parabacteroides distasonis ATCC 8503]
 gi|300831698|gb|EFK62329.1| beta-glucosidase [Bacteroides sp. 20_3]
 gi|409240840|gb|EKN33614.1| hypothetical protein HMPREF1059_00064 [Parabacteroides distasonis
           CL09T03C24]
          Length = 868

 Score =  279 bits (713), Expect = 5e-72,   Method: Compositional matrix adjust.
 Identities = 167/452 (36%), Positives = 235/452 (51%), Gaps = 44/452 (9%)

Query: 48  QMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVS 107
           +   + F +  LP   R+ DL+SR+T +EK+ Q+ +    + RLG+P Y+WW+EALHGV+
Sbjct: 22  KQQDYPFRNPDLPLEERIDDLLSRLTPEEKIGQMMNVTPAIERLGIPTYDWWNEALHGVA 81

Query: 108 NVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRA------- 160
             G           AT FP  I   A+F+++   +    VS EARA Y+  +        
Sbjct: 82  RAG----------RATVFPQAIAMAATFDDNAVHETFTMVSDEARAKYHQYQKDKEYDRY 131

Query: 161 -GLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPL 219
            GLT+W+PNIN+ RDPRWGR  ET GEDP++  +  V   RGLQ         D N    
Sbjct: 132 KGLTFWTPNINIFRDPRWGRGMETYGEDPYLTEKMGVAVTRGLQ-------GDDPNY--Y 182

Query: 220 KVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNR 279
           K  +C KHYA +    W   +R+ F+A  T +D+ ET+L  FE  VKEGD   VMC+YNR
Sbjct: 183 KTHACAKHYAVHSGPEW---NRHEFNAEATPRDLYETYLPAFEALVKEGDVQEVMCAYNR 239

Query: 280 VNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVM------VDNHKFLADSKEDAVAQ 333
             G P C+  KLL   +R  W     I++DC +I            H+   D+ E A A 
Sbjct: 240 FEGKPCCSSDKLLIDILRNSWGYDNIILSDCGAIDDFWRKDKNTPRHETHPDA-ESASAD 298

Query: 334 TLKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ--YVSL 391
            +  G DL+CG  Y      A+  GK+ E D+D SL+ L      LG FD   +  Y  +
Sbjct: 299 AVLNGTDLECGGSYRALN-KALADGKISEKDLDVSLRRLLKGRFELGMFDPDERVPYSKI 357

Query: 392 GKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYA 451
               + S E+I  A + AR+ IVLLKN  N LPL+   +K +AVVGP+A  +  +  NY 
Sbjct: 358 PYSVVESPEHIAKALDMARKSIVLLKNKNNMLPLDK-NIKKIAVVGPNAADSTMLWANYN 416

Query: 452 GIPCRYMSPIAGFSGY---ANVTYKTGCDDVA 480
           G P + ++ + G       A V Y+ GC+  A
Sbjct: 417 GFPTKTVTIVEGIRNKVPNAEVIYELGCNHTA 448



 Score =  118 bits (295), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 114/429 (26%), Positives = 174/429 (40%), Gaps = 85/429 (19%)

Query: 382 FDGSPQYVSLGKQ-------DICSDENIELAAEAAR---------EGIVLLK---NDQNT 422
           F+G+P Y  L K+       +     N+ L    AR         +G V  K   ND   
Sbjct: 477 FEGTPAYKGLAKELHYTTGGNTQFAPNVNLTNFTARFTGEFESPIDGPVEFKLSGNDAFR 536

Query: 423 LPLNSAKVKTV--AVVGPHANATV-AMIGNYAGIPCRYMSPIAGFSGYANVTYKTGCDDV 479
           L +++AKV  V     G     T+ A  G    I   YM      +G A++ +  G    
Sbjct: 537 LYIDTAKVAEVWENEYGAEKLYTLNAKKGEKYPIKIEYMQR----TGSADLNFTVGV--- 589

Query: 480 ACKSNNSIFAASEAAKTADATIILAGLDLSVEAESL----------DREDLWLPGYQTQL 529
             ++     A +   K AD  + + G+   +E E +          DR ++ +P  Q ++
Sbjct: 590 --RTPVDFQATASKVKDADVIVFVGGISPRLEGEEMPVDAEGFRKGDRTNIEIPAVQKEM 647

Query: 530 INQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNP 589
           +  +    K PV+ V+ +  G  +A    N ++ AIL A Y G+EGG A+ADV+FG +NP
Sbjct: 648 VKALVATGK-PVVYVVCT--GSALALNWENDHVNAILNAWYGGQEGGTAVADVLFGDYNP 704

Query: 590 GGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYN 649
            GRLPIT+Y    V  LP               GRTY++     LYPFGYGLSYT F Y 
Sbjct: 705 AGRLPITFYKS--VDQLP-------DFQDYSMKGRTYRYMTQTPLYPFGYGLSYTTFDYK 755

Query: 650 LLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDG 709
               +K                               + +  ++      D  N G  DG
Sbjct: 756 NAKLSK-------------------------------DKIASNESVTLSFDIANTGKMDG 784

Query: 710 SDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLP 769
            +V  +Y K P + A   +K +  F+RV V+AG  + +          +  D      + 
Sbjct: 785 DEVAQIYIKNPNDPAGP-LKAMKAFKRVNVKAGSEQPVSIQLEPKAFQSFNDNTQTMEVR 843

Query: 770 AGEHTIFVG 778
            G++ I  G
Sbjct: 844 PGKYQILYG 852


>gi|255013451|ref|ZP_05285577.1| glycoside hydrolase family beta-glycosidase [Bacteroides sp. 2_1_7]
 gi|410103695|ref|ZP_11298616.1| hypothetical protein HMPREF0999_02388 [Parabacteroides sp. D25]
 gi|409236424|gb|EKN29231.1| hypothetical protein HMPREF0999_02388 [Parabacteroides sp. D25]
          Length = 868

 Score =  279 bits (713), Expect = 5e-72,   Method: Compositional matrix adjust.
 Identities = 167/452 (36%), Positives = 234/452 (51%), Gaps = 44/452 (9%)

Query: 48  QMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVS 107
           +   + F +  LP   R+ DL+SR+T +EK+ Q+ +    + RLG+P Y+WW+EALHGV+
Sbjct: 22  KQQDYPFRNPDLPLEERIDDLLSRLTPEEKIGQMMNVTPAIERLGIPTYDWWNEALHGVA 81

Query: 108 NVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRA------- 160
             G           AT FP  I   A+F+++   +    VS EARA Y+  +        
Sbjct: 82  RAG----------RATVFPQAIAMAATFDDNAVHETFTMVSDEARAKYHQYQKDKEYDRY 131

Query: 161 -GLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPL 219
            GLT+W+PNIN+ RDPRWGR  ET GEDP++  +  V   RGLQ         D N    
Sbjct: 132 KGLTFWTPNINIFRDPRWGRGMETYGEDPYLTEKMGVAVTRGLQ-------GDDPNY--Y 182

Query: 220 KVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNR 279
           K  +C KHYA +    W   +R+ FD   T +D+ ET+L  FE  VKEGD   VMC+YNR
Sbjct: 183 KTHACAKHYAVHSGPEW---NRHEFDVEATPRDLYETYLPAFEALVKEGDVQEVMCAYNR 239

Query: 280 VNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVM------VDNHKFLADSKEDAVAQ 333
             G P C+  KLL   +R  W     I++DC +I            H+   D+ E A A 
Sbjct: 240 FEGKPCCSSDKLLIDILRNSWGYDNIILSDCGAIDDFWRKDKNTPRHETHPDA-ESASAD 298

Query: 334 TLKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ--YVSL 391
            +  G DL+CG  Y      A+  GK+ E D+D SL+ L      LG FD   +  Y  +
Sbjct: 299 AVLNGTDLECGGSYRALN-KALADGKISEKDLDVSLRRLLKGRFELGMFDPDERVPYSKI 357

Query: 392 GKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYA 451
               + S E+I  A + AR+ IVLLKN  N LPL+   +K +AVVGP+A  +  +  NY 
Sbjct: 358 PYSVVESPEHIAKALDMARKSIVLLKNKNNMLPLDK-NIKKIAVVGPNAADSTMLWANYN 416

Query: 452 GIPCRYMSPIAGFSGY---ANVTYKTGCDDVA 480
           G P + ++ + G       A V Y+ GC+  A
Sbjct: 417 GFPTKTVTIVEGIRNKVPNAEVIYELGCNHTA 448



 Score =  117 bits (294), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 114/429 (26%), Positives = 174/429 (40%), Gaps = 85/429 (19%)

Query: 382 FDGSPQYVSLGKQ-------DICSDENIELAAEAAR---------EGIVLLK---NDQNT 422
           F+G+P Y  L K+       +     N+ L    AR         +G V  K   ND   
Sbjct: 477 FEGTPAYKGLAKELHYTTGGNTQFAPNVNLTNFTARFTGEFESPIDGPVEFKLSGNDAFR 536

Query: 423 LPLNSAKVKTV--AVVGPHANATV-AMIGNYAGIPCRYMSPIAGFSGYANVTYKTGCDDV 479
           L +++AKV  V     G     T+ A  G    I   YM      +G A++ +  G    
Sbjct: 537 LYIDTAKVAEVWENEYGAEKLYTLNAKKGEKYPIKIEYMQR----TGSADLNFTVGV--- 589

Query: 480 ACKSNNSIFAASEAAKTADATIILAGLDLSVEAESL----------DREDLWLPGYQTQL 529
             ++     A +   K AD  + + G+   +E E +          DR ++ +P  Q ++
Sbjct: 590 --RTPVDFQATASKVKDADVIVFVGGISPRLEGEEMPVDAEGFRKGDRTNIEIPAVQKEM 647

Query: 530 INQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNP 589
           +  +    K PV+ V+ +  G  +A    N ++ AIL A Y G+EGG A+ADV+FG +NP
Sbjct: 648 VKALVATGK-PVVYVVCT--GSALALNWENDHVNAILNAWYGGQEGGTAVADVLFGDYNP 704

Query: 590 GGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYN 649
            GRLPIT+Y    V  LP               GRTY++     LYPFGYGLSYT F Y 
Sbjct: 705 AGRLPITFYKS--VDQLP-------DFQDYSMKGRTYRYMTQTPLYPFGYGLSYTTFDYK 755

Query: 650 LLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDG 709
               +K                               + +  ++      D  N G  DG
Sbjct: 756 NAKLSK-------------------------------DKIASNESVTLSFDIANTGKMDG 784

Query: 710 SDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLP 769
            +V  +Y K P + A   +K +  F+RV V+AG  + +          +  D      + 
Sbjct: 785 DEVAQIYIKNPNDPAGP-LKAMKAFKRVNVKAGSEQPVSIQLEPKAFQSFNDNTQTMEVR 843

Query: 770 AGEHTIFVG 778
            G++ I  G
Sbjct: 844 PGKYQILYG 852


>gi|256840106|ref|ZP_05545615.1| glycoside hydrolase family beta-glycosidase [Parabacteroides sp.
           D13]
 gi|256739036|gb|EEU52361.1| glycoside hydrolase family beta-glycosidase [Parabacteroides sp.
           D13]
          Length = 868

 Score =  279 bits (713), Expect = 5e-72,   Method: Compositional matrix adjust.
 Identities = 167/452 (36%), Positives = 235/452 (51%), Gaps = 44/452 (9%)

Query: 48  QMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVS 107
           +   + F +  LP   R+ DL+SR+T +EK+ Q+ +    + RLG+P Y+WW+EALHGV+
Sbjct: 22  KQQDYPFRNPDLPLEERIDDLLSRLTPEEKIGQMMNVTPAIERLGIPTYDWWNEALHGVA 81

Query: 108 NVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRA------- 160
             G           AT FP  I   A+F+++   +    VS EARA Y+  +        
Sbjct: 82  RAG----------RATVFPQAIAMAATFDDNAVHETFTMVSDEARAKYHQYQKDKEYDRY 131

Query: 161 -GLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPL 219
            GLT+W+PNIN+ RDPRWGR  ET GEDP++  +  V   RGLQ         D N    
Sbjct: 132 KGLTFWTPNINIFRDPRWGRGMETYGEDPYLTEKMGVAVTRGLQ-------GDDPNY--Y 182

Query: 220 KVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNR 279
           K  +C KHYA +    W   +R+ F+A  T +D+ ET+L  FE  VKEGD   VMC+YNR
Sbjct: 183 KTHACAKHYAVHSGPEW---NRHEFNAEATPRDLYETYLPAFEALVKEGDVQEVMCAYNR 239

Query: 280 VNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVM------VDNHKFLADSKEDAVAQ 333
             G P C+  KLL   +R  W     I++DC +I            H+   D+ E A A 
Sbjct: 240 FEGKPCCSSDKLLIDILRNSWGYDNIILSDCGAIDDFWRKDKNTPRHETHPDA-ESASAD 298

Query: 334 TLKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ--YVSL 391
            +  G DL+CG  Y      A+  GK+ E D+D SL+ L      LG FD   +  Y  +
Sbjct: 299 AVLNGTDLECGGSYRALN-KALADGKISEKDLDVSLRRLLKGRFELGMFDPDERVPYSKI 357

Query: 392 GKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYA 451
               + S E+I  A + AR+ IVLLKN  N LPL+   +K +AVVGP+A  +  +  NY 
Sbjct: 358 PYSVVESPEHIAKALDMARKSIVLLKNKNNMLPLDK-NIKKIAVVGPNAADSTMLWANYN 416

Query: 452 GIPCRYMSPIAGFSGY---ANVTYKTGCDDVA 480
           G P + ++ + G       A V Y+ GC+  A
Sbjct: 417 GFPSKTVTIVEGIRNKVPNAEVIYELGCNHTA 448



 Score =  118 bits (296), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 114/429 (26%), Positives = 174/429 (40%), Gaps = 85/429 (19%)

Query: 382 FDGSPQYVSLGKQ-------DICSDENIELAAEAAR---------EGIVLLK---NDQNT 422
           F+G+P Y  L K+       +     N+ L    AR         +G V  K   ND   
Sbjct: 477 FEGTPAYKGLAKELHYTTGGNTQFAPNVNLTNFTARFTGEFESPIDGPVEFKLSGNDAFR 536

Query: 423 LPLNSAKVKTV--AVVGPHANATV-AMIGNYAGIPCRYMSPIAGFSGYANVTYKTGCDDV 479
           L +++AKV  V     G     T+ A  G    I   YM      +G A++ +  G    
Sbjct: 537 LYIDTAKVAEVWENEYGAEKLYTLNAKKGEKYPIKIEYMQR----TGSADLNFTVGV--- 589

Query: 480 ACKSNNSIFAASEAAKTADATIILAGLDLSVEAESL----------DREDLWLPGYQTQL 529
             ++     A +   K AD  + + G+   +E E +          DR ++ +P  Q ++
Sbjct: 590 --RTPVDFQATASKVKDADVIVFVGGISPRLEGEEMPVDAEGFRKGDRTNIEIPAVQKEM 647

Query: 530 INQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNP 589
           +  +    K PV+ V+ +  G  +A    N ++ AIL A Y G+EGG A+ADV+FG +NP
Sbjct: 648 VKALVATGK-PVVYVVCT--GSALALNWENDHVNAILNAWYGGQEGGTAVADVLFGDYNP 704

Query: 590 GGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYN 649
            GRLPIT+Y    V  LP               GRTY++     LYPFGYGLSYT F Y 
Sbjct: 705 AGRLPITFYKS--VDQLP-------DFQDYSMKGRTYRYMTQTPLYPFGYGLSYTTFDYK 755

Query: 650 LLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDG 709
               +K                               + +  ++      D  N G  DG
Sbjct: 756 NAKLSK-------------------------------DKIASNESVTLSFDIANTGKMDG 784

Query: 710 SDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLP 769
            +V  +Y K P + A   +K +  F+RV V+AG  + +          +  D      + 
Sbjct: 785 DEVAQIYIKNPNDPAGP-LKAMKAFKRVNVKAGSEQPVSIQLEPKAFQSFNDNTQTMEIR 843

Query: 770 AGEHTIFVG 778
            G++ I  G
Sbjct: 844 PGKYQILYG 852


>gi|399030621|ref|ZP_10730998.1| beta-glucosidase-like glycosyl hydrolase [Flavobacterium sp. CF136]
 gi|398071229|gb|EJL62496.1| beta-glucosidase-like glycosyl hydrolase [Flavobacterium sp. CF136]
          Length = 876

 Score =  278 bits (712), Expect = 6e-72,   Method: Compositional matrix adjust.
 Identities = 162/461 (35%), Positives = 245/461 (53%), Gaps = 38/461 (8%)

Query: 48  QMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVS 107
           +   FLF +  L +  RV DLV+R+TL+EKV Q+ + +  +PRL +P Y+WW+E LHGV+
Sbjct: 25  KQKEFLFQNPDLSFEKRVDDLVNRLTLEEKVSQMLNSSPAIPRLDIPAYDWWNETLHGVA 84

Query: 108 NVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNL----GRA--- 160
                T F       T +P  I   A+F+++   K+    + E RA+YN     GR    
Sbjct: 85  R----TPFK-----VTVYPQAIAMAATFDKNSLYKMADFSALEGRAIYNKAVESGRTNER 135

Query: 161 --GLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRP 218
             GLTYW+PNIN+ RDPRWGR  ET GEDP++ G    ++V+GLQ  +          + 
Sbjct: 136 YLGLTYWTPNINIFRDPRWGRGQETYGEDPYLTGVLGDSFVKGLQGDD---------PKY 186

Query: 219 LKVSSCCKHYAAYDVDNWKGVD--RYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCS 276
           LK ++C KHYA +      G +  R+ FD  VT  ++ +T+L  F+  V E   + VMC+
Sbjct: 187 LKAAACAKHYAVHS-----GPEPLRHTFDVDVTPYELWDTYLPAFQKLVTESKVAGVMCA 241

Query: 277 YNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLK 336
           YN     P CA   L+   +R +W   GY+ +DC +I     NHK   D+ E A A  + 
Sbjct: 242 YNAFRTQPCCASDILMTDILRNQWKFEGYVTSDCWAIDDFFKNHKTHPDA-ESASADAVF 300

Query: 337 AGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSP--QYVSLGKQ 394
            G D+DCG         AV+ GK+ E  ID S+K L+ +  RLG FD     +Y      
Sbjct: 301 HGTDIDCGTDAYKALVQAVKDGKISEKQIDISVKRLFMIRFRLGMFDPVEMVKYAQTPTS 360

Query: 395 DICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIP 454
            + +DE+   A + AR+ IVLL+N+  TLPL S K+K + V+GP+ +  +A++GNY G P
Sbjct: 361 VLENDEHKAHALKMARQSIVLLRNENKTLPL-SKKLKKIVVLGPNVDNAIAILGNYNGTP 419

Query: 455 CRYMSPIAGFSGYANVTYKTGCDDVACKSNNSIFAASEAAK 495
            +  + + G         +   +     +N+++   S+  K
Sbjct: 420 SKLTTVLEGIKEKVGSNTEVVYEKAVNFTNDTLLVYSDVKK 460



 Score =  114 bits (285), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 80/268 (29%), Positives = 119/268 (44%), Gaps = 54/268 (20%)

Query: 495 KTADATIILAGLDLSVEAESL----------DREDLWLPGYQTQLINQVAEVAKGPVILV 544
           K ADA + + G+   +E E +          DR  + LP  QT L+  +    K P++ V
Sbjct: 606 KDADAFVFVGGISPQLEGEEMKVNFPGFKGGDRTSILLPKIQTDLMKALKTTGK-PIVFV 664

Query: 545 IMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQ 604
           +M+   + I +   N  I AI  A Y G+  G A+ADV+FG +NP GRLP+T+Y  D   
Sbjct: 665 MMTGSAIAIPWEAEN--IPAIANAWYGGQAAGTAVADVLFGNYNPAGRLPVTFYKSD--- 719

Query: 605 MLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKL 664
                   L P        RTY+++ G  LY FGYGLSYT FKY+ L    ++       
Sbjct: 720 ------ADLSPFVDYKMDNRTYRYFKGKPLYGFGYGLSYTTFKYDNLKIAPSV------- 766

Query: 665 QHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIA 724
              +N+  T                         V   N G   G +VV +Y        
Sbjct: 767 IKGKNVPIT-------------------------VKVTNTGKVSGEEVVQLYVINQNTAI 801

Query: 725 ATYIKQVIGFQRVFVRAGRNKRIKFVFN 752
              +K + GF+R+ ++AG++K I F  +
Sbjct: 802 KAPLKTLKGFERISLKAGKSKTITFTLS 829


>gi|354580734|ref|ZP_08999639.1| glycoside hydrolase family 3 domain protein [Paenibacillus lactis
           154]
 gi|353203165|gb|EHB68614.1| glycoside hydrolase family 3 domain protein [Paenibacillus lactis
           154]
          Length = 766

 Score =  278 bits (712), Expect = 6e-72,   Method: Compositional matrix adjust.
 Identities = 221/738 (29%), Positives = 339/738 (45%), Gaps = 111/738 (15%)

Query: 76  EKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGATSFPTVILTTASF 135
           E V  +  +A    RLG+P   +  E  HG   +G           AT FP  +   +++
Sbjct: 90  EAVNVIQRYAIEHSRLGIPIL-FGEECSHGHMAIG-----------ATVFPVPLTIGSTW 137

Query: 136 NESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYA 195
           N  L++ + +AV+ E R+     + G   +SP ++V RDPRWGR  ET GEDP +V  +A
Sbjct: 138 NPELFRSMCRAVAAETRS-----QGGAATYSPVLDVVRDPRWGRTEETFGEDPHLVAEFA 192

Query: 196 VNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDV-DNWKGVDRYHFDARVTEQDME 254
           V  V+GLQ      +A D       + +  KH+A Y   +  +     H   R    ++ 
Sbjct: 193 VAAVQGLQG--DRLDAED------SLLATLKHFAGYGASEGGRNGAPVHMGLR----ELH 240

Query: 255 ETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQ 314
           E  L PF   V+ G A SVM +YN ++G+P  +   LL+  +R  W   G+++ DC +I 
Sbjct: 241 EIDLLPFRKAVEAG-AQSVMTAYNEIDGVPCTSSRYLLHDVLREAWGFDGFVITDCGAID 299

Query: 315 VMVDNHKFLADSKEDAVAQTLKAGLDLD-CGQYYTNFTGNAVQQGKVKETDIDKSLKYLY 373
           ++   H   A S E+A AQ L AG+D++  G  +  +   A++QG + E D++ ++  + 
Sbjct: 300 MLKSGHNTAA-SGEEAAAQALTAGVDMEMSGSMFRVYLRQALEQGHITEDDLNTAVGRVL 358

Query: 374 TVLMRLGFFDGSPQYVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTV 433
            +  RLG FD         ++ I  +E+IELA   A EGIVLLKN+ N LPLN  K   +
Sbjct: 359 AMKFRLGLFDRPYTDPERAEKVIGCEEHIELARRVAAEGIVLLKNEGNVLPLNP-KTGKI 417

Query: 434 AVVGPHANATVAMIGNYAG--IPCRYMSPIAGFSGY------ANVTYKTGCDDVACKSNN 485
           AV+GP+ANA    +G+Y     P + ++ + G   +        V Y  GC  +   S  
Sbjct: 418 AVIGPNANAPYNQLGDYTSPQPPGQIITVLEGIRRHIGEDADTRVLYAPGC-RIQGDSRE 476

Query: 486 SIFAASEAAKTADATIILAG-----------LDLSVEA--------------ESLDREDL 520
            +  A   A  AD  ++  G           +DL   A              E +DR  L
Sbjct: 477 GLSHALACAAEADVIVMAIGGSSARDFGEGTIDLRTGASVVTGLAQSDMECGEGIDRSTL 536

Query: 521 WLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIA 580
            L G Q +L+ ++ ++ K PV++V ++  G  I     + +I AIL A YPG+EGG AIA
Sbjct: 537 HLMGVQLELLQEIHKLGK-PVVVVYIN--GRPITEPWIDEHIPAILEAWYPGQEGGSAIA 593

Query: 581 DVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYG 640
           D++FG  NP GRL +T      V  LP+     R        G+ Y   +    YPFGYG
Sbjct: 594 DILFGDVNPSGRLTLTIPK--EVGQLPINYNAKR------TRGKRYLETDLEPRYPFGYG 645

Query: 641 LSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVD 700
           LSYT F Y  LS    +                               +  D     ++ 
Sbjct: 646 LSYTDFHYGNLSVEPAV-------------------------------IPADGSAAVRIV 674

Query: 701 FQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIV 760
             N G  DG++VV +Y    A       K +  F +VF++AG ++ + F     + L ++
Sbjct: 675 VTNTGPRDGAEVVQLYVSDLAASVTRPEKALKAFSKVFLKAGESREVTFTVGP-EQLELI 733

Query: 761 DYAANTLLPAGEHTIFVG 778
                 ++  GE  I VG
Sbjct: 734 GPDMKAVVEPGEFRIRVG 751


>gi|365122193|ref|ZP_09339098.1| hypothetical protein HMPREF1033_02444 [Tannerella sp.
           6_1_58FAA_CT1]
 gi|363642907|gb|EHL82241.1| hypothetical protein HMPREF1033_02444 [Tannerella sp.
           6_1_58FAA_CT1]
          Length = 853

 Score =  278 bits (712), Expect = 6e-72,   Method: Compositional matrix adjust.
 Identities = 165/440 (37%), Positives = 251/440 (57%), Gaps = 41/440 (9%)

Query: 43  SKLGLQMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEA 102
           + + +  +  L+ D + P   R+ DL+SR+T++EK+  L   + G+PRLG+ +Y   +EA
Sbjct: 19  TSVAVAQTKELYKDMNAPQHERIMDLLSRLTVEEKISLLRATSPGIPRLGIDKYYHGNEA 78

Query: 103 LHGVSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAG- 161
           LHGV  V PG          T FP  I   + +N  L  +I  A+S EAR  +N    G 
Sbjct: 79  LHGV--VRPGNF--------TVFPQAIGLASMWNPELLYEISTAISDEARGRWNELNRGK 128

Query: 162 ---------LTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENAT 212
                    LT+WSP +N+ARDPRWGR  ET GEDPF+ G+  V +V+GLQ   G++   
Sbjct: 129 DQKGFFSDLLTFWSPTVNMARDPRWGRTPETYGEDPFLSGKLGVAFVKGLQ---GND--- 182

Query: 213 DLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASS 272
               R LK+ S  KH+AA + ++    +R+  +  ++E+++ E +L  FE C+KEG A S
Sbjct: 183 ---PRYLKIVSTPKHFAANNEEH----NRFECNPHISERNLREYYLPAFESCIKEGKAQS 235

Query: 273 VMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVA 332
           +M +YN +N +P   +P LL Q +R EW  +GY+V+DC     +V +HK++  + E A  
Sbjct: 236 IMSAYNAINDVPCTLNPWLLTQVLRKEWGFNGYVVSDCGGPGFLVTHHKYVK-TPEAAAT 294

Query: 333 QTLKAGLDLDCG-QYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ--YV 389
            ++KAGLDL+CG   Y     NA +Q  V + DID +   +    M LG FD   +  Y 
Sbjct: 295 LSIKAGLDLECGDNVYIEPLMNAYKQCMVTDADIDTAAYRILRARMMLGLFDDPEKNPYN 354

Query: 390 SLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGN 449
           ++    +  +++ +LA EAAR+ +VLLKN++N LPLN  KVK++AVVG   NA     G+
Sbjct: 355 AISPSIVGCEKHRQLALEAARQSLVLLKNEKNFLPLNPKKVKSIAVVG--INAGNCEFGD 412

Query: 450 YAGIPCRYMSPIAGFSGYAN 469
           Y+G P    +P++   G  N
Sbjct: 413 YSGTPVN--APVSVLEGIKN 430



 Score =  124 bits (312), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 90/291 (30%), Positives = 132/291 (45%), Gaps = 48/291 (16%)

Query: 490 ASEAAKTADATIILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAG 549
           A  A +  D T+ + G++ S+E E  DR  + LP  Q   I +  +     V++++    
Sbjct: 597 AGRAIRECDVTVAVLGINKSIEREGQDRYTIELPADQQLFIKEAYKANPNTVVVLV---A 653

Query: 550 GVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLT 609
           G  +A    + NI AIL A YPGE+GG A+A+ +FG +NPGGRLP+T+Y    +  LP  
Sbjct: 654 GSSLAINWIDENIPAILNAWYPGEQGGTAVAEALFGDYNPGGRLPLTYYRS--LDELPAF 711

Query: 610 SMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRN 669
                  D     GRTY ++    LYPFGYGLSYT+F Y                     
Sbjct: 712 D------DYDIQKGRTYMYFENKPLYPFGYGLSYTRFDYK-------------------- 745

Query: 670 LNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIK 729
            N  S+ S              DD    K   +N G   G +V  VY + P       +K
Sbjct: 746 -NLKSEVS--------------DDAVNLKFTVKNTGKYAGDEVAQVYVRFPESGIKVPLK 790

Query: 730 QVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLL-PAGEHTIFVGN 779
           Q+ GF+RV +  G++ ++  V    K L + D        P+G +   VG+
Sbjct: 791 QLKGFERVHIGKGKSAQVS-VSIPKKELRLWDEKDGKFYTPSGNYIFMVGS 840


>gi|262405981|ref|ZP_06082531.1| conserved hypothetical protein [Bacteroides sp. 2_1_22]
 gi|345510488|ref|ZP_08790055.1| beta-glucosidase [Bacteroides sp. D1]
 gi|262356856|gb|EEZ05946.1| conserved hypothetical protein [Bacteroides sp. 2_1_22]
 gi|345454434|gb|EEO48987.2| beta-glucosidase [Bacteroides sp. D1]
          Length = 735

 Score =  278 bits (712), Expect = 6e-72,   Method: Compositional matrix adjust.
 Identities = 224/768 (29%), Positives = 356/768 (46%), Gaps = 115/768 (14%)

Query: 53  LFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPG 112
           L+ D+  P   R+ DL+SRMTL+EKV QL  +  G         E   E     S +G  
Sbjct: 29  LYKDAKAPIEKRIDDLISRMTLEEKVLQLNQYTLGRNNNVNNVGE---EVKKVPSEIGSL 85

Query: 113 THFD---------------------------DVIPG-ATSFPTVILTTASFNESLWKKIG 144
            +FD                           D I G  T +P  +    S+N  L ++  
Sbjct: 86  IYFDINPELRNSMQKKAMEESRLGIPIIFGYDAIHGFRTIYPISLGQACSWNPGLVEQAC 145

Query: 145 QAVSTEARAMYNLGRAGLTY-WSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQ 203
              + EAR       +G+ + +SP I+VARDPRWGR+ E  GEDP+  G +A   VRG Q
Sbjct: 146 AVSAQEARM------SGVDWTFSPMIDVARDPRWGRVAEGYGEDPYTNGVFAAASVRGYQ 199

Query: 204 -DVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFE 262
            D    EN         ++++C KHY  Y         R +    ++ Q + +T+L P+E
Sbjct: 200 GDDMSAEN---------RMAACLKHYVGYGASE---AGRDYVYTEISAQTLWDTYLLPYE 247

Query: 263 MCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKF 322
           M VK G A ++M S+N ++G+P  A+P ++ + ++  W   G+IV+D  +++ +   ++ 
Sbjct: 248 MGVKAG-APTLMSSFNDISGVPGSANPYIMTEILKKRWKHDGFIVSDWGAVEQL--KNQG 304

Query: 323 LADSKEDAVAQTLKAGLDLDCGQY-YTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGF 381
           LA +K+DA      AGL++D   + Y       V++GKV    +D+S++ +  V  RLG 
Sbjct: 305 LAATKKDAARYAFNAGLEMDMMSHAYDRHLKELVEEGKVTMAQVDESVRRVLRVKFRLGL 364

Query: 382 FDGSPQYVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHAN 441
           F+     V+  K      +++ +AA+ A E +VLLKND   LPL +   K +AVVGP A 
Sbjct: 365 FERPYTPVTNEKDRFFRPQSMAVAAQLAAESMVLLKNDNQILPLTNK--KKIAVVGPMAK 422

Query: 442 ATVAMIGNYAG------IPCRYMSPIAGFSGYANVTYKTGCDDVACKSNNSIFA-ASEAA 494
               ++G++ G      +   Y    A F G A + Y  GC      ++ S FA A + A
Sbjct: 423 NGWDLLGSWCGHGKDTDVEMLYDGLTAEFGGDAELRYAMGCKPQG--NDRSGFAGALDVA 480

Query: 495 KTADATIILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIA 554
           + +D  I+  G  L+   E+  R  + LP  Q +L+ ++ E  K PVILV+  + G  + 
Sbjct: 481 RWSDVVIVCLGEMLTWSGENASRSTIALPQIQEELVKELKEAGK-PVILVL--SNGRPLE 537

Query: 555 FAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTS--MP 612
                    AIL    PG  G R++A ++ G+ NP G+L +T+         P ++  +P
Sbjct: 538 LNRMEPLCDAILEIWQPGINGARSMAGILSGRINPSGKLAMTF---------PYSTGQIP 588

Query: 613 L---RPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRN 669
           +   R     G+ G  YK      LYPFG+GLSYT+FKY  +                  
Sbjct: 589 IYYNRRKSGRGHQG-FYKDITSDPLYPFGHGLSYTEFKYGTV------------------ 629

Query: 670 LNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIK 729
              T  A+K          ++  D    +V   N GS DG++ V  +   P       +K
Sbjct: 630 ---TPSATK----------VKRGDKLSAEVTVTNTGSRDGAETVHWFISDPYCSITRPVK 676

Query: 730 QVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFV 777
           ++  F++  ++AG  K  +F  +  +    V+      L AGE+ I V
Sbjct: 677 ELKHFEKQLIKAGETKTFRFDIDLERDFGFVNEDGKRFLEAGEYHILV 724


>gi|317477153|ref|ZP_07936394.1| glycosyl hydrolase family 3 N terminal domain-containing protein
           [Bacteroides eggerthii 1_2_48FAA]
 gi|316906696|gb|EFV28409.1| glycosyl hydrolase family 3 N terminal domain-containing protein
           [Bacteroides eggerthii 1_2_48FAA]
          Length = 863

 Score =  278 bits (710), Expect = 1e-71,   Method: Compositional matrix adjust.
 Identities = 166/464 (35%), Positives = 254/464 (54%), Gaps = 35/464 (7%)

Query: 53  LFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPG 112
           +  D S P S+R+++++ +MTL+EKV QL + +  +PRL LP Y +W+E LHGV+  G  
Sbjct: 48  IIGDLSQPISVRIENIIRQMTLEEKVAQLSNESDSIPRLNLPSYNYWNECLHGVARAGE- 106

Query: 113 THFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMY-NLGRAGLTYWSPNINV 171
                     T FP  I   ++++  L K+I  A+STEAR  Y ++G+ GLTYW+P IN+
Sbjct: 107 ---------VTVFPQAINLASTWDTLLVKRIASAISTEARLKYLDIGK-GLTYWAPTINM 156

Query: 172 ARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAY 231
           ARDPRWGR  ET GEDP++  R  V +V+GLQ    H N        LK  +  KH+ A 
Sbjct: 157 ARDPRWGRNEETYGEDPYLTSRLGVAFVKGLQG--DHPNY-------LKTVATVKHFVAN 207

Query: 232 DVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKL 291
           + +N    DR+   +++  + + E +   +E CVKE +  S+M +YN  NGIP      L
Sbjct: 208 NQEN----DRFSSSSQIPTKQLYEYYFPAYEACVKEANVQSIMTAYNAFNGIPPSGSTWL 263

Query: 292 LNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFT 351
           L   +R EW   G++V+DC +I VM   H+ + +S E+A A  + +G DL+CG  Y    
Sbjct: 264 LEDVLRKEWGFDGFVVSDCGAIGVMNWQHR-IVNSLEEAAALGINSGCDLECGGTYRENL 322

Query: 352 GNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSP--QYVSLGKQDICSDENIELAAEAA 409
             AVQ+G V E  ID++L  + T+  +LG FD      Y    K+ +  ++   LA EAA
Sbjct: 323 VAAVQRGLVSEYAIDRALTRVLTMRFKLGEFDPIELVPYNHYDKKLLAGEQFRRLAYEAA 382

Query: 410 REGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFSGY-- 467
            + I+LLKN+ N LP++   V+++A+VGP A+     +G Y+G P   +S + G      
Sbjct: 383 VKSIILLKNEDNFLPIDKKDVRSIAIVGPFADNN--YLGGYSGKPVHNISLLQGVKKMVG 440

Query: 468 --ANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLS 509
               ++Y  G   V    ++S   AS+          + G DL+
Sbjct: 441 EEVEISYIEGT-SVVSPVDSSYLLASDGVNNGLTADYIDGHDLN 483



 Score =  100 bits (249), Expect = 3e-18,   Method: Compositional matrix adjust.
 Identities = 80/309 (25%), Positives = 141/309 (45%), Gaps = 46/309 (14%)

Query: 485 NSIFAASEAAKTADATIILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILV 544
           N I    +    AD  ++  G D  +  E+ D   ++LP  Q  L+ ++ +V   P I +
Sbjct: 597 NQIDKVKKIVSRADLVLVALGNDGKLARENRDLPSIYLPMTQELLLKEIYKV--NPRIAL 654

Query: 545 IMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQ 604
           I+  G   +       ++ +IL A YPG+EGG A+A ++FG  NP G+LP+T Y  +  Q
Sbjct: 655 ILQTGN-PLTSQWAAEHVPSILQAWYPGQEGGAALAGILFGLENPSGKLPMTIYESE--Q 711

Query: 605 MLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKL 664
            LP        +D   + GRTY++ +   LY FG+GLSY+ F+Y  L     + V+    
Sbjct: 712 QLP------NILDYDIWKGRTYQYLSSKPLYGFGHGLSYSNFEYADLQCNDVVHVD---- 761

Query: 665 QHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVY-SKPPAEI 723
                                   L+C       +  +N+    G +V+ VY S+    +
Sbjct: 762 ----------------------GTLQC------SIKVKNISDVVGEEVIQVYVSREKTPV 793

Query: 724 AATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGNGGVS 783
               +K++I F RV ++   +K + F     + L++       +L +G++++FVG G   
Sbjct: 794 YTFPLKKLIAFARVNLKPNESKTVTFTITP-RQLSVWQDGEWKML-SGKYSLFVGGGQKE 851

Query: 784 FPIHLNFNY 792
               +N ++
Sbjct: 852 LSKGMNKDF 860


>gi|336412679|ref|ZP_08593032.1| hypothetical protein HMPREF1017_00140 [Bacteroides ovatus
           3_8_47FAA]
 gi|335942725|gb|EGN04567.1| hypothetical protein HMPREF1017_00140 [Bacteroides ovatus
           3_8_47FAA]
          Length = 735

 Score =  278 bits (710), Expect = 1e-71,   Method: Compositional matrix adjust.
 Identities = 221/776 (28%), Positives = 357/776 (46%), Gaps = 113/776 (14%)

Query: 53  LFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPG 112
           L+ D+  P   R+ DL+SRMTL+EKV QL  +  G         E   E     S +G  
Sbjct: 29  LYKDAKAPIEKRIDDLISRMTLEEKVLQLNQYTLGRNNNVNNVGE---EVKKVPSEIGSL 85

Query: 113 THFD---------------------------DVIPG-ATSFPTVILTTASFNESLWKKIG 144
            +FD                           D I G  T +P  +    S+N  L ++  
Sbjct: 86  IYFDINPELRNSMQKKAMEESRLGIPIIFGYDAIHGFRTIYPISLGQACSWNPGLVEQAC 145

Query: 145 QAVSTEARAMYNLGRAGLTY-WSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQ 203
              + EAR       +G+ + +SP I+VARDPRWGR+ E  GEDP+  G +A   VRG Q
Sbjct: 146 AVSAQEARM------SGVDWTFSPMIDVARDPRWGRVAEGYGEDPYTNGVFAAASVRGYQ 199

Query: 204 -DVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFE 262
            D    EN         ++++C KHY  Y         R +    ++ Q + +T+L P+E
Sbjct: 200 GDDMSAEN---------RMAACLKHYVGYGASE---AGRDYVYTEISAQTLWDTYLLPYE 247

Query: 263 MCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKF 322
           M VK G A+++M S+N ++G+P  A+P ++ + ++  W   G+IV+D  +++ +   ++ 
Sbjct: 248 MGVKAG-AATLMSSFNDISGVPGSANPYIMTEILKKRWKHDGFIVSDWGAVEQL--KNQG 304

Query: 323 LADSKEDAVAQTLKAGLDLDCGQY-YTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGF 381
           LA +K+DA      AGL++D   + Y       V++GKV    +D+S++ +  V  RLG 
Sbjct: 305 LAATKKDAARYAFNAGLEMDMMSHAYDRHLKELVEEGKVTMAQVDESVRRVLRVKFRLGL 364

Query: 382 FDGSPQYVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHAN 441
           F+     V+  K      +++ +AA+ A E +VLLKND   LPL +   K +AVVGP A 
Sbjct: 365 FERPYTPVTNEKDRFFRPQSMAVAAQLAAESMVLLKNDNQILPLTNK--KRIAVVGPMAK 422

Query: 442 ATVAMIGNYAG------IPCRYMSPIAGFSGYANVTYKTGCDDVACKSNNSIFA-ASEAA 494
               ++G++ G      +   Y    A F G A + Y  GC      ++ S FA A +  
Sbjct: 423 NGWDLLGSWCGHGKDTDVEMLYDGLTAEFGGEAELRYAMGCKPQG--NDRSGFAGALDVV 480

Query: 495 KTADATIILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIA 554
           + +D  I+  G  L+   E+  R  + LP  Q +L+ ++ E  K P+ILV+  + G  + 
Sbjct: 481 RWSDVVIVCLGEMLTWSGENASRSTIALPQIQEELVKELKEAGK-PIILVL--SNGRPLE 537

Query: 555 FAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLR 614
                    AIL    PG  G R++A ++ G+ NP G+L IT          P ++  + 
Sbjct: 538 LNRMEPLCDAILEIWQPGINGARSMAGILSGRINPSGKLAIT---------FPYSTGQIP 588

Query: 615 PVDSLGYPGRTYK-FYNGPT---LYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNL 670
              +    GR ++ FY   T    Y FGYGLSYT+F+Y +++ + T      KL      
Sbjct: 589 IYYNRRKSGRWHQGFYKDITSDPFYSFGYGLSYTEFQYGVVTPSSTTVKRGEKLS----- 643

Query: 671 NYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQ 730
                                      +V   NVG  DG++ V  +   P       +K+
Sbjct: 644 --------------------------VEVTVTNVGKRDGAETVHWFISDPYCSITRPVKE 677

Query: 731 VIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGNGGVSFPI 786
           +  F++ F++ G  +  +F  +  + L  VD      L AGE+ I+V +  V   +
Sbjct: 678 LKHFEKQFIKVGETRTFRFDVDLERDLGFVDGNGKRFLEAGEYNIWVQDQKVKIEL 733


>gi|336404202|ref|ZP_08584900.1| hypothetical protein HMPREF0127_02213 [Bacteroides sp. 1_1_30]
 gi|335943530|gb|EGN05369.1| hypothetical protein HMPREF0127_02213 [Bacteroides sp. 1_1_30]
          Length = 735

 Score =  278 bits (710), Expect = 1e-71,   Method: Compositional matrix adjust.
 Identities = 222/768 (28%), Positives = 357/768 (46%), Gaps = 115/768 (14%)

Query: 53  LFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPG 112
           L+ D+  P   R+ DL+SRMTL+EK+ QL  +  G         E   E     S +G  
Sbjct: 29  LYKDAKAPIEKRIDDLISRMTLEEKILQLNQYTLGRNNNVNNVGE---EVKKVPSEIGSL 85

Query: 113 THFD---------------------------DVIPG-ATSFPTVILTTASFNESLWKKIG 144
            +FD                           D I G  T +P  +    S+N  L ++  
Sbjct: 86  IYFDINPELRNSMQKKAMEESRLGIPIIFGYDAIHGFRTIYPISLGQACSWNPGLVEQAC 145

Query: 145 QAVSTEARAMYNLGRAGLTY-WSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQ 203
              + EAR       +G+ + +SP I+VARDPRWGR+ E  GEDP+  G +A   VRG Q
Sbjct: 146 AVSAQEARM------SGVDWTFSPMIDVARDPRWGRVAEGYGEDPYTNGVFAAASVRGYQ 199

Query: 204 -DVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFE 262
            D    EN         ++++C KHY  Y         R +    ++ Q + +T+L P+E
Sbjct: 200 GDDMSAEN---------RMAACLKHYVGYGASE---AGRDYVYTEISAQTLWDTYLLPYE 247

Query: 263 MCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKF 322
           M VK G A+++M S+N ++G+P  A+P ++ + ++  W   G+IV+D  +++ +   ++ 
Sbjct: 248 MGVKAG-AATLMSSFNDISGVPGSANPYIMTEILKKRWKHDGFIVSDWGAVEQL--KNQG 304

Query: 323 LADSKEDAVAQTLKAGLDLDCGQY-YTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGF 381
           LA +K+DA      AGL++D   + Y       V++GKV    +D+S++ +  V  RLG 
Sbjct: 305 LAATKKDAAQYAFNAGLEMDMMSHAYDRHLKELVEEGKVTMAQVDESVRRVLRVKFRLGL 364

Query: 382 FDGSPQYVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHAN 441
           F+     V+  K      +++ +AA+ A E +VLLKND   LPL +   K +AVVGP A 
Sbjct: 365 FERPYTPVTNEKDRFFRPQSMAVAAQLAAESMVLLKNDNQILPLTNK--KKIAVVGPMAK 422

Query: 442 ATVAMIGNYAG------IPCRYMSPIAGFSGYANVTYKTGCDDVACKSNNSIFA-ASEAA 494
               ++G++ G      +   Y    A F G A + Y  GC      ++ S FA A + A
Sbjct: 423 NGWDLLGSWCGHGKDTDVEMLYDGLTAEFGGDAELRYAMGCKPQG--NDRSGFAGALDVA 480

Query: 495 KTADATIILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIA 554
           + +D  I+  G  L+   E+  R  + LP  Q +L+ ++ E  K PVILV+  + G  + 
Sbjct: 481 RWSDVVIVCLGEMLTWSGENASRSTIALPQIQEELVKELKEAGK-PVILVL--SNGRPLE 537

Query: 555 FAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTS--MP 612
                    AIL    PG  G R++A ++ G+ NP G+L +T+         P ++  +P
Sbjct: 538 LNRMEPLCDAILEIWQPGINGARSMAGILSGRINPSGKLAMTF---------PYSTGQIP 588

Query: 613 L---RPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRN 669
           +   R     G+ G  YK      LYPFG+GLSYT+FKY  +                  
Sbjct: 589 IYYNRRKSGRGHQG-FYKDITSDPLYPFGHGLSYTEFKYGTV------------------ 629

Query: 670 LNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIK 729
              T  A+K          ++  D    +V   N G+ DG++ V  +   P       +K
Sbjct: 630 ---TPSATK----------VKRGDKLSAEVTVTNTGARDGAETVHWFISDPYCSITRPVK 676

Query: 730 QVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFV 777
           ++  F++  ++AG  K  +F  +  +    V+      L AGE+ I V
Sbjct: 677 ELKHFEKQLIKAGETKTFRFDIDLERDFGFVNEDGKRFLEAGEYHILV 724


>gi|15837447|ref|NP_298135.1| family 3 glycoside hydrolase [Xylella fastidiosa 9a5c]
 gi|9105751|gb|AAF83655.1|AE003924_1 family 3 glycoside hydrolase [Xylella fastidiosa 9a5c]
          Length = 882

 Score =  277 bits (709), Expect = 1e-71,   Method: Compositional matrix adjust.
 Identities = 163/423 (38%), Positives = 235/423 (55%), Gaps = 40/423 (9%)

Query: 68  LVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGATSFPT 127
           LV++MTL EK+ Q  + A  +PRLG+P Y+WWSE LHG++  G           AT FP 
Sbjct: 37  LVAKMTLQEKITQTMNAAPAIPRLGIPAYDWWSEGLHGIARNG----------YATVFPQ 86

Query: 128 VILTTASFNESLWKKIGQAVSTEARAMYNLG---------RAGLTYWSPNINVARDPRWG 178
            I   AS+N  L + +G   STEARA +NL           AGLT WSPNIN+ RDPRWG
Sbjct: 87  AIGLAASWNTDLLQHVGTVTSTEARAKFNLAGGPGKDHPRYAGLTLWSPNINIFRDPRWG 146

Query: 179 RITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKG 238
           R  ET GEDP++ G+ AV+++RGLQ         ++   P  +++  KH+A   V +   
Sbjct: 147 RGMETYGEDPYLTGQLAVSFIRGLQG--------NIPDHPRTIATP-KHFA---VHSGPE 194

Query: 239 VDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRG 298
             R+ FD  V+  D+E T+   F   + +G A SVMC+YN ++G P+CA   LLN  +R 
Sbjct: 195 PGRHSFDVDVSAYDLEATYTPAFRAAIVDGHAGSVMCAYNALHGTPACASDWLLNTRLRN 254

Query: 299 EWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFTGNAVQQG 358
           +W  +G++V+DCD+I  M   H F  D+   A A  LK+G DL+CG  Y +    A+ +G
Sbjct: 255 DWGFNGFVVSDCDAIDDMTRFHFFRQDNAS-ASAAALKSGNDLNCGNTYRDLN-QAIARG 312

Query: 359 KVKETDIDKSLKYLYTVLMRLGFFDGSPQ--YVSLGKQDICSDENIELAAEAAREGIVLL 416
            + E  +D++L  L+    RLG         Y ++G + I +  +  LA +AA + +VLL
Sbjct: 313 DIDEALLDQALIRLFAARQRLGTLQPREHDPYATIGIKHIDTPAHRALALQAAVQSLVLL 372

Query: 417 KNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFS---GYANVTYK 473
           KN  NTLPL      T+AV+GP A++  A+  NY G     ++P+ G     G A + Y 
Sbjct: 373 KNSGNTLPLTPG--TTLAVLGPDADSLTALEANYQGTSSTPVTPLTGLRTRFGAAKIHYA 430

Query: 474 TGC 476
            G 
Sbjct: 431 QGA 433



 Score =  147 bits (370), Expect = 3e-32,   Method: Compositional matrix adjust.
 Identities = 100/301 (33%), Positives = 139/301 (46%), Gaps = 55/301 (18%)

Query: 490 ASEAAKTADATIILAGLDLSVEAESL----------DREDLWLPGYQTQLINQVAEVAKG 539
           A  A   ADA +   GL   VE E L          DR  + LP  Q  L+  V    K 
Sbjct: 604 AERAVAHADAIVAFVGLSPEVEGEELHIDTPGFSGGDRTTIDLPATQETLLQHVKTTGK- 662

Query: 540 PVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYN 599
           P+I+V+MS   V + +A+ + N  AIL A YPG+ GG AIA  + G  NPGGRLP+T+Y 
Sbjct: 663 PLIVVLMSGSAVALNWAQHHAN--AILAAWYPGQSGGTAIAQALAGDVNPGGRLPVTFYR 720

Query: 600 GDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQV 659
               Q LP       P  S    GRTY+++ G  LYPFGYGLSYTQF Y           
Sbjct: 721 S--TQDLP-------PYISYDMTGRTYRYFKGQPLYPFGYGLSYTQFTYE---------- 761

Query: 660 NLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKP 719
                                 P +    L+  D        +N G+  G +VV +Y +P
Sbjct: 762 ---------------------APQLSTATLKAGDTLTVTAHVRNTGTRAGDEVVQLYLEP 800

Query: 720 PAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGN 779
           P    A  ++ ++GF+RV +R G ++ + F  +  + L+ V       + AG + +FVG 
Sbjct: 801 PHSPQAP-LRNLVGFKRVTLRPGESRLLTFTLD-TRQLSSVQQTGQRSVEAGHYHLFVGG 858

Query: 780 G 780
           G
Sbjct: 859 G 859


>gi|189464219|ref|ZP_03013004.1| hypothetical protein BACINT_00556 [Bacteroides intestinalis DSM
           17393]
 gi|189438009|gb|EDV06994.1| glycosyl hydrolase family 3 N-terminal domain protein [Bacteroides
           intestinalis DSM 17393]
          Length = 865

 Score =  277 bits (709), Expect = 1e-71,   Method: Compositional matrix adjust.
 Identities = 161/408 (39%), Positives = 231/408 (56%), Gaps = 28/408 (6%)

Query: 58  SLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTHFDD 117
           S P S RV++L+S+MTL+EKV QL +    +PRL LP Y +W+E LHGV+  G       
Sbjct: 53  SQPISARVENLISKMTLEEKVAQLSNETDSIPRLNLPSYNYWNECLHGVARAGE------ 106

Query: 118 VIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNINVARDPRW 177
                T FP  I   ++++  L KK+  A+STEAR  Y     GLTYWSP IN+ARDPRW
Sbjct: 107 ----VTVFPQAINLASTWDTLLIKKVASAISTEARLKYLEIGKGLTYWSPTINMARDPRW 162

Query: 178 GRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWK 237
           GR  ET GEDP++  R  V +V+GLQ    H +        LK  +  KH+ A + +N  
Sbjct: 163 GRNEETYGEDPYLTSRLGVAFVKGLQG--DHPDY-------LKTVATIKHFVANNQEN-- 211

Query: 238 GVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVR 297
             DR+   +++  + + E +   +E CVKE DA SVM +YN  NG+       LL   +R
Sbjct: 212 --DRFSSSSQIPTKQLYEYYFPAYEACVKEADAQSVMTAYNAFNGVAPSGSTWLLGDVLR 269

Query: 298 GEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFTGNAVQQ 357
            EW   G++V+DC +I VM   H+ + +S E+A A  + +G DL+CG  Y      AV+ 
Sbjct: 270 KEWGFDGFVVSDCGAIGVMNWQHR-VVNSLEEAAALGINSGCDLECGGTYREKLVAAVKM 328

Query: 358 GKVKETDIDKSLKYLYTVLMRLGFFDGSP--QYVSLGKQDICSDENIELAAEAAREGIVL 415
           G V E  IDK+L  + T   +LG FD      Y    K+ +  ++  +LA EAA + IVL
Sbjct: 329 GLVSEQAIDKALTRVLTARFKLGEFDPIELVPYNHYDKKLLAGEKFGKLAYEAAVKSIVL 388

Query: 416 LKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAG 463
           LKND + LP++  K+++VA+VGP A+     +G Y+G P   +S + G
Sbjct: 389 LKNDNDFLPVDKKKIRSVAIVGPFADNN--YLGGYSGKPVHNVSLLQG 434



 Score =  115 bits (287), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 95/331 (28%), Positives = 153/331 (46%), Gaps = 55/331 (16%)

Query: 467 YANVTYKTGCDDVACKSN-NSIFAASEAAKTADATIILAGLDLSVEAESLDREDLWLPGY 525
           Y N T    C  V+   N + I    E    AD  ++  G D  +  E+ D   ++LP  
Sbjct: 578 YINKTGAAACMLVSDFGNSDQIDKVKEFVSGADLVLVALGNDEKLARENRDLPSIYLPMT 637

Query: 526 QTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFG 585
           Q  L+ ++ +V   P   +I+  G   +       N+ AIL A YPG+EGG+A+A ++FG
Sbjct: 638 QELLLKEIYKV--NPRTALILHTGN-PLTSKWAAENVPAILQAWYPGQEGGKALAGILFG 694

Query: 586 KFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGY---PGRTYKFYNGPTLYPFGYGLS 642
             NP G+LP+T Y  +  + LP         D L Y    GRTY++ +   LY FG+GLS
Sbjct: 695 SENPSGKLPMTIYESE--EQLP---------DILDYDIWKGRTYQYLSSKPLYGFGHGLS 743

Query: 643 YTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQ 702
           Y+ F+Y  L     +                                R D   +  ++ +
Sbjct: 744 YSNFEYTHLQSDDVV--------------------------------RPDGTLQCSIEIK 771

Query: 703 NVGSTDGSDVVIVY-SKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVD 761
           N+    G +VV VY S+    +    +K+++ F RV ++ G +K + F   A + L+I  
Sbjct: 772 NISDVAGEEVVQVYISRENTPVYTFPLKKLVAFARVDLKPGESKTVTFTI-APRQLSIWQ 830

Query: 762 YAANTLLPAGEHTIFVGNG--GVSFPIHLNF 790
                +LP G++++FVG+G  G+S  I+ NF
Sbjct: 831 EGIWKMLP-GKYSLFVGSGQEGLSKGINRNF 860


>gi|94970273|ref|YP_592321.1| Beta-glucosidase [Candidatus Koribacter versatilis Ellin345]
 gi|94552323|gb|ABF42247.1| Beta-glucosidase [Candidatus Koribacter versatilis Ellin345]
          Length = 881

 Score =  277 bits (709), Expect = 1e-71,   Method: Compositional matrix adjust.
 Identities = 167/446 (37%), Positives = 245/446 (54%), Gaps = 52/446 (11%)

Query: 54  FCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGT 113
           + + SL    R  DLV RMT++EKV QL + +  VPRL +P Y+WWSEALHGV+      
Sbjct: 30  YLNPSLAPEKRAADLVHRMTVEEKVSQLTNDSRAVPRLNVPDYDWWSEALHGVAQ----- 84

Query: 114 HFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRA--------GLTYW 165
                 PG T +P  +   A+F+    +++ + +  E R  +  G          GL +W
Sbjct: 85  ------PGVTEYPQPVALAATFDNDKVQRMARFIGIEGRIKHEEGMKDGHSDIFQGLDFW 138

Query: 166 SPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCC 225
           +PNIN+ RDPRWGR  ET GEDPF+  R  V YV+GLQ         D     L +S+  
Sbjct: 139 APNINIFRDPRWGRGQETYGEDPFLTARMGVAYVKGLQG--------DDPKYYLAISTP- 189

Query: 226 KHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPS 285
           KHYA   V +     R+  D +V++ D  +T+L  F   V E  A SVMC+YN +NG P+
Sbjct: 190 KHYA---VHSGPETTRHFADVKVSKHDELDTYLPAFRATVTEAKAGSVMCAYNSINGQPA 246

Query: 286 CADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDC-- 343
           C +  LL   +RG+W+  GY+V+DC++I  +  +HKF   ++ +A A  ++ G+D +C  
Sbjct: 247 CVNEFLLQDQLRGKWNFQGYVVSDCEAIINIYRDHKF-TKTQAEASALAVQRGMDNECVD 305

Query: 344 -------GQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGK--- 393
                    Y   F  +A +QG +KE++ID +L  L+T  M+LG FD  P+ V   K   
Sbjct: 306 FGKQKDDHDYRPYF--DAYKQGILKESEIDTALVRLFTARMKLGMFD-PPEMVPYSKIDP 362

Query: 394 QDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGI 453
           +++ S E+ ELA   A E +VLLKND  TLPL  + +K +AV+GP A  T  ++GNY G 
Sbjct: 363 KELESAEHRELARTLANESMVLLKND-GTLPLKKSGLK-IAVIGPLAEQTRYLLGNYNGT 420

Query: 454 PCRYMSPIAGFSGY---ANVTYKTGC 476
           P   +S + G       A +T++ G 
Sbjct: 421 PSHTVSVLEGLRAEFPDAQITFERGT 446



 Score =  148 bits (373), Expect = 1e-32,   Method: Compositional matrix adjust.
 Identities = 99/302 (32%), Positives = 147/302 (48%), Gaps = 55/302 (18%)

Query: 489 AASEAAKTADATIILAGLDLSVEAESL----------DREDLWLPGYQTQLINQVAEVAK 538
           AA  AAK AD  I + G+   +E E +          DR  L LP  + QL+  ++   K
Sbjct: 602 AAVTAAKNADVVIAVLGITSDLEGEEMPVSEEGFNGGDRTSLDLPKPEQQLLESISAAGK 661

Query: 539 GPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWY 598
            PV+LV+ +   + + +A+ + N  AIL   YPGEEGG AIA  + GK NP GRLP+T+Y
Sbjct: 662 -PVVLVLSNGSALSVNWAQQHAN--AILEGWYPGEEGGTAIAQTLSGKNNPAGRLPVTFY 718

Query: 599 NGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQ 658
            G   + LP       P +     GRTY+++ G  LYPFGYGLSYT F Y  L+  K   
Sbjct: 719 TG--TEQLP-------PFEDYAMKGRTYRYFEGKPLYPFGYGLSYTTFSYRDLALPKA-- 767

Query: 659 VNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSK 718
                                         L   D    +V   N G  +G +V  +Y  
Sbjct: 768 -----------------------------PLNAGDPVTAQVTVTNTGKVEGDEVAQLYLS 798

Query: 719 PPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVG 778
            P  IA   ++ + GF+R+ ++AG ++ IKF     + L++V+ A + ++  GE+++ VG
Sbjct: 799 FP-NIAGAPLRALRGFRRIHLKAGESQTIKFELKD-RDLSMVNEAGDPIIAEGEYSVSVG 856

Query: 779 NG 780
            G
Sbjct: 857 GG 858


>gi|225873995|ref|YP_002755454.1| beta-xylosidase B [Acidobacterium capsulatum ATCC 51196]
 gi|225792796|gb|ACO32886.1| beta-xylosidase B [Acidobacterium capsulatum ATCC 51196]
          Length = 896

 Score =  277 bits (709), Expect = 2e-71,   Method: Compositional matrix adjust.
 Identities = 171/432 (39%), Positives = 232/432 (53%), Gaps = 42/432 (9%)

Query: 60  PYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTHFDDVI 119
           P   RV +LVS+MTL E+  Q+ + A  +PRLG+P Y WWSE LHG++  G         
Sbjct: 45  PIQKRVHELVSQMTLQEEAAQMMNTAPAIPRLGVPAYNWWSEGLHGIARSG--------- 95

Query: 120 PGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRA--------GLTYWSPNINV 171
             AT FP  I  +A+F+ +   ++G  VSTEARA YN            GLT W+PNIN+
Sbjct: 96  -YATVFPQAIGMSATFDPAAIHQMGTTVSTEARAKYNWAIRHDIHSIYFGLTLWAPNINI 154

Query: 172 ARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAY 231
            RDPRWGR  ET GEDPF+ G  A  YV GLQ           N + LK  +  KH++ Y
Sbjct: 155 VRDPRWGRGQETYGEDPFLTGTMAAEYVSGLQGN---------NPKYLKTVATPKHFSVY 205

Query: 232 DVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKL 291
              N     R+  +A  +  DM++T+L  F M + +G A S+MCSYN V G+PSCA+ KL
Sbjct: 206 ---NGPESMRHKINANPSAHDMQDTYLAAFRMAITKGHADSMMCSYNAVYGVPSCAN-KL 261

Query: 292 LNQTVRGEWDLHGYIVADCDSIQ--VMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTN 349
           L   VRG+W   GYI +DC +I        H +  D+   A +  L AG D DCG  Y  
Sbjct: 262 LADVVRGKWGFDGYITSDCGAISDFYRPGAHGYSPDAVHAAASAVL-AGTDTDCGTGYKV 320

Query: 350 FTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ--YVSLGKQDICSDENIELAAE 407
               +VQQG + +  ID++++ L+T   RLG FD      Y S+    + S  +   A E
Sbjct: 321 LP-QSVQQGLISKAAIDRAVERLFTARFRLGMFDPKADVPYNSIPYSVVDSAAHRAQALE 379

Query: 408 AAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFSG- 466
            A + +VLLKN+   LPL +A  +T+AVVGP+A    ++ GNY  IP     P+ G    
Sbjct: 380 DASKSMVLLKNEGGILPLRNA--RTIAVVGPNAANLNSIEGNYNAIPSHPSLPVDGIEAA 437

Query: 467 --YANVTYKTGC 476
              A+V Y  G 
Sbjct: 438 FPQAHVVYAQGS 449



 Score =  128 bits (322), Expect = 1e-26,   Method: Compositional matrix adjust.
 Identities = 84/263 (31%), Positives = 133/263 (50%), Gaps = 45/263 (17%)

Query: 516 DREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEG 575
           DR  L LP  Q  L++ +    K PV+LV+++   + I +A+ +  ++ IL A YPGE G
Sbjct: 655 DRTRLSLPQTQQDLLHALVATGK-PVVLVLLNGSALSIDWAKQH--VQGILEAWYPGEAG 711

Query: 576 GRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLY 635
           G AI + + G+ +PGG+LPIT+Y    V+ LP       P       GRTY++Y G  L+
Sbjct: 712 GEAIGETLSGQNDPGGKLPITFYTS--VKDLP-------PFTDYSMKGRTYRYYTGKPLF 762

Query: 636 PFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYF 695
           PFGYGLSYT F+Y+                H R               +  ++L+  +  
Sbjct: 763 PFGYGLSYTTFEYS----------------HVR---------------LSTSNLKAGEPL 791

Query: 696 EFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACK 755
             + + +N G   G  V  VY  PP +     +K++ GF RV +  G+++++ F  N  +
Sbjct: 792 TVEAEVKNTGHVAGDAVTEVYVTPP-QNGVNPLKELKGFDRVHLAPGQSRQLTFTLNP-R 849

Query: 756 SLNIVDYAANTLLPAGEHTIFVG 778
            L++VD A    +  G ++IFVG
Sbjct: 850 DLSLVDEAGKRSVQPGVYSIFVG 872


>gi|110737298|dbj|BAF00595.1| xylosidase [Arabidopsis thaliana]
          Length = 303

 Score =  277 bits (709), Expect = 2e-71,   Method: Compositional matrix adjust.
 Identities = 135/242 (55%), Positives = 171/242 (70%), Gaps = 13/242 (5%)

Query: 7   SLLCFSLSIALLVFSTNAVDANGSSSPVFVCDPGRFSKLGLQMSSFLFCDSSLPYSIRVK 66
           +LL  +  + +LVF    V ++ S  P+F CDP      GL   +  FC +++P  +RV+
Sbjct: 7   ALLIGNKVVVILVFLLCLVHSSESLRPLFACDPAN----GLT-RTLRFCRANVPIHVRVQ 61

Query: 67  DLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGATSFP 126
           DL+ R+TL EK++ L + A  VPRLG+  YEWWSEALHG+S+VGPG  F    PGATSFP
Sbjct: 62  DLLGRLTLQEKIRNLVNNAAAVPRLGIGGYEWWSEALHGISDVGPGAKFGGAFPGATSFP 121

Query: 127 TVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNINVARDPRWGRITETPGE 186
            VI T ASFN+SLW++IG+ VS EARAMYN G AGLTYWSPN+N+ RDPRWGR  ETPGE
Sbjct: 122 QVITTAASFNQSLWEEIGRVVSDEARAMYNGGVAGLTYWSPNVNILRDPRWGRGQETPGE 181

Query: 187 DPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDA 246
           DP V  +YA +YVRGLQ        T   +R LKV++CCKHY AYD+DNW GVDR+HF+A
Sbjct: 182 DPIVAAKYAASYVRGLQ-------GTAAGNR-LKVAACCKHYTAYDLDNWNGVDRFHFNA 233

Query: 247 RV 248
           +V
Sbjct: 234 KV 235


>gi|380512525|ref|ZP_09855932.1| beta-glucosidase [Xanthomonas sacchari NCPPB 4393]
          Length = 885

 Score =  277 bits (708), Expect = 2e-71,   Method: Compositional matrix adjust.
 Identities = 167/423 (39%), Positives = 230/423 (54%), Gaps = 40/423 (9%)

Query: 68  LVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGATSFPT 127
           LV++MT  EK+ Q  + A  +PRLG+P YEWWSE LHG++  G           AT FP 
Sbjct: 40  LVAKMTRAEKIAQAMNAAPAIPRLGVPAYEWWSEGLHGIARNGE----------ATVFPQ 89

Query: 128 VILTTASFNESLWKKIGQAVSTEARAMYNLGR---------AGLTYWSPNINVARDPRWG 178
            I   A++N  L   +G   STEARA +NL           AGLT WSPNIN+ RDPRWG
Sbjct: 90  AIGLAATWNPELLHDVGTVTSTEARAKFNLAGGPGKDHPRYAGLTIWSPNINIFRDPRWG 149

Query: 179 RITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKG 238
           R  ET GEDP++ GR AV ++ GLQ         D  + P  +++  KH A   V +   
Sbjct: 150 RGMETYGEDPYLTGRLAVGFIHGLQ--------GDDPAHPRTIATP-KHLA---VHSGPE 197

Query: 239 VDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRG 298
             R+ FD  V+  D E T+   F   + +G A SVMC+YN ++G P+CA   L++  VRG
Sbjct: 198 PGRHGFDVDVSPHDFEATYSPAFRAAIVDGQAGSVMCAYNSLHGTPACAADWLIDGRVRG 257

Query: 299 EWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFTGNAVQQG 358
           +W   G++V+DCD+I  M   H +  D+   + A  LKAG DL+CG  Y    G A  +G
Sbjct: 258 DWGFKGFVVSDCDAIDDMTQFHYYRPDNAGSSAA-ALKAGHDLNCGTAYREL-GIAFDRG 315

Query: 359 KVKETDIDKSLKYLYTVLMRLGFFDG--SPQYVSLGKQDICSDENIELAAEAAREGIVLL 416
           +  E  +D+SL  L+    RLG      +  Y  LG +DI S  +  LA +AA++ +VLL
Sbjct: 316 EADEALLDRSLVRLFAARYRLGELQPRRNDPYARLGARDIDSAAHRALALQAAQQSLVLL 375

Query: 417 KNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFS---GYANVTYK 473
           KN   TLPL       +AV+GP+A+A  A+  NY G   + ++P+ G     G A V Y 
Sbjct: 376 KNANATLPLRPG--LRLAVLGPNADALAALEANYQGTSVQPVTPLQGLRTRFGAAQVAYA 433

Query: 474 TGC 476
            G 
Sbjct: 434 QGA 436



 Score =  140 bits (353), Expect = 3e-30,   Method: Compositional matrix adjust.
 Identities = 92/286 (32%), Positives = 141/286 (49%), Gaps = 55/286 (19%)

Query: 505 GLDLSVEAESL----------DREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIA 554
           GL   VE E L          DR DL LP  Q  L+ + A+ +  P+++V+MS   V + 
Sbjct: 622 GLSPDVEGEELRIDVPGFDGGDRNDLALPAAQQALLER-AKASGKPLVVVLMSGSAVALN 680

Query: 555 FAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLR 614
           +AE + +  AI+ A YPG+ GG AIA  + G  NPGGRLP+T+Y          ++  L 
Sbjct: 681 WAEQHAD--AIIAAWYPGQSGGTAIAQALAGDINPGGRLPVTFYR---------STKDLP 729

Query: 615 PVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTS 674
           P  S    GRTY+++ G  L+PFGYGLSYTQF Y+                         
Sbjct: 730 PYVSYDMKGRTYRYFKGEPLFPFGYGLSYTQFAYD------------------------- 764

Query: 675 DASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGF 734
                  P +    L+     +     +N G+  G +VV VY + P + A + ++ ++GF
Sbjct: 765 ------APQLSTTTLQAGQPLQVSTTVRNTGARAGDEVVQVYLQYP-QRAQSPLRSLVGF 817

Query: 735 QRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGNG 780
           QRV ++ G  + + F  +A + L+ VD +    + AG++ +FVG G
Sbjct: 818 QRVHLQPGEARTLSFALDA-RQLSDVDRSGQRAVEAGDYRLFVGGG 862


>gi|393784338|ref|ZP_10372503.1| hypothetical protein HMPREF1071_03371 [Bacteroides salyersiae
           CL02T12C01]
 gi|392666114|gb|EIY59631.1| hypothetical protein HMPREF1071_03371 [Bacteroides salyersiae
           CL02T12C01]
          Length = 857

 Score =  277 bits (708), Expect = 2e-71,   Method: Compositional matrix adjust.
 Identities = 228/795 (28%), Positives = 355/795 (44%), Gaps = 146/795 (18%)

Query: 54  FCDSSLPYSIRVKDLVSRMTLDEKVQQ-------------------LGDFAHGVP----- 89
           +  SSLP S RV DL+ RMTL+EK+ Q                   LG F  GV      
Sbjct: 28  YRQSSLPISERVDDLLGRMTLEEKIAQIRHIHSWNVFNGQDLDMEKLGKFTGGVSWGFVE 87

Query: 90  ------------------------RLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGATSF 125
                                   RLG+P +   +E+LHG            V  G+T +
Sbjct: 88  GFPLTGVNCKKNMQLIQKFMVENTRLGIPVFTV-AESLHG-----------SVHEGSTIY 135

Query: 126 PTVILTTASFNESLWKKIGQAVSTE--ARAMYNLGRAGLTYWSPNINVARDPRWGRITET 183
           P  I   ++F   L  +    ++ +  A+ M+ +        +P I+V RD RWGR+ E+
Sbjct: 136 PQNIAMGSTFRPELAYRKAAMITKDLHAQGMHQV-------LAPCIDVVRDLRWGRVEES 188

Query: 184 PGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYH 243
            GEDP + G + +  V+G  D     N          +S   KHY  +  +   G++   
Sbjct: 189 FGEDPVLCGLFGIAEVKGYMD-----NG---------ISPMLKHYGPHG-NPLSGLNLAS 233

Query: 244 FDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLH 303
            +  +  +D+ E +L+PFEM ++     +VM +YN  N +P+ A   LL + +RG++   
Sbjct: 234 VECGL--RDLHEVYLKPFEMVIRNTPVLAVMSTYNSWNHVPNSASHYLLTEVLRGQFGFK 291

Query: 304 GYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFTGNAVQQGKVKET 363
           GY+ +D  +I+++   H+ +A + E+A  Q   AGLD++            +Q+GK+ E 
Sbjct: 292 GYVYSDWGAIEMLKTLHR-VAHNSEEAAMQAFTAGLDVEASSNCYPLLAGLIQKGKLDEE 350

Query: 364 DIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTL 423
            +++S++ +     ++G F+  P        ++   E+I L+ E A E +VLLKN+   L
Sbjct: 351 VLNESVRRVLYAKFKMGLFE-DPYGEQYSHSEMHGAESIRLSKEIADESVVLLKNENGLL 409

Query: 424 PLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRY--MSPIAG----FSGYANVTYKTGCD 477
           PLN+ K+K+VAV+GP  NA     G+Y         ++P+ G      G A V Y  GCD
Sbjct: 410 PLNADKLKSVAVIGP--NADQVQFGDYTWSRNNKDGVTPLEGIRRLLGGKATVRYAKGCD 467

Query: 478 DVACKSNNSIFAASEAAKTADATIILAG---------LDLSVEAESLDREDLWLPGYQTQ 528
            V+  +   I  A EAA+ ++  I+  G            S   E  D  DL L G Q Q
Sbjct: 468 LVSLNAG-GIKEAVEAARKSEVAILFCGSASAALARDYKSSTCGEGFDLNDLNLTGVQGQ 526

Query: 529 LINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFN 588
           LI +V E    PV+LV+++  G   A +    +I AIL   Y GE+ G +IAD++FG  +
Sbjct: 527 LIKEVYETGT-PVVLVLVT--GKPFAISWEKKHIPAILTQWYAGEQAGNSIADILFGSIS 583

Query: 589 PGGRLPITWYNGD-----YVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSY 643
           P GRL  ++         Y   LP      +   S   PGR Y F +   L+ FG+GL+Y
Sbjct: 584 PSGRLTFSYPQTTGHLPVYYNYLPSDKGFYKNPGSYESPGRDYVFSSPDALWAFGHGLTY 643

Query: 644 TQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQN 703
           T F Y           NL   +    LN                     D     VD +N
Sbjct: 644 TSFVYK----------NLRTDKEHYGLN---------------------DTIYIDVDIKN 672

Query: 704 VGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYA 763
            G  +G +VV +Y         T +KQ+  F++V V AG+ + +K    A   L IV+  
Sbjct: 673 TGKREGKEVVQLYVNDKVSTVVTPVKQLRDFKKVDVEAGKTETVKLKV-AVNDLYIVNAG 731

Query: 764 ANTLLPAGEHTIFVG 778
              ++  GE  + VG
Sbjct: 732 NKRVVEPGEFELQVG 746


>gi|380694149|ref|ZP_09859008.1| glycoside hydrolase 3 [Bacteroides faecis MAJ27]
          Length = 946

 Score =  277 bits (708), Expect = 2e-71,   Method: Compositional matrix adjust.
 Identities = 231/821 (28%), Positives = 372/821 (45%), Gaps = 148/821 (18%)

Query: 53  LFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRL---GLPQYEW----WS----- 100
           ++ D +     R++DL+S+MTL+EK  Q+    +G  R+    LP  EW    W      
Sbjct: 52  VYEDPTATIDARIEDLLSQMTLEEKTCQMVTL-YGYKRVLKDDLPTPEWKNQLWKDGIGA 110

Query: 101 --EALHGVSNVG-PGTHFDDVIPG------------------------------------ 121
             E L+G    G P +  +++ P                                     
Sbjct: 111 IDEHLNGFQQWGLPPSDNENIWPASRHAWALNEVQRFFIEETRLGIPTDFTNEGIRGVES 170

Query: 122 --ATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLT-YWSPNINVARDPRWG 178
             AT+FPT +    ++N  L  ++G     EAR +      G T  ++P ++V RD RWG
Sbjct: 171 YKATNFPTQLGLGHTWNRRLIHQVGLITGREARML------GYTNVYAPILDVGRDQRWG 224

Query: 179 RITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKG 238
           R  E  GE P++V    +  VRG+Q    H +         ++++  KH+ AY  +    
Sbjct: 225 RYEEVYGESPYLVAELGIEMVRGMQ----HNH---------QIAATGKHFIAYSNNKGAR 271

Query: 239 VDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRG 298
                 D +++ +++E T + PF+  ++E     VM SYN  +G P  +    L   +RG
Sbjct: 272 EGMARVDPQMSPREVEMTHVYPFKRVIREAGLLGVMSSYNDYDGFPIQSSYYWLTTRLRG 331

Query: 299 EWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCG----QYYTNFTGNA 354
           E    GY+V+D D+++ +   H    D KE AV Q+++AGL++ C       Y       
Sbjct: 332 EMGFRGYVVSDSDAVEYLYTKHGTAKDMKE-AVRQSVEAGLNVRCTFRSPDSYVLPLREL 390

Query: 355 VQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSL--GKQDICSDENIELAAEAAREG 412
           V++G + E  I+  ++ +  V   +G FD  P  + L    +++    N E+A +A+RE 
Sbjct: 391 VKEGGLSEEVINDRVRDILRVKFLVGLFD-HPYQIDLKGADEEVEKAANEEIALQASRES 449

Query: 413 IVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAG----FSGYA 468
           IVLLKND+N LPL+++ ++ +AV GP+A+     + +Y  +     S + G      G A
Sbjct: 450 IVLLKNDKNILPLDASGIQKIAVCGPNADEHSYALTHYGPLAVEVTSVLKGIQEKMKGKA 509

Query: 469 NVTYKTGCDDVAC--------------KSNNSIFAASEAAKTADATIILAGLDLSVEAES 514
            V Y  GCD V                +    I  A +  K AD  +++ G       E+
Sbjct: 510 EVLYTKGCDLVDANWPESELIDYPLTDEEQKEIEKAVDQTKQADVAVVVLGGGQRTCGEN 569

Query: 515 LDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEE 574
             R  L LPG Q  L+  VA   K PV+LV+++   + I +A  +  + AI+ A YPG +
Sbjct: 570 KSRSSLDLPGRQLDLLKAVAATGK-PVVLVLINGRPLSINWA--DKFVPAIVEAWYPGSK 626

Query: 575 GGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRP---VDSLGYPGRTYKFY-- 629
           GG+A+ADV+FG++NPGG+L +T+     V  +P  + P +P   +D    PG        
Sbjct: 627 GGKAVADVLFGEYNPGGKLTVTFPK--TVGQIPF-NFPCKPSSQIDGGKNPGMEGNMSRA 683

Query: 630 NGPTLYPFGYGLSYTQFKYNLLSFTKTIQV-NLNKLQHCRNLNYTSDASKTRCPGVLVND 688
           NG  LYPFGYGLSYT F+Y+ L  +  I   N      C+                    
Sbjct: 684 NG-ALYPFGYGLSYTTFEYSDLKISPAIITPNQQTFVTCK-------------------- 722

Query: 689 LRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIK 748
                         N G   G +VV +Y +       TY K + GF+RV ++ G  K + 
Sbjct: 723 ------------VTNTGKRAGDEVVQLYVRDVLSSVTTYEKNLAGFERVHLQPGETKEVT 770

Query: 749 FVFNACKSLNIVDYAANTLLPAGEHTIFVGNGGVSFPIHLN 789
           F  +  K+L +++   + ++  G+ T+ V  G  S  I LN
Sbjct: 771 FPIDR-KALELLNADMHWVVEPGDFTLMV--GASSTDIRLN 808


>gi|217968103|ref|YP_002353609.1| glycoside hydrolase family 3 [Dictyoglomus turgidum DSM 6724]
 gi|217337202|gb|ACK42995.1| glycoside hydrolase family 3 domain protein [Dictyoglomus turgidum
           DSM 6724]
          Length = 756

 Score =  277 bits (708), Expect = 2e-71,   Method: Compositional matrix adjust.
 Identities = 220/697 (31%), Positives = 339/697 (48%), Gaps = 110/697 (15%)

Query: 101 EALHGVSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRA 160
           EALHG            +  G+T FP  I   +++N  L  ++  A+  E R+     R 
Sbjct: 138 EALHGC-----------MAKGSTIFPQAIGMASTWNPELIYQVATAIGKETRS-----RG 181

Query: 161 GLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLK 220
                SP IN+ARDPR GR  ET GEDP++  R AV Y++G+Q+ +G             
Sbjct: 182 IHQVLSPTINIARDPRCGRTEETYGEDPYLASRMAVAYIKGVQE-QG------------- 227

Query: 221 VSSCCKHYAAYDVDNWKGVDRY--HFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYN 278
           V +  KH+AA  V +  G D Y  HF  R+    + E +   F+  +KE  A S+M +YN
Sbjct: 228 VIATPKHFAANFVGD-GGRDSYPIHFSERL----LREVYFPAFKASIKEAGALSLMAAYN 282

Query: 279 RVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAG 338
            ++GIP  ++  LL   +R EW   GY+V+D  S+  ++  HK +A+SK +A    L+AG
Sbjct: 283 SLDGIPCSSNKWLLTDVLRKEWGFKGYVVSDYFSVLHLMTKHK-VAESKAEAARLALEAG 341

Query: 339 LDL-----DCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDG---SPQYVS 390
           LD+     DC +   N     V+ GK+ E  I+++++ +  V    G FD     P Y  
Sbjct: 342 LDMELPDSDCFEEMINL----VKGGKLSEETINEAVRRILGVKFWAGLFDNPFVDPDYAE 397

Query: 391 LGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNY 450
             + + C+ E+ ELA   ARE IVLLKN +  LPL S  + ++AV+GP  NA V  +G Y
Sbjct: 398 --RVNDCA-EHRELALRVARESIVLLKN-EGILPL-SKDIGSIAVIGP--NAAVPRLGGY 450

Query: 451 AGIPCRYMSPIAG----FSGYANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGL 506
           +G   + ++P+ G        A + +  GC  +   S +    A + A+ +D  I+  G 
Sbjct: 451 SGYGVKIVTPLEGIKNKMENKAKIYFAEGCG-LNDTSKSGFDEAIKIAQKSDVAILFVGN 509

Query: 507 DL-SVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAI 565
            +   E E  DR +L LPG Q +LI ++      PVI+V+++  G  I        ++A+
Sbjct: 510 SVPETEGEQRDRHNLNLPGVQEELIKEICN-TNTPVIVVLIN--GSAITMMNWIDKVQAV 566

Query: 566 LWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPL--TSMPLRPVDSLGYPG 623
           + A YPGEEGG AIADV+FG +NPGG+LPIT+    Y   LPL     P   VD      
Sbjct: 567 IEAWYPGEEGGNAIADVLFGDYNPGGKLPITF--PKYSSQLPLYYNHKPSGRVDD----- 619

Query: 624 RTYKFYNGPT-LYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCP 682
             Y     P  L+PFGYGLSYT+F+Y+                   NL  T +       
Sbjct: 620 --YVDLRSPQYLFPFGYGLSYTEFRYS-------------------NLRITPE------- 651

Query: 683 GVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAG 742
                ++  D       + +N+G   G +VV +Y           +K++  F+R+ +  G
Sbjct: 652 -----EIPMDGEITITFEVENIGKYKGDEVVQLYLHDEFASVVRPVKELKRFKRITLAVG 706

Query: 743 RNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGN 779
             K + F  +  + L  ++     ++  G   +F+G+
Sbjct: 707 EKKTVSFKLDR-RDLEFLNIDMEPIVEPGRFEVFIGS 742


>gi|253574420|ref|ZP_04851761.1| glycoside hydrolase [Paenibacillus sp. oral taxon 786 str. D14]
 gi|251846125|gb|EES74132.1| glycoside hydrolase [Paenibacillus sp. oral taxon 786 str. D14]
          Length = 782

 Score =  276 bits (707), Expect = 2e-71,   Method: Compositional matrix adjust.
 Identities = 229/820 (27%), Positives = 374/820 (45%), Gaps = 154/820 (18%)

Query: 48  QMSSFLFCDSSLPYSIRVKDLVSRMTLDEK----VQQLG--DFAH--------------- 86
           ++   L+ DSS P   RV+ L+  MTL+EK    VQ  G   + H               
Sbjct: 14  EVEMLLYKDSSKPIPERVEHLLGLMTLEEKAGQLVQPFGWQTYEHKDGEIKLTEAFKAQV 73

Query: 87  ---GVPRL-GLPQYEWWS--------------EALHGVSN-------------VGPGTHF 115
              GV  L G+ + + W+              EA++ +               +G     
Sbjct: 74  KNGGVGSLYGVLRADPWTGVTLETGLSPREGTEAVNAIQRYAIENSRLGIPILIGEECSH 133

Query: 116 DDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNINVARDP 175
             +  GAT FP  +   +++N  L++++ +AV+ E RA     + G   +SP ++V RDP
Sbjct: 134 GHMAIGATVFPVPLSLGSTWNVELYREMCRAVARETRA-----QGGAVTYSPVLDVVRDP 188

Query: 176 RWGRITETPGEDPFVVGRYAVNYVRGLQ--DVEGHENATDLNSRPLKVSSCCKHYAAY-D 232
           RWGR  E  GED +++   AV  V GLQ   ++G ++          V++  KH+  Y  
Sbjct: 189 RWGRTEECFGEDAYLISEMAVASVEGLQGESLDGEDS----------VAATLKHFVGYGS 238

Query: 233 VDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLL 292
            +  +     H   R    ++ E  L PF   V+ G A+S+M +YN ++G+P   + +LL
Sbjct: 239 SEGGRNAGPVHMGRR----ELLEVDLLPFRKAVEAG-AASIMPAYNEIDGVPCTTNEELL 293

Query: 293 NQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLD-CGQYYTNFT 351
           +  +RGEW   G ++ DC +I ++   H    D + DA  Q ++AG+D++  G  +    
Sbjct: 294 DGVLRGEWGFDGMVITDCGAIDMLASGHDVAEDGR-DAAIQAIRAGIDMEMSGVMFGKHL 352

Query: 352 GNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQDICSDENIELAAEAARE 411
             AV+ G+++E  +D++++ + T+  RLG F+         ++ I S E++ELA + A E
Sbjct: 353 VEAVRSGQLEEEVLDRAVRRVLTLKFRLGLFERPYADPERAERVIGSAEHVELARQLASE 412

Query: 412 GIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCR--YMSPIAGFSGY-- 467
           G+VLLKN    LPL SA   T+AV+GP+A+A    +G+Y     R    + + G      
Sbjct: 413 GVVLLKNKDGVLPL-SADAGTIAVIGPNADAGYNQLGDYTSPQPRSKVTTVLGGIRSKLA 471

Query: 468 ---ANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAG-----------LDLSVEA- 512
                V Y  GC  +   S      A   A+ AD  +++ G           +DL   A 
Sbjct: 472 ETPERVLYAPGC-RINGNSREGFDVALSCAEKADTVVMVVGGSSARDFGEGTIDLRTGAS 530

Query: 513 -------------ESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETN 559
                        E +DR +L L G Q +LI ++ ++ K P+++V ++  G  IA    +
Sbjct: 531 KVTDNAESDMDCGEGIDRMNLSLSGVQLELIQEIHKLGK-PLVVVYIN--GRPIAEPWID 587

Query: 560 TNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSL 619
            +  AIL A YPG+EGG AIAD++FG  NP GRL I+     +V  +P+     R     
Sbjct: 588 EHADAILEAWYPGQEGGHAIADILFGDVNPSGRLTISIPK--HVGQVPVYYHGKRS---- 641

Query: 620 GYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKT 679
              G+ Y   +    YPFGYGLSYT+F YN                   NL   SD    
Sbjct: 642 --RGKRYLEGDSQPRYPFGYGLSYTEFTYN-------------------NLKLESDT--- 677

Query: 680 RCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFV 739
                    +  D   +  V+  NVG   G++V+ +Y    A       K++ GF+++F+
Sbjct: 678 ---------INKDGSTKVTVEVTNVGERAGAEVIQLYITDVASKVTRPAKELKGFRKIFL 728

Query: 740 RAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGN 779
           + G  + ++F     + L  +      ++  GE  + VG 
Sbjct: 729 QPGETQTVEFTVGP-EQLQYIGQNYKPVVEPGEFRVHVGK 767


>gi|332982620|ref|YP_004464061.1| glycoside hydrolase [Mahella australiensis 50-1 BON]
 gi|332700298|gb|AEE97239.1| glycoside hydrolase family 3 domain protein [Mahella australiensis
           50-1 BON]
          Length = 753

 Score =  276 bits (707), Expect = 3e-71,   Method: Compositional matrix adjust.
 Identities = 219/678 (32%), Positives = 337/678 (49%), Gaps = 84/678 (12%)

Query: 121 GATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNINVARDPRWGRI 180
           GAT FP  I   ++++    + +   +  + +A       GL   SP ++VARDPRWGR+
Sbjct: 108 GATVFPQAIGLASTWDAEAIEAMAGVIRQQMKAAG--AHQGL---SPVLDVARDPRWGRV 162

Query: 181 TETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVD 240
            ET GEDP++V   AV+YVRGLQ         DL      + +  KH+A +   ++    
Sbjct: 163 EETFGEDPYLVASMAVSYVRGLQ-------GQDLTK---GIFATLKHFAGH---SFSEGG 209

Query: 241 RYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEW 300
           R      V E+++ + FL PFE  V+E +A SVM +Y+ ++G+P  A  +LL   +RG +
Sbjct: 210 RNCAPVHVGERELWDIFLFPFEAAVREANAKSVMNAYHDIDGVPCAASRELLTDILRGHF 269

Query: 301 DLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQY--YTNFTGNAVQQG 358
              G +V+D D+I  +   H F A +K++A  Q L+AG+D++  +   Y     +AV++G
Sbjct: 270 GFDGIVVSDYDAIDRLRKAH-FTAGNKKEAAVQALEAGIDIELPKMDCYGQPLMDAVKEG 328

Query: 359 KVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQDICSDENIELAAEAAREGIVLLKN 418
            + E  I++S++ + T    LG FDG    V        + E  E++ + AR+ IVLLKN
Sbjct: 329 MISEATINESVERVLTAKFELGLFDGVYVDVDSVPGLFETPEQREMSRDIARKSIVLLKN 388

Query: 419 DQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRY-----MSPIAGFSGYAN---- 469
           D N LPL S  +K++AV+GP+A+    M+G+YA +  R      +  +    G  N    
Sbjct: 389 D-NVLPL-SKDIKSIAVIGPNADNARNMLGDYAFMAHRSYDKTSVHIVTVLEGIKNKVLD 446

Query: 470 ---VTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSV-----EAESLDREDLW 521
              +TY  GCD +   S +    A  AA+ ADA I++ G +  +       E+ DR D+ 
Sbjct: 447 SCRITYAKGCD-IIDPSTDGFVEAVNAARAADAAIVVVGDNSGIFGKGTSGENDDRTDIT 505

Query: 522 LPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIAD 581
           LPG Q QL+  + +  K PVI+V+++  G   A  E   N  A++ A YPGEEGG A+AD
Sbjct: 506 LPGVQMQLVKAIKDTGK-PVIVVLIN--GRAFAAKELADNASALMEAWYPGEEGGNAVAD 562

Query: 582 VVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGL 641
           V+FG +NP GRLPI+      V  +P+ +  L+P   + Y     K       + FGYG+
Sbjct: 563 VLFGDYNPAGRLPISLPC--EVGQIPI-NYNLKPASYINYLSTETK-----PAFAFGYGM 614

Query: 642 SYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDF 701
           SYT F Y+ LS T  +  +  K+                                FKV  
Sbjct: 615 SYTTFGYSDLSITPAVAPSAGKVD-----------------------------ISFKV-- 643

Query: 702 QNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVD 761
            N G   G +VV +Y +         +K++ GF+RV ++ G  K I F   A   L   D
Sbjct: 644 TNAGQLAGDEVVQLYIRDEVSSIVRPVKELKGFKRVNLQPGETKEITFTLYA-DQLAFHD 702

Query: 762 YAANTLLPAGEHTIFVGN 779
                ++  G   I VG+
Sbjct: 703 KDMRLVVEPGTFKIMVGS 720


>gi|317477144|ref|ZP_07936385.1| glycosyl hydrolase family 3 C terminal domain-containing protein
           [Bacteroides eggerthii 1_2_48FAA]
 gi|316906687|gb|EFV28400.1| glycosyl hydrolase family 3 C terminal domain-containing protein
           [Bacteroides eggerthii 1_2_48FAA]
          Length = 814

 Score =  276 bits (707), Expect = 3e-71,   Method: Compositional matrix adjust.
 Identities = 224/724 (30%), Positives = 335/724 (46%), Gaps = 109/724 (15%)

Query: 86  HGVPRLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQ 145
           HG  RLG+P +    E  HG   +G            T FPT I   +++N  L +++G+
Sbjct: 147 HG--RLGIPLF-LAEECPHGHMAIG-----------TTVFPTSIGQASTWNPELIRRMGR 192

Query: 146 AVSTEARAMYNLGRAGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDV 205
           A++TEA A     +     + P +++ARDPRW R+ ET GED ++ G      V+G Q  
Sbjct: 193 AIATEASA-----QGAHIGYGPVLDLARDPRWSRVEETYGEDAYLNGVMGAALVKGFQG- 246

Query: 206 EGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCV 265
                  +      KV +  KH+AAY    W         A V  ++MEE    PF   V
Sbjct: 247 -------EFPRTKGKVIATLKHFAAY---GWTEGGHNGGSAHVGNREMEEAIYPPFREAV 296

Query: 266 KEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLAD 325
             G A SVM SYN ++GIP  A+  LL   ++  W   G++V+D  +I  + ++   +AD
Sbjct: 297 AAG-ALSVMSSYNEIDGIPCTANSNLLTGLLKKRWQFKGFVVSDLYAIGGLREHG--VAD 353

Query: 326 SKEDAVAQTLKAGLDLDCG-QYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDG 384
           +  +A  + + AG+D D G   Y     NAV++G V+E  I+K++  +  +   +G FD 
Sbjct: 354 TDYEAAVKAVNAGVDSDLGTNVYAGQLVNAVKRGDVQEVVINKAVSRILALKFHMGLFDH 413

Query: 385 SPQYVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATV 444
                   +Q + S E++ELA E AR+ I+LLKN    LPLN  K+KT+AV+GP+A+   
Sbjct: 414 PFVDEREPEQVVASTEHLELAREVARQSIILLKNKNELLPLNK-KMKTIAVIGPNADNIY 472

Query: 445 AMIGNYAGIPCRYMSPIAGFSGY-------ANVTYKTGCDDVACKSNNSIFAASEAAKTA 497
            M+G+Y   P    S +    G         ++ Y  GC  V   S +    A EAA+ +
Sbjct: 473 NMLGDYTA-PQSESSVVTVLDGIRQKVSNDTHIIYAKGCA-VRDSSKSGFQEAIEAARQS 530

Query: 498 DATIILAG----LDLSVE-------------------AESLDREDLWLPGYQTQLINQVA 534
           D  +++ G     D S +                    E  DR  L L G Q +LI +V 
Sbjct: 531 DVVVMVMGGSSARDFSSKYEETGAAKVSDSHISDMESGEGYDRSTLELLGRQRELIREVG 590

Query: 535 EVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLP 594
           ++ K P++LV++   G  +        + AI+ A YPG +GG A+ADV+FG +NP GRL 
Sbjct: 591 KLNK-PIVLVLIK--GRPLLLEGIEAEVDAIVDAWYPGMQGGNAVADVLFGDYNPAGRLT 647

Query: 595 ITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFT 654
           I+      V  LP+     R  +        Y    G   YPFGYGLSYT F Y+     
Sbjct: 648 ISVPRS--VGQLPVYYNTKRKGNR-----SKYIEEEGTPRYPFGYGLSYTSFNYS----- 695

Query: 655 KTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVI 714
                         +L      ++  C   LVN           V  +N GS DG +VV 
Sbjct: 696 --------------DLKAEVVEAEDSC---LVN---------ISVKVRNEGSRDGDEVVQ 729

Query: 715 VYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHT 774
           +Y +       T  KQ+ GFQR+ ++ G  K I F  +  KSL +        +  G  T
Sbjct: 730 LYLRDEVASFTTPFKQLCGFQRIHLKVGETKEITFRLDK-KSLALYMQNEEWAVEPGRFT 788

Query: 775 IFVG 778
           + +G
Sbjct: 789 LMLG 792


>gi|154493680|ref|ZP_02033000.1| hypothetical protein PARMER_03021 [Parabacteroides merdae ATCC
           43184]
 gi|423723902|ref|ZP_17698051.1| hypothetical protein HMPREF1078_02038 [Parabacteroides merdae
           CL09T00C40]
 gi|154086890|gb|EDN85935.1| glycosyl hydrolase family 3 C-terminal domain protein
           [Parabacteroides merdae ATCC 43184]
 gi|409240709|gb|EKN33484.1| hypothetical protein HMPREF1078_02038 [Parabacteroides merdae
           CL09T00C40]
          Length = 868

 Score =  276 bits (706), Expect = 3e-71,   Method: Compositional matrix adjust.
 Identities = 161/452 (35%), Positives = 235/452 (51%), Gaps = 44/452 (9%)

Query: 48  QMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVS 107
           +   + F +  LP   R+ DL+ R+T +EK+ Q+ +    + RLG+P+Y+WW+EALHGV+
Sbjct: 22  RQEDYPFRNPDLPIDERIDDLLKRLTAEEKIGQMMNTTPAIERLGIPEYDWWNEALHGVA 81

Query: 108 NVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRA------- 160
             G           AT FP  I   A+F++    +    VS EARA Y+  +        
Sbjct: 82  RAGK----------ATVFPQAIAMAATFDDDALYETFTMVSDEARAKYHQYQKNKEYDRY 131

Query: 161 -GLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPL 219
            GLT+W+PNIN+ RDPRWGR  ET GEDP++  R  V  V+GLQ  +          +  
Sbjct: 132 KGLTFWTPNINIFRDPRWGRGMETYGEDPYLTERMGVAVVKGLQGDD---------PKYF 182

Query: 220 KVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNR 279
           K  +C KHYA +    W   +R+ FD  VT +D+ +T+L  FE  VK+G+   VMC+YNR
Sbjct: 183 KTHACAKHYAVHSGPEW---NRHEFDVTVTPRDLWQTYLPAFEALVKKGNVQEVMCAYNR 239

Query: 280 VNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQ------VMVDNHKFLADSKEDAVAQ 333
             G P C+  KLL   +R  W     I++DC +I            H+   D+ E A A 
Sbjct: 240 YQGKPCCSSDKLLIDILRNSWGYENIILSDCGAINDFWQRDERTPRHETHPDA-ESASAD 298

Query: 334 TLKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ--YVSL 391
            +  G DL+CG  Y      A+++GK+ E D+D SL+ L      LG FD   +  Y  +
Sbjct: 299 AVLNGTDLECGNSYKALI-KALKEGKISENDLDVSLRRLLKGRFELGMFDPDERVPYAQI 357

Query: 392 GKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYA 451
               + S E++  A E A + +VLLKN  NTLPL S  ++ +AVVGP+A  +  +  NY 
Sbjct: 358 PYNVVESPEHVAQALEMAHKSMVLLKNKNNTLPL-SKTIRKIAVVGPNAADSTMLWANYN 416

Query: 452 GIPCRYMSPIAGFSGY---ANVTYKTGCDDVA 480
           G P   ++ + G         V Y+ GC+  A
Sbjct: 417 GFPTHTVTILEGIRNKVPDTEVIYELGCNHAA 448



 Score =  113 bits (282), Expect = 5e-22,   Method: Compositional matrix adjust.
 Identities = 92/320 (28%), Positives = 140/320 (43%), Gaps = 58/320 (18%)

Query: 473 KTGCDDVACK--SNNSIFAASEAAKTADATIIL--AGLDLSVEAESL----------DRE 518
           +TG  D+  +  +   +  A+ AAK  DA +I+   G+   +E E +          DR 
Sbjct: 577 RTGSADLNFQIGTRRPVDYAATAAKVKDADVIVYVGGISPRLEGEEMPVNVEGFKKGDRT 636

Query: 519 DLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRA 578
           ++ +P  Q +++  +    K PV+ V+ +  G  +A    + NI AIL A Y G+E G A
Sbjct: 637 NIEIPKVQQEMVKALKATGK-PVVYVLCT--GSALALNWEDANIDAILNAWYGGQEAGTA 693

Query: 579 IADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFG 638
           +AD++FG +NP GRLP+T+Y    +  LP         +     GRTY++     LYPFG
Sbjct: 694 VADILFGDYNPSGRLPVTFYKS--IDQLP-------DFEDYSMKGRTYRYMTETPLYPFG 744

Query: 639 YGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFK 698
           YGLSYT F Y                   RN   +S        G +  D      F   
Sbjct: 745 YGLSYTNFAY-------------------RNAKLSS--------GKITKDQSVTLTF--- 774

Query: 699 VDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLN 758
            D  N G  DG +V  +Y K P +     IK +  F RV V+AG ++ +          +
Sbjct: 775 -DIANTGKMDGDEVAQIYIKNPNDPEGP-IKALKAFLRVHVKAGDSQEVNIELTPEAFHS 832

Query: 759 IVDYAANTLLPAGEHTIFVG 778
             D      +  G++ I  G
Sbjct: 833 FNDNTQTMEVRPGKYQILYG 852


>gi|189464211|ref|ZP_03012996.1| hypothetical protein BACINT_00548 [Bacteroides intestinalis DSM
           17393]
 gi|189438001|gb|EDV06986.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
           intestinalis DSM 17393]
          Length = 814

 Score =  276 bits (706), Expect = 3e-71,   Method: Compositional matrix adjust.
 Identities = 224/724 (30%), Positives = 334/724 (46%), Gaps = 109/724 (15%)

Query: 86  HGVPRLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQ 145
           HG  RLG+P +    E  HG   +G            T FPT I   +++N  L +++G+
Sbjct: 147 HG--RLGIPLF-LAEECPHGHMAIG-----------TTVFPTSIGQASTWNPELIRRMGR 192

Query: 146 AVSTEARAMYNLGRAGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDV 205
           A++TEA A     +     + P +++ARDPRW R+ ET GED ++ G      V+G Q  
Sbjct: 193 AIATEASA-----QGAHIGYGPVLDLARDPRWSRVEETYGEDAYLNGVMGAALVKGFQG- 246

Query: 206 EGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCV 265
                  +      KV +  KH+AAY    W         A V  ++MEE    PF   V
Sbjct: 247 -------EFPRTKGKVIATLKHFAAY---GWTEGGHNGGSAHVGNREMEEAIYPPFREAV 296

Query: 266 KEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLAD 325
             G A SVM SYN ++GIP  A+  LL   ++  W   G++V+D  +I  + ++   +AD
Sbjct: 297 AAG-ALSVMSSYNEIDGIPCTANSNLLTGLLKERWQFKGFVVSDLYAIGGLREHG--VAD 353

Query: 326 SKEDAVAQTLKAGLDLDCG-QYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDG 384
           +  +A  + + AG+D D G   Y     NAV++G V+E  I+K++  +  +   +G FD 
Sbjct: 354 TDYEAAVKAVNAGVDSDLGTNVYAGQLVNAVKRGDVQEVVINKAVSRILALKFHMGLFDH 413

Query: 385 SPQYVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATV 444
                   +Q + S E++ELA E AR+ I+LLKN    LPLN  K KT+AV+GP+A+   
Sbjct: 414 PFVDEREPEQVVASTEHLELAREVARQSIILLKNKNELLPLNK-KTKTIAVIGPNADNIY 472

Query: 445 AMIGNYAGIPCRYMSPIAGFSGY-------ANVTYKTGCDDVACKSNNSIFAASEAAKTA 497
            M+G+Y   P    S +    G         ++ Y  GC  V   S +    A EAA+ +
Sbjct: 473 NMLGDYTA-PQSESSVVTVLDGIRQKVSNDTHIIYAKGCA-VRDSSKSGFQEAIEAARQS 530

Query: 498 DATIILAG----LDLSVE-------------------AESLDREDLWLPGYQTQLINQVA 534
           D  +++ G     D S +                    E  DR  L L G Q +LI +V 
Sbjct: 531 DVVVMVMGGSSARDFSSKYEETGAAKVSDSHISDMESGEGYDRSTLELLGRQRELIREVG 590

Query: 535 EVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLP 594
           ++ K P++LV++   G  +        + AI+ A YPG +GG A+ADV+FG +NP GRL 
Sbjct: 591 KLNK-PIVLVLIK--GRPLLLEGIEAEVDAIVDAWYPGMQGGNAVADVLFGDYNPAGRLT 647

Query: 595 ITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFT 654
           I+      V  LP+     R  +        Y    G   YPFGYGLSYT F Y+     
Sbjct: 648 ISVPRS--VGQLPVYYNTKRKGNR-----SKYIEEEGTPRYPFGYGLSYTSFNYS----- 695

Query: 655 KTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVI 714
                         +L      ++  C   LVN           V  +N GS DG +VV 
Sbjct: 696 --------------DLKAEVVEAEDSC---LVN---------ISVKVRNEGSRDGDEVVQ 729

Query: 715 VYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHT 774
           +Y +       T  KQ+ GFQR+ ++ G  K I F  +  KSL +        +  G  T
Sbjct: 730 LYLRDEVASFTTPFKQLCGFQRIHLKVGETKEITFRLDK-KSLALYMQNEEWAVEPGRFT 788

Query: 775 IFVG 778
           + +G
Sbjct: 789 LMLG 792


>gi|322437617|ref|YP_004219707.1| glycoside hydrolase family protein [Granulicella tundricola
           MP5ACTX9]
 gi|321165510|gb|ADW71213.1| glycoside hydrolase family 3 domain protein [Granulicella
           tundricola MP5ACTX9]
          Length = 892

 Score =  276 bits (705), Expect = 4e-71,   Method: Compositional matrix adjust.
 Identities = 172/455 (37%), Positives = 245/455 (53%), Gaps = 45/455 (9%)

Query: 54  FCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGT 113
           + D +L    RV DLVSRMTL+EKV Q  + A  + RL +P+Y++WSE LHG++  G   
Sbjct: 34  YMDPALTTQQRVDDLVSRMTLEEKVSQTINSAPAISRLNVPEYDYWSEGLHGIARSG--- 90

Query: 114 HFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLG--------RAGLTYW 165
                   AT FP  I   A+++  L ++IG  +S EARA +N            GLT W
Sbjct: 91  -------YATMFPQAIGMAATWDAPLLQQIGDVISIEARAKFNEAIRHNIHSIYYGLTIW 143

Query: 166 SPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCC 225
           SPNIN+ RDPRWGR  ET GEDPF+ GR  V +V+G+Q         D N    +  +  
Sbjct: 144 SPNINIFRDPRWGRGQETYGEDPFLTGRLGVAFVKGIQ-------GPDPNY--FRAIATP 194

Query: 226 KHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPS 285
           KH+A   V +     R+  +   T  D+ +T+L  F   + E  A S+MC+YN V G P+
Sbjct: 195 KHFA---VHSGPESTRHSANIEPTPHDLHDTYLPAFRATITEAHADSIMCAYNAVEGSPA 251

Query: 286 CADPKLLNQTVRGEWDLHGYIVADCDSIQ----VMVDNHKFLADSKEDAVAQTLKAGLDL 341
           CA   LL  T+R +W   G++ +DC +I         +H    D KE A A  +KAG D 
Sbjct: 252 CASKLLLQDTLRRDWGFKGFVTSDCGAIDDFYATDYPSHHTSPD-KEAAAAAGIKAGTDS 310

Query: 342 DCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ--YVSLGKQDICSD 399
           +CGQ Y    G+AV++G V E +ID +LK+L+T   +LG FD + +  + ++   ++ S 
Sbjct: 311 NCGQTYLTL-GSAVKKGLVTEAEIDTALKHLFTARFQLGLFDPAAKVAFNAIPFSEVNSP 369

Query: 400 ENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMS 459
            +  LA +AA E IVLLKND +TLP   + V+T+AV+GP A     + GNY  IP   + 
Sbjct: 370 AHQALALKAAEESIVLLKNDAHTLPFKPS-VRTIAVIGPSAATLNNLEGNYNAIPLHPVL 428

Query: 460 PIAGFSGY---ANVTYKTG---CDDVACKSNNSIF 488
           P+ G       + V Y  G    D VA     ++F
Sbjct: 429 PLDGILTQFKSSKVLYAQGSSFADGVAIAVPRTVF 463



 Score =  108 bits (269), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 77/278 (27%), Positives = 124/278 (44%), Gaps = 48/278 (17%)

Query: 503 LAGLDLSVEAESL---DREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETN 559
           L G ++ +  E     DR D+ LP  Q Q++  VA   K P+++V+++   + + +A  N
Sbjct: 636 LEGEEMPIHIEGFAGGDRTDIKLPAAQQQMLEAVAATGK-PLVVVLLNGSALAVNWA--N 692

Query: 560 TNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSL 619
            +  AIL A YPG+ GG AIA+ + GK NP GRLP+T+Y+   +  +P         D  
Sbjct: 693 DHAAAILEAWYPGQAGGTAIAETLAGKNNPAGRLPVTFYSS--IDQIPA-------FDDY 743

Query: 620 GYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKT 679
               RTY++     L+ FGYGLSYT F Y+ +  +                         
Sbjct: 744 SMANRTYRYSKAKPLFEFGYGLSYTTFTYSNIKLS------------------------- 778

Query: 680 RCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFV 739
                    L   D    + D +N G   G +V  +Y  PP   A +  + +  F RV +
Sbjct: 779 ------TQTLHAGDPLTVEADVRNTGRVAGDEVAELYLTPP-HTAVSPQRALSAFTRVHL 831

Query: 740 RAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFV 777
             G  + + F  +  ++L+ VD      +  G +T+ V
Sbjct: 832 APGELRHVTFTLDP-RTLSQVDEKGARAVTPGNYTLSV 868


>gi|423289663|ref|ZP_17268513.1| hypothetical protein HMPREF1069_03556 [Bacteroides ovatus
           CL02T12C04]
 gi|423298156|ref|ZP_17276215.1| hypothetical protein HMPREF1070_04880 [Bacteroides ovatus
           CL03T12C18]
 gi|392663697|gb|EIY57244.1| hypothetical protein HMPREF1070_04880 [Bacteroides ovatus
           CL03T12C18]
 gi|392667374|gb|EIY60884.1| hypothetical protein HMPREF1069_03556 [Bacteroides ovatus
           CL02T12C04]
          Length = 850

 Score =  276 bits (705), Expect = 4e-71,   Method: Compositional matrix adjust.
 Identities = 162/430 (37%), Positives = 246/430 (57%), Gaps = 41/430 (9%)

Query: 53  LFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPG 112
           L+ + + P   RV DL+SR+T++EK+  L   + G+PRLG+ +Y   +EALHGV  V PG
Sbjct: 26  LYKNENAPVHERVADLLSRLTVEEKISLLRATSPGIPRLGIDKYYHGNEALHGV--VRPG 83

Query: 113 THFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAG----------L 162
                     T FP  I   A++N  L +K+   +S EARA +N    G          L
Sbjct: 84  RF--------TVFPQAIGLAATWNPVLQQKVATVISDEARARWNELDQGRNQKEQFSDVL 135

Query: 163 TYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVS 222
           T+WSP +N+ARDPRWGR  ET GEDPF+ G     +V+GLQ           + R LK+ 
Sbjct: 136 TFWSPTVNMARDPRWGRTPETYGEDPFLSGVMGTAFVKGLQGE---------DPRYLKIV 186

Query: 223 SCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNG 282
           S  KH+ A + ++    +R+  + +++E+ + E +   FEMCVK+G A+S+M +YN +N 
Sbjct: 187 STPKHFVANNEEH----NRFICNPQISEKQLREYYFPAFEMCVKKGKAASIMTAYNALND 242

Query: 283 IPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLD 342
           +P   +  LL + +R +W   GY+V+DC    ++V+ HK++  +KE A   ++KAGLDL+
Sbjct: 243 VPCTLNAWLLQKVLRQDWGFRGYVVSDCGGPSLLVNAHKYVK-TKETAATLSIKAGLDLE 301

Query: 343 CG-QYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ--YVSLGKQDICSD 399
           CG   Y  +  NA +Q  V + DID +  ++    M+LG FD   +  Y  +    I S 
Sbjct: 302 CGDDVYDEYLLNAYKQYMVSDADIDSAACHVLAARMKLGMFDSKERNPYARISPSVIGSK 361

Query: 400 ENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMS 459
           ++ ++A +AARE IVLLKN +N LPLN  K+K++AVVG   NA     G+Y+G P   + 
Sbjct: 362 DHQQVALDAARECIVLLKNQKNMLPLNVDKLKSIAVVG--INAGTCEFGDYSGAPV--IE 417

Query: 460 PIAGFSGYAN 469
           P++   G  N
Sbjct: 418 PVSVLQGIKN 427



 Score =  132 bits (332), Expect = 7e-28,   Method: Compositional matrix adjust.
 Identities = 89/290 (30%), Positives = 134/290 (46%), Gaps = 48/290 (16%)

Query: 490 ASEAAKTADATIILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAG 549
           A +A    +  + + G++ S+E E  DR D+ LP  Q + + ++ +V   P I+V++ AG
Sbjct: 594 AGKAVSECETVVAVMGINKSIEREGQDRYDIQLPADQREFLQEIYKV--NPNIIVVLVAG 651

Query: 550 GVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLT 609
              +A    + +I AI+ A YPGE+GG A+ADV+FG +NP GRLP+T+Y         L 
Sbjct: 652 S-SLAVNWMDEHIPAIVNAWYPGEQGGTAVADVLFGDYNPAGRLPLTYYKS-------LD 703

Query: 610 SMPLRPVDSLGY-PGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCR 668
            +P  P D      GRTYK++ G  LYPFGYGLSY+ FKY+ L    +            
Sbjct: 704 ELP--PFDDYDITKGRTYKYFKGDVLYPFGYGLSYSSFKYSDLKVKDST----------- 750

Query: 669 NLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYI 728
                                   D        +N G   G +V  VY + P       I
Sbjct: 751 ------------------------DKVTVSFRLKNTGRRKGDEVAQVYVRIPETGGIVPI 786

Query: 729 KQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVG 778
           K++ GF+RV +  G ++ I    +  +           +LPAG   + VG
Sbjct: 787 KELKGFRRVPLEPGESRAIDIELDKEQLRYWDTTKEQFILPAGTFDVMVG 836


>gi|423293434|ref|ZP_17271561.1| hypothetical protein HMPREF1070_00226 [Bacteroides ovatus
           CL03T12C18]
 gi|392678377|gb|EIY71785.1| hypothetical protein HMPREF1070_00226 [Bacteroides ovatus
           CL03T12C18]
          Length = 735

 Score =  276 bits (705), Expect = 4e-71,   Method: Compositional matrix adjust.
 Identities = 220/768 (28%), Positives = 357/768 (46%), Gaps = 115/768 (14%)

Query: 53  LFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPG 112
           L+ D+  P   R+ DL+SRMTL+EK+ QL  +  G         E   E     S +G  
Sbjct: 29  LYKDAKAPIEKRIDDLISRMTLEEKILQLNQYTLGRNNNVNNVGE---EVKKVPSEIGSL 85

Query: 113 THFD---------------------------DVIPG-ATSFPTVILTTASFNESLWKKIG 144
            +FD                           D I G  T +P  +    S+N  L ++  
Sbjct: 86  IYFDINPELRNSMQKKAMEESRLGIPIIFGYDAIHGFRTIYPISLGQACSWNPGLVEQAC 145

Query: 145 QAVSTEARAMYNLGRAGLTY-WSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQ 203
              + EAR       +G+ + +SP I+VARDPRWGR+ E  GEDP+  G +A   VRG Q
Sbjct: 146 AVSAQEARM------SGVDWTFSPMIDVARDPRWGRVAEGYGEDPYTNGVFAAASVRGYQ 199

Query: 204 -DVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFE 262
            D    EN         ++++C KHY  Y         R +    ++ Q + +T+L P+E
Sbjct: 200 GDDMSAEN---------RMAACLKHYVGYGASE---AGRDYVYTEISAQTLWDTYLLPYE 247

Query: 263 MCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKF 322
           M VK G A+++M S+N ++G+P  A+P ++ + ++  W   G+IV+D  +++ +   ++ 
Sbjct: 248 MGVKAG-AATLMSSFNDISGVPGSANPYIMTEILKKRWKHDGFIVSDWGAVEQL--KNQG 304

Query: 323 LADSKEDAVAQTLKAGLDLDCGQY-YTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGF 381
           LA +K+DA      AGL++D   + Y       V++GKV    +D+S++ +  V  RLG 
Sbjct: 305 LAATKKDAARYAFNAGLEMDMMSHAYDRHLKELVEEGKVTMAQVDESVRRVLRVKFRLGL 364

Query: 382 FDGSPQYVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHAN 441
           F+     V+  K      +++ +AA+ A E +VLLKN+   LPL +   K +AVVGP A 
Sbjct: 365 FERPYTPVTNEKDRFFRPQSMAVAAQLAAESMVLLKNNNQILPLTNK--KKIAVVGPMAK 422

Query: 442 ATVAMIGNYAG------IPCRYMSPIAGFSGYANVTYKTGCDDVACKSNNSIFA-ASEAA 494
               ++G++ G      +   Y    A F G A + Y  GC      ++ S FA A + A
Sbjct: 423 NGWDLLGSWCGHGKDTDVEMLYDGLTAEFGGDAELRYAMGCKPQG--NDRSGFAGALDVA 480

Query: 495 KTADATIILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIA 554
           + +D  I+  G  L+   E+  R  + LP  Q +L+ ++ E  K P+ILV+  + G  + 
Sbjct: 481 RWSDVVIVCLGEMLTWSGENASRSTIALPQIQEELVKELKEAGK-PIILVL--SNGRPLE 537

Query: 555 FAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTS--MP 612
                    AIL    PG  G R++A ++ G+ NP G+L +T+         P ++  +P
Sbjct: 538 LNRMEPLCDAILEIWQPGINGARSMAGILSGRINPSGKLAMTF---------PYSTGQIP 588

Query: 613 L---RPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRN 669
           +   R     G+ G  YK      LYPFG+GLSYT+FKY  +                  
Sbjct: 589 IYYNRRKSGRGHQG-FYKDITSDPLYPFGHGLSYTEFKYGTV------------------ 629

Query: 670 LNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIK 729
              T  A+K          ++  D    +V   N G+ DG++ V  +   P       +K
Sbjct: 630 ---TPSATK----------VKRGDKLSAEVTVTNTGARDGAETVHWFISDPYCSITRPVK 676

Query: 730 QVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFV 777
           ++  F++ F++ G  K  +F  +  +    V+      L AGE+ I V
Sbjct: 677 ELKHFEKQFIKVGETKTFRFDIDLERDFGFVNEDGKRFLEAGEYHILV 724


>gi|298479985|ref|ZP_06998184.1| periplasmic beta-glucosidase [Bacteroides sp. D22]
 gi|298273794|gb|EFI15356.1| periplasmic beta-glucosidase [Bacteroides sp. D22]
          Length = 735

 Score =  276 bits (705), Expect = 4e-71,   Method: Compositional matrix adjust.
 Identities = 223/768 (29%), Positives = 356/768 (46%), Gaps = 115/768 (14%)

Query: 53  LFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPG 112
           L+ D+  P   R+ DL+SRMTL+EKV QL  +  G         E   E     S +G  
Sbjct: 29  LYKDAKAPIEKRIDDLISRMTLEEKVLQLNQYTLGRNNNVNNVGE---EVKKVPSEIGSL 85

Query: 113 THFD---------------------------DVIPG-ATSFPTVILTTASFNESLWKKIG 144
            +FD                           D I G  T +P  +    S+N  L ++  
Sbjct: 86  IYFDINPELRNSMQKKAMEESRLGIPIIFGYDAIHGFRTIYPISLGQACSWNPGLVEQAC 145

Query: 145 QAVSTEARAMYNLGRAGLTY-WSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQ 203
              + EAR       +G+ + +SP I+VARDPRWGR+ E  GEDP+  G +A   VRG Q
Sbjct: 146 AVSAQEARM------SGVDWTFSPMIDVARDPRWGRVAEGYGEDPYTNGVFAAASVRGYQ 199

Query: 204 -DVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFE 262
            D    EN         ++++C KHY  Y         R +    ++ Q + +T+L P+E
Sbjct: 200 GDDMSAEN---------RMAACLKHYVGYGASE---AGRDYVYTEISAQTLWDTYLLPYE 247

Query: 263 MCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKF 322
           M VK G A+++M S+N ++G+P  A+P ++ + ++  W   G+IV+D  +++ +   ++ 
Sbjct: 248 MGVKAG-AATLMSSFNDISGVPGSANPYIMTEILKKRWKHDGFIVSDWGAVEQL--KNQG 304

Query: 323 LADSKEDAVAQTLKAGLDLDCGQY-YTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGF 381
           LA +K+DA      AGL++D   + Y       V++GKV    +D+S++ +  V   LG 
Sbjct: 305 LAATKKDAAQYAFNAGLEMDMMSHAYDRHLKELVEEGKVTMAQVDESVRRVLRVKFCLGL 364

Query: 382 FDGSPQYVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHAN 441
           F+     V+  K      +++ +AA+ A E +VLLKND   LPL +   K +AVVGP A 
Sbjct: 365 FERPYTPVTNEKDRFFRPQSMAVAAQLAAESMVLLKNDNQILPLTNK--KKIAVVGPMAK 422

Query: 442 ATVAMIGNYAG------IPCRYMSPIAGFSGYANVTYKTGCDDVACKSNNSIFA-ASEAA 494
               ++G++ G      +   Y    A F G A + Y  GC      ++ S FA A + A
Sbjct: 423 NGWDLLGSWCGHGKDTDVEMLYDGLTAEFGGDAELRYAMGCKPQG--NDRSGFAGALDVA 480

Query: 495 KTADATIILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIA 554
           + +D  I+  G  L+   E+  R  + LP  Q +L+ ++ E  K PVILV+  + G  + 
Sbjct: 481 RWSDVVIVCLGEMLTWSGENASRSTIALPQIQEELVKELKEAGK-PVILVL--SNGRPLE 537

Query: 555 FAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTS--MP 612
                    AIL    PG  G R++A ++ G+ NP G+L +T+         P ++  +P
Sbjct: 538 LNRMEPLCDAILEIWQPGINGARSMAGILSGRINPSGKLAMTF---------PYSTGQIP 588

Query: 613 L---RPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRN 669
           +   R     G+ G  YK      LYPFG+GLSYT+FKY  +                  
Sbjct: 589 IYYNRRKSGRGHQG-FYKDITSDPLYPFGHGLSYTEFKYGTV------------------ 629

Query: 670 LNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIK 729
              T  A+K          ++  D    +V   N GS DG++ V  +   P       +K
Sbjct: 630 ---TPSATK----------VKRGDKLSAEVTVTNTGSRDGAETVHWFISDPYCSITRPVK 676

Query: 730 QVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFV 777
           ++  F++  ++AG  K  +F  +  +    V+      L AGE+ I V
Sbjct: 677 ELRHFEKQLIKAGETKTFRFDIDLERDFGFVNEDGKRFLEAGEYHILV 724


>gi|393781221|ref|ZP_10369422.1| hypothetical protein HMPREF1071_00290 [Bacteroides salyersiae
           CL02T12C01]
 gi|392677556|gb|EIY70973.1| hypothetical protein HMPREF1071_00290 [Bacteroides salyersiae
           CL02T12C01]
          Length = 946

 Score =  275 bits (704), Expect = 5e-71,   Method: Compositional matrix adjust.
 Identities = 233/819 (28%), Positives = 371/819 (45%), Gaps = 145/819 (17%)

Query: 42  FSKLGLQMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRL---GLPQYEW 98
           F+K G++    ++ D + P   R++DL+S+M L+EK  Q+    +G  R+    LP  EW
Sbjct: 44  FNKNGIKD---IYEDPTAPIDARIEDLLSQMNLNEKTCQMVTL-YGYKRVLKDDLPTPEW 99

Query: 99  ----WS-------EALHGVSNVG-PGTHFDDVIPG------------------------- 121
               W        E L+G    G P +  + V P                          
Sbjct: 100 KQMLWKDGMGAIDEHLNGFQQWGLPPSDNEYVWPASRHAWALNEVQRFFVEETRLGIPVD 159

Query: 122 -------------ATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLT-YWSP 167
                        AT+FPT +    ++N  L  +IG     EAR +      G T  ++P
Sbjct: 160 FTNEGIRGVESYKATNFPTQLGLGHTWNRKLIHQIGLITGREARML------GYTNVYAP 213

Query: 168 NINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKH 227
            ++V RD RWGR  E  GE P++V    +  VRG+Q    H +         +V++  KH
Sbjct: 214 ILDVGRDQRWGRYEEVYGESPYLVAELGIEMVRGMQ----HNH---------QVAATGKH 260

Query: 228 YAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCA 287
           + AY  +          D +++ +++E   + PF+  ++E     VM SYN  +G P  +
Sbjct: 261 FIAYSNNKGAREGMARVDPQMSPREVEMIHVYPFKRVIQEAGLLGVMSSYNDYDGFPIQS 320

Query: 288 DPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCG--- 344
               L   +RG+    GY+V+D D+++ +   H    D KE AV Q+++AGL++ C    
Sbjct: 321 SYYWLTTRLRGQMGFRGYVVSDSDAVEYLYTKHGTAKDMKE-AVRQSVEAGLNVRCTFRS 379

Query: 345 -QYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQD-ICSDENI 402
              Y       VQ+G + E  I+  ++ +  V   +G FD   Q    G  D +  +EN 
Sbjct: 380 PDSYVLPLRELVQEGGLSEEVINDRVRDILRVKFLVGLFDAPYQTDLKGADDEVEKEENE 439

Query: 403 ELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIA 462
            +A +A+RE IVLLKN+ NTLPL+   VK +AV GP+A      + +Y  +     + + 
Sbjct: 440 AVALQASRESIVLLKNENNTLPLDITSVKKIAVCGPNAAEKAYALTHYGPLAVEVTTVVD 499

Query: 463 G----FSGYANVTYKTGCDDV--------------ACKSNNSIFAASEAAKTADATIILA 504
           G     +G A V Y  GCD V              +    + I  A   A+ AD  +++ 
Sbjct: 500 GLREKLNGKAEVLYTKGCDLVDAHWPESEIIDYPLSKDEQSEIDKAVAQAQEADVAVVVL 559

Query: 505 GLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKA 564
           G       E+  R  L LPG Q  L+  V    K PVILV+++   + + +A  +  + A
Sbjct: 560 GGGQRTCGENKSRSSLDLPGRQLDLLKAVQATGK-PVILVLINGRPLSVNWA--DKFVPA 616

Query: 565 ILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRP---VDSLGY 621
           IL A YPG +GG AIADV+FG +NPGG+L +T+     V  +P  + P +P   +D    
Sbjct: 617 ILEAWYPGSKGGTAIADVLFGDYNPGGKLTVTFPKS--VGQIPF-NFPHKPSSQIDGGKN 673

Query: 622 PGRT--YKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKT 679
           PG        NG  LYPFGYGLSYT F+Y+ ++ +  +     K+Q              
Sbjct: 674 PGTKGDMSRVNG-ALYPFGYGLSYTTFEYSDINISPKVITPNQKVQ-------------- 718

Query: 680 RCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFV 739
                    +RC           N G   G +VV +Y +       TY K + GF+R+ +
Sbjct: 719 ---------VRC--------KVTNTGKHAGDEVVQLYVRDLISSVTTYEKNLEGFERIHL 761

Query: 740 RAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVG 778
           + G  K + F  +  K+L +++   + ++  G+ +I +G
Sbjct: 762 QPGETKEVSFTLDR-KALELLNAKNDWVVEPGDFSIMLG 799


>gi|198274480|ref|ZP_03207012.1| hypothetical protein BACPLE_00628 [Bacteroides plebeius DSM 17135]
 gi|198272682|gb|EDY96951.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
           plebeius DSM 17135]
          Length = 912

 Score =  275 bits (704), Expect = 5e-71,   Method: Compositional matrix adjust.
 Identities = 205/689 (29%), Positives = 329/689 (47%), Gaps = 99/689 (14%)

Query: 90  RLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVST 149
           RLG+P  ++ +E + GV N             AT+FPT +    ++N  L ++IG     
Sbjct: 118 RLGIPA-DFTNEGIRGVENYI-----------ATNFPTQLALGHTWNRELIRQIGYITGR 165

Query: 150 EARAMYNLGRAGLT-YWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGH 208
           EAR +      G T  ++P ++V RD RWGR  E  GE P++V    +   +GLQ     
Sbjct: 166 EARLL------GYTNVYAPILDVGRDQRWGRYEEVYGESPYLVAELGIAMGKGLQ----- 214

Query: 209 ENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEG 268
              TD+     +V+S  KH+ AY  +          D +++ +++E     PF   ++E 
Sbjct: 215 ---TDM-----QVASTAKHFIAYSNNKGAREGFARVDPQMSWREVENIHAYPFTRVIQEA 266

Query: 269 DASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKE 328
               VM SYN  +G P  +    L Q +RG     GY+V+D D+++ +   HK   D KE
Sbjct: 267 GILGVMSSYNDYDGFPIQSSYYWLTQRLRGTMGFRGYVVSDSDAVEYLYSKHKTAKDMKE 326

Query: 329 DAVAQTLKAGLDLDCG----QYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDG 384
            AV Q+++AGL++ C     + Y       +Q+G +    ID  ++ +  V    G FD 
Sbjct: 327 -AVRQSVEAGLNVRCTFRSPESYVLPLRELIQEGGLSMETIDNRVRDILRVKFLTGLFDT 385

Query: 385 SPQY-VSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANAT 443
             Q  ++L  +++ S+ + ++A +A+REG+VLLKN  N LPL+ +++K +AV GP+A+  
Sbjct: 386 PYQTDLALADKEVNSEAHQQVALQASREGLVLLKNANNLLPLDKSQIKRIAVCGPNADEA 445

Query: 444 VAMIGNYAGIPCRYMSPIAGFSGY----ANVTYKTGCDDV--------------ACKSNN 485
              + +Y  +     + + G          VTY  GCD V                +   
Sbjct: 446 SFALTHYGPVAVEVTTVLEGIKQQVKEGTKVTYTKGCDLVDANWPESEIISYPLTAEEKT 505

Query: 486 SIFAASEAAKTADATIILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVI 545
            I  A +  K +D  +++ G  +    E+  R  L LPG+Q QL+  +    K PV+LV+
Sbjct: 506 EIQKAVDNVKESDVAVVVLGGGIRTCGENKSRTSLDLPGHQQQLLEAIVATGK-PVVLVL 564

Query: 546 MSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQM 605
           ++   + I +A  +  + AIL A YPG +GG AIA+ +FG +NPGG+L +T+     V  
Sbjct: 565 INGRPLSINWA--DKFVPAILEAWYPGSQGGTAIAEALFGDYNPGGKLTVTF--PKTVGQ 620

Query: 606 LPLTSMPLRP---VDSLGYPGR--TYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVN 660
           +P  + P +P   VD    PG        NGP LYPFGYGLSYT F+Y+           
Sbjct: 621 IPF-NFPAKPASQVDGGQTPGMKGNQSRINGP-LYPFGYGLSYTTFEYS----------- 667

Query: 661 LNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPP 720
                   NL  +S     + P  +   ++            N G+  G +VV +Y++  
Sbjct: 668 --------NLQLSSPVITDKEPVTVTCKIK------------NTGTRSGDEVVQLYTRDV 707

Query: 721 AEIAATYIKQVIGFQRVFVRAGRNKRIKF 749
                TY K + GF+RV +  G  K++ F
Sbjct: 708 ISSVTTYEKNLRGFERVHLEPGETKKVSF 736


>gi|323344052|ref|ZP_08084278.1| beta-glucosidase [Prevotella oralis ATCC 33269]
 gi|323094781|gb|EFZ37356.1| beta-glucosidase [Prevotella oralis ATCC 33269]
          Length = 779

 Score =  275 bits (703), Expect = 6e-71,   Method: Compositional matrix adjust.
 Identities = 211/726 (29%), Positives = 335/726 (46%), Gaps = 123/726 (16%)

Query: 90  RLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVST 149
           RLG+P +    EA HG   +G           AT FPT +   A+++  + ++ G  ++ 
Sbjct: 128 RLGIPLF-LAEEAPHGHMAIG-----------ATVFPTGLGMAATWSTDVIEQAGVIIAK 175

Query: 150 EARAMYNLGRAGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHE 209
           E R      + G   + P +++A +PRW R+ ET GEDP + G  AV  V+GL       
Sbjct: 176 EIRL-----QGGHISYGPVLDLAHEPRWSRVEETMGEDPVLSGTIAVAQVKGL------- 223

Query: 210 NATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGD 269
            A D+ ++P    +  KH+ AY +       +    + +  +D+ + FL PF   +  G 
Sbjct: 224 GAGDI-TKPFATIATLKHFIAYGIPE---SGQNGAPSIIGTRDLLDNFLPPFRRAIDAG- 278

Query: 270 ASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKED 329
           A SVM SYN ++GIP  ++  LL + +R +W   G++V+D  SI  +   H  ++  +E 
Sbjct: 279 ALSVMTSYNSMDGIPCTSNGHLLTEILRNQWGFKGFVVSDLYSIDGIYGTHHTVSSLQEA 338

Query: 330 AVAQTLKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYV 389
            + + L+AG+D+D G        +AV+QG+V E  ID+++  +  + + +G F+      
Sbjct: 339 GI-EALRAGVDVDLGANAFALLCDAVRQGRVSEAAIDEAVLRILRMKIEMGLFEHPYVNP 397

Query: 390 SLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGN 449
              K  + + ENI++A   A E I LLKN    LPL S  +K +AV+GP+A+    M+G+
Sbjct: 398 KTAKTGVRTAENIQVAKRVAEESITLLKNSNKLLPL-SKNIK-IAVIGPNADNRYNMLGD 455

Query: 450 YA-------------GIPCRYMSPIAGFSGYANVTYKTGCDDVACKSNNSIFAASEAAKT 496
           Y              GI  + +SP       + +TY  GC  +     N I  A  AA+ 
Sbjct: 456 YTAPQQDSNVKTILDGIRSK-LSP-------SQITYVKGCS-IRDTVFNEIGEAVRAARE 506

Query: 497 ADATIILAGLDLSVE-----------------------AESLDREDLWLPGYQTQLINQV 533
           AD  ++  G   + +                        E  DR  L L G Q++L+  +
Sbjct: 507 ADVIVVAVGGSSARDFKTSYQETGAAITSSKVVSDMESGEGFDRASLSLMGIQSRLLQSL 566

Query: 534 AEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRL 593
            E  K P++++ +    +D  +A    +  A+L A YPG+EGG AIA+V+FG +NP GRL
Sbjct: 567 KETGK-PMVVIYIEGRPLDKTWASEQAD--ALLTAYYPGQEGGNAIANVLFGDYNPAGRL 623

Query: 594 PITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSF 653
           PIT      V  LP+     RPV         Y       LYPFGYGLSYT F Y+ L+ 
Sbjct: 624 PITVPRS--VGQLPVYYNKKRPVV------HNYVEMASTPLYPFGYGLSYTSFDYSHLNI 675

Query: 654 TKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVV 713
           TK                                  + ++ +E   D +N G  DG +V 
Sbjct: 676 TK----------------------------------KSEEEYEVSFDIRNSGERDGDEVA 701

Query: 714 IVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEH 773
            +Y           +KQ+ GF R+ ++ G  KRI  +      L+I D     ++ AG+ 
Sbjct: 702 QLYISDKVASVVQPVKQLKGFARIHLKKGETKRITLILKK-DDLSITDRNMERVVEAGDF 760

Query: 774 TIFVGN 779
            I +G+
Sbjct: 761 EIQIGS 766


>gi|423344787|ref|ZP_17322476.1| hypothetical protein HMPREF1060_00148 [Parabacteroides merdae
           CL03T12C32]
 gi|409224378|gb|EKN17311.1| hypothetical protein HMPREF1060_00148 [Parabacteroides merdae
           CL03T12C32]
          Length = 866

 Score =  275 bits (703), Expect = 7e-71,   Method: Compositional matrix adjust.
 Identities = 160/452 (35%), Positives = 235/452 (51%), Gaps = 44/452 (9%)

Query: 48  QMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVS 107
           +   + F +  LP   R+ DL+ R+T +EK+ Q+ +    + RLG+P+Y+WW+EALHGV+
Sbjct: 20  RQEDYPFRNPDLPIDERIDDLLKRLTAEEKIGQMMNTTPAIERLGIPEYDWWNEALHGVA 79

Query: 108 NVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRA------- 160
             G           AT FP  I   A+F++    +    VS EARA Y+  +        
Sbjct: 80  RAGK----------ATVFPQAIAMAATFDDDALYETFTMVSDEARAKYHQYQKNKEYDRY 129

Query: 161 -GLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPL 219
            GLT+W+PNIN+ RDPRWGR  ET GEDP++  R  +  V+GLQ  +          +  
Sbjct: 130 KGLTFWTPNINIFRDPRWGRGMETYGEDPYLTERMGLAVVKGLQGDD---------PKYF 180

Query: 220 KVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNR 279
           K  +C KHYA +    W   +R+ FD  VT +D+ +T+L  FE  VK+G+   VMC+YNR
Sbjct: 181 KTHACAKHYAVHSGPEW---NRHEFDVTVTPRDLWQTYLPAFEALVKKGNVQEVMCAYNR 237

Query: 280 VNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQ------VMVDNHKFLADSKEDAVAQ 333
             G P C+  KLL   +R  W     I++DC +I            H+   D+ E A A 
Sbjct: 238 YQGKPCCSSDKLLIDILRNSWGYENIILSDCGAINDFWQRDERTPRHETHPDA-ESASAD 296

Query: 334 TLKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ--YVSL 391
            +  G DL+CG  Y      A+++GK+ E D+D SL+ L      LG FD   +  Y  +
Sbjct: 297 AVLNGTDLECGNSYKALI-KALKEGKISENDLDVSLRRLLKGRFELGMFDPDERVPYAQI 355

Query: 392 GKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYA 451
               + S E++  A E A + +VLLKN  NTLPL S  ++ +AVVGP+A  +  +  NY 
Sbjct: 356 PYNVVESPEHVAQALEMAHKSMVLLKNKNNTLPL-SKTIRKIAVVGPNAADSTMLWANYN 414

Query: 452 GIPCRYMSPIAGFSGY---ANVTYKTGCDDVA 480
           G P   ++ + G         V Y+ GC+  A
Sbjct: 415 GFPTHTVTILEGIRNKVPDTEVIYELGCNHAA 446



 Score =  113 bits (282), Expect = 5e-22,   Method: Compositional matrix adjust.
 Identities = 89/301 (29%), Positives = 132/301 (43%), Gaps = 56/301 (18%)

Query: 490 ASEAAKTADATIIL--AGLDLSVEAESL----------DREDLWLPGYQTQLINQVAEVA 537
           A+ AAK  DA +I+   G+   +E E +          DR ++ +P  Q +++  +    
Sbjct: 594 AATAAKVKDADVIVYVGGISPRLEGEEMPVNVEGFKKGDRTNIEIPKVQQEMVKALKATG 653

Query: 538 KGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITW 597
           K PV+ V+ +  G  +A    + NI AIL A Y G+E G A+AD++FG +NP GRLP+T+
Sbjct: 654 K-PVVYVLCT--GSALALNWEDANIDAILNAWYGGQEAGTAVADILFGDYNPSGRLPVTF 710

Query: 598 YNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTI 657
           Y    +  LP         +     GRTY++     LYPFGYGLSYT F Y         
Sbjct: 711 YKS--IDQLP-------DFEDYSMKGRTYRYMTETPLYPFGYGLSYTNFAY--------- 752

Query: 658 QVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYS 717
                     RN   +S        G +  D      F    D  N G  DG +V  +Y 
Sbjct: 753 ----------RNAKLSS--------GKITKDQSVTLTF----DIANTGKMDGDEVAQIYI 790

Query: 718 KPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFV 777
           K P +     IK +  F RV V+AG ++ +          +  D      +  G++ I  
Sbjct: 791 KNPNDPEGP-IKALKAFLRVHVKAGDSQEVNIELTPEAFHSFNDNTQTMEVRPGKYQILY 849

Query: 778 G 778
           G
Sbjct: 850 G 850


>gi|423226625|ref|ZP_17213090.1| hypothetical protein HMPREF1062_05276 [Bacteroides cellulosilyticus
           CL02T12C19]
 gi|392628884|gb|EIY22909.1| hypothetical protein HMPREF1062_05276 [Bacteroides cellulosilyticus
           CL02T12C19]
          Length = 863

 Score =  275 bits (702), Expect = 8e-71,   Method: Compositional matrix adjust.
 Identities = 175/459 (38%), Positives = 241/459 (52%), Gaps = 47/459 (10%)

Query: 52  FLFCDSSLPY-------SIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALH 104
           FL C S  PY         R  DLV R+TL+EK   + + +  +PRLG+  Y+WW+EALH
Sbjct: 18  FLSC-SQPPYKNPALSPEERANDLVGRLTLEEKAALMQNTSPAIPRLGIKAYDWWNEALH 76

Query: 105 GVSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYN-------L 157
           GV   G           AT FP  I   ASFN  L   +  AVS EARA          L
Sbjct: 77  GVGRAGL----------ATVFPQAIGMGASFNNELLYDVFTAVSDEARAKNTEFSKEGGL 126

Query: 158 GR-AGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNS 216
            R  GLT W+PNIN+ RDPRWGR  ET GEDP++ G+  +  VRGLQ  EG +       
Sbjct: 127 KRYQGLTMWTPNINIFRDPRWGRGQETYGEDPYLTGQMGMAVVRGLQGPEGEKYD----- 181

Query: 217 RPLKVSSCCKHYAAYDVDNWKGVDRYHFDAR-VTEQDMEETFLRPFEMCVKEGDASSVMC 275
              K+ +C KHYA +    W   +R+ F+A  +  +D+ ET+L  F+  V++     VMC
Sbjct: 182 ---KLHACAKHYAVHSGPEW---NRHSFNAENIDPRDLWETYLPAFKDLVQKAHVKEVMC 235

Query: 276 SYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLAD-SKEDAVAQT 334
           +YNR  G P C   +LL Q +R EW     +V+DC +I    +      D  K+ A A+ 
Sbjct: 236 AYNRFEGEPCCGSNRLLMQILRDEWGYKEIVVSDCWAISDFYNKDAHETDPDKQHASAKA 295

Query: 335 LKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ--YVSLG 392
           + +G D++CG  Y +    AV++G + E  ID SLK L      LG  D   Q  +  + 
Sbjct: 296 VLSGTDVECGDSYASLP-EAVKEGLIDEKQIDISLKRLMKARFELGEMDEPSQVSWAQIP 354

Query: 393 KQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAG 452
              + S E+ ELA   ARE +VLL+N+Q+ LPLN  K   VAVVGP+AN +V   GNY G
Sbjct: 355 YSVVDSKEHRELALRMARESLVLLQNNQSLLPLN--KNLKVAVVGPNANDSVMQWGNYNG 412

Query: 453 IPCRYMSPIAGFSGY---ANVTYKTGCDDVACKSNNSIF 488
            P   ++ + G   Y   + + Y+ GCD  +  +  S+F
Sbjct: 413 FPSHTITLLEGIREYLPESQIIYEPGCDLTSDVTLQSVF 451



 Score =  111 bits (277), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 84/298 (28%), Positives = 132/298 (44%), Gaps = 56/298 (18%)

Query: 492 EAAKTADATIILAGLDLSVEAESL----------DREDLWLPGYQTQLINQVAEVAKGPV 541
           +  K AD  I   G+  +VE E +          DRE + LP  Q++L+   AE+ K   
Sbjct: 595 DKVKEADVIIFAGGISPAVEGEEMHVNIPGFKGGDRETIELPSIQSRLL---AELKKAGK 651

Query: 542 ILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGD 601
            +V ++  G  IA    +    AIL A YPG+ GG AIA+V+FG +NP GRLP+T+Y   
Sbjct: 652 KIVFVNFSGSAIALTPESKTCDAILQAWYPGQAGGTAIANVLFGDYNPAGRLPVTFYK-- 709

Query: 602 YVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNL 661
                  ++  L   +     GRTY++     L+PFG+GLSYT F+Y   S      +N 
Sbjct: 710 -------STKQLPDFEDYSMKGRTYRYMTENPLFPFGHGLSYTTFQYGNAS------LNT 756

Query: 662 NKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPA 721
           ++++    +  T                         +   N G  DG +VV VY + P 
Sbjct: 757 SEIKDGEQVTLT-------------------------IPVSNTGKYDGEEVVQVYLRHPG 791

Query: 722 EIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLP-AGEHTIFVG 778
           +        +  F+RV +  G    +    +  ++    D + NT+ P  G++ I  G
Sbjct: 792 DKEGPS-HALRAFKRVAIAKGATNNVTIPLSK-ENFEWFDTSTNTMRPIEGDYEILYG 847


>gi|325105296|ref|YP_004274950.1| beta-glucosidase [Pedobacter saltans DSM 12145]
 gi|324974144|gb|ADY53128.1| Beta-glucosidase [Pedobacter saltans DSM 12145]
          Length = 884

 Score =  275 bits (702), Expect = 8e-71,   Method: Compositional matrix adjust.
 Identities = 192/564 (34%), Positives = 279/564 (49%), Gaps = 75/564 (13%)

Query: 35  FVCDPGRFSKLGLQMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLP 94
           F C  G  S L  Q   + F +  LP + R+++L+  +TL+EKV  + + +  V RLG+P
Sbjct: 14  FYCLLGN-SNLKSQEIPYKFRNPDLPVNERIENLLGLLTLEEKVGLMMNSSKPVGRLGIP 72

Query: 95  QYEWWSEALHGVSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAM 154
            Y+WW+EALHGV+  G           AT FP  I   A++NES  K+    +S EARA 
Sbjct: 73  AYDWWNEALHGVARSGK----------ATVFPQAIGMAATWNESGHKQTFDLISDEARAK 122

Query: 155 YN-------LGRA-GLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVE 206
           YN        GR  GL++W+PNIN+ RDPRWGR  ET GEDP++  R  V  VRGLQ  +
Sbjct: 123 YNEAIRNGERGRYYGLSFWTPNINIFRDPRWGRGQETYGEDPYLTARLGVAAVRGLQGDD 182

Query: 207 GHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVK 266
                     +  K  +C KH+A +    W   +R+ +DA  + +D+ ET+L  F+  VK
Sbjct: 183 ---------PKYFKTHACAKHFAVHSGPEW---NRHSYDATASGRDLWETYLPAFKALVK 230

Query: 267 EGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMV-----DNHK 321
           E +   VMC+YN   G P C   +LL   +R  W+  G +V+DC +I         + HK
Sbjct: 231 EANVQEVMCAYNAYEGQPCCGSDRLLTDILRNRWEYKGIVVSDCWAIDDFFRKGHHETHK 290

Query: 322 FLADSKEDAVAQTLKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGF 381
             A +  DAV  +     DL+CG  YTN    AV+QG + +  ID SL+ +      LG 
Sbjct: 291 DAAAAAADAVIHS----TDLECGSAYTNLL-EAVRQGLISQQQIDISLRRVLRGWFELGM 345

Query: 382 FDGSPQ--YVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPH 439
            D + +  +  L  Q + S E+++ A + ARE + LLKN+ + LPL S  +K +AV+GP+
Sbjct: 346 LDPAERLPWSQLPYQIVASKEHVQQALKVARESMTLLKNNGSILPL-SKSIKKIAVIGPN 404

Query: 440 ANATVAMIGNYAGIPCRYMSPIAGFSG---YANVTYKTGCDDVACKSNNSI---FAASEA 493
           A  +V + GNY G P   ++ + G      +A + Y  GCD V      S+   F +S  
Sbjct: 405 AADSVMLWGNYNGTPNSTVTILQGIKNKLPHAEIIYDKGCDWVDPWVRTSLFEGFTSSPK 464

Query: 494 AKTADATIILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDI 553
            +            LS   E             T LIN +A        +   +AGG  +
Sbjct: 465 GQKGMKVEFFNNTQLSGSPE-------------TTLINTLA--------IKYNNAGGTAL 503

Query: 554 A----FAETNTNIKAILWAGYPGE 573
           A       T+T I  +  A Y GE
Sbjct: 504 AQGVNLQNTSTRISGVFTAPYTGE 527



 Score =  110 bits (275), Expect = 3e-21,   Method: Compositional matrix adjust.
 Identities = 83/298 (27%), Positives = 131/298 (43%), Gaps = 50/298 (16%)

Query: 495 KTADATIILAGLDLSVEAESL----------DREDLWLPGYQTQLINQVAEVAKGPVILV 544
           K  DA +   GL   +E E +          D+  + LP  Q +L++ +    K PV+ V
Sbjct: 606 KEVDAIVYAGGLSPQLEGEEMPVNADGFRGGDKISIDLPKIQRELLSSLKSTGK-PVVFV 664

Query: 545 IMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYV- 603
           + +  G  +A  +   N  A+L A Y G+E G A+ADV+FG +NP GRLPIT+Y      
Sbjct: 665 LCT--GSSLALEQDEKNYNALLCAWYGGQEAGTAVADVLFGDYNPAGRLPITFYKSLSQL 722

Query: 604 --QMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNL 661
              +L  +    +  ++    GRTY++     LY FG+GLSY++F Y     T       
Sbjct: 723 DNALLKTSDTSRQDFENYSMQGRTYRYMTEKPLYAFGHGLSYSKFNYGEAKLTS------ 776

Query: 662 NKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPA 721
                                      ++  +     +   N+ +  G +VV VY K   
Sbjct: 777 -------------------------GTVKIGNTLNISIPLTNISNNKGEEVVQVYVKRNG 811

Query: 722 EIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLP-AGEHTIFVG 778
           +  A  +K + GF+RV + AG  K + F   A ++    D + + L P AG +TI  G
Sbjct: 812 DPDAP-VKSLKGFKRVAIAAGETKHLDFQLTA-EAFEFYDPSKDELGPKAGNYTIMYG 867


>gi|224537384|ref|ZP_03677923.1| hypothetical protein BACCELL_02262 [Bacteroides cellulosilyticus
           DSM 14838]
 gi|224521009|gb|EEF90114.1| hypothetical protein BACCELL_02262 [Bacteroides cellulosilyticus
           DSM 14838]
          Length = 863

 Score =  275 bits (702), Expect = 9e-71,   Method: Compositional matrix adjust.
 Identities = 175/459 (38%), Positives = 241/459 (52%), Gaps = 47/459 (10%)

Query: 52  FLFCDSSLPY-------SIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALH 104
           FL C S  PY         R  DLV R+TL+EK   + + +  +PRLG+  Y+WW+EALH
Sbjct: 18  FLSC-SQPPYKNPALSPEERANDLVGRLTLEEKAALMQNTSPAIPRLGIKAYDWWNEALH 76

Query: 105 GVSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYN-------L 157
           GV   G           AT FP  I   ASFN  L   +  AVS EARA          L
Sbjct: 77  GVGRAGL----------ATVFPQAIGMGASFNNELLYDVFTAVSDEARAKNTEFSKEGGL 126

Query: 158 GR-AGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNS 216
            R  GLT W+PNIN+ RDPRWGR  ET GEDP++ G+  +  VRGLQ  EG +       
Sbjct: 127 KRYQGLTMWTPNINIFRDPRWGRGQETYGEDPYLTGQMGMAVVRGLQGPEGEKYD----- 181

Query: 217 RPLKVSSCCKHYAAYDVDNWKGVDRYHFDAR-VTEQDMEETFLRPFEMCVKEGDASSVMC 275
              K+ +C KHYA +    W   +R+ F+A  +  +D+ ET+L  F+  V++     VMC
Sbjct: 182 ---KLHACAKHYAVHSGPEW---NRHSFNAENIDPRDLWETYLPAFKNLVQKAHVKEVMC 235

Query: 276 SYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLAD-SKEDAVAQT 334
           +YNR  G P C   +LL Q +R EW     +V+DC +I    +      D  K+ A A+ 
Sbjct: 236 AYNRFEGEPCCGSNRLLMQILRDEWGYKEIVVSDCWAISDFYNKGAHETDPDKQHASAKA 295

Query: 335 LKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ--YVSLG 392
           + +G D++CG  Y +    AV++G + E  ID SLK L      LG  D   Q  +  + 
Sbjct: 296 VLSGTDVECGDSYASLP-EAVKEGLIDEKQIDISLKRLMKARFELGEMDEPSQVSWAQIP 354

Query: 393 KQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAG 452
              + S E+ ELA   ARE +VLL+N+Q+ LPLN  K   VAVVGP+AN +V   GNY G
Sbjct: 355 YSVVDSKEHRELALRMARESLVLLQNNQSLLPLN--KNLKVAVVGPNANDSVMQWGNYNG 412

Query: 453 IPCRYMSPIAGFSGY---ANVTYKTGCDDVACKSNNSIF 488
            P   ++ + G   Y   + + Y+ GCD  +  +  S+F
Sbjct: 413 FPSHTITLLEGIREYLPESQIIYEPGCDLTSDVTLQSVF 451



 Score =  110 bits (276), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 84/298 (28%), Positives = 132/298 (44%), Gaps = 56/298 (18%)

Query: 492 EAAKTADATIILAGLDLSVEAESL----------DREDLWLPGYQTQLINQVAEVAKGPV 541
           +  K AD  I   G+  +VE E +          DRE + LP  Q++L+   AE+ K   
Sbjct: 595 DKVKEADVIIFAGGISPAVEGEEMHVNIPGFKGGDRETIELPSIQSRLL---AELKKAGK 651

Query: 542 ILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGD 601
            +V ++  G  IA    +    AIL A YPG+ GG AIA+V+FG +NP GRLP+T+Y   
Sbjct: 652 KIVFVNFSGSAIALTPESKTCDAILQAWYPGQAGGTAIANVLFGDYNPAGRLPVTFYK-- 709

Query: 602 YVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNL 661
                  ++  L   +     GRTY++     L+PFG+GLSYT F+Y   S      +N 
Sbjct: 710 -------STKQLPDFEDYSMKGRTYRYMTENPLFPFGHGLSYTTFQYGNAS------LNT 756

Query: 662 NKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPA 721
           ++++    +  T                         +   N G  DG +VV VY + P 
Sbjct: 757 SEIKDGEQVTLT-------------------------IPVSNTGKYDGEEVVQVYLRHPG 791

Query: 722 EIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLP-AGEHTIFVG 778
           +        +  F+RV +  G    +    +  ++    D + NT+ P  G++ I  G
Sbjct: 792 DKEGPS-HALRAFKRVAIAKGATNNVTIPLSK-ENFEWFDTSTNTMRPIEGDYEILYG 847


>gi|103486503|ref|YP_616064.1| glycoside hydrolase [Sphingopyxis alaskensis RB2256]
 gi|98976580|gb|ABF52731.1| glycoside hydrolase, family 3-like protein [Sphingopyxis alaskensis
           RB2256]
          Length = 772

 Score =  275 bits (702), Expect = 9e-71,   Method: Compositional matrix adjust.
 Identities = 229/737 (31%), Positives = 344/737 (46%), Gaps = 110/737 (14%)

Query: 65  VKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGP------------- 111
           + DL+ +MTLDEK  QL          G    + + E +     VG              
Sbjct: 57  IADLMVKMTLDEKTGQLTLLTSNWESTGPTMRDSYKEDIR-AGRVGAIFNAYTAKYTREL 115

Query: 112 ------GTHFD-------DVIPG-ATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNL 157
                 GT          DVI G  T FP  +   AS++    +K  +  + EA A    
Sbjct: 116 QALAVEGTRLKIPLLFGYDVIHGHRTIFPISLGEAASWDLQAIEKAARISAIEASA---- 171

Query: 158 GRAGLTY-WSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNS 216
              G+ + +SP +++ARDPRWGRI+E  GED ++    A   VRG Q         DL S
Sbjct: 172 --EGIHWTFSPMVDIARDPRWGRISEGAGEDVYLGSLIAKARVRGYQ-------GGDL-S 221

Query: 217 RPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCS 276
           RP  + +  KH+AAY      G D +  D  ++E+ M + +L PF+       A++ M +
Sbjct: 222 RPDTILATAKHFAAYGAAQ-AGRDYHTVD--ISERTMRDVYLPPFKAAADA-GAATFMTA 277

Query: 277 YNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLK 336
           +N  +G+P+     LL   +R +W   G++V D  SI  MV  H +  D K+ A  Q ++
Sbjct: 278 FNEYDGVPASGSHYLLTDVLRKKWGFKGFVVTDYTSINEMVP-HGYAKDLKQ-AGEQAMR 335

Query: 337 AGLDLDC-GQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQD 395
           AG+D+D  G  +      +V +GKV    ID ++K +  +  RLG FD   +Y    ++ 
Sbjct: 336 AGVDMDMQGAVFMENLAKSVAEGKVDTARIDAAVKAILEMKYRLGLFDDPYRYADAAREK 395

Query: 396 --ICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGI 453
             I     +E A + AR+ IVLLKN  N LPL +A  K++AV+GP  N+   MIG+++  
Sbjct: 396 ATIYKPAFLEAARDVARKSIVLLKNKDNVLPL-AASAKSIAVIGPLGNSKEDMIGSWSAA 454

Query: 454 PCRYMSPI-------AGFSGYANVTYKTGC----DDVACKSNNSIFAASEAAKTADATII 502
             R   P+       AG      + Y  G     DDV     +    A   A+ +D  I 
Sbjct: 455 GDRRTRPVTLLEGLQAGAPKGTTIAYAKGASYHFDDVG--KTDGFAEALALAEKSDVIIA 512

Query: 503 LAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNI 562
             G   ++  E+  R  L LPG Q  L+  + +  K PVILV+MS     I +A  + N+
Sbjct: 513 AMGEHWNMTGEAASRTSLDLPGNQQALLEALEKTGK-PVILVLMSGRPNSIEWA--DANV 569

Query: 563 KAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPL---TSMPLRPVDSL 619
            AIL A YPG  GG AIAD+++G++NP G+LP+T+     V  +P+        RP++ L
Sbjct: 570 DAILEAWYPGTMGGHAIADILYGRYNPSGKLPVTFPR--TVGQVPIHYDMKNTGRPIE-L 626

Query: 620 GYPGRTY--KFYNGPT--LYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSD 675
           G PG  Y  ++ N P   LYPFGYGLSYT F Y+ ++                      D
Sbjct: 627 GAPGAKYVSRYLNTPNTPLYPFGYGLSYTSFTYSPVTL---------------------D 665

Query: 676 ASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQ 735
            SK R PG         +     V   N G  DG +VV +Y +         +K++ GFQ
Sbjct: 666 RSKIR-PG---------EPLTASVTVTNSGPRDGEEVVQLYVRDLVGSVTRPVKELKGFQ 715

Query: 736 RVFVRAGRNKRIKFVFN 752
           ++ ++ G  + ++F   
Sbjct: 716 KIGLKKGETRTVRFTLT 732


>gi|296081549|emb|CBI20072.3| unnamed protein product [Vitis vinifera]
          Length = 333

 Score =  275 bits (702), Expect = 9e-71,   Method: Compositional matrix adjust.
 Identities = 143/335 (42%), Positives = 204/335 (60%), Gaps = 12/335 (3%)

Query: 446 MIGNYAGIPCRYMSPIAGFSGYANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAG 505
           MIGNY G P +Y +P+ G +     TY  GC +VAC +   I  A + A  ADAT+++ G
Sbjct: 1   MIGNYEGTPGKYTTPLQGLTALVATTYLPGCSNVACGTAQ-IDEAKKIAAAADATVLIVG 59

Query: 506 LDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAI 565
           +D S+EAE  DR ++ LPG Q  LI +VA+ +KG VILV+MS GG DI+FA+ +  I +I
Sbjct: 60  IDQSIEAEGRDRVNIQLPGQQPLLITEVAKASKGNVILVVMSGGGFDISFAKNDDKITSI 119

Query: 566 LWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRT 625
           LW GYPGE GG AIADV+FG +NP GRLP TWY   YV  +P+T+M +RP  + GYPGRT
Sbjct: 120 LWVGYPGEAGGAAIADVIFGFYNPSGRLPTTWYPQSYVDKVPMTNMNMRPDPASGYPGRT 179

Query: 626 YKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVL 685
           Y+FY G T+Y FG GLSYTQF ++L+   K++ + + +   C +         ++C  V 
Sbjct: 180 YRFYTGETIYTFGDGLSYTQFNHHLIQAPKSVSIPIEEGHSCHS---------SKCKSVD 230

Query: 686 VNDLRCDDY-FEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRN 744
                C +  F+  +   N G+  GS  V ++S PP+ +  +  K ++GF++VFV A   
Sbjct: 231 AVQESCQNLAFDIHLRVNNAGNISGSHTVFLFSSPPS-VHNSPQKHLLGFEKVFVTAKAE 289

Query: 745 KRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGN 779
             ++F  + CK L+IVD      +  G H + VGN
Sbjct: 290 ALVRFKVDVCKDLSIVDELGTRKVALGLHVLHVGN 324


>gi|319901412|ref|YP_004161140.1| glycoside hydrolase 3 [Bacteroides helcogenes P 36-108]
 gi|319416443|gb|ADV43554.1| glycoside hydrolase family 3 domain protein [Bacteroides helcogenes
           P 36-108]
          Length = 944

 Score =  275 bits (702), Expect = 1e-70,   Method: Compositional matrix adjust.
 Identities = 228/807 (28%), Positives = 359/807 (44%), Gaps = 144/807 (17%)

Query: 53  LFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRL---GLPQYEW----WS----- 100
           ++ D +     R++DL+ +M+L+EK  Q+    +G  R+    LP  EW    W      
Sbjct: 52  IYEDPTAAIDARIEDLLKQMSLEEKTCQMVTL-YGYKRVLKDDLPTPEWKQMLWKDGIGA 110

Query: 101 --EALHGVSNVG-PGTHFDDVIPG------------------------------------ 121
             E L+G    G P +  ++V P                                     
Sbjct: 111 IDEHLNGFRQWGLPPSDNENVWPASRHAWALNEVQRFFVEETRLGIPVDFTNEGIRGVES 170

Query: 122 --ATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLT-YWSPNINVARDPRWG 178
             AT+FPT +    ++N  L  KIG     EAR +      G T  ++P ++V RD RWG
Sbjct: 171 YKATNFPTQLGLGHTWNRELIHKIGFITGREARML------GYTNVYAPILDVGRDQRWG 224

Query: 179 RITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKG 238
           R  E  GE P++V    +  VRG+Q    H+           V++  KH+AAY  +    
Sbjct: 225 RYEEVYGESPYLVAELGIEMVRGMQ--YNHQ-----------VAATGKHFAAYSNNKGAR 271

Query: 239 VDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRG 298
                 D +++ +++E   + PF   ++E     VM SYN  +GIP       L   +RG
Sbjct: 272 EGMSRVDPQISPREVENIHIYPFRRVIREAGLLGVMSSYNDYDGIPIQGSHYWLTTRLRG 331

Query: 299 EWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCG----QYYTNFTGNA 354
           E    GY+V+D D+++ +   H    D KE A+ Q+++AGL++ C       +       
Sbjct: 332 EIGFRGYVVSDSDAVEYLYTKHGTAKDMKE-AIRQSVEAGLNIRCTFRSPDSFVLPLREL 390

Query: 355 VQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQD--ICSDENIELAAEAAREG 412
           V++G + E  I+  ++ +  V    G FD +P    L   D  +  +EN  +A +A+RE 
Sbjct: 391 VKEGGLSEEIINDRVRDILRVKFLTGLFD-TPYQSDLAGADREVEKEENGSIALQASRES 449

Query: 413 IVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGF----SGYA 468
           IVLLKN+ N LPL+ + VK +AV GP+A+     + +Y  +    ++ + G     SG A
Sbjct: 450 IVLLKNENNMLPLDLSTVKRIAVCGPNADEKNYALTHYGPLAVEVITVLKGIQDKVSGKA 509

Query: 469 NVTYKTGCDDV--------------ACKSNNSIFAASEAAKTADATIILAGLDLSVEAES 514
            V Y  GCD V                     I  A+E A+ +D  +++ G       E+
Sbjct: 510 EVLYTKGCDLVDANWPESEIINHPLTADEQAEINKAAENARQSDVAVVVLGGGQRTCGEN 569

Query: 515 LDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEE 574
             R  L LPG Q QL+  +    K PVILV+++   + + +A  +  + AIL A YPG +
Sbjct: 570 KSRSSLDLPGRQLQLLQAIQATGK-PVILVLINGRPLSVNWA--DKYVPAILEAWYPGAK 626

Query: 575 GGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRP---VDSLGYPGRTYKF--Y 629
           GG A+ADV+FG +NPGG+L +T+     V  +P  + P +P   +D    PG        
Sbjct: 627 GGIALADVLFGDYNPGGKLTVTFPK--TVGQIPF-NFPYKPASQIDGGKNPGPEGNMSRI 683

Query: 630 NGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDL 689
           NG  LYPFGYGLSYT F+Y+ L  T  +                               +
Sbjct: 684 NG-ALYPFGYGLSYTTFEYSDLEITPKV-------------------------------I 711

Query: 690 RCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKF 749
             ++    ++   N G   G +VV +Y +       TY K + GF+RV +  G  K + F
Sbjct: 712 TPNEEATVRLKVTNTGKRAGDEVVQLYIRDVVSSVITYEKNLAGFERVHLEPGETKEVVF 771

Query: 750 VFNACKSLNIVDYAANTLLPAGEHTIF 776
                K L ++D     ++  G+ TI 
Sbjct: 772 TL-GRKHLELLDANMQWVVEPGDFTIM 797


>gi|333377782|ref|ZP_08469515.1| hypothetical protein HMPREF9456_01110 [Dysgonomonas mossii DSM
           22836]
 gi|332883802|gb|EGK04082.1| hypothetical protein HMPREF9456_01110 [Dysgonomonas mossii DSM
           22836]
          Length = 727

 Score =  274 bits (701), Expect = 1e-70,   Method: Compositional matrix adjust.
 Identities = 220/740 (29%), Positives = 341/740 (46%), Gaps = 114/740 (15%)

Query: 52  FLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGP 111
           + F ++SL    R+ +L+S MT+DEK+  L     GVPRLG+ +    SE LHG++  GP
Sbjct: 24  YPFQNTSLSDEKRLDNLLSIMTIDEKINALST-NLGVPRLGI-RNTGHSEGLHGMALGGP 81

Query: 112 GT----------HFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGR-- 159
           G              DV P  T+FP       +++  L KK+    +TE R      R  
Sbjct: 82  GNWGGFKMVNYQRVPDVYP-TTTFPQAYGLGETWDTELIKKVADIEATEIRYYTQNERYT 140

Query: 160 -AGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRP 218
             GL   +PN ++ARDPRWGR  E+ GEDPF+V   AV +++GLQ           N R 
Sbjct: 141 KGGLVMRAPNADLARDPRWGRTEESFGEDPFLVSEMAVAFIKGLQGE---------NPRY 191

Query: 219 LKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYN 278
            K +S  KH+ A   ++ +     +FD R+      E +  PF   +++G + + M +YN
Sbjct: 192 WKSASLMKHFLANSNEDGRDSTSSNFDNRL----FHEYYSYPFRKGIEKGGSQAFMAAYN 247

Query: 279 RVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAG 338
             N IP    P L  + +R +W+  G I  D  ++ +++  HK      E + A  +KAG
Sbjct: 248 SWNEIPMTIHPIL--KKIRKDWNFKGIICTDGGALDLLIKAHKTFPTHTEGSAA-IVKAG 304

Query: 339 LDLDCGQYYTNFTG---NAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ---YVSLG 392
           +    GQ+  NF      A+++G + E +IDK+++  + + ++LG  DG      Y  +G
Sbjct: 305 V----GQFLDNFRPYIYQALEKGMLTEAEIDKAIRGNFYIALKLGLLDGDQTKLPYAHIG 360

Query: 393 KQDICS----DENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIG 448
             D  S     E  +       + +VLLKN++  LPLN   +K +AV+GP AN    ++ 
Sbjct: 361 VTDTVSVWRNKEIQDFVRLVTAKSVVLLKNEKKLLPLNKGNIKRIAVIGPRANEV--LLD 418

Query: 449 NYAGIPCRYMSPIAGFSGYA--NVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGL 506
            Y+G P   +S + G       NV       +V  +S+N I  A  AA+ AD  I+  G 
Sbjct: 419 WYSGTPPYTVSILQGIKNAVGNNV-------EVIYESSNEIDKAYLAAQKADIAIVCVGN 471

Query: 507 DL-------------SVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDI 553
            +             S   E++DR+ L L   Q  L+  V +     V++++ S      
Sbjct: 472 HVYGTDPKWKYSPVPSDGREAVDRKALSLE--QEDLVKIVHKANPNTVMVLVSS---FPF 526

Query: 554 AFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPL 613
           A   +  NI AIL      +E G  +ADV+FG +NP GR   TW       + P+    +
Sbjct: 527 AINWSQENIPAILHITNNSQELGNGLADVIFGNYNPAGRTNQTWVKS-IADLPPMMDYDI 585

Query: 614 RPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYT 673
           R        GRTY +     LYPFGYGLSYT F Y+ ++ + +       L   +NL   
Sbjct: 586 R-------NGRTYMYAKEKPLYPFGYGLSYTNFTYSDMALSSS------ALSKGKNL--- 629

Query: 674 SDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIG 733
                                 +  V+ +N G  DG +V  +Y   P       IKQ+ G
Sbjct: 630 ----------------------KVSVNVKNTGDMDGEEVAQLYVSFPQSKVVRPIKQLKG 667

Query: 734 FQRVFVRAGRNKRIKFVFNA 753
           F R+ ++ G +K  +F  +A
Sbjct: 668 FDRISIKKGESKTFEFTLSA 687


>gi|317480750|ref|ZP_07939836.1| glycosyl hydrolase family 3 C terminal domain-containing protein
           [Bacteroides sp. 4_1_36]
 gi|316903091|gb|EFV24959.1| glycosyl hydrolase family 3 C terminal domain-containing protein
           [Bacteroides sp. 4_1_36]
          Length = 942

 Score =  274 bits (700), Expect = 1e-70,   Method: Compositional matrix adjust.
 Identities = 232/809 (28%), Positives = 360/809 (44%), Gaps = 144/809 (17%)

Query: 53  LFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRL---GLPQYEW----WS----- 100
           ++ D S P   R+++L+ +MTLDEK  Q+    +G  R+    LP  EW    W      
Sbjct: 52  VYEDPSAPLEARIENLLQQMTLDEKTCQMVTL-YGYKRVLKDDLPTPEWKELLWKDGIGA 110

Query: 101 --EALHGVSNVG----------------------------------PGTHFDDVIPG--- 121
             E L+G    G                                  P    ++ I G   
Sbjct: 111 IDEHLNGFQQWGLPPSDNAYVWPASRHAWALNEVQRFFVEDTRLGIPVDFTNEGIRGVES 170

Query: 122 --ATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLT-YWSPNINVARDPRWG 178
             AT+FPT +    ++N  L +++G     EAR +      G T  ++P ++V RD RWG
Sbjct: 171 YRATNFPTQLGLGHTWNRELIRQVGLITGREARML------GYTNVYAPILDVGRDQRWG 224

Query: 179 RITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKG 238
           R  E  GE P++V    +  VRGLQ    H +         +V++  KH+AAY  +    
Sbjct: 225 RYEEVYGESPYLVAELGIEMVRGLQ----HNH---------QVAATGKHFAAYSNNKGAR 271

Query: 239 VDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRG 298
                 D +++ +++E   + PF+  ++E     VM SYN  +GIP       L   +RG
Sbjct: 272 EGMARVDPQMSPREVENIHIYPFKRVIREAGMLGVMSSYNDYDGIPVQGSYYWLTTRLRG 331

Query: 299 EWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCG----QYYTNFTGNA 354
           E    GY+V+D D+++ +   H    D KE AV Q+++AGL++ C       +       
Sbjct: 332 EMGFRGYVVSDSDAVEYLYTKHGTAKDMKE-AVRQSVEAGLNVRCTFRSPDSFVLPLREL 390

Query: 355 VQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQD--ICSDENIELAAEAAREG 412
           V++G + E  I+  ++ +  V   +G FD +P    L   D  +  +EN  +A +A+RE 
Sbjct: 391 VKEGGLSEEVINDRVRDILRVKFLIGLFD-APYQTDLADADREVEKEENEAIALQASRES 449

Query: 413 IVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFS----GYA 468
           IVLLKN    LPL+    K +AV GP+AN     + +Y  +     + + G      G A
Sbjct: 450 IVLLKNAGELLPLDINSTKKIAVCGPNANEEGYALTHYGPLAVEVTTVLEGIQEKTKGKA 509

Query: 469 NVTYKTGCDDVACKSNNS--------------IFAASEAAKTADATIILAGLDLSVEAES 514
            V Y  GCD V      S              I  A E A+ AD  I++ G       E+
Sbjct: 510 EVLYTKGCDLVDAHWPESEIIDYPLTDDEQAEIDKAVENARQADVAIVVLGGGQRTCGEN 569

Query: 515 LDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEE 574
             R  L LPG Q QL+  +    K PV+L++++   + I +A  +  + AIL A YPG +
Sbjct: 570 KSRTSLDLPGRQLQLLQAIQATGK-PVVLILINGRPLSINWA--DKFVPAILEAWYPGSK 626

Query: 575 GGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRP---VDSLGYPGRTYKF--Y 629
           GG A+AD++FG +NPGG+L +T+     V  +P  + P +P   +D    PG T      
Sbjct: 627 GGTALADILFGDYNPGGKLTVTFPK--TVGQIPF-NFPCKPSSQIDGGKNPGPTGNMSRI 683

Query: 630 NGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDL 689
           NG  LYPFGYGLSYT F+Y+ L  T  +               T + S T          
Sbjct: 684 NG-ALYPFGYGLSYTTFEYSDLDITPRV--------------ITPNESAT---------- 718

Query: 690 RCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKF 749
                   ++   N G   G +VV +Y +       TY K + GFQR+ +  G  + + F
Sbjct: 719 -------VRLKVTNTGKRAGDEVVQLYIRDVLSSITTYEKNLAGFQRIHLEPGEAQELSF 771

Query: 750 VFNACKSLNIVDYAANTLLPAGEHTIFVG 778
             +  K L ++D     ++  G+  +  G
Sbjct: 772 TIDR-KHLELLDADMKWVVEPGDFVLMAG 799


>gi|333380551|ref|ZP_08472242.1| hypothetical protein HMPREF9455_00408 [Dysgonomonas gadei ATCC
           BAA-286]
 gi|332826546|gb|EGJ99375.1| hypothetical protein HMPREF9455_00408 [Dysgonomonas gadei ATCC
           BAA-286]
          Length = 854

 Score =  274 bits (700), Expect = 1e-70,   Method: Compositional matrix adjust.
 Identities = 163/427 (38%), Positives = 240/427 (56%), Gaps = 41/427 (9%)

Query: 53  LFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPG 112
           ++ D   P   R+ DL+SR+T++EK+  L   + G+PRL +P+Y   +E+LHGV  V PG
Sbjct: 29  VYLDEKAPTHDRIMDLLSRLTIEEKISLLRATSPGIPRLQIPKYYHGNESLHGV--VRPG 86

Query: 113 THFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAG----------L 162
                     T FP  I   + +N  L  KI  A+S EAR  +N    G          L
Sbjct: 87  RF--------TVFPQAIGLASMWNPELHHKIATAISDEARGRWNELEQGKLQTQRFTDLL 138

Query: 163 TYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVS 222
           T+WSP +N+ARDPRWGR  ET GEDP++ G     +VRGLQ  +          R LK+ 
Sbjct: 139 TFWSPTVNMARDPRWGRTPETYGEDPYLSGILGTAFVRGLQGDD---------PRYLKIV 189

Query: 223 SCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNG 282
           S  KH+AA + ++    +R+  + +++E+ + E +   FEMCVK+G ++S+M +YN +N 
Sbjct: 190 STPKHFAANNEEH----NRFVCNPQISERQLREYYFPAFEMCVKDGKSASIMSAYNAIND 245

Query: 283 IPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLD 342
           +P  A+P LL + +R +W  +GY+V+DC    ++V   K++  +KE A   ++KAGLDL+
Sbjct: 246 VPCTANPWLLTKVLRHDWGFNGYVVSDCGGPSLLVSAMKYVK-TKEAAATLSIKAGLDLE 304

Query: 343 CG-QYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSP--QYVSLGKQDICSD 399
           CG   Y     NA  Q  V   DID +   +    M LG FD      Y  +    + S 
Sbjct: 305 CGDDVYMQPLLNAYNQYMVSRADIDTAAYRVLRARMHLGLFDDPDLNPYNKISPSVVGSA 364

Query: 400 ENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMS 459
           E+ +LA EAAR+ IVLLKN+  TLPLN  KVK++AVVG   NA  +  G+Y+GIP    +
Sbjct: 365 EHKQLALEAARQSIVLLKNNNRTLPLNPKKVKSIAVVG--INAGNSEFGDYSGIPAN--A 420

Query: 460 PIAGFSG 466
           P++   G
Sbjct: 421 PVSILQG 427



 Score =  142 bits (357), Expect = 8e-31,   Method: Compositional matrix adjust.
 Identities = 103/316 (32%), Positives = 158/316 (50%), Gaps = 51/316 (16%)

Query: 478 DVACKSNNSIFA-ASEAAKTADATIILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEV 536
           DV  K    ++  A +A +  +  I + G++ ++E E  DR D+ LP  Q + I ++ +V
Sbjct: 584 DVGSKQRLDMYGEAGKAVRECEQVIAVLGINKTIEREGQDRYDIHLPADQEEFIREIYKV 643

Query: 537 AKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPIT 596
              P I+V++ AG   +A    + ++ AI+ A YPGE+GG A+A+V+FG++NPGGRLP+T
Sbjct: 644 --NPNIVVVLVAGS-SLAINWMDEHVPAIVNAWYPGEQGGTAVAEVLFGEYNPGGRLPVT 700

Query: 597 WYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKT 656
           +YN   ++ +P         D     GRTY+++ G  LYPFGYGLSYT F Y      K 
Sbjct: 701 YYNS--LEEIPSFD------DYDITKGRTYQYFKGKPLYPFGYGLSYTTFAY------KN 746

Query: 657 IQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVY 716
           +Q+N N                        N+++    FE K    N G  DG +V  VY
Sbjct: 747 LQINDNG-----------------------NNIKVS--FELK----NTGRMDGDEVSQVY 777

Query: 717 SKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKS-LNIVDYAANTLL-PAGEHT 774
            K P+      IK++ GFQR  ++ G  K ++   N  K  L   D A  T + P GE+ 
Sbjct: 778 VKIPSSGIFMPIKELKGFQRSTLKKGATKNVE--INIRKDLLRYWDDATETFITPKGEYE 835

Query: 775 IFVGNGGVSFPIHLNF 790
             +G       +  +F
Sbjct: 836 FMIGTSSQDIQLTKSF 851


>gi|167765093|ref|ZP_02437206.1| hypothetical protein BACSTE_03479 [Bacteroides stercoris ATCC
           43183]
 gi|167696721|gb|EDS13300.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
           stercoris ATCC 43183]
          Length = 944

 Score =  274 bits (700), Expect = 2e-70,   Method: Compositional matrix adjust.
 Identities = 231/819 (28%), Positives = 365/819 (44%), Gaps = 145/819 (17%)

Query: 42  FSKLGLQMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRL---GLPQYEW 98
           F+K G++    ++ D +     R+++L+ +MTL+EK  Q+    +G  R+    LP  EW
Sbjct: 44  FNKNGIKD---IYEDPAATLDARIENLLQQMTLEEKTCQMVTL-YGYKRVLKDALPTPEW 99

Query: 99  ----WS-------EALHGVSNVG-PGTHFDDVIPG------------------------- 121
               W        E L+G    G P +  ++V P                          
Sbjct: 100 KQMLWKDGIGAIDEHLNGFQQWGLPPSDNENVWPASRHAWALNEIQRFFVEDTRLGIPVD 159

Query: 122 -------------ATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLT-YWSP 167
                        AT+FPT +    ++N  L +++G     EAR +      G T  ++P
Sbjct: 160 FTNEGIRGVESYKATNFPTQLGLGHTWNRELIRQVGLITGREARML------GYTNVYAP 213

Query: 168 NINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKH 227
            ++V RD RWGR  E  GE P++V    +  VRGLQ    H +         +V++  KH
Sbjct: 214 ILDVGRDQRWGRYEEVYGESPYLVAELGIEMVRGLQ----HNH---------QVAATAKH 260

Query: 228 YAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCA 287
           +AAY  +          D ++  +++E   + PF+  ++E     VM SYN  +GIP   
Sbjct: 261 FAAYSNNKGAREGMARVDPQMPPREVENIHIYPFKRVIREAGLLGVMSSYNDYDGIPIQG 320

Query: 288 DPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCG--- 344
               L   +R E    GY+V+D D+++ +   H    D KE AV Q+++AGL++ C    
Sbjct: 321 SYYWLTTRLRKEMGFRGYVVSDSDAVEYLYTKHNTAKDMKE-AVRQSVEAGLNVRCTFRS 379

Query: 345 -QYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQDICSDE-NI 402
              +       V++G + E  I+  ++ +  V   +G FD   Q    G  D    E N 
Sbjct: 380 PDSFVLPLRELVKEGGLSEEVINDRVRDILRVKFLIGLFDAPYQTDLAGADDEVEKEANE 439

Query: 403 ELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIA 462
            +A +A+RE IVLLKN  NTLPLN  K+K +AV GP+A+     + +Y  +     + + 
Sbjct: 440 AVALQASRESIVLLKNTDNTLPLNIDKIKKIAVCGPNADEEGYALTHYGPLAVEVTTVLE 499

Query: 463 GF----SGYANVTYKTGCDDVACKSNNS--------------IFAASEAAKTADATIILA 504
           G      G A V Y  GCD V      S              I  A   A+ AD  +++ 
Sbjct: 500 GIREKAQGKAEVLYTKGCDLVDAHWPESEIMEYPLTPDEQAEIDRAVANARQADVAVVVL 559

Query: 505 GLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKA 564
           G       E+  R  L LPG+Q +L+  V    K PVIL++++   + + +A  +  + A
Sbjct: 560 GGGQRTCGENKSRTSLELPGHQLKLLQAVQATGK-PVILILINGRPLSVNWA--DKFVPA 616

Query: 565 ILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRP---VDSLGY 621
           IL A YPG +GG  +AD++FG +NPGG+L +T+     V  +P  + P +P   +D    
Sbjct: 617 ILEAWYPGSKGGTVVADILFGDYNPGGKLTVTF--PKTVGQIPF-NFPYKPASQIDGGKN 673

Query: 622 PGR--TYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKT 679
           PG        NG  LYPFGYGLSYT F+Y+ L  T                         
Sbjct: 674 PGPDGNMSRING-ALYPFGYGLSYTTFEYSDLEIT------------------------- 707

Query: 680 RCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFV 739
             P V+  + +       ++   N G   G +VV +Y++       TY K + GF+R+ +
Sbjct: 708 --PKVITPNQKAT----IRLKVTNTGKRAGDEVVQLYTRDILSSVTTYEKNLAGFERIHL 761

Query: 740 RAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVG 778
           + G +K I F  +  K L +++      +  GE  I  G
Sbjct: 762 KPGESKEIVFTLDR-KHLELLNADMKWTVEPGEFAIMAG 799


>gi|160892207|ref|ZP_02073210.1| hypothetical protein BACUNI_04671 [Bacteroides uniformis ATCC 8492]
 gi|156858685|gb|EDO52116.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
           uniformis ATCC 8492]
          Length = 990

 Score =  274 bits (700), Expect = 2e-70,   Method: Compositional matrix adjust.
 Identities = 232/809 (28%), Positives = 360/809 (44%), Gaps = 144/809 (17%)

Query: 53  LFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRL---GLPQYEW----WS----- 100
           ++ D S P   R+++L+ +MTLDEK  Q+    +G  R+    LP  EW    W      
Sbjct: 100 VYEDPSAPLEARIENLLQQMTLDEKTCQMVTL-YGYKRVLKDDLPTPEWKELLWKDGIGA 158

Query: 101 --EALHGVSNVG----------------------------------PGTHFDDVIPG--- 121
             E L+G    G                                  P    ++ I G   
Sbjct: 159 IDEHLNGFQQWGLPPSDNAYVWPASRHAWALNEVQRFFVEDTRLGIPVDFTNEGIRGVES 218

Query: 122 --ATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLT-YWSPNINVARDPRWG 178
             AT+FPT +    ++N  L +++G     EAR +      G T  ++P ++V RD RWG
Sbjct: 219 YRATNFPTQLGLGHTWNRELIRQVGLITGREARML------GYTNVYAPILDVGRDQRWG 272

Query: 179 RITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKG 238
           R  E  GE P++V    +  VRGLQ    H +         +V++  KH+AAY  +    
Sbjct: 273 RYEEVYGESPYLVAELGIEMVRGLQ----HNH---------QVAATGKHFAAYSNNKGAR 319

Query: 239 VDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRG 298
                 D +++ +++E   + PF+  ++E     VM SYN  +GIP       L   +RG
Sbjct: 320 EGMARVDPQMSPREVENIHIYPFKRVIREAGMLGVMSSYNDYDGIPVQGSYYWLTTRLRG 379

Query: 299 EWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCG----QYYTNFTGNA 354
           E    GY+V+D D+++ +   H    D KE AV Q+++AGL++ C       +       
Sbjct: 380 EMGFRGYVVSDSDAVEYLYTKHGTAKDMKE-AVRQSVEAGLNVRCTFRSPDSFVLPLREL 438

Query: 355 VQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQD--ICSDENIELAAEAAREG 412
           V++G + E  I+  ++ +  V   +G FD +P    L   D  +  +EN  +A +A+RE 
Sbjct: 439 VKEGGLSEEVINDRVRDILRVKFLIGLFD-APYQTDLADADREVEKEENEAIALQASRES 497

Query: 413 IVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFS----GYA 468
           IVLLKN    LPL+    K +AV GP+AN     + +Y  +     + + G      G A
Sbjct: 498 IVLLKNAGELLPLDINSTKKIAVCGPNANEEGYALTHYGPLAVEVTTVLEGIQEKTKGKA 557

Query: 469 NVTYKTGCDDVACKSNNS--------------IFAASEAAKTADATIILAGLDLSVEAES 514
            V Y  GCD V      S              I  A E A+ AD  I++ G       E+
Sbjct: 558 EVLYTKGCDLVDAHWPESEIIDYPLTDDEQAEIDKAVENARQADVAIVVLGGGQRTCGEN 617

Query: 515 LDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEE 574
             R  L LPG Q QL+  +    K PV+L++++   + I +A  +  + AIL A YPG +
Sbjct: 618 KSRTSLDLPGRQLQLLQAIQATGK-PVVLILINGRPLSINWA--DKFVPAILEAWYPGSK 674

Query: 575 GGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRP---VDSLGYPGRTYKF--Y 629
           GG A+AD++FG +NPGG+L +T+     V  +P  + P +P   +D    PG T      
Sbjct: 675 GGTALADILFGDYNPGGKLTVTFPK--TVGQIPF-NFPCKPSSQIDGGKNPGPTGNMSRI 731

Query: 630 NGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDL 689
           NG  LYPFGYGLSYT F+Y+ L  T  +               T + S T          
Sbjct: 732 NG-ALYPFGYGLSYTTFEYSDLDITPRV--------------ITPNESAT---------- 766

Query: 690 RCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKF 749
                   ++   N G   G +VV +Y +       TY K + GFQR+ +  G  + + F
Sbjct: 767 -------VRLKVTNTGKRAGDEVVQLYIRDVLSSITTYEKNLAGFQRIHLEPGEAQELSF 819

Query: 750 VFNACKSLNIVDYAANTLLPAGEHTIFVG 778
             +  K L ++D     ++  G+  +  G
Sbjct: 820 TIDR-KHLELLDADMKWVVEPGDFVLMAG 847


>gi|335433420|ref|ZP_08558246.1| glycoside hydrolase family 3 domain protein [Halorhabdus tiamatea
           SARL4B]
 gi|335434171|ref|ZP_08558974.1| glycoside hydrolase family 3 domain protein [Halorhabdus tiamatea
           SARL4B]
 gi|334898028|gb|EGM36149.1| glycoside hydrolase family 3 domain protein [Halorhabdus tiamatea
           SARL4B]
 gi|334898759|gb|EGM36857.1| glycoside hydrolase family 3 domain protein [Halorhabdus tiamatea
           SARL4B]
          Length = 783

 Score =  274 bits (700), Expect = 2e-70,   Method: Compositional matrix adjust.
 Identities = 222/754 (29%), Positives = 340/754 (45%), Gaps = 121/754 (16%)

Query: 76  EKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGATSFPTVILTTASF 135
           E + +L  +     RLG+P  E   E L G              PG T FP  I   +++
Sbjct: 89  ETINELQRYLVEETRLGIPAIEH-EECLTGYRG-----------PGGTIFPQSIGLASTW 136

Query: 136 NESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYA 195
           + +L + I  ++ T   A+       +   SP ++V+RD RWGR+ ET GEDP +VG   
Sbjct: 137 SPALVESITDSIRTRLDAV-----GTVQALSPVLDVSRDMRWGRVEETYGEDPQLVGALG 191

Query: 196 VNYVRGLQ-DVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDME 254
             YV GLQ D EG             + +  KH+AA+      G +R     ++ E+++ 
Sbjct: 192 AAYVAGLQSDGEG-------------IDATLKHFAAHG-SGEGGKNRSSV--QIGERELR 235

Query: 255 ETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQ 314
           E  L PFE+ ++E DA +VM +Y+ ++G+P  +   LL   +RGEW   G++VAD  S+ 
Sbjct: 236 EVHLYPFEVAIQEADARAVMNAYHDIDGVPCASSEWLLTDVLRGEWGFDGHVVADYFSVD 295

Query: 315 VMVDNHKFLADSKEDAVAQTLKAGLDL-----DCGQYYTNFTGNAVQQGKVKETDIDKSL 369
           ++ + H  +AD++ +A    L+AGLD+     DC   Y      AV+ G++ E  +D ++
Sbjct: 296 LLKEEHG-IADTQREAGVAALEAGLDVELPATDC---YDENLRKAVEDGELSEATVDTAV 351

Query: 370 KYLYTVLMRLGFFDGSPQYVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAK 429
           + +    +  G FD          +   +DE  ELAA AARE I LL+ND   LPL   +
Sbjct: 352 RRVLRAKIESGVFDDPYVDPDAATEPFDTDEQTELAARAARESITLLEND-GLLPLAGGE 410

Query: 430 VKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAG----------------FSGYANVTYK 473
           + +VA+VGP A+   A +G+Y     R+ +  AG                 +G+ +V Y 
Sbjct: 411 LDSVALVGPQADDGRAQVGDYTHA-ARFDTEEAGDFESVTPRDALEARGETAGF-DVEYV 468

Query: 474 TGCDDVACKSNNSIFAASEAAKTADATIILAGL----------------DLSVEAESLDR 517
            G   +   S +   AA E    AD  +   G                 D+    E+ D 
Sbjct: 469 EGA-TMTGPSTDGFDAAEETVADADLAVACVGARSDIDFADRENPAELPDVPTSGENCDV 527

Query: 518 EDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGR 577
            DL LPG Q  L++++AE    P+I+V +S  G   A  E   ++ A+L A  PG+EGG 
Sbjct: 528 TDLELPGVQEALVDRLAET-DTPLIVVQVS--GKPHAIPEIAESVPALLHAWLPGQEGGT 584

Query: 578 AIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPF 637
           AIADV+FG++NP G LP++       Q +  +  P             + + +G  LY F
Sbjct: 585 AIADVLFGEYNPSGHLPVSVPKSVGQQPVYYSRKP-------NSANEEHVYMDGEPLYSF 637

Query: 638 GYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEF 697
           GYGLSYT F+Y  L                       DA      G L            
Sbjct: 638 GYGLSYTDFEYGDLEV---------------------DAETVAPMGTLTA---------- 666

Query: 698 KVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSL 757
            V   N G   G DVV +Y        A  +++++GF+RV +  G  KR+ F F+A + L
Sbjct: 667 SVTVTNAGDVAGDDVVQLYQHAENPSQARPVQELLGFERVHLEPGETKRVTFSFDATQ-L 725

Query: 758 NIVDYAANTLLPAGEHTIFVGNGGVSFPIHLNFN 791
              D   N  +  G + + VG          +F 
Sbjct: 726 AYHDLDMNLAVEEGPYELRVGKSAAEIVDTADFE 759


>gi|270296173|ref|ZP_06202373.1| conserved hypothetical protein [Bacteroides sp. D20]
 gi|270273577|gb|EFA19439.1| conserved hypothetical protein [Bacteroides sp. D20]
          Length = 942

 Score =  273 bits (699), Expect = 2e-70,   Method: Compositional matrix adjust.
 Identities = 231/808 (28%), Positives = 360/808 (44%), Gaps = 142/808 (17%)

Query: 53  LFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRL---GLPQYEW----WS----- 100
           ++ D S P   R+++L+ +MTLDEK  Q+    +G  R+    LP  EW    W      
Sbjct: 52  VYEDPSAPLEARIENLLQQMTLDEKTCQVVTL-YGYKRVLKDDLPTPEWKELLWKDGIGA 110

Query: 101 --EALHGVSNVG----------------------------------PGTHFDDVIPG--- 121
             E L+G    G                                  P    ++ I G   
Sbjct: 111 IDEHLNGFQQWGLPPSDNAYVWPASRHAWALNEVQRFFVEDTRLGIPVDFTNEGIRGVES 170

Query: 122 --ATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLT-YWSPNINVARDPRWG 178
             AT+FPT +    ++N  L +++G     EAR +      G T  ++P ++V RD RWG
Sbjct: 171 YRATNFPTQLGLGHTWNRELIRQVGLITGREARML------GYTNVYAPILDVGRDQRWG 224

Query: 179 RITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKG 238
           R  E  GE P++V    +  VRGLQ    H +         +V++  KH+AAY  +    
Sbjct: 225 RYEEVYGESPYLVAELGIEMVRGLQ----HNH---------QVAATGKHFAAYSNNKGAR 271

Query: 239 VDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRG 298
                 D +++ +++E   + PF+  ++E     VM SYN  +GIP       L   +RG
Sbjct: 272 EGMARVDPQMSPREVENIHIYPFKRVIREAGMLGVMSSYNDYDGIPVQGSYYWLTTRLRG 331

Query: 299 EWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCG----QYYTNFTGNA 354
           E    GY+V+D D+++ +   H    D KE AV Q+++AGL++ C       +       
Sbjct: 332 EMGFRGYVVSDSDAVEYLYTKHGTAKDMKE-AVRQSVEAGLNVRCTFRSPDSFVLPLREL 390

Query: 355 VQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLG-KQDICSDENIELAAEAAREGI 413
           V++G + E  I+  ++ +  V   +G FD   Q    G  +++  +EN  +A +A+RE I
Sbjct: 391 VKEGGLSEEVINDRVRDILRVKFLIGLFDAPYQTDLAGADREVEKEENEAIALQASRESI 450

Query: 414 VLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFS----GYAN 469
           VLLKN    LPL+    K +AV GP+AN     + +Y  +     + + G      G A 
Sbjct: 451 VLLKNAGELLPLDINSTKKIAVCGPNANEEGYALTHYGPLAVEVTTVLEGIQEKTKGKAE 510

Query: 470 VTYKTGCDDVACKSNNS--------------IFAASEAAKTADATIILAGLDLSVEAESL 515
           V Y  GCD V      S              I  A E A+ AD  I++ G       E+ 
Sbjct: 511 VLYTKGCDLVDAHWPESEIIDYPLTDDEQAEIDKAVENARQADVAIVVLGGGQRTCGENK 570

Query: 516 DREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEG 575
            R  L LPG Q QL+  +    K PV+L++++   + I +A  +  + AIL A YPG +G
Sbjct: 571 SRTSLDLPGRQLQLLQAIQATGK-PVVLILINGRPLSINWA--DKFVPAILEAWYPGSKG 627

Query: 576 GRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRP---VDSLGYPGRTYKF--YN 630
           G A+AD++FG +NPGG+L +T+     V  +P  + P +P   +D    PG T      N
Sbjct: 628 GTALADILFGDYNPGGKLTVTFPK--TVGQIPF-NFPCKPSSQIDGGKNPGPTGNMSRIN 684

Query: 631 GPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLR 690
           G  LYPFGYGLSYT F+Y+ L  T  +               T + S T           
Sbjct: 685 G-ALYPFGYGLSYTTFEYSDLDITPRV--------------ITPNESAT----------- 718

Query: 691 CDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFV 750
                  ++   N G   G +VV +Y +       TY K + GFQR+ +  G  + + F 
Sbjct: 719 ------VRLKVTNTGKRAGDEVVQLYIRDVLSSITTYEKNLAGFQRIHLEPGEAQELSFT 772

Query: 751 FNACKSLNIVDYAANTLLPAGEHTIFVG 778
            +  K L ++D     ++  G+  +  G
Sbjct: 773 IDR-KHLELLDADMKWVVEPGDFVLMAG 799


>gi|374320547|ref|YP_005073676.1| glycoside hydrolase [Paenibacillus terrae HPL-003]
 gi|357199556|gb|AET57453.1| glycoside hydrolase family protein [Paenibacillus terrae HPL-003]
          Length = 767

 Score =  273 bits (699), Expect = 2e-70,   Method: Compositional matrix adjust.
 Identities = 207/692 (29%), Positives = 328/692 (47%), Gaps = 98/692 (14%)

Query: 122 ATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNINVARDPRWGRIT 181
            T FP  +   +++N  L++ + +AV++E RA     + G   +SP ++V RDPRWGR  
Sbjct: 124 GTVFPVPLSIGSTWNVDLYRDMCRAVASETRA-----QGGAVTYSPVLDVVRDPRWGRTE 178

Query: 182 ETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAY-DVDNWKGVD 240
           E  GEDP+++G +AV  V GLQ     E+    +S    V++  KH+A Y   +  +   
Sbjct: 179 ECFGEDPYLIGEFAVAAVEGLQG----ESLLSEHS----VAATLKHFAGYGSSEGGRNAG 230

Query: 241 RYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEW 300
             H   R    +  E  L PF+  V+ G A SVM +YN ++G+P   + +LL+  +R  W
Sbjct: 231 PVHMGWR----EFLEVDLYPFQKAVEAG-AQSVMPAYNEIDGVPCTVNAELLDGILRQTW 285

Query: 301 DLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLD-CGQYYTNFTGNAVQQGK 359
              G I+ DC +I+++ + H  +A+   DA  Q ++AG+D++  G+ + +    AV  GK
Sbjct: 286 GFDGLIITDCGAIEMLANGHD-VAEDGSDAAVQAIRAGIDMEMSGEMFGSHLVEAVHAGK 344

Query: 360 VKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQDICSDENIELAAEAAREGIVLLKND 419
           ++ + +D++++ + T+  RLG FD         +Q I   E+I LA + A EGIVLLKN 
Sbjct: 345 LETSVLDRAVRRVLTLKFRLGLFDKPYVDAERAEQVIGQTEHIRLARQLATEGIVLLKNV 404

Query: 420 QNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIP--CRYMSPIAGFSG-----YANVTY 472
             TLPL     K +A++GP+A+     +G+Y       R ++ + G  G      A V Y
Sbjct: 405 DGTLPLPKTS-KRIAIIGPNADQVYNQLGDYTSPQPRSRVITVLDGIRGKLGKDQAGVLY 463

Query: 473 KTGCDDVACKSNNSIFAASEAAKTADATIILAG-----------LDLSVEA--------- 512
             GC  +  +S      A   A   D  +++ G           +DL   A         
Sbjct: 464 APGC-RIKGESREGFENALACAAEVDTVVMVVGGSSARDFGEGTIDLKTGASKVSDHDWN 522

Query: 513 -----ESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILW 567
                E +DR  L L G Q QL+ +V  + K    LV++   G  IA      +  AI+ 
Sbjct: 523 DMESGEGIDRMTLGLAGVQLQLMQEVYRLGKE---LVVVYMNGRPIAEPWVEEHAHAIVE 579

Query: 568 AGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYK 627
           A YPG+EGG AIAD++FG  NP GRL ++     +V  LP+     R        G+ Y 
Sbjct: 580 AWYPGQEGGHAIADILFGDVNPSGRLTLSIPK--HVGQLPVYYNGKRS------RGKRYL 631

Query: 628 FYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVN 687
             +    YPFGYGLSYT F Y  L+ +                                N
Sbjct: 632 EDDAEPRYPFGYGLSYTTFSYERLTLS-------------------------------AN 660

Query: 688 DLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRI 747
            +R D+     VD  N G  +G++VV +Y           I+++ GF +V ++ G  + +
Sbjct: 661 SIRADESVTVTVDVTNTGEREGAEVVQLYISDTVSSVTRPIRELKGFCKVVLKPGETRTV 720

Query: 748 KFVFNACKSLNIVDYAANTLLPAGEHTIFVGN 779
           +FV  + K L  +     +++ AG  +I VG 
Sbjct: 721 EFVVGSDK-LQYIGRDLKSVVEAGRFSIEVGR 751


>gi|402304900|ref|ZP_10823963.1| glycosyl hydrolase family 3, N-terminal domain protein [Prevotella
           sp. MSX73]
 gi|400380686|gb|EJP33499.1| glycosyl hydrolase family 3, N-terminal domain protein [Prevotella
           sp. MSX73]
          Length = 866

 Score =  273 bits (699), Expect = 2e-70,   Method: Compositional matrix adjust.
 Identities = 157/440 (35%), Positives = 237/440 (53%), Gaps = 30/440 (6%)

Query: 64  RVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGAT 123
           R +DL SR+TL+EK + + + +  +PRLG+PQ+EWWSEALHG++  G           AT
Sbjct: 35  RAEDLCSRLTLEEKTKLMRNSSPAIPRLGIPQFEWWSEALHGIARNG----------FAT 84

Query: 124 SFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRA--------GLTYWSPNINVARDP 175
            FP      AS+++ L  ++  A S EA A  NL R         G++ W+PNIN+ RDP
Sbjct: 85  VFPQTTAMAASWDDELLYRVFCAASDEAVAKNNLARKSGDIKRYQGVSIWTPNINIFRDP 144

Query: 176 RWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRP--LKVSSCCKHYAAYDV 233
           RWGR  ET GEDP++  R  +  V GLQ      +      RP   K  +C KHYA +  
Sbjct: 145 RWGRGQETYGEDPYLTSRMGLAVVNGLQGQPFRRDMRPFTERPRYYKTLACAKHYAVHSG 204

Query: 234 DNWKGVDRYHFDA-RVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLL 292
             W   +R+ FD  R+ E+D+ ET+L  F+  V+EG+   VMC+Y R++G P C + + L
Sbjct: 205 PEW---NRHVFDVERLPERDLWETYLPAFKSLVQEGNVREVMCAYQRIDGSPCCGNTRYL 261

Query: 293 NQTVRGEWDLHGYIVADCDSIQ-VMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFT 351
           +Q +RGEW  +G +V+DC +I     + H  + ++  +A A  ++AG D++CG  Y    
Sbjct: 262 HQILRGEWGYNGLVVSDCGAISDFYREGHHHVVETPAEASAMGVRAGTDVECGAVYATLP 321

Query: 352 GNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSP--QYVSLGKQDICSDENIELAAEAA 409
             AV+QG +    ID S+  L      +G FD      +   G + I S+ +  LA + A
Sbjct: 322 -RAVEQGLISREAIDTSVVRLLKARFEVGDFDSEKLVPWKLTGPEVIASETHRRLALDMA 380

Query: 410 REGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGF-SGYA 468
           RE + LL+N    LPL+   ++ +AV+GP+AN +V + GNY G P    + + G  S   
Sbjct: 381 RESMTLLQNRNRLLPLSKNGLR-IAVMGPNANDSVMLWGNYTGYPISTTTILKGIRSKVP 439

Query: 469 NVTYKTGCDDVACKSNNSIF 488
              +  GC  +  +   S F
Sbjct: 440 AARFVEGCGYIRNEIRQSHF 459



 Score =  118 bits (296), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 91/315 (28%), Positives = 144/315 (45%), Gaps = 68/315 (21%)

Query: 478 DVACKSNNSIFAASEAAKTADATIILAGLDLSVEAESL----------DREDLWLPGYQT 527
           D+A KS  +    +  A  AD  + + G+   +E E +          DR  + LP  Q 
Sbjct: 592 DIARKSPITASEIAAQAGDADVVVFVGGISPRLEGEEMKVDAPGFKGGDRTSIELPEAQR 651

Query: 528 QLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKF 587
           ++I  + +  K   ++V ++  G  +A         A+L A Y GE GG+A+ADV+FG +
Sbjct: 652 EVIRLLRQAGK---LVVFVNCSGGAVALVPEAEACDAVLQAWYAGEAGGQAVADVLFGDY 708

Query: 588 NPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGY--PGRTYKFYNGPTLYPFGYGLSYTQ 645
           NP G+LP+T+Y  D         +P    D L Y   GRTY+++ G  L+PFG+GLSYT 
Sbjct: 709 NPSGKLPVTFYKSD-------ADLP----DFLDYRMTGRTYRYFRGTPLFPFGFGLSYTS 757

Query: 646 FKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVG 705
           F +              K ++   + Y                          V+  N G
Sbjct: 758 FAF-------------GKPRYENGMLY--------------------------VEVTNTG 778

Query: 706 STDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAAN 765
             DG++VV VY K PA+ A   +K + GF R+ ++AG  +R++      +     D  AN
Sbjct: 779 KRDGAEVVQVYVKNPAD-ADGPVKTLRGFARIDLKAGERRRVEIAMPR-ERFEGWDATAN 836

Query: 766 TL-LPAGEHTIFVGN 779
           T+ +  G H + VG+
Sbjct: 837 TMRVKPGNHLLMVGS 851


>gi|288927072|ref|ZP_06420962.1| beta-glucosidase [Prevotella buccae D17]
 gi|288336152|gb|EFC74543.1| beta-glucosidase [Prevotella buccae D17]
          Length = 866

 Score =  273 bits (699), Expect = 2e-70,   Method: Compositional matrix adjust.
 Identities = 157/440 (35%), Positives = 237/440 (53%), Gaps = 30/440 (6%)

Query: 64  RVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGAT 123
           R +DL SR+TL+EK + + + +  +PRLG+PQ+EWWSEALHG++  G           AT
Sbjct: 35  RAEDLCSRLTLEEKTKLMRNSSPAIPRLGIPQFEWWSEALHGIARNG----------FAT 84

Query: 124 SFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRA--------GLTYWSPNINVARDP 175
            FP      AS+++ L   +  A S EA A  NL R         G++ W+PNIN+ RDP
Sbjct: 85  VFPQTTAMAASWDDELLYHVFCAASDEAVAKNNLARKSGDIKRYQGVSIWTPNINIFRDP 144

Query: 176 RWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRP--LKVSSCCKHYAAYDV 233
           RWGR  ET GEDP++  R  +  V GLQ      +      RP   K  +C KHYA +  
Sbjct: 145 RWGRGQETYGEDPYLTSRMGLAVVNGLQGQPFRRDMRPFTERPRYYKTLACAKHYAVHSG 204

Query: 234 DNWKGVDRYHFDA-RVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLL 292
             W   +R+ FD  R+ E+D+ ET+L  F+  V+EG+   VMC+Y R++G P C + + L
Sbjct: 205 PEW---NRHVFDVERLPERDLWETYLPAFKSLVQEGNVREVMCAYQRIDGSPCCGNTRYL 261

Query: 293 NQTVRGEWDLHGYIVADCDSIQ-VMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFT 351
           +Q +RGEW+ +G +V+DC +I     + H  + ++  +A A  ++AG D++CG  Y    
Sbjct: 262 HQILRGEWEYNGLVVSDCGAISDFYREGHHHVVETPAEASAMGVRAGTDVECGAVYATLP 321

Query: 352 GNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSP--QYVSLGKQDICSDENIELAAEAA 409
             AV+QG +    ID S+  L      +G FD      +   G + I S+ +  LA + A
Sbjct: 322 -RAVEQGLISREAIDTSVVRLLKARFEVGDFDSEKLVPWKLTGPEVIASETHRRLALDMA 380

Query: 410 REGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGF-SGYA 468
           RE + LL+N    LPL+   ++ +AV+GP+AN +V + GNY G P    + + G  S   
Sbjct: 381 RESMTLLQNRNRLLPLSKNGLR-IAVMGPNANDSVMLWGNYTGYPISTTTILKGIRSKVP 439

Query: 469 NVTYKTGCDDVACKSNNSIF 488
              +  GC  +  +   S F
Sbjct: 440 AARFVEGCGYIRNEIRQSHF 459



 Score =  116 bits (290), Expect = 5e-23,   Method: Compositional matrix adjust.
 Identities = 88/315 (27%), Positives = 140/315 (44%), Gaps = 68/315 (21%)

Query: 478 DVACKSNNSIFAASEAAKTADATIILAGLDLSVEAESL----------DREDLWLPGYQT 527
           D+A KS  +    +  A  AD  + + G+   +E E +          DR  + LP  Q 
Sbjct: 592 DIARKSPITASEIAAQAGDADVVVFVGGISPRLEGEEMKVDAPGFKGGDRTSIELPEAQR 651

Query: 528 QLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKF 587
           ++I  + +  K   ++V ++  G  +A         A+L A Y GE GG+A+ADV+FG +
Sbjct: 652 EVIRLLRQAGK---LVVFVNCSGGAVALVPETEACDAVLQAWYAGEAGGQAVADVLFGDY 708

Query: 588 NPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGY--PGRTYKFYNGPTLYPFGYGLSYTQ 645
           NP G+LP+T+Y  D         +P    D L Y   GRTY+++ G  L+PFG+GLSYT 
Sbjct: 709 NPSGKLPVTFYKSD-------ADLP----DFLDYRMTGRTYRYFRGIPLFPFGFGLSYTS 757

Query: 646 FKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVG 705
           F +    +                                          +  V+  N G
Sbjct: 758 FAFGKPRYENG---------------------------------------KLYVEVTNTG 778

Query: 706 STDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAAN 765
             DG++VV VY K PA+ A   +K + GF R+ ++AG  +R++      +     D   N
Sbjct: 779 KRDGAEVVQVYVKNPAD-ADGPVKTLRGFARIDLKAGERRRVEIAMPR-ERFEGWDATTN 836

Query: 766 TL-LPAGEHTIFVGN 779
           T+ +  G H + VG+
Sbjct: 837 TMRVKPGNHLLMVGS 851


>gi|395803818|ref|ZP_10483061.1| glycoside hydrolase family 3 protein [Flavobacterium sp. F52]
 gi|395434089|gb|EJG00040.1| glycoside hydrolase family 3 protein [Flavobacterium sp. F52]
          Length = 875

 Score =  273 bits (699), Expect = 2e-70,   Method: Compositional matrix adjust.
 Identities = 159/425 (37%), Positives = 237/425 (55%), Gaps = 38/425 (8%)

Query: 52  FLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGP 111
           F F ++ L +  RV++LVS++TL+EKV Q+ + A  +PRLG+P Y+WW+E LHGV+    
Sbjct: 27  FPFQNTDLTFEERVENLVSQLTLEEKVAQMLNAAPAIPRLGIPAYDWWNETLHGVAR--- 83

Query: 112 GTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYN----LGRA-----GL 162
            T F       T FP  I   A+F+++   K+    + E RA+YN    L R      GL
Sbjct: 84  -TPFK-----TTVFPQAIAMAATFDKNSLFKMADYSALEGRAIYNKAVELNRTKERYLGL 137

Query: 163 TYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVS 222
           TYW+PNIN+ RDPRWGR  ET GEDP++       +V+GLQ  +          + LK +
Sbjct: 138 TYWTPNINIFRDPRWGRGQETYGEDPYLTAVLGDAFVKGLQGDD---------PKYLKAA 188

Query: 223 SCCKHYAAYDVDNWKGVD--RYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRV 280
           +C KHYA +      G +  R+ FD  VT  ++ +T+L  F+  V     + VMC+YN  
Sbjct: 189 ACAKHYAVHS-----GPESLRHTFDVDVTPYELWDTYLPAFKKLVTNSKVAGVMCAYNAF 243

Query: 281 NGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLD 340
              P CA   L+N  +R +W   GY+ +DC +I     NHK   D+   +    L  G D
Sbjct: 244 RTQPCCASDILMNDILRNQWKFTGYVTSDCWAIDDFFKNHKTHPDAASASADAVLH-GTD 302

Query: 341 LDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFD--GSPQYVSLGKQDICS 398
           +DCG         AV+ G++ E  ID S+K L+ +  RLG FD     +Y       + S
Sbjct: 303 IDCGTDAYKSLVQAVKNGQITEKQIDVSVKRLFMIRFRLGMFDPVSMVKYAQTPSSVLES 362

Query: 399 DENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYM 458
           +E+ E A + AR+ IVLLKN++NTLPL S K+K + V+GP+A+ +++++GNY G P +  
Sbjct: 363 EEHKEHALKMARQSIVLLKNEKNTLPL-SKKLKKIVVLGPNADNSISILGNYNGTPSKLT 421

Query: 459 SPIAG 463
           + + G
Sbjct: 422 TVLQG 426



 Score =  113 bits (282), Expect = 5e-22,   Method: Compositional matrix adjust.
 Identities = 99/318 (31%), Positives = 144/318 (45%), Gaps = 59/318 (18%)

Query: 475 GCDDVACKSNNSI---FA-ASEAAKTADATIILAGLDLSVEAESL----------DREDL 520
           G  +VA ++ N I   FA   E  K ADA I   G+   +E E +          DR  +
Sbjct: 580 GKAEVALQTGNFIKTDFANLIERHKNADAFIFAGGISPQLEGEEMPVDAPGFNGGDRTSI 639

Query: 521 WLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIA 580
            LP  QT+L+  +    K PV+ +IM+  G  IA      NI AIL   Y G+  G A A
Sbjct: 640 LLPEVQTRLLKALQSSGK-PVVFLIMT--GSAIAVPWEAENIPAILNIWYGGQSAGTASA 696

Query: 581 DVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYG 640
           DV+FG +NP GRLP+T+Y GD      L+S     +D+     +TY+++ G  LY FGYG
Sbjct: 697 DVIFGDYNPAGRLPVTFYKGDS----DLSSFVDYKMDN-----KTYRYFKGIPLYGFGYG 747

Query: 641 LSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVD 700
           LSYT+FKY+ L                     T D  K   P  +             V 
Sbjct: 748 LSYTEFKYSGLK--------------------TPDKIKKGQPVTI------------SVK 775

Query: 701 FQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIV 760
             N G  +G +V  +Y   P     + +K + GF+R  ++ G++  + F   + + L+ V
Sbjct: 776 VTNTGKMEGEEVAQLYLINPNTSIKSPLKSLKGFERFNLKPGQSTVVNFTL-SPEDLSYV 834

Query: 761 DYAANTLLPAGEHTIFVG 778
             + N     G+  I VG
Sbjct: 835 TESGNLKPYEGKIQIAVG 852


>gi|398386387|ref|ZP_10544389.1| beta-glucosidase-like glycosyl hydrolase [Sphingobium sp. AP49]
 gi|397718418|gb|EJK79007.1| beta-glucosidase-like glycosyl hydrolase [Sphingobium sp. AP49]
          Length = 791

 Score =  273 bits (699), Expect = 2e-70,   Method: Compositional matrix adjust.
 Identities = 220/733 (30%), Positives = 338/733 (46%), Gaps = 101/733 (13%)

Query: 78  VQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGATSFPTVILTTASFNE 137
           V  L  +A    RLG+P   +  E LHG + VG           ATSFP  I   +S++ 
Sbjct: 126 VNALQKWAMTETRLGIPIL-FHEEGLHGYAAVG-----------ATSFPQSIAMASSWDP 173

Query: 138 SLWKKIGQAVSTEARAMYNLGRAGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVN 197
           ++ +++ Q +  E RA     R      SP +++ARDPRWGRI ET GEDP++VG   V 
Sbjct: 174 TMLRQVNQVIGREIRA-----RGVPMVLSPVVDIARDPRWGRIEETYGEDPYLVGEMGVA 228

Query: 198 YVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAY-DVDNWKGVDRYHFDARVTEQDMEET 256
            V GLQ  EG         RP  V +  KH   +   ++   V      A V+E+++ E 
Sbjct: 229 AVEGLQG-EGRSRLL----RPGHVFATLKHLTGHGQPESGTNVG----PAPVSERELREN 279

Query: 257 FLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVM 316
           F  PFE  VK     +VM SYN ++G+PS A+  LL+  +R EW   G +V+D  ++  +
Sbjct: 280 FFPPFEQVVKRTGIEAVMASYNEIDGVPSHANRWLLDNVLRQEWGFRGAVVSDYSAVDQL 339

Query: 317 VDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFT-GNAVQQGKVKETDIDKSLKYLYTV 375
           +  H  +A + E+A  + L AG+D D  +  +  T G  V++GKV E  +D +++ +  +
Sbjct: 340 MSIH-HIAANLEEAAMRALDAGVDADLPEGLSYATLGKLVREGKVSEAKVDLAVRRMLEL 398

Query: 376 LMRLGFFDGSPQYVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAV 435
             R G F+      +       +DE   LA  AA+  I LLKND   LPL      T+AV
Sbjct: 399 KFRAGLFENPYADANAAAAITNNDEARALARTAAQRSITLLKND-GMLPLKPE--GTIAV 455

Query: 436 VGPHANATVAMIGNYAGIPCRYMSPIAGFSGY----ANVTYKTGC---------DDVACK 482
           +GP  +A VA +G Y G P   +S + G        AN+ +  G          +D   K
Sbjct: 456 IGP--SAAVARLGGYYGQPPHSVSILEGIKARVGTKANIVFAQGVKITENDDWWEDKVVK 513

Query: 483 SNNS-----IFAASEAAKTADATIILAGLDLSVEAESL------DREDLWLPGYQTQLIN 531
           S+ +     I  A EAA+  D  I+  G       E        DR  L L G Q +L +
Sbjct: 514 SDPAENRKLIAQAVEAARNVDRIILTLGDTEQSSREGWADNHLGDRPSLDLVGEQQELFD 573

Query: 532 QVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGG 591
            +  + K P+ +V+++  G   +  + +    AIL   Y GE+GG A+AD++FG  NPGG
Sbjct: 574 ALKALGK-PITVVLIN--GRPASTVKVSEQANAILEGWYLGEQGGNAVADILFGDVNPGG 630

Query: 592 RLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLL 651
           +LP+T      V  LP+    ++P        R Y F     LYPFG+GLSYT F  +  
Sbjct: 631 KLPVTVPRS--VGQLPMF-YNMKPSAR-----RGYLFDTTDPLYPFGFGLSYTNFSLS-- 680

Query: 652 SFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSD 711
                                         P +    +         VD +N G+ +G +
Sbjct: 681 -----------------------------APRLSATKIGTGGKTSVSVDVRNTGAREGDE 711

Query: 712 VVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAG 771
           VV +Y +         +K++ GFQRV ++ G ++ + F     ++L + +     ++  G
Sbjct: 712 VVQLYIRDKVSSVTRPVKELKGFQRVTLKPGESRTVTFTV-GPEALQMWNDQMRRVVEPG 770

Query: 772 EHTIFVGNGGVSF 784
           +  I  GN  V+ 
Sbjct: 771 DFEIMTGNSSVAL 783


>gi|315607027|ref|ZP_07882031.1| beta-glucosidase [Prevotella buccae ATCC 33574]
 gi|315251081|gb|EFU31066.1| beta-glucosidase [Prevotella buccae ATCC 33574]
          Length = 866

 Score =  273 bits (699), Expect = 2e-70,   Method: Compositional matrix adjust.
 Identities = 157/440 (35%), Positives = 237/440 (53%), Gaps = 30/440 (6%)

Query: 64  RVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGAT 123
           R +DL SR+TL+EK + + + +  +PRLG+PQ+EWWSEALHG++  G           AT
Sbjct: 35  RAEDLCSRLTLEEKTKLMRNSSPAIPRLGIPQFEWWSEALHGIARNG----------FAT 84

Query: 124 SFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRA--------GLTYWSPNINVARDP 175
            FP      AS+++ L  ++  A S EA A  NL R         G++ W+PNIN+ RDP
Sbjct: 85  VFPQTTAMAASWDDELLYRVFCAASDEAVAKNNLARKSGDIKRYQGVSIWTPNINIFRDP 144

Query: 176 RWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRP--LKVSSCCKHYAAYDV 233
           RWGR  ET GEDP++  R  +  V GLQ      +      RP   K  +C KHYA +  
Sbjct: 145 RWGRGQETYGEDPYLTSRMGLAVVNGLQGQPFRRDMRPFTERPRYYKTLACAKHYAVHSG 204

Query: 234 DNWKGVDRYHFDA-RVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLL 292
             W   +R+ FD  R+ E+D+ ET+L  F+  V+EG+   VMC+Y R++G P C + + L
Sbjct: 205 PEW---NRHVFDVERLPERDLWETYLPAFKSLVQEGNVREVMCAYQRIDGSPCCGNTRYL 261

Query: 293 NQTVRGEWDLHGYIVADCDSIQ-VMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFT 351
           +Q +RGEW  +G +V+DC +I     + H  + ++  +A A  ++AG D++CG  Y    
Sbjct: 262 HQILRGEWGYNGLVVSDCGAISDFYREGHHHVVETPAEASAMGVRAGTDVECGAVYATLP 321

Query: 352 GNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSP--QYVSLGKQDICSDENIELAAEAA 409
             AV+QG +    ID S+  L      +G FD      +   G + I S+ +  LA + A
Sbjct: 322 -RAVEQGLISREAIDTSVVRLLKARFEVGDFDSEKLVPWKLTGPEVIASETHRRLALDMA 380

Query: 410 REGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGF-SGYA 468
           RE + LL+N    LPL+   ++ +AV+GP+AN +V + GNY G P    + + G  S   
Sbjct: 381 RESMTLLQNRNRLLPLSKNGLR-IAVMGPNANDSVMLWGNYTGYPISTTTILKGIRSKVP 439

Query: 469 NVTYKTGCDDVACKSNNSIF 488
              +  GC  +  +   S F
Sbjct: 440 AARFVEGCGYIRNEIRQSHF 459



 Score =  116 bits (290), Expect = 6e-23,   Method: Compositional matrix adjust.
 Identities = 88/315 (27%), Positives = 140/315 (44%), Gaps = 68/315 (21%)

Query: 478 DVACKSNNSIFAASEAAKTADATIILAGLDLSVEAESL----------DREDLWLPGYQT 527
           D+A KS  +    +  A  AD  + + G+   +E E +          DR  + LP  Q 
Sbjct: 592 DIARKSPITASEIAAQAGDADVVVFVGGISPRLEGEEMKVDAPGFNGGDRTSIELPEAQR 651

Query: 528 QLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKF 587
           ++I  + +  K   ++V ++  G  +A         A+L A Y GE GG+A+ADV+FG +
Sbjct: 652 EVIRLLRQAGK---LVVFVNCSGGAVALVPEAEACDAVLQAWYAGEAGGQAVADVLFGDY 708

Query: 588 NPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGY--PGRTYKFYNGPTLYPFGYGLSYTQ 645
           NP G+LP+T+Y  D         +P    D L Y   GRTY+++ G  L+PFG+GLSYT 
Sbjct: 709 NPSGKLPVTFYKSD-------ADLP----DFLDYRMTGRTYRYFRGTPLFPFGFGLSYTS 757

Query: 646 FKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVG 705
           F +    +                                          +  V+  N G
Sbjct: 758 FVFGTPRYENG---------------------------------------KLYVEVTNTG 778

Query: 706 STDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAAN 765
             DG++VV VY K PA+ A   +K + GF R+ ++AG  +R++      +     D   N
Sbjct: 779 KRDGAEVVQVYVKNPAD-ADGPVKTLRGFARIDLKAGERRRVEIAMPR-ERFEGWDATTN 836

Query: 766 TL-LPAGEHTIFVGN 779
           T+ +  G H + VG+
Sbjct: 837 TMRVKPGNHLLMVGS 851


>gi|325918730|ref|ZP_08180824.1| beta-glucosidase-like glycosyl hydrolase [Xanthomonas vesicatoria
           ATCC 35937]
 gi|325535054|gb|EGD06956.1| beta-glucosidase-like glycosyl hydrolase [Xanthomonas vesicatoria
           ATCC 35937]
          Length = 391

 Score =  273 bits (699), Expect = 2e-70,   Method: Compositional matrix adjust.
 Identities = 165/397 (41%), Positives = 223/397 (56%), Gaps = 38/397 (9%)

Query: 45  LGLQMSSFLFC---DSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSE 101
           LGL +    F    D S     R   LV++M+ DEKV Q  + A  +PRL +P YEWWSE
Sbjct: 13  LGLCLPCIAFAAPADRSGTPEQRAAALVAQMSRDEKVAQAMNDAPAIPRLDIPAYEWWSE 72

Query: 102 ALHGVSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLG--- 158
            LHG++  G           AT FP  I   AS+N +L +++G  VSTEARA +N     
Sbjct: 73  GLHGIARNG----------YATVFPQAIGLAASWNTALMQQVGTVVSTEARAKFNQAGGP 122

Query: 159 ------RAGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENAT 212
                  AGLT WSPNIN+ RDPRWGR  ET GEDPF+ G+ AV ++RGLQ         
Sbjct: 123 GKDHKRYAGLTIWSPNINIFRDPRWGRGMETYGEDPFLTGQLAVGFIRGLQ-------GD 175

Query: 213 DLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASS 272
           DLN  P  +++  KH A   V +     R+ FD  V+ +DME T+   F   + +G A S
Sbjct: 176 DLN-HPRTIATP-KHIA---VHSGPEPGRHGFDVDVSPRDMEATYTPAFRAALVDGQAWS 230

Query: 273 VMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVA 332
           VMC+YN ++G P+CA   LLN  VRG+W   G++V+DCD++  M   H F  D+   + A
Sbjct: 231 VMCAYNSLHGTPACAADWLLNGRVRGDWGFKGFVVSDCDAVDDMTQFHYFRPDNAGSSAA 290

Query: 333 QTLKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ--YVS 390
             LKAG DL+CG  Y    G A+++G+V E  +D+SL  L+    RLG  +   +  Y  
Sbjct: 291 -ALKAGHDLNCGHAYREL-GTAIERGEVDEALLDQSLVRLFAARYRLGELEAPRKDPYAR 348

Query: 391 LGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNS 427
           LG +D+ +  +  LA +AA E IVLLKN   TLPL +
Sbjct: 349 LGAKDVDNAAHRALALQAAAESIVLLKNTATTLPLKA 385


>gi|393788557|ref|ZP_10376684.1| hypothetical protein HMPREF1068_02964 [Bacteroides nordii
           CL02T12C05]
 gi|392654237|gb|EIY47885.1| hypothetical protein HMPREF1068_02964 [Bacteroides nordii
           CL02T12C05]
          Length = 859

 Score =  273 bits (699), Expect = 2e-70,   Method: Compositional matrix adjust.
 Identities = 224/819 (27%), Positives = 362/819 (44%), Gaps = 146/819 (17%)

Query: 30  SSSPVFVCDPGRFSKLGLQMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGD------ 83
           S S +  C     S + +      +   +LP   RV DL+SRMTL+EKV Q+        
Sbjct: 6   SISRLLFCSSLFLSGISVFAQELPYKQPNLPIEERVNDLLSRMTLEEKVAQIRHIHSWNI 65

Query: 84  ---------------------FAHGVP---------------------RLGLPQYEWWSE 101
                                F  G P                     RLG+P +   +E
Sbjct: 66  FNGQTLDTEKLKAFSKGMSWGFVEGFPLTGANCRKNMQLVQKFMVENTRLGIPVFTV-AE 124

Query: 102 ALHGVSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTE--ARAMYNLGR 159
           +LHG            V  G+  +P  +   ++F+  L  +    ++ +  A+ M+ +  
Sbjct: 125 SLHG-----------SVHEGSVIYPQNVALGSTFSPELAYRKAAMITKDLHAQGMHQV-- 171

Query: 160 AGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPL 219
                 +P I+V RD RWGR+ E+ GEDP + G + +  V+G  D     N         
Sbjct: 172 -----LAPCIDVVRDLRWGRVEESFGEDPILCGLFGIAEVKGYMD-----NG-------- 213

Query: 220 KVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNR 279
            +S   KHY  +  +   G++    +  +  +D+ E +L+PFEM ++     +VM +YN 
Sbjct: 214 -ISPMLKHYGPHG-NPLSGLNLASVECGL--RDLHEVYLKPFEMVIRNTSVLAVMSTYNS 269

Query: 280 VNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGL 339
            N IP+ A   LL + +R ++   GY+ +D  +I+++   H + A + E+A  Q   AGL
Sbjct: 270 WNRIPNSASHYLLTEVLRNQFGFKGYVYSDWGAIEMLKTLH-YTAHNSEEAAMQAFTAGL 328

Query: 340 DLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQDICSD 399
           D++          + +++GK+ E  +++S++ +  V  ++G F+  P         +   
Sbjct: 329 DVEASSNCYPLLADLIKEGKLDEEILNESVRRVLYVKFKMGLFE-DPYGEQYAHCKMHPQ 387

Query: 400 ENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRY-- 457
           E ++L+ E A E +VLLKN+   LPLN+ K+++VAV+GP  NA     G+Y         
Sbjct: 388 EGVQLSKEIADESVVLLKNENGLLPLNAEKLRSVAVIGP--NADQVQFGDYTWSRNNKDG 445

Query: 458 MSPIAG----FSGYANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAG-------- 505
           M+P+AG          V Y+ GC  V+  + + I  A E A+ ++  I+  G        
Sbjct: 446 MTPLAGIRQLLGDKVTVRYEKGCSLVSLDT-SGIKKAVEVARQSEVAIVFCGSASAALAR 504

Query: 506 -LDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKA 564
               S   E  D  DL L G Q+QLI +V E    PV+LV+++     I++ +   +I A
Sbjct: 505 DYKSSTCGEGFDLNDLNLTGAQSQLIKEVYETGT-PVVLVLVTGKPFTISWEK--KHIPA 561

Query: 565 ILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGD-----YVQMLPLTSMPLRPVDSL 619
           IL   Y GE+ G +IAD++FGK +P GRL  ++         Y   LP      +   S 
Sbjct: 562 ILTQWYAGEQAGNSIADILFGKISPSGRLTFSFPQSTGHLPVYYDYLPSDKGFYKNPGSY 621

Query: 620 GYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKT 679
             PGR Y F +   L+ FG+GL+YT F Y  +   K         +H             
Sbjct: 622 ETPGRDYVFSSPDPLWAFGHGLTYTSFVYKSMETDK---------EH------------- 659

Query: 680 RCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFV 739
                        D    KVD +N G  DG +VV +Y +       T +KQ+  F++V V
Sbjct: 660 ---------YDPTDTIYVKVDIKNTGKRDGKEVVQLYVRDKVSTVVTPVKQLRDFEKVLV 710

Query: 740 RAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVG 778
            AG  + ++    A K L IVD     ++  GE  + VG
Sbjct: 711 EAGSTRTVRLKV-AVKDLYIVDAGDRRIVEPGEFELQVG 748


>gi|225873993|ref|YP_002755452.1| beta-xylosidase B [Acidobacterium capsulatum ATCC 51196]
 gi|225791521|gb|ACO31611.1| beta-xylosidase B [Acidobacterium capsulatum ATCC 51196]
          Length = 894

 Score =  273 bits (699), Expect = 2e-70,   Method: Compositional matrix adjust.
 Identities = 166/468 (35%), Positives = 245/468 (52%), Gaps = 48/468 (10%)

Query: 39  PGRFSKLGLQM-SSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYE 97
           P  F++   Q  S+  + + SLP  +R +DLVSRMTL EK  QL + A  +PRL +P Y 
Sbjct: 23  PSAFAQSQTQSPSTPAYLNPSLPPVVRARDLVSRMTLKEKASQLVNAARAIPRLKVPAYN 82

Query: 98  WWSEALHGVSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNL 157
           WWSEALHGV+           + G T FP  I   A+F+     ++   + TE R +Y  
Sbjct: 83  WWSEALHGVA-----------VNGTTEFPEPIGLGATFDVPAIHEMAVDIGTEGRVVYEE 131

Query: 158 GRA--------GLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHE 209
                      GL +W+PN+N+ RDPRWGR  ET GEDPF+ G+  V +V G+Q      
Sbjct: 132 NEKDGSSKIFHGLDFWAPNLNIFRDPRWGRGQETYGEDPFLTGKMGVAFVSGMQGD---- 187

Query: 210 NATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGD 269
                N +  +V +  KH+   DV +     R+  D  V+  D  +T+   F   + +G 
Sbjct: 188 -----NPKYYRVIATPKHF---DVHSGPEPTRHFADVDVSLHDQLDTYEPAFRAAIMQGH 239

Query: 270 ASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKED 329
           A SVMCSYN +NG P+CA+   L   +RG W   GY+V+DCD++  +   HK+   +   
Sbjct: 240 ADSVMCSYNAINGQPACANQFTLQHQLRGAWGFKGYVVSDCDAVHDIYSGHKYRP-TLAQ 298

Query: 330 AVAQTLKAGLDLDCGQY--------YTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGF 381
           A A +++ G+D DC  +        Y  +  +AVQQG + +  +D +L  L+T  ++LG 
Sbjct: 299 AAAISMERGMDNDCADFAQPKGDDDYKAYI-DAVQQGYLSQQAMDTALVRLFTARIKLGL 357

Query: 382 FD--GSPQYVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPH 439
           FD  G   Y      ++ S  +   A + A E +VLLKND  TLPL    V ++AVVGP 
Sbjct: 358 FDPKGMDPYADTPHSELNSPAHRAYARKLADESMVLLKND-GTLPLKPGSVHSIAVVGPL 416

Query: 440 ANATVAMIGNYAGIPCRYMSPIAGFSG-YAN--VTYKTGCDDVACKSN 484
           A+ T  ++GNY G+P   +S + G    Y N  +TY  G   ++  +N
Sbjct: 417 ADQTAVLLGNYNGVPTHTVSFLEGLRAEYPNTKITYVPGTQFLSDTAN 464



 Score =  116 bits (291), Expect = 4e-23,   Method: Compositional matrix adjust.
 Identities = 87/290 (30%), Positives = 135/290 (46%), Gaps = 55/290 (18%)

Query: 501 IILAGLDLSVEAESL----------DREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGG 550
           I + G+   +E E +          DR +L +P  +  L+  VA+  K PV++V+M+   
Sbjct: 627 IAVVGITSKLEGEEMPVDQPGFLGGDRTNLQMPEPEEALVEAVAKTGK-PVVVVLMNGSA 685

Query: 551 VDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTS 610
           + + +   + N  A+L A Y GEEGG AIAD + GK +P GRLP+T+Y    V  LP   
Sbjct: 686 LAVNWISQHAN--AVLEAWYSGEEGGAAIADTLSGKNDPAGRLPVTFYKS--VNQLPN-- 739

Query: 611 MPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNL 670
                 +      RTY+++ G  LYPFGYGLSYT F+Y+ LS                  
Sbjct: 740 -----FEDYSMENRTYRYFKGKPLYPFGYGLSYTTFRYSDLSIPHA-------------- 780

Query: 671 NYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQ 730
             T DA +                 E      N G   G +VV +Y K P    A  I  
Sbjct: 781 --TVDAGQP---------------VEASATVTNTGKVAGDEVVQLYLKFPKVDGAPDIA- 822

Query: 731 VIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGNG 780
           + GFQR+ +  G+++++ F     + L++V      ++  G++T+ +G G
Sbjct: 823 LRGFQRIHLEPGQSQQVHFELKK-RDLSMVTALGQIIVAQGDYTLSIGGG 871


>gi|329957143|ref|ZP_08297710.1| glycosyl hydrolase family 3 protein [Bacteroides clarus YIT 12056]
 gi|328523411|gb|EGF50510.1| glycosyl hydrolase family 3 protein [Bacteroides clarus YIT 12056]
          Length = 803

 Score =  273 bits (697), Expect = 3e-70,   Method: Compositional matrix adjust.
 Identities = 212/724 (29%), Positives = 331/724 (45%), Gaps = 113/724 (15%)

Query: 90  RLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVST 149
           RLG+P  ++ +E +HG+++             AT  P  I   +++N+ L ++ G     
Sbjct: 142 RLGIP-VDFTNEGIHGLNHTK-----------ATPLPAPIAIGSTWNKELVRRAGVIAGQ 189

Query: 150 EARAMYNLGRAGLT-YWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGH 208
           EA+A+      G T  ++P ++V RDPRWGR  E  GE+PF++       V G+Q  +G 
Sbjct: 190 EAKAL------GYTNVYAPILDVVRDPRWGRTLECYGEEPFLIAALGTEMVNGIQS-QG- 241

Query: 209 ENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEG 268
                       V++  KHYA Y V           D  V  +++ E FL PF+  ++  
Sbjct: 242 ------------VAATLKHYAVYSVPKGGRDGHCRTDPHVAPRELHELFLYPFKKVIQNS 289

Query: 269 DASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKE 328
               VM SYN  +G+P  A    L + +R E+   GY+V+D  +++ +   H  +AD+ +
Sbjct: 290 HPMGVMSSYNDWDGVPVSASYYFLTELLREEYGFDGYVVSDSQAVEFVESKH-HVADTYD 348

Query: 329 DAVAQTLKAGLDLDCGQYYTNFTGNA---------VQQGKVKETDIDKSLKYLYTVLMRL 379
           +AV Q L+AGL++      T+FT  +         +++ K+    IDK +  +  V  RL
Sbjct: 349 EAVRQVLEAGLNV-----RTHFTPPSDFILPIRRLLEEKKISMATIDKRVSEVLRVKFRL 403

Query: 380 GFFDGSPQYVSLGKQDIC--SDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVG 437
           G FD  P     G  D    +D N++   E  ++ +VLLKN+ N LPL+  ++K V V G
Sbjct: 404 GLFD-RPYVTDTGAADNVGGADRNMDFVKEMQQQALVLLKNENNILPLDKQRIKKVLVTG 462

Query: 438 PHANATVAMIGNYAGIPCRYMSPIAGFSGY----ANVTYKTGCDDVAC------------ 481
           P A+    M   Y       ++ +AG   Y    A V Y  GCD V              
Sbjct: 463 PLADEDNFMTSRYGPNGLETVTVLAGLRAYLQGVAEVDYAKGCDIVDAGWPATEILPVPM 522

Query: 482 --KSNNSIFAASEAAKTADATIILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKG 539
             +    I  A   A  +D  I + G D     ES  R  L LPG Q QL+  +    K 
Sbjct: 523 NEREKRGIAEAVAKAGESDVVIAVLGEDEYRTGESRSRTSLDLPGRQQQLLEALHATGK- 581

Query: 540 PVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYN 599
           PVILV+++   + + +A  N  I AIL + +PG +GG  IA+ +FG+ NPGG+L +T+  
Sbjct: 582 PVILVLINGQPLTVNWA--NAYIPAILESWFPGCQGGTVIAETLFGEHNPGGKLTVTFPK 639

Query: 600 GDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPT-----LYPFGYGLSYTQFKYNLLSFT 654
              V  + L + P +P  S G   ++    +G T     LYPFG+GLSYT F Y+ L  +
Sbjct: 640 S--VGQIEL-NFPFKP-GSHGSQPKSGPNGSGATRVIGELYPFGFGLSYTTFAYSDLEVS 695

Query: 655 KTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVI 714
              Q                               R    +  KV+  N G   G +VV 
Sbjct: 696 PLRQ-------------------------------RTQGEYTVKVNVTNTGKRAGDEVVQ 724

Query: 715 VYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHT 774
           +Y +       TY  Q+ GF+RV ++ G  +++ F     + L I+D   N  +  GE  
Sbjct: 725 LYVRDKVSSVITYDSQLRGFERVSLKPGETRQVTFSLKP-EDLQILDRNMNWTVEPGEFE 783

Query: 775 IFVG 778
           + +G
Sbjct: 784 VMIG 787


>gi|440747308|ref|ZP_20926567.1| Periplasmic beta-glucosidase [Mariniradius saccharolyticus AK6]
 gi|436484228|gb|ELP40232.1| Periplasmic beta-glucosidase [Mariniradius saccharolyticus AK6]
          Length = 763

 Score =  273 bits (697), Expect = 3e-70,   Method: Compositional matrix adjust.
 Identities = 227/785 (28%), Positives = 361/785 (45%), Gaps = 131/785 (16%)

Query: 60  PYSIRVKDLVSRMTLDEKVQQL-----GDFAHG------------------------VPR 90
           P+  RV  +++ MTL+EK+ QL     GDF  G                        V +
Sbjct: 30  PFRDRVDSVMALMTLEEKIGQLNLPAAGDFTTGQASSSNIAEKIKAGLVGGLFNIKSVAK 89

Query: 91  LGLPQYEWWSEALHGVSNVGPGTHFDDVIPG-ATSFPTVILTTASFNESLWKKIGQAVST 149
           +   Q     E+  G+    P     DVI G  T FP  I  + S++ +L +K  +  + 
Sbjct: 90  IRDVQRVAVEESRLGI----PLIFAMDVIHGYETVFPIPIGMSCSWDMALMEKSARIAAQ 145

Query: 150 EARAMYNLGRAGLTY-WSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGH 208
           EA A       G+ + +SP  +++RDPRWGR++E  GEDP++  + A   ++G Q  +  
Sbjct: 146 EASA------DGINWTFSPMTDISRDPRWGRMSEGSGEDPYLGAQIAKAMIKGYQGDDLS 199

Query: 209 ENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEG 268
            N T L        +C KH+A Y      G D    D  ++ Q M   +  P++  ++ G
Sbjct: 200 LNNTIL--------ACVKHFALYGAPE-AGRDYNTVD--MSRQRMFNEYFLPYQAAIEAG 248

Query: 269 DASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKE 328
              SVM S+N V+GIP+ A+  L+ + +R  W   G++V D  +I  M D+   L D ++
Sbjct: 249 -VGSVMTSFNDVDGIPASANKWLMTEVLRERWGFEGFVVTDYTAINEMTDHG--LGDLQQ 305

Query: 329 DAVAQTLKAGLDLD-CGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ 387
              A  + AG+D+D  G+ +      +V++GKV E +ID + + + T   +LG FD   +
Sbjct: 306 -VSALAMNAGVDMDMVGEGFLTTLKKSVEEGKVSEAEIDAACRRILTAKFKLGLFDDPYR 364

Query: 388 Y--VSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVA 445
           Y  V   K++I SD + ++A E A +  VLLKN+  TLPL   K  T+A+VGP A+ T  
Sbjct: 365 YCDVERAKREIFSDAHRKVAREIATQTFVLLKNENQTLPLK--KEGTIALVGPMADNTEN 422

Query: 446 MIGNYAGIPCRYMSPIAGFSGYAN-------VTYKTGCD---DVACKSNNSIFA------ 489
           M G ++ +  R+ + I+   G  N       + Y  G +   D   +S  SIF       
Sbjct: 423 MTGTWS-VAARFENSISLRKGLENALGDRAKIVYAKGSNIYPDSLLESRVSIFGKPTYRD 481

Query: 490 ----------ASEAAKTADATIILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKG 539
                     A +AA+ A+  +   G    +  ES  R D+ +P  Q  L+  + +  K 
Sbjct: 482 NRPAQVLIQEALQAARNANVIVAAMGESAEMSGESSSRTDIEIPENQRALLEALLKTGK- 540

Query: 540 PVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYN 599
           PV+LV+ +  G  +A      N+ AIL   + G E G AIADV+FG  NP G+L  T+  
Sbjct: 541 PVVLVLFT--GRPLAIKWEQENLHAILNVWFAGSEAGHAIADVLFGDVNPSGKLSATFPQ 598

Query: 600 GDYVQMLPL------TSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSF 653
              V  +P+      T  PL            Y   +   LYPFG+GLSYT F+Y  +  
Sbjct: 599 N--VGQVPIYYNHKSTGRPLAAGQWFQKFRTNYLDVSNDPLYPFGFGLSYTDFEYGEIKL 656

Query: 654 TKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVV 713
           +K+                               +L  D+     +D +N G  DG++VV
Sbjct: 657 SKS-------------------------------ELVGDERIRVSIDVKNAGGVDGAEVV 685

Query: 714 IVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEH 773
            +Y +         +K++ GF++VF++AG +K ++F     + L   +   N +   GE 
Sbjct: 686 QLYVRDIVASMTRPVKELKGFEKVFLKAGESKTVRFELGQ-EQLKFYNNDLNFIFEPGEF 744

Query: 774 TIFVG 778
            I VG
Sbjct: 745 EIMVG 749


>gi|294673871|ref|YP_003574487.1| family 3 glycosyl hydrolase [Prevotella ruminicola 23]
 gi|294474367|gb|ADE83756.1| glycosyl hydrolase, family 3 [Prevotella ruminicola 23]
          Length = 782

 Score =  273 bits (697), Expect = 4e-70,   Method: Compositional matrix adjust.
 Identities = 220/728 (30%), Positives = 334/728 (45%), Gaps = 129/728 (17%)

Query: 90  RLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVST 149
           RLG+P +    EA HG   +G            T FPT     A++N +L +K G+ +  
Sbjct: 130 RLGIPLF-LAEEAPHGHMAIG-----------TTVFPTGFGMAATWNPALIEKTGEVIGQ 177

Query: 150 EARAMYNLGRAGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHE 209
           E R      + G   + P +++AR+PRW R+ ET GEDP + G      V+GL       
Sbjct: 178 EIRL-----QGGHISYGPVLDLAREPRWSRVEETMGEDPVLAGELGAAMVKGL------- 225

Query: 210 NATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVT---EQDMEETFLRPFEMCVK 266
               + S+P    +  KH+  Y      G      +  +T    ++++E+FL PF+  + 
Sbjct: 226 -GGGILSKPYSTIATLKHFIGY------GTTEAGQNGGITIAGARELQESFLPPFKKAIN 278

Query: 267 EGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADS 326
            G A SVM SYN ++GIPS     LL   +R +W  +G++V+D  SI  +   H+ +A++
Sbjct: 279 AG-ALSVMTSYNSLDGIPSTCSKALLTDLLRTQWGFNGFVVSDLYSIDGIHGTHR-VAET 336

Query: 327 KEDAVAQTLKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSP 386
           K+ A    LKAG+D D G        +AVQ+G V E +ID ++K +  +   +G F+   
Sbjct: 337 KQQAGVMALKAGVDADLGALAFGRLEDAVQKGMVTEAEIDVAVKRILKMKFEMGLFEHPY 396

Query: 387 QYVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAM 446
              +  KQ + SD N  +A + ARE I LLKN  + LPL  +K + V V GP+A+    M
Sbjct: 397 VDAAQAKQLVRSDNNKAVALQVAREIITLLKNQNHVLPL--SKTQKVLVCGPNADNVYNM 454

Query: 447 IGNYA-----GIPCRYMSPIAGFSGYANVTYKTGC---DDVA------------------ 480
           +G+Y      G     ++ I      + VTY  GC   D  A                  
Sbjct: 455 LGDYTAPQEEGNVKTILAGIRSKLPASQVTYVKGCAVRDTTASNIAEAVAAAKQADVVVV 514

Query: 481 ------CKSNNSIFAASEAAKTADATIILAGLDLSVEAESLDREDLWLPGYQTQLINQVA 534
                  +   + +  + AA T   TI  + +D     E  DR  L   G+Q QL+  + 
Sbjct: 515 AVGGSSARDFKTSYKETGAAVTDSKTI--SDMDC---GEGFDRATLTPLGHQMQLLKALK 569

Query: 535 EVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLP 594
            + K P+++V +    +D ++A  + +  A+L A YPG+EGG AIADV+FG +NP GRLP
Sbjct: 570 AIGK-PLVVVYIEGRPMDKSWAAQHAD--ALLTAYYPGQEGGTAIADVLFGDYNPAGRLP 626

Query: 595 ITWYNGDYVQMLPL--TSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLS 652
           ++      V  +P+     P  P D +    R         LY FGYGLSYT FKY+ L 
Sbjct: 627 VSVPAN--VGQIPVYYNKKPPMPHDYVEMSAR--------PLYAFGYGLSYTTFKYDDL- 675

Query: 653 FTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQ--NVGSTDGS 710
                           N+  T D                    +FKV F   N G  DG 
Sbjct: 676 ----------------NIEETGDT-------------------QFKVTFNVTNTGDMDGD 700

Query: 711 DVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPA 770
           +VV +Y        A  + Q+  F R+F+  G  K++ F   A + L IVD   N ++  
Sbjct: 701 EVVQLYLHDEFASTAQPMMQLKKFSRIFIPKGETKQVSFTLEA-EDLEIVDQEMNHVVET 759

Query: 771 GEHTIFVG 778
           G+ T+ +G
Sbjct: 760 GDFTVMIG 767


>gi|329923020|ref|ZP_08278536.1| glycosyl hydrolase family 3 N-terminal domain protein
           [Paenibacillus sp. HGF5]
 gi|328941793|gb|EGG38078.1| glycosyl hydrolase family 3 N-terminal domain protein
           [Paenibacillus sp. HGF5]
          Length = 763

 Score =  273 bits (697), Expect = 4e-70,   Method: Compositional matrix adjust.
 Identities = 222/735 (30%), Positives = 351/735 (47%), Gaps = 108/735 (14%)

Query: 76  EKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGATSFPTVILTTASF 135
           E V  +  +A    RLG+P   +  E  HG   +G           AT FP  +   +++
Sbjct: 90  EAVNAIQRYAMEHSRLGIPIL-FGEECSHGHMAIG-----------ATVFPVPLTIGSTW 137

Query: 136 NESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYA 195
           N  L++ I +AV+ E RA     + G   +SP ++V RDPRWGR  ET GEDP +V  +A
Sbjct: 138 NTELFRSISRAVAAETRA-----QGGAATYSPVLDVVRDPRWGRTEETFGEDPHLVAEFA 192

Query: 196 VNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDV-DNWKGVDRYHFDARVTEQDME 254
           V  V+GLQ  E  ++ T L        +  KH+A Y   +  +     H   R    ++ 
Sbjct: 193 VAAVQGLQG-ERLDSHTSL-------LATLKHFAGYGASEGGRNGAPVHMGLR----ELH 240

Query: 255 ETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQ 314
           E  L PF   V+ G A S+M +YN ++G+P  +   LL   +R  W   G+++ DC +I 
Sbjct: 241 EVDLLPFRKAVESG-ALSIMTAYNEIDGVPCTSSRYLLQNVLREAWGFDGFVITDCGAIH 299

Query: 315 VMVDNHKFLADSKEDAVAQTLKAGLDLD-CGQYYTNFTGNAVQQGKVKETDIDKSLKYLY 373
           ++   H   A S  +A  Q+LKAG+D++  G  +      A++QG + E D++++   + 
Sbjct: 300 MLACGHN-TAGSGVEAATQSLKAGVDMEMSGTMFRAHLQQALEQGLITEDDLNRAAGRVL 358

Query: 374 TVLMRLGFFDGSPQYVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTV 433
            +  RLG FD      +  +Q I   E+I LA +AA EGIVLLKN+ N LPL+S+   T+
Sbjct: 359 ELKFRLGLFDRPYVDPAWAEQVIGCKEHIALAYQAAAEGIVLLKNEGNLLPLDSSS-GTI 417

Query: 434 AVVGPHANATVAMIGNYAG--IPCRYMSPIAGFS---GYANVTYKTGCDDVACKSNNSIF 488
           AV+GP+A+     +G+Y     P + ++ + G     G + V Y  GC  +   S     
Sbjct: 418 AVIGPNAHTPYHQLGDYTSPQPPGQIVTVLDGIRRRLGDSRVLYAPGC-RIQGDSREGFP 476

Query: 489 AASEAAKTADATIILAG-----------LDLSVEA--------------ESLDREDLWLP 523
            A   A+ AD  +++ G           +DL   A              E +DR  L L 
Sbjct: 477 RALACAEQADVIVMVLGGSSARDFGEGTIDLRTGASVVTGDAKSDMECGEGIDRSTLTLM 536

Query: 524 GYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVV 583
           G Q +L+ ++ ++ K PVI+V ++  G  I     +  I AI+ A YPG+EGG AIAD++
Sbjct: 537 GVQLELLQELQKLGK-PVIVVYIN--GRPITEPWIDEFIPAIIEAWYPGQEGGGAIADML 593

Query: 584 FGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSY 643
           FG  NP GRLP++      V  LP++    R        G+ Y   +    YPFG+GLSY
Sbjct: 594 FGDINPSGRLPLSIPK--EVGQLPISYNARR------TRGKRYLETDLAPRYPFGFGLSY 645

Query: 644 TQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQN 703
           T+F+Y  L+    +            +    +A+                    ++D  N
Sbjct: 646 TEFRYGRLTVEPAV------------VPIGGEAT-------------------VRIDVTN 674

Query: 704 VGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYA 763
            G+ DG++VV +Y    A       K + GF++VF++AG  + + F   + + L ++   
Sbjct: 675 AGARDGAEVVQLYVSDLAASVTRPEKALKGFRKVFLKAGETQEVTFTIGS-EQLELIGLD 733

Query: 764 ANTLLPAGEHTIFVG 778
              ++  GE  I VG
Sbjct: 734 LKPVVEPGEFRIQVG 748


>gi|380692997|ref|ZP_09857856.1| beta-glucosidase [Bacteroides faecis MAJ27]
          Length = 837

 Score =  272 bits (696), Expect = 4e-70,   Method: Compositional matrix adjust.
 Identities = 160/430 (37%), Positives = 246/430 (57%), Gaps = 41/430 (9%)

Query: 53  LFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPG 112
           L+ + + P   RV+DL+S++T++EKV  L   + G+ R+G+ +Y   +EALHG+  + PG
Sbjct: 13  LYKNMNAPIHERVQDLLSKLTIEEKVSLLRATSPGIERMGIDKYYMGNEALHGI--IRPG 70

Query: 113 THFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAG----------L 162
                     T FP  I   + +N  L   I   +S EARA +N    G          L
Sbjct: 71  KF--------TVFPQAIGLASMWNPELHHIIAGVISDEARARWNELERGKKQKDQFSDLL 122

Query: 163 TYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVS 222
           T+WSP +N+ARDPRWGR  ET GEDP++ G     +V+GLQ           + R LK  
Sbjct: 123 TFWSPTVNMARDPRWGRTPETYGEDPYLSGVLGTAFVKGLQGD---------HPRYLKAV 173

Query: 223 SCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNG 282
           +  KH+AA + ++    +R++ DA +TE D+ E +   FE C++EG A S+M +YN +NG
Sbjct: 174 ATPKHFAANNEEH----NRFYCDAAITETDLREYYFPAFEKCIREGKAESIMTAYNAING 229

Query: 283 IPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLD 342
           +P  A+  LLN+ ++ +W  +GYIV+DC +  +++ +H+++  + E A    +KAGLD++
Sbjct: 230 VPCTANNWLLNKVLKQDWGFNGYIVSDCGAPGLLMTDHRYVK-TPEAAAMIAIKAGLDVE 288

Query: 343 CGQY-YTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ--YVSLGKQDICSD 399
           CG Y + N   NA +Q  V   +ID +   +    MRLG FD   +  Y  L  + +   
Sbjct: 289 CGDYVFANPLLNAYKQYMVSAAEIDSAAYRVLRARMRLGMFDDPEKNPYNHLSPEIVGCK 348

Query: 400 ENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMS 459
           ++ +LA EAAR+ IVLLKN QNTLPLN+ K+K++AVVG   NA     G+Y+G P    +
Sbjct: 349 KHHDLALEAARQSIVLLKNQQNTLPLNAQKIKSIAVVG--INAANCEFGDYSGTPVN--A 404

Query: 460 PIAGFSGYAN 469
           P++   G  N
Sbjct: 405 PVSVLDGIRN 414



 Score =  126 bits (316), Expect = 5e-26,   Method: Compositional matrix adjust.
 Identities = 92/289 (31%), Positives = 135/289 (46%), Gaps = 46/289 (15%)

Query: 490 ASEAAKTADATIILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAG 549
           AS+  + +D  I + G++ S+E E  DR  + LP  Q   I +  +    P  +V++ AG
Sbjct: 581 ASKIIRESDVVIAVMGINQSIEREGQDRNSIELPKDQQIFIREAYKA--NPNTIVVLVAG 638

Query: 550 GVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLT 609
              +A    + +I AI+ A YPGE+GG AIA+V+FG +NP GRLP+T+YN   ++ LP  
Sbjct: 639 S-SMAIGWMDQHIPAIIDAWYPGEQGGTAIAEVLFGDYNPAGRLPLTFYNS--IEDLPAF 695

Query: 610 SMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRN 669
                  D      RTY ++ G  LY FGYGLSYT+F Y                   RN
Sbjct: 696 D------DYNVKNNRTYMYFEGKPLYAFGYGLSYTKFDY-------------------RN 730

Query: 670 LNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIK 729
           LN   D        V +N              +N G  +G +V  VY K P +   T +K
Sbjct: 731 LNIKQDTQ-----NVTLN-----------FSIKNSGKYNGDEVAQVYVKFPDQGIKTPLK 774

Query: 730 QVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVG 778
           Q+ GF+RV ++ G  ++I       +     D       P+G +   VG
Sbjct: 775 QLKGFKRVHIKKGATEQISIEIPKEELRLWDDQKKQFYTPSGTYHFMVG 823


>gi|146301622|ref|YP_001196213.1| glycoside hydrolase family 3 protein [Flavobacterium johnsoniae
           UW101]
 gi|146156040|gb|ABQ06894.1| Candidate beta-xylosidase; Glycoside hydrolase family 3
           [Flavobacterium johnsoniae UW101]
          Length = 875

 Score =  272 bits (696), Expect = 4e-70,   Method: Compositional matrix adjust.
 Identities = 158/423 (37%), Positives = 230/423 (54%), Gaps = 34/423 (8%)

Query: 52  FLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGP 111
           F F + SL +  RV DLVSR+TL+EKV Q+ + +  + RLG+P Y+WW+E LHGV+    
Sbjct: 27  FQFQNPSLSFEQRVDDLVSRLTLEEKVSQMLNSSPEIARLGIPAYDWWNETLHGVARTPF 86

Query: 112 GTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYN----LGRA-----GL 162
            T         T +P  I   A+F+++    +    + E RA+YN    L R      GL
Sbjct: 87  KT---------TVYPQAIGMAATFDKNSLFTMADYSALEGRAIYNKAVELKRTNERYLGL 137

Query: 163 TYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVS 222
           TYW+PNIN+ RDPRWGR  ET GEDP++       +V+GLQ  +          + LK +
Sbjct: 138 TYWTPNINIFRDPRWGRGQETYGEDPYLTAVLGDAFVKGLQGDD---------PKYLKAA 188

Query: 223 SCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNG 282
           +C KHYA   V +     R+ FD  VT  ++ +T+L  F   + E + + VMC+YN    
Sbjct: 189 ACAKHYA---VHSGPESLRHTFDVDVTPYELWDTYLPAFRKLITESNVAGVMCAYNAFRT 245

Query: 283 IPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLD 342
            P CA   L+N  +R EW   GY+ +DC +I     NHK   D+ E A A  +  G D+D
Sbjct: 246 QPCCASDILMNDILRKEWKFDGYVTSDCWAIDDFFKNHKTHPDA-ESAAADAVFHGTDID 304

Query: 343 CGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFD--GSPQYVSLGKQDICSDE 400
           CG         AV+ GK+ E  ID S+K L+ +  RLG FD     +Y       + S E
Sbjct: 305 CGTDAYKALVQAVKNGKISEKQIDISVKRLFMIRFRLGMFDPVSMVKYAQTPSSVLESKE 364

Query: 401 NIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSP 460
           +   A + AR+ IVLLKN++N LPLN   +K + V+GP+A+  ++++GNY G P +  + 
Sbjct: 365 HQLHALKMARQSIVLLKNEKNILPLNK-NLKKIVVLGPNADNAISILGNYNGTPSKLTTV 423

Query: 461 IAG 463
           + G
Sbjct: 424 LQG 426



 Score =  109 bits (272), Expect = 7e-21,   Method: Compositional matrix adjust.
 Identities = 86/268 (32%), Positives = 117/268 (43%), Gaps = 54/268 (20%)

Query: 492 EAAKTADATIILAGLDLSVEAESL----------DREDLWLPGYQTQLINQVAEVAKGPV 541
           E  K ADA I   G+   +E E +          DR  +  P  QT+L+  +    K PV
Sbjct: 601 EHHKNADAFIFAGGISPQLEGEEMPVDFPGFKGGDRTSILFPEVQTKLLKALQSSGK-PV 659

Query: 542 ILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGD 601
           +  +M+  G  IA      NI AIL   Y G+  G A ADV+FG +NP GRLP+T+Y  D
Sbjct: 660 VFAMMT--GSAIAIPWEAENIPAILNIWYGGQSAGTAAADVIFGDYNPAGRLPVTFYKND 717

Query: 602 YVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNL 661
                 L S     +D+     +TY+++ G  LY FGYGLSYT FKY+ L       V +
Sbjct: 718 S----DLPSFVDYKMDN-----KTYRYFKGTPLYGFGYGLSYTSFKYSDLK----TPVKI 764

Query: 662 NKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPA 721
            K Q    L                            V   N G T+G +V  +Y     
Sbjct: 765 KKGQSVSIL----------------------------VKVANTGKTEGEEVAQLYLINQD 796

Query: 722 EIAATYIKQVIGFQRVFVRAGRNKRIKF 749
               T +K + GF+R  ++ G NK I F
Sbjct: 797 TAIKTPLKSLKGFERFNLKPGENKTITF 824


>gi|218132023|ref|ZP_03460827.1| hypothetical protein BACEGG_03648 [Bacteroides eggerthii DSM 20697]
 gi|217985783|gb|EEC52123.1| glycosyl hydrolase family 3 N-terminal domain protein [Bacteroides
           eggerthii DSM 20697]
          Length = 762

 Score =  272 bits (696), Expect = 4e-70,   Method: Compositional matrix adjust.
 Identities = 219/750 (29%), Positives = 346/750 (46%), Gaps = 124/750 (16%)

Query: 60  PYSIRVKDLVSRMTLDEKVQQLGDFA-------------------HGVP-------RLGL 93
           P  +RV DL+ RMTL+EK+ Q+ D                      G+        RL +
Sbjct: 34  PVEVRVADLLKRMTLEEKIAQMQDLKFKDFSVDGKVDTVKMDSVLKGMSYASVFGSRLSV 93

Query: 94  PQYEWWSEALH---------GVSNVGPGTHFDDVI-PGATSFPTVILTTASFNESLWKKI 143
            Q +    A++         G+  +G       +I  GAT FP  I  +++FN  +  ++
Sbjct: 94  EQMQESMFAINKYMAEHNRLGIPVLGEAESLHGLIHDGATIFPQSIALSSTFNPDITHRV 153

Query: 144 GQAVSTEARAMYNLGRAGL-TYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGL 202
              ++ EA+A       G+    SP +++AR+ RWGR+ ET GEDP++VGR  V YV   
Sbjct: 154 ATVIAQEAKA------TGVDQVLSPVLDLARELRWGRVEETYGEDPYLVGRMGVAYVSAF 207

Query: 203 QDVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVT--EQDMEETFLRP 260
              EG             V +  KH+ A+      G++     A VT  E+D+   +L+P
Sbjct: 208 NK-EG-------------VMTTLKHFLAHGSPT-GGLNL----ASVTGCERDLRSLYLKP 248

Query: 261 FEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNH 320
           F+  ++E    SVM SYN    +P  A   +L+  +RGE    GYI +D  S++++   H
Sbjct: 249 FQDVMREAMPYSVMNSYNSYESVPVAASHWILDDILRGEMGFKGYISSDWGSVEMLRSLH 308

Query: 321 KFLADSKEDAVAQTLKAGLDLDC-GQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRL 379
               D K DA  Q + AG+D++  G  Y     + V+ G + E +IDK +  + T    +
Sbjct: 309 HTAKD-KADAACQAVIAGVDVEVDGDCYETLD-SLVRSGVLPEKEIDKCVSRVLTAKFAM 366

Query: 380 GFFDGSPQYVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPH 439
           G FD      +   Q + + E +ELA  AARE  +L+KN+ + LPL++ K+++VAV+GP 
Sbjct: 367 GLFDKDYTKRANLSQTVHTPEAVELALVAARESAILVKNENSLLPLDANKLRSVAVIGP- 425

Query: 440 ANATVAMIGNYAGIPCRY--MSPIAGFS----GYANVTYKTGCDDVACKSNNSIFAASEA 493
            NA     G+Y         ++P+ G      G   + Y  GC ++  +  +    A  A
Sbjct: 426 -NAAQVQFGDYMWTNSNEYGITPLQGIEAVTQGKVKINYAKGC-EIHTQDRSGFSQAVTA 483

Query: 494 AKTADATIILAGL---------DLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILV 544
           A+ +D  ++  G            SV  ES D  D+ LPG Q  LI  V    K P I+V
Sbjct: 484 ARNSDVALLFVGAMSGSPGRPWPNSVSGESFDLSDISLPGCQEALIRAVKATGK-PTIVV 542

Query: 545 IMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGD--- 601
           +++     I + + N     + W  Y GE+ GRAIA+++FG+ NP GRL +++       
Sbjct: 543 LVAGKPFAIPWVKDNCEAVIVQW--YGGEQEGRAIAEILFGEVNPSGRLNVSFPQSTGHL 600

Query: 602 --YVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQV 659
             +    P          +L  PGR Y F +   ++ FG+GLSYT FKY      K++Q+
Sbjct: 601 PVFYNYYPSDKGFYHDHGTLEKPGRDYVFSSPDPVWAFGHGLSYTTFKY------KSMQI 654

Query: 660 NLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKP 719
           +        N  +T                  DD  E  V+  N G  DG +VV +Y   
Sbjct: 655 S--------NKEFTD-----------------DDTCEITVEVANTGKRDGKEVVQLYVND 689

Query: 720 PAEIAATYIKQVIGFQRVFVRAGRNKRIKF 749
                 T +K++  F++VF+ AG  + +KF
Sbjct: 690 IVSSVVTPVKELRRFEKVFIPAGETRTVKF 719


>gi|325103214|ref|YP_004272868.1| glycoside hydrolase family protein [Pedobacter saltans DSM 12145]
 gi|324972062|gb|ADY51046.1| glycoside hydrolase family 3 domain protein [Pedobacter saltans DSM
           12145]
          Length = 866

 Score =  272 bits (696), Expect = 4e-70,   Method: Compositional matrix adjust.
 Identities = 165/434 (38%), Positives = 232/434 (53%), Gaps = 37/434 (8%)

Query: 43  SKLGLQMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEA 102
           S++  Q   + F D+ LP+  RV DL+ R+T++EKV  + D +  + RLG+ QY WW+EA
Sbjct: 15  SQISAQNKLYPFQDNRLPFDKRVDDLLQRLTVEEKVLLMQDVSRPIERLGIKQYNWWNEA 74

Query: 103 LHGVSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYN------ 156
           LHGV+  G           AT FP  I   ASF+      +  AVS EARA +N      
Sbjct: 75  LHGVARAGL----------ATVFPQPIGMAASFDRDALFNVFNAVSDEARAKHNYHLSQG 124

Query: 157 -LGR-AGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDL 214
             GR  GLT W+P IN+ RDPRWGR  ET GEDP++     V  V+GLQ           
Sbjct: 125 SYGRYEGLTMWTPTINIFRDPRWGRGIETYGEDPYLTAVMGVQAVKGLQGPS-------- 176

Query: 215 NSRPLKVSSCCKHYAAYDVDNWKGVDRYHFD-ARVTEQDMEETFLRPFEMCVKEGDASSV 273
           N +  K+ +C KH+A +    W   +R+ FD A + ++D+ ET+L  FE  VKE     V
Sbjct: 177 NGKYDKLHACAKHFAVHSGPEW---NRHSFDAANIKQRDLYETYLPAFEALVKEAKVQEV 233

Query: 274 MCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDN--HKFLADSKEDAV 331
           MC+YNR  G P C   +LL Q +R +W   G +VADC +I        HK   D+   A 
Sbjct: 234 MCAYNRFEGDPCCGSDRLLQQILRKKWGFEGIVVADCGAIADFFKENAHKTHPDAAS-AS 292

Query: 332 AQTLKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSP--QYV 389
           A  + +G DLDCG  Y   T  AV++G ++E DID S++ L     RLG  D      + 
Sbjct: 293 AAAVYSGTDLDCGSSYKALT-EAVKKGLIEEKDIDVSVRRLLMARFRLGEMDDQSLVPWS 351

Query: 390 SLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGN 449
            +    + S  + ++A + AR+ I LL+N  N LPL S  +K +AV+GP+A  +V   GN
Sbjct: 352 KISYNVVASKAHNQIALDMARKSITLLQNKNNILPLKSGGLK-IAVMGPNAQDSVMQWGN 410

Query: 450 YAGIPCRYMSPIAG 463
           Y G P   ++ + G
Sbjct: 411 YNGTPANTITILEG 424



 Score =  119 bits (298), Expect = 6e-24,   Method: Compositional matrix adjust.
 Identities = 85/311 (27%), Positives = 139/311 (44%), Gaps = 54/311 (17%)

Query: 478 DVACKSNNSIFAASEAAKTADATIILAGLDLSVEAESL----------DREDLWLPGYQT 527
           D+  K   +I  + +    AD  + + G+  S+E E +          DR D+ LP  Q 
Sbjct: 584 DIGYKEEANINKSIKNIAGADLVVFVGGISPSLEGEEMGVKLPGFRGGDRTDIQLPTIQR 643

Query: 528 QLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKF 587
           Q +  + E  K    ++ ++  G  I  A+   N +AI+ A YPG+ GG+A+ADV+FGK+
Sbjct: 644 QFVKALKEAGKR---VIFINCSGSPIGLADEMANSEAIVQAWYPGQAGGQAVADVLFGKY 700

Query: 588 NPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFK 647
           NP GRLPIT+Y          T +P    ++    GRTY++     L+PFGYGLSYTQF+
Sbjct: 701 NPSGRLPITFYR-------DTTQLP--DFENYDMAGRTYRYMQDKPLFPFGYGLSYTQFQ 751

Query: 648 YNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGST 707
           Y      + +  N   +Q                                 V   N G  
Sbjct: 752 YGNPILNQQVITNGQTIQ-------------------------------LTVPVTNTGKR 780

Query: 708 DGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTL 767
            G +VV VY +   + A   +K +  F+R+   AG+ +++ F     +     + +    
Sbjct: 781 SGDEVVQVYLRKKGD-ATGPVKTLRDFRRLSFNAGQTQQVVFKITPKQLEWWNEQSKAMQ 839

Query: 768 LPAGEHTIFVG 778
           + +G++ + VG
Sbjct: 840 VQSGDYELLVG 850


>gi|393781363|ref|ZP_10369562.1| hypothetical protein HMPREF1071_00430 [Bacteroides salyersiae
           CL02T12C01]
 gi|392676856|gb|EIY70278.1| hypothetical protein HMPREF1071_00430 [Bacteroides salyersiae
           CL02T12C01]
          Length = 863

 Score =  272 bits (696), Expect = 5e-70,   Method: Compositional matrix adjust.
 Identities = 167/450 (37%), Positives = 239/450 (53%), Gaps = 38/450 (8%)

Query: 54  FCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGT 113
           F +S LP   R +DL+ R+TL EKV  + D++  +PRLG+ +Y WW+EALHGV   G   
Sbjct: 24  FNNSDLPVEERAQDLLQRLTLQEKVLLMCDYSSPIPRLGIKRYNWWNEALHGVGRAGL-- 81

Query: 114 HFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGR--------AGLTYW 165
                   AT FP  I   A+F++   ++  + VS EARA Y+            GLT+W
Sbjct: 82  --------ATVFPQAIGMAATFDDCAVRQAFECVSDEARAKYHHSENKEGSERYQGLTFW 133

Query: 166 SPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCC 225
           +PN+N+ RDPRWGR  ET GEDP++  +  +  VRGLQ            S+  K+ +C 
Sbjct: 134 TPNVNIFRDPRWGRGQETYGEDPYLTSQMGLAVVRGLQGPS--------ESKYDKLHACA 185

Query: 226 KHYAAYDVDNWKGVDRYHFDA-RVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIP 284
           KHYA +    W   +R+ FD   ++ +D+ ET+L  F+  V++G    VMC+YNR  G P
Sbjct: 186 KHYALHSGPEW---NRHSFDVDSISPRDLWETYLPAFKALVQQGGVKEVMCAYNRFEGEP 242

Query: 285 SCADPKLLNQTVRGEWDLHGYIVADCDSIQ-VMVDNHKFLADSKEDAVAQTLKAGLDLDC 343
            C   +LL   +R EW   G +V+DC +I    +  H     +KE AVA  +KAG DLDC
Sbjct: 243 CCGSNRLLYNILREEWGFDGLVVSDCGAISDFYLKGHHETHPTKEAAVAAAVKAGTDLDC 302

Query: 344 GQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSP--QYVSLGKQDICSDEN 401
           G  Y      AV++G + E  ID SL  L      LG  D      +  +    + S+++
Sbjct: 303 GVDYYALQ-KAVEEGIITEKQIDVSLFRLLKARFELGLMDEEHLVSWSDIPYTVVDSEKH 361

Query: 402 IELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPI 461
            E A E AR+ + LLKND  TLPL S     +AV+GP+AN +V M GNY G P   ++ +
Sbjct: 362 REKALEMARKSMTLLKNDHGTLPL-SKHCGKIAVIGPNANDSVMMWGNYNGFPSHTVTIL 420

Query: 462 AGFS---GYANVTYKTGCDDVACKSNNSIF 488
            G +   G   + Y  GC+     +  S+F
Sbjct: 421 EGITHKLGAEQIIYDKGCELTTGDTFVSLF 450



 Score =  119 bits (297), Expect = 8e-24,   Method: Compositional matrix adjust.
 Identities = 90/299 (30%), Positives = 139/299 (46%), Gaps = 58/299 (19%)

Query: 493 AAKTADATIIL--AGLDLSVEAESL----------DREDLWLPGYQTQLINQVAEVAKGP 540
           AA+  DA +I+   G+   VE E L          DR  + LP  Q  L+ ++ +  K P
Sbjct: 594 AARVGDAEVIVFVGGISPKVEGEELPVSFPGFKGGDRTVIELPQVQRDLLQELHKTGK-P 652

Query: 541 VILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNG 600
           VIL++ S   + ++ AE +    AI+ A Y G+ GG A+ADV+FG +NP GRLP+T+Y  
Sbjct: 653 VILILCSGSAIGLS-AEVDL-ADAIIQAWYLGQAGGTAVADVLFGDYNPAGRLPVTFYKA 710

Query: 601 DYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVN 660
              + LP         +     GRTY+++ G  L+PFGYGLSYT F+             
Sbjct: 711 --TEQLP-------DFEDYSMQGRTYRYFEGEALFPFGYGLSYTSFEIG----------- 750

Query: 661 LNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPP 720
                        +  SK R        +R ++    K+  +N G  DG +V+ +Y +  
Sbjct: 751 ------------KARLSKKR--------IRENESVSLKLTVENTGKLDGDEVIQIYIRKL 790

Query: 721 AEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTL-LPAGEHTIFVG 778
            +     +K +  F+R  +RAG  K + F        N  D  +NT+ +  GE+ I  G
Sbjct: 791 QDKEGP-LKTLRAFKRFHLRAGEKKDVTFHLQN-DHFNFFDTESNTMRVMPGEYEILYG 847


>gi|329956938|ref|ZP_08297506.1| glycosyl hydrolase family 3 protein [Bacteroides clarus YIT 12056]
 gi|328523695|gb|EGF50787.1| glycosyl hydrolase family 3 protein [Bacteroides clarus YIT 12056]
          Length = 944

 Score =  272 bits (695), Expect = 5e-70,   Method: Compositional matrix adjust.
 Identities = 210/719 (29%), Positives = 337/719 (46%), Gaps = 102/719 (14%)

Query: 90  RLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVST 149
           RLG+P  ++ +E + GV +             AT+FPT +    ++N  L +++G     
Sbjct: 153 RLGIP-VDFTNEGIRGVESYK-----------ATNFPTQLGLGHTWNRELIRQVGLITGR 200

Query: 150 EARAMYNLGRAGLT-YWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGH 208
           EAR +      G T  ++P ++V RD RWGR  E  GE P++V    +  VRGLQ    H
Sbjct: 201 EARML------GYTNVYAPILDVGRDQRWGRYEEVYGESPYLVAELGIEMVRGLQ----H 250

Query: 209 ENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEG 268
            +         +V++  KH+AAY  +          D +++ +++E   + PF+  ++E 
Sbjct: 251 NH---------QVAATAKHFAAYSNNKGAREGMSRVDPQMSPREVENIHIYPFKRVIRET 301

Query: 269 DASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKE 328
               +M SYN  +GIP       L   +R E    GY+V+D D+++ +   H    D KE
Sbjct: 302 GLLGIMSSYNDYDGIPVQGSYYWLTTRLRQEMGFRGYVVSDSDAVEYLYTKHNTAKDMKE 361

Query: 329 DAVAQTLKAGLDLDCG----QYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDG 384
            AV Q+++AGL++ C       +       V++G + E  I+  ++ +  V   +G FD 
Sbjct: 362 -AVRQSVEAGLNVRCTFRSPDSFVLPLRELVKEGGLSEEVINDRVRDILRVKFLIGLFD- 419

Query: 385 SPQYVSLGKQD--ICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANA 442
           SP    L   D  +    N  +A +A+RE +VLLKN  NTLPLN  K+K +AV GP+A+ 
Sbjct: 420 SPYQTDLAGADNEVEKAANEAVALQASRESVVLLKNADNTLPLNIDKIKKIAVCGPNADE 479

Query: 443 TVAMIGNYAGIPCRYMSPIAGF----SGYANVTYKTGCDDVACKSNNS------------ 486
               + +Y  +     + + G      G A V Y  GCD V      S            
Sbjct: 480 EGYALTHYGPLAVEVTTVLEGIREKAQGKAEVLYTKGCDLVDAHWPESEIIEYPLTPDEQ 539

Query: 487 --IFAASEAAKTADATIILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILV 544
             I  A+  A+ AD  +++ G       E+  R  L LPG+Q +L+  V    K PV+LV
Sbjct: 540 AEIDRAAANARQADVAVVVLGGGQRTCGENKSRTSLDLPGHQLKLLQAVQATGK-PVVLV 598

Query: 545 IMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQ 604
           +++   + + +A  +  + AIL A YPG +GG A+AD++FG +NPGG+L +T+     V 
Sbjct: 599 LINGRPLSVNWA--DKFVPAILEAWYPGSKGGTAVADILFGDYNPGGKLTVTFPK--TVG 654

Query: 605 MLPLTSMPLRP---VDSLGYPGR--TYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQV 659
            +P  + P +P   +D    PG        NG  LYPFGYGLSYT F+Y+ L  +     
Sbjct: 655 QIPF-NFPCKPASQIDGGKNPGADGNMSRING-ALYPFGYGLSYTTFEYSDLEIS----- 707

Query: 660 NLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKP 719
                                 P V+  D +       ++   N G   G +VV +Y++ 
Sbjct: 708 ----------------------PKVITPDQKAT----VRLKVTNTGKRAGDEVVQLYTRD 741

Query: 720 PAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVG 778
                 TY K + GF+R+ ++ G  K + F  +  K L +++     ++  GE  I  G
Sbjct: 742 ILSSITTYEKNLAGFERIRLKPGETKEVTFTLDR-KHLELLNADMKWIVEPGEFAIMAG 799


>gi|224538725|ref|ZP_03679264.1| hypothetical protein BACCELL_03619 [Bacteroides cellulosilyticus
           DSM 14838]
 gi|224519667|gb|EEF88772.1| hypothetical protein BACCELL_03619 [Bacteroides cellulosilyticus
           DSM 14838]
          Length = 942

 Score =  272 bits (695), Expect = 6e-70,   Method: Compositional matrix adjust.
 Identities = 231/810 (28%), Positives = 360/810 (44%), Gaps = 146/810 (18%)

Query: 53  LFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRL---GLPQYEW----WS----- 100
           ++ D +     R++DL+S+MTL+EK  Q+    +G  R+    LP  EW    W      
Sbjct: 52  VYEDPNASLDARIEDLLSQMTLEEKTCQMVTL-YGYKRVLKDDLPTPEWKQMLWKDGIGA 110

Query: 101 --EALHGVSNVG----------------------------------PGTHFDDVIPG--- 121
             E L+G    G                                  P    ++ I G   
Sbjct: 111 IDEHLNGFQQWGLPPSDNPYVWPASRHAWALNEVQRFFIEETRLGIPVDFTNEGIRGVES 170

Query: 122 --ATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLT-YWSPNINVARDPRWG 178
             AT+FPT +    ++N  L ++IG     EAR +      G T  ++P ++V RD RWG
Sbjct: 171 YRATNFPTQLGLGHTWNRELIRQIGLITGREARML------GYTNVYAPILDVGRDQRWG 224

Query: 179 RITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKG 238
           R  E  GE P++V    +  VRG+Q    H +         +V++  KH+ AY  +    
Sbjct: 225 RYEEVYGESPYLVAELGIEMVRGMQ----HNH---------QVAATGKHFVAYSNNKGAR 271

Query: 239 VDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRG 298
                 D +++ +++E   + PF+  +KE     VM SYN  +G+P       L   +RG
Sbjct: 272 EGMARVDPQMSPREVEMIHVYPFKRVIKEAGLLGVMSSYNDYDGVPIQGSYYWLTTRLRG 331

Query: 299 EWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCG----QYYTNFTGNA 354
           E    GY+V+D D+++ +   H    D KE AV Q+++AGL++ C       Y       
Sbjct: 332 EMGFRGYVVSDSDAVEYLYTKHSTAKDMKE-AVRQSVEAGLNVRCTFRSPDSYVLPLREL 390

Query: 355 VQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQD--ICSDENIELAAEAAREG 412
           V++G + E  I+  ++ +  V   +G FD +P    L   D  +   EN  LA +A+RE 
Sbjct: 391 VKEGGLSEEVINDRVRDILRVKFLVGLFD-TPYQTDLAGADKEVEKAENESLALQASRES 449

Query: 413 IVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGF----SGYA 468
           +VLLKN+ N LPL+   VK +AV GP+A+     + +Y  +     + + G      G A
Sbjct: 450 LVLLKNENNVLPLDINNVKKIAVCGPNADEEGYALTHYGPLAVEVTTVLEGIRQKAEGKA 509

Query: 469 NVTYKTGCDDVACKSNNS--------------IFAASEAAKTADATIILAGLDLSVEAES 514
            V Y  GCD V      S              I  A E A+ AD  +++ G       E+
Sbjct: 510 EVLYTKGCDLVDANWPESELIDYPMTDSEQAEIDKAVENARQADVAVVVLGGGQRTCGEN 569

Query: 515 LDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEE 574
             R  L LPG Q +L+  V    K PV+LV+++   + I +A  +  + AIL A YPG +
Sbjct: 570 KSRSSLELPGRQLKLLQAVQATGK-PVVLVLINGRPLSINWA--DKFVPAILEAWYPGSK 626

Query: 575 GGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRP---VDSLGYPGR--TYKFY 629
           GG A+ADV+FG +NPGG+L +T+     V  +P  + P +P   +D    PG        
Sbjct: 627 GGTAVADVLFGDYNPGGKLTVTFPKS--VGQIPF-NFPCKPSSQIDGGKNPGLDGNMSRV 683

Query: 630 NGPTLYPFGYGLSYTQFKYNLLSFT-KTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVND 688
           NG  LY FGYGLSYT F+Y+ +  + K I  N      C+                    
Sbjct: 684 NG-ALYSFGYGLSYTTFEYSDIEISPKVITPNQKATVRCK-------------------- 722

Query: 689 LRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIK 748
                         N G   G +VV +Y +       TY K + GF+R+ ++ G  K + 
Sbjct: 723 ------------VTNTGKRAGDEVVQLYVRDILSSVTTYEKNLAGFERIHLQPGETKEVV 770

Query: 749 FVFNACKSLNIVDYAANTLLPAGEHTIFVG 778
           F  +  K L ++D     ++  G+ +I +G
Sbjct: 771 FTLDR-KQLELLDKHMEWVVEPGDFSIMIG 799


>gi|29350122|ref|NP_813625.1| periplasmic beta-glucosidase , xylosidase/arabinosidase
           [Bacteroides thetaiotaomicron VPI-5482]
 gi|29342034|gb|AAO79819.1| periplasmic beta-glucosidase precursor, xylosidase/arabinosidase
           [Bacteroides thetaiotaomicron VPI-5482]
          Length = 769

 Score =  271 bits (694), Expect = 7e-70,   Method: Compositional matrix adjust.
 Identities = 218/719 (30%), Positives = 336/719 (46%), Gaps = 110/719 (15%)

Query: 90  RLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVST 149
           RLG+P +    EA HG   +G            T FPT I   A+++ +L +++G  ++ 
Sbjct: 114 RLGIPLF-LAEEAPHGHMAIG-----------TTVFPTGIGMAATWSPTLIEEVGNVIAK 161

Query: 150 EARAMYNLGRAGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHE 209
           E R+     +     + P ++++RDPRW R+ ET GEDP + GR     + GL       
Sbjct: 162 EIRS-----QGAHISYGPVLDLSRDPRWSRVEETFGEDPVLSGRLGAAMILGL------- 209

Query: 210 NATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGD 269
            + DL+     +++  KH+ AY V        Y   A V  +D+ E FL PF   +  G 
Sbjct: 210 GSGDLSCEYATIATL-KHFLAYAVPEGGQNGNY---ASVGTRDLHENFLPPFREAIDAG- 264

Query: 270 ASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKED 329
           A SVM SYN ++G+P  A+  LL Q +R EW   G++V+D  SI+ + ++H F+A + E+
Sbjct: 265 ALSVMTSYNSIDGVPCTANHYLLTQLLRNEWRFRGFVVSDLYSIEGVHESH-FVAPTIEE 323

Query: 330 AVAQTLKAGLDLDCG-QYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQY 388
           A  Q + AG D+D G   + N T +AVQ GK+ E  ID ++  +  +   +G F+     
Sbjct: 324 AAMQAVSAGADIDLGGDAFMNLT-HAVQFGKISEAVIDTAVCRVLRMKFEIGLFEHPYVN 382

Query: 389 VSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIG 448
                + + S ++I+LA + A+  IVLLKN+ + LPLN  K+K VAVVGP+A+    M+G
Sbjct: 383 PKTATKIVRSKDHIKLARKVAQSSIVLLKNENSILPLNK-KIKKVAVVGPNADNRYNMLG 441

Query: 449 NYAG------IPCRYMSPIAGFSGYANVTYKTGCDDVACKSNNSIFAASEAAKTADATII 502
           +Y        I       I+  S  + V Y  GC  +   + N I  A EAA  ++  I 
Sbjct: 442 DYTAPQEDENIKTVLDGVISKLSP-SKVEYVRGCA-IRDTTVNEIAEAVEAASRSEVIIA 499

Query: 503 LAGLDLSVE-----------------------AESLDREDLWLPGYQTQLINQVAEVAKG 539
           + G   + +                        E  DR  L L G Q  L+  +    K 
Sbjct: 500 VVGGSSARDFKTSYQETGAAIADEKSISDMECGEGFDRATLTLLGKQQDLLIALKATGK- 558

Query: 540 PVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYN 599
           P+I+V +    +D  +A    +  A+L A YPG+EGG AIADV+FG +NP GRLP++   
Sbjct: 559 PLIVVYIEGRPLDKVWASEYAD--ALLTASYPGQEGGYAIADVLFGDYNPAGRLPVSIPR 616

Query: 600 GDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQV 659
              V  +P+      P +        Y       LY FGYGLSYT F+Y+ L   +    
Sbjct: 617 S--VGQIPVYYNKKAPRN------HDYVEQAASPLYTFGYGLSYTTFEYSDLQVIR---- 664

Query: 660 NLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKP 719
                             K+ C            +FE     +N GS DG +V  +Y + 
Sbjct: 665 ------------------KSPC------------HFEVSFKVKNTGSYDGEEVAQLYLRD 694

Query: 720 PAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVG 778
                   ++Q+  F+R F++ G  K I F     K L+I+D     ++  G+  I +G
Sbjct: 695 EYASVVQPLRQLKCFERFFLKRGEEKEIFFTLTE-KDLSIIDRNMKRVVETGDFRIMIG 752


>gi|197106390|ref|YP_002131767.1| glucan 1,4-beta-glucosidase [Phenylobacterium zucineum HLK1]
 gi|196479810|gb|ACG79338.1| glucan 1,4-beta-glucosidase [Phenylobacterium zucineum HLK1]
          Length = 888

 Score =  271 bits (694), Expect = 8e-70,   Method: Compositional matrix adjust.
 Identities = 175/507 (34%), Positives = 257/507 (50%), Gaps = 68/507 (13%)

Query: 54  FCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGT 113
           + D+ LP   R  DLV+RMTL+EK +Q+G  A  +PRLG+P Y WW+E LHGV+  G   
Sbjct: 38  YRDTRLPAERRAADLVARMTLEEKSRQIGHTAPAIPRLGVPAYNWWNEGLHGVARAGI-- 95

Query: 114 HFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLG---------RAGLTY 164
                   AT FP  I   A+++    +     + TE RA Y              GLT 
Sbjct: 96  --------ATVFPQAIGMAATWDVDRMRGTADVIGTEFRAKYAERVHPDGSTDWYRGLTV 147

Query: 165 WSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSC 224
           WSPNIN+ RDPRWGR  ET GEDP++ GR  V ++RGLQ         D N    K  + 
Sbjct: 148 WSPNINIFRDPRWGRGQETYGEDPYLTGRMGVAFIRGLQ-------GQDPNF--FKTIAT 198

Query: 225 CKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIP 284
            KHYA   V +    +R+  D   +  D+E+T+L  F   V EG   +VMC+YN V+G+P
Sbjct: 199 AKHYA---VHSGPESNRHREDVHPSAYDLEDTYLPAFRAAVTEGKVQAVMCAYNAVDGVP 255

Query: 285 SCADPKLLNQTVRGEWDLHGYIVADCD-SIQVMVDNHKFLADSKEDAVAQTLKAGLDLDC 343
           +CA   L++Q +R +W   G++V+DC  +  +  ++      + E+ + + L AG+DL C
Sbjct: 256 ACASEDLMDQRLRRDWGFSGHVVSDCGAAANIYREDSLAYVKTPEEGITRALNAGMDLVC 315

Query: 344 GQYYTNF------TGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQDIC 397
           G Y  ++      T +AV++G + ET +D +L  L+   +RLG FD  P  V   K    
Sbjct: 316 GDYRADWNTEAEATVSAVRKGMLDETVLDGALVRLFADRIRLGLFD-PPAEVPFSKITAA 374

Query: 398 SDENIE---LAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIP 454
            ++  E   ++ E A+  + LLKND   LPL   + + +AVVGP+A++  A+IGNY G P
Sbjct: 375 QNDTPEHRAMSLEMAKASMTLLKND-GVLPLK-GEPRRIAVVGPNADSVDALIGNYYGTP 432

Query: 455 CRYMSPIAGFSGY---ANVTYKTG----------------CDDVACKS---NNSIF--AA 490
              ++ +AG       A V Y  G                C D AC++      +F   A
Sbjct: 433 SNPVTVLAGIRARFPKAEVVYAEGTGLVGPASLPVPDAVLCADAACRTKGLKQEVFEGVA 492

Query: 491 SEAAKTADATIILAGLDLSVEAESLDR 517
            E A     T+  A  D + + +S  R
Sbjct: 493 LEGAPVETRTVANATFDWTGDRQSSAR 519



 Score =  130 bits (326), Expect = 4e-27,   Method: Compositional matrix adjust.
 Identities = 86/293 (29%), Positives = 131/293 (44%), Gaps = 55/293 (18%)

Query: 498 DATIILAGLDLSVEAESL----------DREDLWLPGYQTQLINQVAEVAKGPVILVIMS 547
           D  + + GL   VE E +          DR  L LP  Q  L+ ++    K PV+LV+M+
Sbjct: 613 DLVVFVGGLTARVEGEEMKLQVPGFAGGDRTSLDLPAPQQDLLRRLHATGK-PVVLVLMN 671

Query: 548 AGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLP 607
              + + +A  + N+ AI+ A YPG EGG A+A ++ G ++P GRLP+T+Y         
Sbjct: 672 GSALSVNWA--DANLPAIVEAWYPGGEGGHAVAQLLAGDYSPAGRLPVTFYR-------- 721

Query: 608 LTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHC 667
            ++  L P       GRTY+++ G  LYPFGYGLSYT+F Y                   
Sbjct: 722 -SAGDLPPFADYAMKGRTYRYFGGEVLYPFGYGLSYTRFSYG------------------ 762

Query: 668 RNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATY 727
                         P +    +  D          N G  DG +VV +Y   P     T 
Sbjct: 763 -------------APQLSARSVSADGEITVTTQVTNTGGMDGEEVVQLYVSHPGR-DGTP 808

Query: 728 IKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGNG 780
           I+ + GFQR+ ++ G  + + F     + L++VD   N  +  G   ++VG G
Sbjct: 809 IRALQGFQRIGLKRGETRPVSFTLKD-RQLSVVDAEGNRRVEPGRVEVWVGGG 860


>gi|427411073|ref|ZP_18901275.1| hypothetical protein HMPREF9718_03749 [Sphingobium yanoikuyae ATCC
           51230]
 gi|425710258|gb|EKU73280.1| hypothetical protein HMPREF9718_03749 [Sphingobium yanoikuyae ATCC
           51230]
          Length = 791

 Score =  271 bits (694), Expect = 8e-70,   Method: Compositional matrix adjust.
 Identities = 231/782 (29%), Positives = 355/782 (45%), Gaps = 110/782 (14%)

Query: 29  GSSSPVFVCDPGRFSKLGLQMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGV 88
           G  SP F    G F++      SF       P  +  +D    + L   V  L  +A   
Sbjct: 86  GKLSPTFPSGIGHFTRPSDGRGSFS------PRVVPGRDPRRTVAL---VNGLQKWAMTQ 136

Query: 89  PRLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVS 148
            RLG+P   +  E LHG + VG           ATSFP  I   +S++ ++ +++ Q ++
Sbjct: 137 TRLGIPIL-FHEEGLHGYAAVG-----------ATSFPQSIAMASSWDPAMLRQVNQVIA 184

Query: 149 TEARAMYNLGRAGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGH 208
            E RA     R      SP +++ARDPRWGRI ET GEDP++VG   V  V GLQ V   
Sbjct: 185 REIRA-----RGVPMVLSPVVDIARDPRWGRIEETYGEDPYLVGEMGVAAVEGLQGV--- 236

Query: 209 ENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEG 268
                   +P  V +  KH   +      G +     A V+E+++ E F  PFE  VK  
Sbjct: 237 --GRSRTLQPNHVFATLKHLTGHGQPE-SGTN--IGPAPVSERELRENFFPPFEQVVKRT 291

Query: 269 DASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKE 328
              +VM SYN ++G+PS A+  LL+  +R EW   G +V+D  ++  ++  H  +A + E
Sbjct: 292 GIEAVMASYNEIDGVPSHANRWLLDNVLRQEWGFRGAVVSDYSAVDQLMSIH-HIAANLE 350

Query: 329 DAVAQTLKAGLDLDCGQYYTNFT-GNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ 387
           +A  + L AG+D D  +  +  T G  V++GKV E  +D +++ +  +  R G F+ +P 
Sbjct: 351 EAAMRALDAGVDADLPEGLSYATLGKLVREGKVSEAKVDLAVRRMLELKFRAGLFE-NPY 409

Query: 388 YVSLGKQDICSDENIE-LAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAM 446
             +     I ++E+   LA  AA+  I LLKND   LPL      T+AV+GP  +A VA 
Sbjct: 410 ADANAAAAITNNEDARALARTAAQRSITLLKND-GMLPLKPE--GTIAVIGP--SAAVAR 464

Query: 447 IGNYAGIPCRYMSPIAGFSGY----ANVTYKTGCD---------DVACKSNNS-----IF 488
           +G Y G P   +S + G        AN+ +  G           D   KS+ +     I 
Sbjct: 465 LGGYYGQPPHSVSILEGIKARVGTKANIVFAQGVKITEDDDWWADSVTKSDPAENRKLIA 524

Query: 489 AASEAAKTADATIILAGLDLSVEAESL------DREDLWLPGYQTQLINQVAEVAKGPVI 542
            A EAA+  D  I+  G       E        DR  L L G Q +L + +  + K P+ 
Sbjct: 525 QAVEAARNVDRIILTLGDTEQSSREGWADNHLGDRPSLDLVGEQQELFDALKALGK-PIT 583

Query: 543 LVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDY 602
           +V+++  G   +  + +    AIL   Y GE+GG A+AD++FG  NPGG+LP+T      
Sbjct: 584 VVLIN--GRPASTVKVSEQANAILEGWYLGEQGGNAVADILFGDVNPGGKLPVTVPRS-- 639

Query: 603 VQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLN 662
              LPL    ++P        R Y F     LYPFG+GLSYT F  +             
Sbjct: 640 AGQLPLF-YNMKPSAR-----RGYLFDTTDPLYPFGFGLSYTSFSLS------------- 680

Query: 663 KLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAE 722
                              P +    +         VD +N G+ +G +VV +Y +    
Sbjct: 681 ------------------APRLSATRIGTGGKTSVSVDVRNTGAREGDEVVQLYIRDKVS 722

Query: 723 IAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGNGGV 782
                +K++ GFQRV ++ G ++ I F     ++L + +     ++  G+  I  GN  V
Sbjct: 723 SVTRPVKELKGFQRVTLKPGESRTITFTV-GPEALQMWNDQMRRVVEPGDFEIMTGNSSV 781

Query: 783 SF 784
           + 
Sbjct: 782 AL 783


>gi|404406439|ref|ZP_10998023.1| glycoside hydrolase 3 [Alistipes sp. JC136]
          Length = 925

 Score =  271 bits (694), Expect = 8e-70,   Method: Compositional matrix adjust.
 Identities = 194/686 (28%), Positives = 327/686 (47%), Gaps = 88/686 (12%)

Query: 122 ATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLT-YWSPNINVARDPRWGRI 180
           AT+FP+ +    ++N  L +K G+ V  EAR +      G T  ++P ++V RD RWGR 
Sbjct: 180 ATNFPSQLGMGHTWNRELLRKTGRIVGREARLL------GYTNIYAPVLDVGRDQRWGRY 233

Query: 181 TETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVD 240
            E  GE P++V    V    G+Q        TD      +V+S  KH+AAY  +      
Sbjct: 234 EEVFGESPYLVAELGVAMASGMQ--------TDY-----QVASTAKHFAAYSNNKGAREG 280

Query: 241 RYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEW 300
               D ++  +++E   L PF   ++      VM SYN  +G+P       L + +RGE 
Sbjct: 281 MSRVDPQMPPREVENIHLMPFREVIRRAGILGVMSSYNDYDGVPIQGSRYWLTERLRGEM 340

Query: 301 DLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCG----QYYTNFTGNAVQ 356
              GY+V+D  S++ + + H   A ++ DAV Q+++AGL++ C     + Y       ++
Sbjct: 341 GFRGYVVSDSGSVEYLHNKH-HTAVNQLDAVRQSIEAGLNVRCNFWHPETYVMPLRQLLR 399

Query: 357 QGKVKETDIDKSLKYLYTVLMRLGFFDGSPQY-VSLGKQDICSDENIELAAEAAREGIVL 415
           +G + E  +D  ++ +  V   +G FD   Q  ++   +++   E+ E+A +A+RE IVL
Sbjct: 400 EGLITEELLDSRVRDVLRVKFLVGLFDRPYQTDLAAADREVDGPEHNEVALQASRESIVL 459

Query: 416 LKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFS----GYANVT 471
           LKN+ +TLPL++ K++ +AV+GP+A+A    +G+Y  +     S + G          + 
Sbjct: 460 LKNENSTLPLDARKIRRIAVLGPNADARGFALGHYGPLAVEVTSVLDGLKRNLGARCEIV 519

Query: 472 YKTGC--------------DDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAESLDR 517
           Y+ GC              +++  +    I  A+EAA  +D  +++ G       E+  R
Sbjct: 520 YEKGCELVDAAWPLSEIFREEMTPEEKAGIRRAAEAASESDVAVVVLGGGSRTCGENCSR 579

Query: 518 EDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGR 577
             L LPG Q +L+  V    K P +LV+++     I +A  + ++ AI+ A YPG  GG+
Sbjct: 580 SSLDLPGRQEELLRAVEATGK-PTVLVMINGRPNSINWA--DAHVDAIVEAWYPGAHGGQ 636

Query: 578 AIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLG-----YPGRTYKFYNGP 632
           A+ +V+FG++NPGG+L +T+    +V  +P  + P +P  +        PG      NG 
Sbjct: 637 AVYEVLFGEYNPGGKLTVTFPR--HVGQIPF-NFPYKPAANTDGGLTPGPGGNQTRING- 692

Query: 633 TLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCD 692
            LY FGYGLSYT F+Y  L                                     +R D
Sbjct: 693 ALYDFGYGLSYTTFEYADLRIEPQT-------------------------------IRQD 721

Query: 693 DYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFN 752
           + F    D  N G  DG +VV +Y         TY K + GF RV ++AG  +R+     
Sbjct: 722 EPFRVSFDVTNTGQRDGDEVVQLYIHDVLSSVTTYEKNLRGFDRVHLKAGETRRVTMQVR 781

Query: 753 ACKSLNIVDYAANTLLPAGEHTIFVG 778
             + L++++     ++  G+  + +G
Sbjct: 782 P-QDLSLLNERMERVVEPGDFDVLIG 806


>gi|365877135|ref|ZP_09416640.1| glycoside hydrolase family protein [Elizabethkingia anophelis Ag1]
 gi|442587941|ref|ZP_21006755.1| glycoside hydrolase family protein [Elizabethkingia anophelis R26]
 gi|365754995|gb|EHM96929.1| glycoside hydrolase family protein [Elizabethkingia anophelis Ag1]
 gi|442562440|gb|ELR79661.1| glycoside hydrolase family protein [Elizabethkingia anophelis R26]
          Length = 827

 Score =  271 bits (693), Expect = 9e-70,   Method: Compositional matrix adjust.
 Identities = 221/816 (27%), Positives = 361/816 (44%), Gaps = 158/816 (19%)

Query: 53  LFCDSSLPYSIRVKDLVSRMTLDEKVQQ-LGDFAHG-VPRLGLPQYEWWSEA-LHGVSNV 109
           +F D   P   RV++L+S+MTL EK  Q +  + +G + +   P  +W +E  +HG++N+
Sbjct: 66  IFEDRKEPIDKRVENLISQMTLQEKANQTVTLYGYGRILKDEQPTSQWKNEVWVHGLANI 125

Query: 110 G------------------------------------------PGTHFDDVIPG-----A 122
                                                      P    ++ I G     A
Sbjct: 126 DEMLNSLPYHKSAVTKYSYPYSNHTEALNNIQKWFIEETRLGIPVDFTNEGIHGLTHDRA 185

Query: 123 TSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNINVARDPRWGRITE 182
           T FP  I   +++++ L  KIG  +  EA   Y LG   +  ++P ++V+RDPRWGR+ E
Sbjct: 186 TPFPAPINIGSTWDKDLVGKIGNTIGKEA---YYLGYTNV--YAPILDVSRDPRWGRVVE 240

Query: 183 TPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRY 242
           T GEDPF++G Y    V+G+Q     +N          V+S  KHYA Y V         
Sbjct: 241 TYGEDPFMIGEYGKRMVKGIQ-----QNG---------VASTLKHYAVYSVPKGGRDGLA 286

Query: 243 HFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDL 302
             D  V  ++M   +L PF+  +++     VM SYN  +G+P  +    L   +R E+  
Sbjct: 287 RTDPHVAPKEMHTMYLYPFKEVIRKEHPLGVMASYNDYDGVPVISSKYFLTDLLRKEYGF 346

Query: 303 HGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFTG---------N 353
            GY+V+D D+++ +   H  +A   E+ + + L+AGLD+      TNFT          +
Sbjct: 347 DGYVVSDSDALEFLHGKH-HVAKDYEEGIQKALEAGLDVR-----TNFTQPKEYLTALMD 400

Query: 354 AVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ-YVSLGKQDICSDENIELAAEAAREG 412
           A++ GK+KE  +++ ++ +     RLG FD   + ++    + + + E+  L+ +  R  
Sbjct: 401 ALKSGKIKEEVLNERVRSVLKTKFRLGLFDEPIRNFIKEADRKVHTKEDEALSVDVNRRS 460

Query: 413 IVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFSGYA---- 468
           +VLLKN++ TLPL++ K+K + + GP A+A       Y        +   G   YA    
Sbjct: 461 VVLLKNEKQTLPLDTGKLKNILITGPLADAVNYTTSRYGPSNNPVTTIRKGIEDYASLHH 520

Query: 469 -NVTYKTGCD--------------DVACKSNNSIFAASEAAKTADATIILAGLDLSVEAE 513
            N +Y  G D              +   K  + I      A+ +D  I + G       E
Sbjct: 521 INTSYTKGVDVIDEGWPETEIIPVEPTEKEKSEISKTISMAEKSDVIIAVMGESEKEVGE 580

Query: 514 SLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGE 573
           S  R  L LPG QT  + Q+ +  K P++LV+++   + I +   N  + AIL   + G 
Sbjct: 581 SRSRSSLNLPGKQTYFLQQLYKTRK-PIVLVLVNGRPLTINWE--NKYLPAILETWFLGP 637

Query: 574 EGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPT 633
           + G  +A+ +FG+ NPGG+LPI++     +  L + + P +P    G PG       GP 
Sbjct: 638 QSGNIVAETLFGENNPGGKLPISFPKS--IGQLEM-NFPTKPAAQAGQPG------TGPN 688

Query: 634 ----------LYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPG 683
                     LYPFGYGLSYT F++   S                       +SK    G
Sbjct: 689 GSGSSRVTGFLYPFGYGLSYTNFEFTDFSL----------------------SSKKIKAG 726

Query: 684 VLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGR 743
              N+L        K+   N G   G +VV +Y         TY   + GF+RV +  G 
Sbjct: 727 ---NELHA------KLKVTNTGKVKGDEVVQLYLSDLVSSVTTYEMDLRGFERVTLEPGE 777

Query: 744 NKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGN 779
            K ++F  N  + + +++     ++  GE  + VGN
Sbjct: 778 AKEVQFTLNK-EHMQLLNDKMEWVVEPGEFRVSVGN 812


>gi|294675412|ref|YP_003576028.1| family 3 glycosyl hydrolase [Prevotella ruminicola 23]
 gi|294472176|gb|ADE81565.1| glycosyl hydrolase, family 3 [Prevotella ruminicola 23]
          Length = 875

 Score =  271 bits (693), Expect = 1e-69,   Method: Compositional matrix adjust.
 Identities = 167/468 (35%), Positives = 252/468 (53%), Gaps = 54/468 (11%)

Query: 47  LQMSSFL-FCDSS-----LPY-------SIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGL 93
           L MS F+ FC ++     LPY       + R  DL+SR+TLDEKV  + D +  +PRLG+
Sbjct: 5   LMMSLFVGFCATAMDAQGLPYQNANLSAAQRADDLLSRLTLDEKVSLMMDTSPAIPRLGI 64

Query: 94  PQYEWWSEALHGVSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARA 153
           PQ++WW+EALHG+   G           AT FP  +   AS++++L  ++  AVS EAR 
Sbjct: 65  PQFQWWNEALHGIGRNG----------FATVFPITMAMAASWDDALLHQVFTAVSDEARV 114

Query: 154 MYNLGR--------AGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDV 205
                +          L++W+PNIN+ RDPRWGR  ET GEDP++  +  +  VRGLQ V
Sbjct: 115 KAQQAKCTGDIKRYQSLSFWTPNINIFRDPRWGRGQETYGEDPYLTAKMGLAVVRGLQGV 174

Query: 206 EGHENATDLN-SRPLKVSSCCKHYAAYDVDNWKGVDRYHFDAR-VTEQDMEETFLRPFEM 263
            G+ N  DL  S+  K+ +C KH+A +    W   +R+ F+   + E+D+ ET+L  F+ 
Sbjct: 175 -GY-NGEDLGVSKYRKLLACAKHFAVHSGPEW---NRHEFNIENLPERDLWETYLPAFKA 229

Query: 264 CVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFL 323
            V+EG  + VMC+Y R++G   CA  +   Q +R EW   G I +DC +I+  +     +
Sbjct: 230 LVQEGKVAEVMCAYQRIDGQACCAQTRYEQQILRDEWGFDGLITSDCGAIRDFLPRWHNV 289

Query: 324 ADSKEDAVAQTLKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFD 383
           +    +A A+ + AG D++CG  Y +    AV++G VKE DID+SL+ L      LG  D
Sbjct: 290 SKDGAEASAKAVLAGTDVECGSEYKHLP-EAVRRGDVKEADIDRSLRRLLIARFELGDMD 348

Query: 384 GSP--QYVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPL------NSAKVKTVAV 435
                 +  + +  + S  + +LA + A + IVLL+N    LPL       +   K + V
Sbjct: 349 SDDLNAWTKIPETVVASQAHKDLALKMALKSIVLLQNKIKVLPLGNPLNAGAGSDKDIVV 408

Query: 436 VGPHANATVAMIGNYAGIPCRYMSPIAG-------FSGYANVTYKTGC 476
           +GP+AN +V M GNYAG P   ++ + G        S  A V +  GC
Sbjct: 409 MGPNANDSVMMWGNYAGYPTHTVTALDGITRMAKTLSPDATVRFIQGC 456



 Score = 80.9 bits (198), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 81/292 (27%), Positives = 123/292 (42%), Gaps = 68/292 (23%)

Query: 501 IILAGLDLSVEAESL----------DREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGG 550
           I + G+  ++E E +          DR  + LP  Q  L+  + +  K    ++ ++  G
Sbjct: 624 IFVGGISPNLEGEEMRVNEPGFKGGDRTSIELPQAQRDLLAVLHKAGKK---VIFVNCSG 680

Query: 551 VDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTS 610
             +A A       AIL   Y GE+GG A+A  +FG   P G+LP+T+Y       LP   
Sbjct: 681 SAMALAPELETCDAILQWWYGGEQGGAALATTLFGMVAPSGKLPVTFYKS--TDELP--- 735

Query: 611 MPLRPVDSLGY--PGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCR 668
                 D L Y    RTY++Y G  L+PFG+GL YT F     +  K I  N NK+Q   
Sbjct: 736 ------DFLDYTMKNRTYRYYEGEPLFPFGFGLGYTTF-----NIDKPIYKN-NKVQ--- 780

Query: 669 NLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYI 728
                                         V  +N+G+T G++ V VY +  A+      
Sbjct: 781 ------------------------------VRVKNLGTTAGTETVQVYIRHLADKEGPK- 809

Query: 729 KQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTL-LPAGEHTIFVGN 779
           K +  +Q+V + A   K I       KS    D   NT+ +  G++ + VGN
Sbjct: 810 KSLRAYQQVTLNAAEAKTISIEL-PRKSFEGWDVKTNTMRVVPGKYEVMVGN 860


>gi|423301682|ref|ZP_17279705.1| hypothetical protein HMPREF1057_02846 [Bacteroides finegoldii
            CL09T03C10]
 gi|408471675|gb|EKJ90206.1| hypothetical protein HMPREF1057_02846 [Bacteroides finegoldii
            CL09T03C10]
          Length = 1365

 Score =  271 bits (693), Expect = 1e-69,   Method: Compositional matrix adjust.
 Identities = 230/797 (28%), Positives = 352/797 (44%), Gaps = 146/797 (18%)

Query: 54   FCDSSLPYSIRVKDLVSRMTLDEKVQQLGD---------------------------FAH 86
            +  + LP   RVKDL+ RMT +EK+ Q+                             F  
Sbjct: 536  YQRADLPIEERVKDLLQRMTPEEKLAQIRHIHSWEIFNGQALDERKLEEKAQGMSWGFVE 595

Query: 87   GVP---------------------RLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGATSF 125
            G P                     RLG+P +   +E+LHGV           V  GAT F
Sbjct: 596  GFPLTAENCAKNMLAIQRFMVEKTRLGIPIFTV-AESLHGV-----------VHEGATVF 643

Query: 126  PTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNINVARDPRWGRITETPG 185
            P  I   ++F+  L  +    ++ E  A+           SP I+V RD RWGR+ E+ G
Sbjct: 644  PQNIALGSTFDTDLAYRKTSMIADELHAV-----GMRQVLSPCIDVVRDLRWGRVEESFG 698

Query: 186  EDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFD 245
            EDP++ GR+ +  V+G  D                +S   KHY  +  +   G++    +
Sbjct: 699  EDPYLCGRFGIAEVKGYMDN--------------GISPMLKHYGPHG-NPLSGLNLASVE 743

Query: 246  ARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGY 305
              +  +D+ E +L+PFEM +K+    +VM +YN  N IP+ A   LL   +R EW   GY
Sbjct: 744  TSI--RDLHEVYLKPFEMVMKQAPTLAVMSAYNSWNRIPNSASHYLLTDVLRKEWGFKGY 801

Query: 306  IVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFTGNAVQQGKVKETDI 365
            + +D  +I+ M+ N  F A + E+A  Q L AGLD++            +++G++    +
Sbjct: 802  VYSDWGAIE-MLKNFHFTARNSEEAALQALTAGLDVEASSDCYPAIPGLIERGELNREIV 860

Query: 366  DKSLKYLYTVLMRLGFFDGSPQYVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPL 425
            D++++ +     R+G FD  P      K  I S + I L+ + A E  VLLKND+  LPL
Sbjct: 861  DEAVRRVLYAKFRIGLFD-DPYGEKFAKGAIHSGKAIALSKKIADESTVLLKNDRQLLPL 919

Query: 426  NSAKVKTVAVVGPHANATVAMIGNYAGI-PCRY-MSPIAGFSGYA----NVTYKTGCDDV 479
            +  K+K++AV+GP  NA     G+Y      R+ ++P+ G   +A     V Y  GC  V
Sbjct: 920  SIGKLKSIAVIGP--NADQIQFGDYTWTRDNRFGVTPLQGIRKWAGTNVKVNYVKGCSLV 977

Query: 480  ACKSNNSIFAASEAAKTADATIILAG---------LDLSVEAESLDREDLWLPGYQTQLI 530
            +    + I  A EAA+ +D  ++  G            S   E  D  DL L G Q  LI
Sbjct: 978  SM-DESGIRQAVEAAEQSDVCVLFCGSASAALARDYKSSTCGEGFDLNDLTLTGAQPALI 1036

Query: 531  NQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPG 590
              V    K PVILV+++  G   A      NI AIL   Y GE+ G +IAD++FGK +P 
Sbjct: 1037 KAVQATGK-PVILVLVT--GKPFAIPWEKKNIPAILVQWYAGEQSGNSIADILFGKVSPS 1093

Query: 591  GRLPITWYNGDYVQMLPLTSMPLR-------PVDSLGYPGRTYKFYNGPTLYPFGYGLSY 643
            GRL  ++   +    LP+    LR          S   PGR Y F     L+ FG+GL+Y
Sbjct: 1094 GRLTFSF--PESTGHLPVFYNHLRSDRGFYKSPGSYDSPGRDYVFSAPVPLWSFGHGLTY 1151

Query: 644  TQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQN 703
            T F+Y+ L   +T  + LN   H R                              +D +N
Sbjct: 1152 TTFEYSNLQTDRTSYL-LNDTVHVR------------------------------IDLKN 1180

Query: 704  VGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYA 763
             G  +G +VV +Y        A  + Q+  F++V ++AG  + ++        L I++  
Sbjct: 1181 TGKREGKEVVQLYVSDVYSSVAMPVHQLRDFRKVALQAGETQTVRLSI-PVSELTILNEK 1239

Query: 764  ANTLLPAGEHTIFVGNG 780
               ++  GE  I VG+ 
Sbjct: 1240 NEAIVEPGEFEIQVGSA 1256


>gi|224535195|ref|ZP_03675734.1| hypothetical protein BACCELL_00056 [Bacteroides cellulosilyticus
           DSM 14838]
 gi|224523186|gb|EEF92291.1| hypothetical protein BACCELL_00056 [Bacteroides cellulosilyticus
           DSM 14838]
          Length = 733

 Score =  271 bits (693), Expect = 1e-69,   Method: Compositional matrix adjust.
 Identities = 220/792 (27%), Positives = 376/792 (47%), Gaps = 126/792 (15%)

Query: 45  LGLQMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFA------------------- 85
           L ++    ++ D+  P   RVKDL++RMTL EKV QL  +                    
Sbjct: 16  LSVRSQKPVYKDAGQPVETRVKDLLNRMTLHEKVLQLNQYTFGENDNPNNIGTEVKNLPA 75

Query: 86  --------HGVPRL-GLPQYEWWSEALHGVSNVGPGTHFDDVIPG-ATSFPTVILTTASF 135
                   H  P+L    Q +   E+  G+    P     DVI G  T +P  +    SF
Sbjct: 76  EIGSLIYLHTDPKLRNRIQRKAMEESRLGI----PILFGFDVIHGLRTVYPISLAQACSF 131

Query: 136 NESLWKKIGQAVSTEARAMYNLGRAGLTY-WSPNINVARDPRWGRITETPGEDPFVVGRY 194
           N  L   + QA    A+       +G+ + +SP I+VARDPRWGRI+E  GEDP+     
Sbjct: 132 NPDL---VTQACGMAAKESV---LSGIDWTFSPMIDVARDPRWGRISECYGEDPY----- 180

Query: 195 AVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDME 254
            +N V G+  V+G++   +  S P  +++C KHY  Y V    G D  + D  ++ Q + 
Sbjct: 181 -LNTVFGVASVKGYQG--EKLSDPYSIAACLKHYVGYGVSE-GGRDYRYTD--ISPQALW 234

Query: 255 ETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQ 314
           ET+L P+E CVK G A+++M S+N ++G+P+ ++  +L + ++ +W   G++V+D ++I+
Sbjct: 235 ETYLPPYEACVKAG-AATLMSSFNDISGVPATSNHYILTEILKNKWRHDGFVVSDWNAIE 293

Query: 315 VMVDNHKFLADSKEDAVAQTLKAGLDLDC-GQYYTNFTGNAVQQGKVKETDIDKSLKYLY 373
            ++  ++ +A ++++A  +   AG+++D     Y  +    V + K++ + ID ++  + 
Sbjct: 294 QLI--YQGVAKNRKEAAYKAFHAGVEMDMRDNVYYEYLEQLVAEKKIEISQIDDAVARIL 351

Query: 374 TVLMRLGFFDGSPQYVSLGKQD-ICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKT 432
            V  RLG FD  P    L +Q+     E+I LAA  A E +VLLKN++N LPL+S  VK 
Sbjct: 352 RVKFRLGLFD-EPYTKELTEQERYLQKEDIALAARLAEESMVLLKNEKNLLPLSST-VKR 409

Query: 433 VAVVGPHANATVAMIGNYA------GIPCRYMSPIAGFSGYANVTYKTGCDDVACKSNNS 486
           VA++GP       ++G +A       +   Y      F     + Y+ GC   A   N+ 
Sbjct: 410 VALIGPMVKDRSDLLGAWAFKGQAEDVETIYEGMQKEFGDKVRLDYEQGC---ALDGNDE 466

Query: 487 --IFAASEAAKTADATIILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILV 544
               AA + A+ +D  ++  G       E+  R  + LP  Q +L+  + +  K P++LV
Sbjct: 467 SGFSAALKTAEASDVVVVCLGESKQWSGENASRSTIALPDIQEKLLLHLKQANK-PIVLV 525

Query: 545 IMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQ 604
           + S  G  +        ++AI+    PG  GG  +A ++ G+ NP G+L +T+       
Sbjct: 526 LSS--GRPLELIRLEPQVEAIIEMWQPGVAGGTPLAGILSGRVNPSGKLSVTF------- 576

Query: 605 MLPLTS--MPL--------RPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFT 654
             PL++  +P+        RP D++G     Y+      LYPFGYGLSYT F Y+     
Sbjct: 577 --PLSTGQIPVYYNMRQSARPFDAMG----DYQDIPTEPLYPFGYGLSYTTFTYS----- 625

Query: 655 KTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVI 714
                 L+ L+  +N   T++ + T                       N G  +G + V+
Sbjct: 626 ---DAKLSSLKIKKNQKITAEVTVT-----------------------NAGKVEGKETVL 659

Query: 715 VYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHT 774
            Y   P    +  +K++  F++  ++ G ++  +F  +  + L+  D      L AGE  
Sbjct: 660 WYVSDPFCSISRPMKELKFFEKQSLKVGESRVFRFEIDPMRDLSYTDATGKRFLEAGEFI 719

Query: 775 IFVGNGGVSFPI 786
           + VG   ++F +
Sbjct: 720 VSVGGRKLTFEV 731


>gi|364284956|gb|AEW47953.1| GHF3 protein [uncultured bacterium D1_14]
 gi|364284964|gb|AEW47958.1| GHF3 protein [uncultured bacterium E2_1]
          Length = 752

 Score =  271 bits (693), Expect = 1e-69,   Method: Compositional matrix adjust.
 Identities = 218/733 (29%), Positives = 347/733 (47%), Gaps = 102/733 (13%)

Query: 64  RVKDLVSRMTLDEKVQQLGDFAHG-----VPRL------GLPQYEWWSEALHGVSNVG-- 110
           RV+ L+++MTL+EK+ Q+   +       V RL      G    E   E ++ +  V   
Sbjct: 36  RVESLLTKMTLEEKIGQMNQVSFSGNIEEVSRLIKNGEVGSILNEVDPERVNALQRVAIE 95

Query: 111 ------PGTHFDDVIPG-ATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLT 163
                 P     DVI G  T FP  +   ASFN  + +K  +  + EA ++      G+ 
Sbjct: 96  ESRLGIPILIGRDVIHGFKTIFPIPLGQAASFNPQIVEKGARVSAVEASSV------GVR 149

Query: 164 Y-WSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVS 222
           + ++P I+++RDPRWGRI E+ GEDP++        V+G Q         D  + P  ++
Sbjct: 150 WTFTPMIDISRDPRWGRIAESCGEDPYLTSVMGAAMVKGFQG--------DSLNNPNSIA 201

Query: 223 SCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNG 282
           +C KH+  Y         R +    +TE+ +   +L PFE  VK+G A+  M S+N  +G
Sbjct: 202 ACAKHFVGYGAAEG---GRDYNTTCITERQLRNVYLPPFEAAVKQGVAT-FMTSFNANDG 257

Query: 283 IPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLD 342
           IPS  +P +L + +R EW   G++V+D  SI  MV  H F  D K DA  + + AG+D++
Sbjct: 258 IPSSGNPFILKKVLRDEWGFDGFVVSDWASIIEMVA-HGFCTDDK-DAAMKAVNAGVDME 315

Query: 343 CGQY-YTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQDICSDEN 401
              Y Y N   +   + KV E  ID +++ +  V  RLG FD +P         I S EN
Sbjct: 316 MVSYTYMNHLKDLKNENKVSEETIDNAVRNILRVKFRLGLFD-NPYVDEKAPSPIYSKEN 374

Query: 402 IELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYA--GIPCRYMS 459
           + +A EAA +  +LLKND+  LP+N + VKT+AVVGP A+A    +G +A  G      +
Sbjct: 375 LAIAKEAAIQSAILLKNDKQILPINES-VKTIAVVGPMADAPYEQMGTWAFDGEKSMTQT 433

Query: 460 PIAGFSGY----ANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAESL 515
           P+     +     N  ++ G      K+ + I  A  AA  AD  +   G +  +  E+ 
Sbjct: 434 PLMALRQFYGDKVNFIFEPGLAYTRDKNTSGISKAVSAANRADLVLAFVGEEAILSGEAH 493

Query: 516 DREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEG 575
              +L L G Q+ LIN +A+  K P++ V+++  G  +   +     KA+L++ +PG  G
Sbjct: 494 CLANLNLQGAQSDLINALAKTGK-PIVTVVIA--GRPLTIGKEAELSKAVLYSFHPGTMG 550

Query: 576 GRAIADVVFGKFNPGGRLPITW---------YNGDYVQMLPLTSMPLRPVDSLGY-PGRT 625
           G AIAD++FGK  P G+ P+T+         Y   Y    P     +  +D++    G+T
Sbjct: 551 GPAIADLLFGKAVPSGKTPVTFPKEVGQIPIYYSHYNTGRPANRNEIL-LDNIAVGAGQT 609

Query: 626 ----YKFY---NGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASK 678
                 FY       LYPFG+GLSYT F+Y+                   NL  +S    
Sbjct: 610 SLGNTSFYLDAGFDPLYPFGFGLSYTTFEYS-------------------NLKLSS---- 646

Query: 679 TRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVF 738
                   N+L   D      D +N G+ +G++V  +Y +         +K++  F R+ 
Sbjct: 647 --------NELSAKDELTVTFDLKNTGNYEGAEVAQLYVRDMVGSVVRPVKELKRFNRIT 698

Query: 739 VRAGRNKRIKFVF 751
           ++ G  + +   F
Sbjct: 699 LKPGETRNVSMTF 711


>gi|423223731|ref|ZP_17210200.1| hypothetical protein HMPREF1062_02386 [Bacteroides cellulosilyticus
           CL02T12C19]
 gi|392638106|gb|EIY31959.1| hypothetical protein HMPREF1062_02386 [Bacteroides cellulosilyticus
           CL02T12C19]
          Length = 854

 Score =  271 bits (692), Expect = 1e-69,   Method: Compositional matrix adjust.
 Identities = 164/434 (37%), Positives = 248/434 (57%), Gaps = 41/434 (9%)

Query: 46  GLQMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHG 105
           G+  +  L+ D   P   R+ DL+SR+T++EK+  L   + G+ RL +P+Y   +EALHG
Sbjct: 20  GVAQAQELYKDEKAPMHERIMDLLSRLTVEEKISLLRATSPGISRLDIPKYYHGNEALHG 79

Query: 106 VSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAG---- 161
           V  V PG          T FP  I   A++N  L  ++   +S EARA +N    G    
Sbjct: 80  V--VRPGRF--------TVFPQAIGLAATWNPELQLQVATVISDEARARWNELDQGREQK 129

Query: 162 ------LTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLN 215
                 LT+WSP +N+ARDPRWGR  ET GEDP++ G     +V+GLQ   G ++     
Sbjct: 130 SQFSDLLTFWSPTVNMARDPRWGRTPETYGEDPYLSGIMGTAFVKGLQ---GDDD----- 181

Query: 216 SRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMC 275
            R LK+ S  KH+AA + ++    +R+  + +++E+ + E +L  FE CVK+G ++S+M 
Sbjct: 182 -RYLKIVSTPKHFAANNEEH----NRFVCNPQISEKQLREYYLPAFEACVKDGKSASIMS 236

Query: 276 SYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTL 335
           +YN +N +P   +  LL + +R +W   GY+V+DC    ++V+ HK++  +KE A A ++
Sbjct: 237 AYNALNDVPCTLNAWLLTKVLRKDWGFKGYVVSDCGGPSLLVNAHKYVK-TKEAAAALSI 295

Query: 336 KAGLDLDCG-QYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ--YVSLG 392
           KAGLDL+CG   Y     +A +Q  V + DID +   +    M LG FD   Q  Y  + 
Sbjct: 296 KAGLDLECGDDVYDQPLLSAYRQYMVTDADIDSAAYRVLRARMELGLFDSGEQNPYTKIS 355

Query: 393 KQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAG 452
              I S E+ E+A  AARE IVLLKN +  LPLN+ KVK++AVVG   NA  +  G+Y+G
Sbjct: 356 PAVIGSAEHQEVALNAARECIVLLKNQKKMLPLNAKKVKSIAVVG--INAGSSEFGDYSG 413

Query: 453 IPCRYMSPIAGFSG 466
           +P   ++PI+   G
Sbjct: 414 LPV--IAPISVLQG 425



 Score =  146 bits (368), Expect = 4e-32,   Method: Compositional matrix adjust.
 Identities = 100/293 (34%), Positives = 147/293 (50%), Gaps = 54/293 (18%)

Query: 490 ASEAAKTADATIILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAG 549
           A +A +  +  + + G++ S+E E  DR D+ LP  Q + + ++ +V   P I+V++ AG
Sbjct: 595 AGKAVRECETVVAVLGINKSIEREGQDRYDIQLPADQQEFLQEIYKV--NPNIVVVLVAG 652

Query: 550 GVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLT 609
              +A    + +I AI+ A YPGE GG+A+A+V+FG +NPGGRLP+T+Y         L 
Sbjct: 653 S-SLAINWMDEHIPAIVNAWYPGESGGKAVAEVLFGDYNPGGRLPLTYYRS-------LD 704

Query: 610 SMPLRPVDSLGY-PGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCR 668
            +P  P D      GRTYK++ G  LYPFGYGLSYT FKY+       +QV         
Sbjct: 705 ELP--PFDDYDITKGRTYKYFKGDVLYPFGYGLSYTTFKYS------NLQV--------- 747

Query: 669 NLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQ--NVGSTDGSDVVIVYSKPPAEIAAT 726
                                  D   E  V FQ  N G   G +V  VY K P      
Sbjct: 748 ----------------------ADGEEEINVSFQLKNSGKYAGDEVAQVYVKLPERDEVM 785

Query: 727 YIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLL-PAGEHTIFVG 778
            IK++ GF+RV +++G NK++         L   D A +  + P+G++TI VG
Sbjct: 786 PIKELKGFERVTLKSGENKKVTLKLRK-DLLRYWDEAKDKFVCPSGDYTIMVG 837


>gi|389696043|ref|ZP_10183685.1| beta-glucosidase-like glycosyl hydrolase [Microvirga sp. WSM3557]
 gi|388584849|gb|EIM25144.1| beta-glucosidase-like glycosyl hydrolase [Microvirga sp. WSM3557]
          Length = 751

 Score =  271 bits (692), Expect = 1e-69,   Method: Compositional matrix adjust.
 Identities = 230/759 (30%), Positives = 354/759 (46%), Gaps = 114/759 (15%)

Query: 64  RVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHG-VSNVGPG---------- 112
           RV +L+ RMTL+EKV QL   +HG P     ++E  SE   G V N              
Sbjct: 39  RVNELLGRMTLEEKVGQLNLVSHGPPL----RWEDISEGKAGAVLNFNSAEDVARAQALV 94

Query: 113 --THFD-------DVIPG-ATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGL 162
             +H         DV+ G  T FP  +   A+F+  + +   +  + EA  +      G+
Sbjct: 95  RESHLKIPLLFGLDVLHGFRTQFPLPLGEAAAFSPRVSRLASEWAAREASYV------GV 148

Query: 163 TY-WSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKV 221
            + ++P  +++RD RWGRI E  GEDP +        V G               R   +
Sbjct: 149 NWTFAPMADLSRDSRWGRIVEGFGEDPTLGAALTAARVEGF--------------RKGGL 194

Query: 222 SSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVN 281
           ++  KH+A Y         R +    +   +M +T+L PF   V+ G AS  M ++N +N
Sbjct: 195 AAAAKHFAGYGAPQG---GRDYDTTYIPRAEMYDTYLPPFRAAVEAGTAS-FMAAFNALN 250

Query: 282 GIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDL 341
           G PS A+P LL   +R +W   G++ +D   I  +V NH   AD  E A  + + AG+D+
Sbjct: 251 GEPSTANPWLLTDVLRTQWGFDGFVTSDWVGIGELV-NHGIAADGAE-AARKAILAGVDM 308

Query: 342 DC-GQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQDICSDE 400
           D  GQ Y N   + V+ G+V E+ ID+S++ +     RLG FD      S    +  S E
Sbjct: 309 DMMGQLYINHLPDEVRAGRVPESVIDESVRRVLRTKFRLGLFDRPDVDSSHLDSEFPSPE 368

Query: 401 NIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYA--------- 451
           + + A E ARE  VLL+N  + LP+ S KV+++AVVGP A+A    +G +A         
Sbjct: 369 SRQAAREVARETFVLLQNRDDVLPIPS-KVRSIAVVGPLADAPQDQMGPHAARGHKEDSV 427

Query: 452 ----GIPCRYMSPIAGFSGYANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLD 507
               GI  R  S  AG +    V +  GCD + C++ +++  A EAA+ +D  I + G  
Sbjct: 428 TILEGIRRRAQS--AGIA----VRHAPGCD-LFCRNTDALPGALEAARQSDFVIAVFGEP 480

Query: 508 LSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILW 567
             +  E+  R ++ L G Q +++ ++A+  K PV LVIM  GG           I +IL 
Sbjct: 481 QELSGEAASRANMELNGKQIEVLEELAKTGK-PVALVIM--GGRPQVLGPVADRIPSILM 537

Query: 568 AGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPL--TSMPL-RPVDSLGYPGR 624
           A YPG E G A+ADV+FG  +P G+LP+TW        LPL    +P  RP  +      
Sbjct: 538 AWYPGTEAGPAVADVLFGDVSPSGKLPLTWPRA--TGQLPLYYNRLPTGRPTLANNRFTL 595

Query: 625 TYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGV 684
            Y   +   LYPFG+GLSYT F Y                         SDA       +
Sbjct: 596 HYIDESIAPLYPFGWGLSYTHFAY-------------------------SDAR------I 624

Query: 685 LVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRN 744
               L      E  +D +N G+ DG +VV +Y++ P    +  ++++  F+++ +++G  
Sbjct: 625 ASRQLDEGQVLEVSLDVKNTGARDGQEVVQLYTRDPVASRSRPLRELKAFEKIALKSGET 684

Query: 745 KRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGNGGVS 783
           KR+       +SL         L+ AG   +FVG   ++
Sbjct: 685 KRVTLRV-PVESLGFHLDDGTYLVEAGAIQVFVGGSSLA 722


>gi|393787054|ref|ZP_10375186.1| hypothetical protein HMPREF1068_01466 [Bacteroides nordii
           CL02T12C05]
 gi|392658289|gb|EIY51919.1| hypothetical protein HMPREF1068_01466 [Bacteroides nordii
           CL02T12C05]
          Length = 958

 Score =  271 bits (692), Expect = 1e-69,   Method: Compositional matrix adjust.
 Identities = 223/809 (27%), Positives = 366/809 (45%), Gaps = 144/809 (17%)

Query: 53  LFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRL---GLPQYEW----WS----- 100
           ++ D + P   R++DL+S+M L+EK  Q+    +G  R+    LP  EW    W      
Sbjct: 64  VYEDPTAPIDARIEDLLSQMNLNEKTCQMVTL-YGYKRVLKDALPTPEWKQMLWKDGMGA 122

Query: 101 --EALHGVSNVG-PGTHFDDVIPG------------------------------------ 121
             E L+G    G P +  ++V P                                     
Sbjct: 123 IDEHLNGFQQWGLPPSDNENVWPASRHAWALNEVQRFFIEETRLGIPVDFTNEGIRGVES 182

Query: 122 --ATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLT-YWSPNINVARDPRWG 178
             AT+FPT +    ++N  L  ++G     EAR +      G T  ++P ++V RD RWG
Sbjct: 183 YKATNFPTQLGLGHTWNRKLIHQVGLITGREARML------GYTNVYAPILDVGRDQRWG 236

Query: 179 RITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKG 238
           R  E  GE P++V    +  V+G+Q                +V++  KH+ AY  +    
Sbjct: 237 RYEEVYGESPYLVAELGIEMVKGMQ-------------HNYQVAATGKHFIAYSNNKGAR 283

Query: 239 VDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRG 298
                 D +++ +++E   + PF+  ++E     VM SYN  +G+P  +    L   +RG
Sbjct: 284 EGMARVDPQMSPREVEMIHVYPFKRVIQEAGLLGVMSSYNDYDGLPVQSSYYWLMTRLRG 343

Query: 299 EWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCG----QYYTNFTGNA 354
           +    GY+V+D D+++ +   H    D KE AV Q+++AGL++ C       Y       
Sbjct: 344 QMGFRGYVVSDSDAVEYLYTKHGTAKDMKE-AVRQSVEAGLNVRCTFRSPDSYVLPLREL 402

Query: 355 VQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQDICSDENIE--LAAEAAREG 412
           VQ+G + E  I+  ++ +  V   +G FD +P    L   D   ++     +A +A+RE 
Sbjct: 403 VQEGGLSEEIINDRVRDILRVKFLVGLFD-TPYQTDLKGADEEVEKEENEIVALQASRES 461

Query: 413 IVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGF----SGYA 468
           IVLLKND+N LPL+ A ++ +AV GP+A+ T   + +Y  +     + ++G      G A
Sbjct: 462 IVLLKNDKNALPLDVASIRKIAVCGPNADETAYALTHYGPLAVDVTTVLSGIRQKVDGKA 521

Query: 469 NVTYKTGCDDVACK--------------SNNSIFAASEAAKTADATIILAGLDLSVEAES 514
            V Y  GC+ V                   N I  A   AK AD  +++ G       E+
Sbjct: 522 EVLYTKGCELVDANWPESEIIDYPLTNDEQNKIDKAVAQAKEADVAVVVLGGGQRTCGEN 581

Query: 515 LDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEE 574
             R  L LPG Q  L+  V    K PV+LV+++   + + +A+    + AI+ A YPG +
Sbjct: 582 KSRSSLDLPGRQLDLLKAVQATGK-PVVLVLINGRPLSVNWAD--KFVPAIIEAWYPGSK 638

Query: 575 GGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRP---VDSLGYPGRTYKF--Y 629
           GG A+ADV+FG +NPGG+L +T+     V  +P  + P +P   +D    PG        
Sbjct: 639 GGTAVADVLFGDYNPGGKLTVTFPKS--VGQIPF-NFPCKPSSQIDGGKNPGPKGNMSRV 695

Query: 630 NGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDL 689
           NG  LYPFG+GLSYT F+Y+ +S +  +     K+Q                       +
Sbjct: 696 NG-ALYPFGHGLSYTTFEYSDISISPKVITPNQKVQ-----------------------V 731

Query: 690 RCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKF 749
           RC           N G   G +VV +Y +       TY K + GF+R+ ++ G  K + F
Sbjct: 732 RC--------KITNTGKRAGDEVVQLYVRDILSSVTTYEKNLEGFERIHLQPGETKEVSF 783

Query: 750 VFNACKSLNIVDYAANTLLPAGEHTIFVG 778
             +  K+L +++   + ++  G+ +I +G
Sbjct: 784 TLDR-KALELLNAKNDWVVEPGDFSIMLG 811


>gi|315499711|ref|YP_004088514.1| beta-glucosidase [Asticcacaulis excentricus CB 48]
 gi|315417723|gb|ADU14363.1| Beta-glucosidase [Asticcacaulis excentricus CB 48]
          Length = 869

 Score =  271 bits (692), Expect = 1e-69,   Method: Compositional matrix adjust.
 Identities = 167/420 (39%), Positives = 229/420 (54%), Gaps = 40/420 (9%)

Query: 69  VSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGATSFPTV 128
           ++RMT+++K  Q+ + A  +P  GL  YEWW+E LHGV+  G           AT FP  
Sbjct: 40  IARMTVEQKAAQMQNRAPDLPSAGLTAYEWWNEGLHGVARAGE----------ATVFPQA 89

Query: 129 ILTTASFNESLWKKIGQAVSTEARAMYNLGRA--------GLTYWSPNINVARDPRWGRI 180
           I   A++N +L K++G  VSTEARA +N            GLT WSPNIN+ RDPRWGR 
Sbjct: 90  IGLAATWNPALLKQVGDVVSTEARAKFNSTDPAGDHQRYYGLTLWSPNINIFRDPRWGRG 149

Query: 181 TETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVD 240
            ET GEDPF+  R A  +V GLQ  +             KV +  KH A   V +     
Sbjct: 150 QETYGEDPFLTSRLAEGFVTGLQGPDPQHP---------KVVASVKHLA---VHSGPEAG 197

Query: 241 RYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEW 300
           R+ F A V+  D+E T+L  F   V    A SVMC+YN V G+P+CA   LL   VR  W
Sbjct: 198 RHGFAASVSPYDLEMTYLPAFRYSVMTTKAQSVMCAYNAVGGVPACASDLLLKTYVREAW 257

Query: 301 DLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFTGNAVQQGKV 360
              GY+V DCD+I  M   H +  +  E + A++LKAG+DL+CG  Y      AVQ+G +
Sbjct: 258 GFKGYVVTDCDAIYDMTRFHFYRLNDAESS-AESLKAGVDLNCGNAYAALP-EAVQKGLI 315

Query: 361 KETDIDKSLKYLYTVLMRLGFFDGSPQ-YVSLGKQDICSDENIELAAEAAREGIVLLKND 419
            E+ +D+SL  L  V  RLG  DG+P  +  +  + I + +   LA +AA + +VLLKN+
Sbjct: 316 PESLMDQSLNRLLDVRKRLG-IDGAPSPWARISPEAINTPQAQGLALQAAEQSLVLLKNN 374

Query: 420 QNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFS---GYANVTYKTGC 476
              LPL     +TVAV+GP+A+    + GNY GI  + ++P+ G     G A V Y  G 
Sbjct: 375 -GVLPLKPG--QTVAVIGPNADTEETLRGNYNGIARQPVTPLTGLRAQLGAAKVLYAQGA 431



 Score =  111 bits (277), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 88/298 (29%), Positives = 127/298 (42%), Gaps = 56/298 (18%)

Query: 505 GLDLSVEAESL----------DREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIA 554
           GL   +E E L          DR DL LP  Q  L+  V    K P+++V++S   V + 
Sbjct: 608 GLSPDIEGEELQILVPGFDRGDRTDLGLPRTQEDLLKAVKATGK-PLVVVLLSGSAVALN 666

Query: 555 FAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLR 614
           +A+ + +     W  YPGE GG AIA  + G+ NP GRLP+T+Y    VQ LP       
Sbjct: 667 WADAHADAVVAAW--YPGEAGGTAIARTLTGEANPSGRLPVTFYRS--VQDLP------- 715

Query: 615 PVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTS 674
           P       GRTY+++ G  LYPFG+GLSYTQF Y+ L    +                  
Sbjct: 716 PFIDYRMEGRTYRYFKGKPLYPFGHGLSYTQFSYSDLKLDTST----------------- 758

Query: 675 DASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGF 734
                         L         V  +N G   G +VV +Y K P          +  F
Sbjct: 759 --------------LTAGQPLRVSVRVRNNGQRAGDEVVQLYVKRPDTFGLN--ASLAAF 802

Query: 735 QRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGNGGVSFPIHLNFNY 792
            RV ++AG ++ +    +  + L+ V       + AG + + VG G   F   LN ++
Sbjct: 803 ARVSLKAGESRTVVMTIDP-RDLSTVTLEGERAIRAGAYGLSVGGGQPGFAPTLNADF 859


>gi|71731103|gb|EAO33170.1| Beta-glucosidase [Xylella fastidiosa subsp. sandyi Ann-1]
          Length = 882

 Score =  271 bits (692), Expect = 2e-69,   Method: Compositional matrix adjust.
 Identities = 166/423 (39%), Positives = 238/423 (56%), Gaps = 40/423 (9%)

Query: 68  LVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGATSFPT 127
           LV++MT  EK+ Q  + A  +PRLG+P Y+WWSE LHG++  G           AT FP 
Sbjct: 37  LVAQMTRQEKIAQTMNAAPAIPRLGIPAYDWWSEGLHGIARNG----------YATVFPQ 86

Query: 128 VILTTASFNESLWKKIGQAVSTEARAMYNL----GR-----AGLTYWSPNINVARDPRWG 178
            I   AS+N  L + +G   STEARA +NL    G+     AGLT WSPNIN+ RDPRWG
Sbjct: 87  AIGLAASWNTDLLQHVGTVTSTEARAKFNLTGGPGKDHPRYAGLTLWSPNINIFRDPRWG 146

Query: 179 RITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKG 238
           R  ET GEDP++ G+ AV+++RGLQ         D    P  +++  KH+A   V +   
Sbjct: 147 RGMETYGEDPYLTGQLAVSFIRGLQG--------DTPDHPRTIATP-KHFA---VHSGPE 194

Query: 239 VDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRG 298
             R+ FD  V+  D+E T+   F   + +G A SVMC+YN ++G P+CA   LLN  +R 
Sbjct: 195 QGRHSFDVDVSAYDLEATYTPAFRAAIVDGHAGSVMCAYNALHGTPACASDWLLNTRLRN 254

Query: 299 EWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFTGNAVQQG 358
           +W  +G++V+DCD+I+ M   H F  D+   A A  LK+G DL+CG  Y +    A+ +G
Sbjct: 255 DWGFNGFVVSDCDAIEDMTRFHFFRQDNAS-ASAAALKSGDDLNCGNTYRDLN-QAIARG 312

Query: 359 KVKETDIDKSLKYLYTVLMRLGFFDGSPQ--YVSLGKQDICSDENIELAAEAAREGIVLL 416
            + E+ +D++L  L+T   RLG         Y ++G + I +  +  LA +AA + +VLL
Sbjct: 313 DIDESTLDQALIRLFTARQRLGTLQPREHDPYAAIGIKHIDTPAHRALALQAAAQSLVLL 372

Query: 417 KNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFS---GYANVTYK 473
           KN  NTLPL      T+AV+GP A++  A+  NY G     ++P+ G     G A V Y 
Sbjct: 373 KNSGNTLPL--PPETTLAVLGPDADSLTALEANYQGTSSTPVTPLTGLRTRFGTAKVHYA 430

Query: 474 TGC 476
            G 
Sbjct: 431 QGA 433



 Score =  144 bits (363), Expect = 2e-31,   Method: Compositional matrix adjust.
 Identities = 99/301 (32%), Positives = 140/301 (46%), Gaps = 55/301 (18%)

Query: 490 ASEAAKTADATIILAGLDLSVEAESL----------DREDLWLPGYQTQLINQVAEVAKG 539
           A  A   ADA +   GL   VE E L          DR  + LP  Q  L+  V    K 
Sbjct: 604 AERAVAHADAIVAFVGLSPEVEGEELHIDTPGFSGGDRTTIDLPATQETLLQHVKTTGK- 662

Query: 540 PVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYN 599
           P+I+V+MS   V + +A+ + +  AIL A YPG+ GG AIA  + G  NPGGRLP+T+Y 
Sbjct: 663 PLIVVLMSGSAVALNWAQHHAD--AILAAWYPGQSGGTAIAQALAGDVNPGGRLPVTFYR 720

Query: 600 GDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQV 659
               Q LP       P  S    GRTY+++ G  LYPFGYGLSYTQF Y           
Sbjct: 721 S--TQDLP-------PYISYDMTGRTYRYFKGQPLYPFGYGLSYTQFAYE---------- 761

Query: 660 NLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKP 719
                                 P +    L+  +        +N G+  G +VV +Y +P
Sbjct: 762 ---------------------APQLSTATLKAGNTLTVTAHVRNTGTRAGDEVVQLYLEP 800

Query: 720 PAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGN 779
           P    A  ++ ++GF+RV +R G ++ + F  +A + L+ V       + AG + +FVG 
Sbjct: 801 PYSPQAP-LRSLVGFKRVTLRPGESRLLTFTLDA-RQLSGVQQTGQRSVEAGHYHLFVGG 858

Query: 780 G 780
           G
Sbjct: 859 G 859


>gi|285018984|ref|YP_003376695.1| beta-glucosidase [Xanthomonas albilineans GPE PC73]
 gi|283474202|emb|CBA16703.1| putative beta-glucosidase protein [Xanthomonas albilineans GPE
           PC73]
          Length = 904

 Score =  270 bits (691), Expect = 2e-69,   Method: Compositional matrix adjust.
 Identities = 171/446 (38%), Positives = 237/446 (53%), Gaps = 43/446 (9%)

Query: 45  LGLQMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALH 104
           LGL +S     D +     R   LV++MT  EK+ Q  + A  +PRLG+P YEWWSE LH
Sbjct: 39  LGLLVSPLAHADDA---EDRATALVAKMTRAEKIAQAMNDAPAIPRLGIPAYEWWSEGLH 95

Query: 105 GVSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRA---- 160
           G++  G           AT FP  I   AS+N  L   +G   STEARA +NL       
Sbjct: 96  GIARNGE----------ATVFPQAIGLAASWNTDLLHAVGTVTSTEARAKFNLAGGPGKN 145

Query: 161 -----GLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLN 215
                GLT WSPNIN+ RDPRWGR  ET GEDP++ G+ AV ++ GLQ         D  
Sbjct: 146 HARYGGLTIWSPNINIFRDPRWGRGMETYGEDPYLTGQLAVGFIHGLQG--------DDP 197

Query: 216 SRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMC 275
           + P  +++  KH A   V +     R+ FD  V+  D E T+   F   + EG A SVMC
Sbjct: 198 THPRTIATP-KHLA---VHSGPESGRHGFDVDVSPHDFEATYSPAFRAAIVEGHAGSVMC 253

Query: 276 SYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTL 335
           +YN ++GIP+CA   L++  VRG W   G++V+DCD+I  M   H +       + A  L
Sbjct: 254 AYNALHGIPACAADWLIDGRVRGNWGFKGFVVSDCDAIDDMTQFH-YYRADNAGSAAAAL 312

Query: 336 KAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ--YVSLGK 393
           KAG DL+CG  Y +  G A+ +G+ +E  +D+SL  L+    RLG      +  Y  LG 
Sbjct: 313 KAGHDLNCGYAYRDL-GTALDRGEAEEAMLDRSLVRLFAARYRLGELQPRSKDPYARLGA 371

Query: 394 QDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGI 453
           +DI S  +  LA +AA++ +VLL+N  +TLPL       +AV+GP+A+A  A+  NY G 
Sbjct: 372 KDIDSPTHRALALQAAQQSLVLLQNRNDTLPLRPG--LRLAVIGPNADALAALEANYQGT 429

Query: 454 PCRYMSPIAGFS---GYANVTYKTGC 476
               ++P+ G     G   V Y  G 
Sbjct: 430 SVAPVTPLQGLRARFGTTQVHYTQGA 455



 Score =  137 bits (344), Expect = 3e-29,   Method: Compositional matrix adjust.
 Identities = 92/286 (32%), Positives = 136/286 (47%), Gaps = 55/286 (19%)

Query: 505 GLDLSVEAESL----------DREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIA 554
           GL   VE E L          DR DL LP  Q  L+ + A+ +  P+I+V+MS   V + 
Sbjct: 641 GLSPDVEGEELRIDVPGFDGGDRNDLSLPAAQQALLER-AKASGKPLIVVLMSGSAVALN 699

Query: 555 FAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLR 614
           +A+ + +  AIL A YPG+ GG AIA  + G  NPGGRLP+T+Y          ++  L 
Sbjct: 700 WAKQHAD--AILAAWYPGQSGGTAIAQALAGDINPGGRLPVTFYR---------STKDLP 748

Query: 615 PVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTS 674
           P  S    GRTY+++ G  L+PFGYGLSYT F Y     + T                  
Sbjct: 749 PYVSYDMKGRTYRYFKGEALFPFGYGLSYTHFAYTAPQLSSTT----------------- 791

Query: 675 DASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGF 734
                         L+  D        +N G+  G +VV VY + P   A + ++ ++GF
Sbjct: 792 --------------LQAGDTLHVTTTVRNTGARAGDEVVQVYLQYPPR-AQSPLRALVGF 836

Query: 735 QRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGNG 780
           QRV ++ G  + + F     + L+ VD +    + AG++ +FVG G
Sbjct: 837 QRVSLQPGEARTLSFALEP-RQLSDVDRSGQRAVEAGDYRLFVGGG 881


>gi|189467715|ref|ZP_03016500.1| hypothetical protein BACINT_04107 [Bacteroides intestinalis DSM
           17393]
 gi|189435979|gb|EDV04964.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
           intestinalis DSM 17393]
          Length = 943

 Score =  270 bits (691), Expect = 2e-69,   Method: Compositional matrix adjust.
 Identities = 229/809 (28%), Positives = 360/809 (44%), Gaps = 144/809 (17%)

Query: 53  LFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRL---GLPQYEW----WS----- 100
           ++ D +     R++DL+S+MTL+EK  Q+    +G  R+    LP  EW    W      
Sbjct: 52  VYEDPNASLDARIEDLLSQMTLEEKTCQMVTL-YGYKRVLKDDLPTPEWKQMLWKDGIGA 110

Query: 101 --EALHGVSNVG----------------------------------PGTHFDDVIPG--- 121
             E L+G    G                                  P    ++ I G   
Sbjct: 111 IDEHLNGFQQWGLPPSDNPYVWPASRHAWALNEVQRFFIEETRLGIPVDFTNEGIRGIES 170

Query: 122 --ATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLT-YWSPNINVARDPRWG 178
             AT+FPT +    ++N  L +++G     EAR +      G T  ++P ++V RD RWG
Sbjct: 171 YRATNFPTQLGLGHTWNRELIRQVGLITGREARIL------GYTNVYAPILDVGRDQRWG 224

Query: 179 RITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKG 238
           R  E  GE P++V    +  VRG+Q    H +         +V++  KH+ AY  +    
Sbjct: 225 RYEEVYGESPYLVAELGIEMVRGMQ----HNH---------QVAATGKHFVAYSNNKGAR 271

Query: 239 VDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRG 298
                 D +++ +++E   + PF+  +KE     VM SYN  +G+P       L   +RG
Sbjct: 272 EGMARVDPQMSPREVEMIHVYPFKRVIKEAGLLGVMSSYNDYDGVPIQGSYYWLTTRLRG 331

Query: 299 EWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCG----QYYTNFTGNA 354
           E    GY+V+D D+++ +   H    D KE AV Q+++AGL++ C       Y       
Sbjct: 332 EMGFRGYVVSDSDAVEYLYTKHSTAKDMKE-AVRQSVEAGLNVRCTFRSPDSYVLPLREL 390

Query: 355 VQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLG-KQDICSDENIELAAEAAREGI 413
           V++G + E  I+  ++ +  V   +G FD   Q    G  +++   EN  LA +A+RE +
Sbjct: 391 VKEGGLSEEVINDRVRDILRVKFLIGLFDAPYQTDLAGADREVEKAENESLALQASRESL 450

Query: 414 VLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGF----SGYAN 469
           VLLKN+ N LPL+   VK +AV GP+A+     + +Y  +     + + G      G A 
Sbjct: 451 VLLKNENNVLPLDINNVKKIAVCGPNADEEGYALTHYGPLAVEVTTVLEGIRQKSEGKAE 510

Query: 470 VTYKTGCDDVACKSNNS--------------IFAASEAAKTADATIILAGLDLSVEAESL 515
           V Y  GCD V      S              I  A E A+ AD  +++ G       E+ 
Sbjct: 511 VLYTKGCDLVDANWPESELIDYPMTDNEQAEIDKAVENARQADVAVVVLGGGQRTCGENK 570

Query: 516 DREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEG 575
            R  L LPG Q +L+  V    K PV+LV+++   + I +A  +  + AIL A YPG +G
Sbjct: 571 SRSSLDLPGRQLKLLQAVQATGK-PVVLVLINGRPLSINWA--DKFVPAILEAWYPGSKG 627

Query: 576 GRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRP---VDSLGYPGR--TYKFYN 630
           G A+ADV+FG +NPGG++ +T+     V  +P  + P +P   +D    PG        N
Sbjct: 628 GTAVADVLFGDYNPGGKMTVTFPKS--VGQIPF-NFPCKPSSQIDGGKNPGLDGNMSRVN 684

Query: 631 GPTLYPFGYGLSYTQFKYNLLSFT-KTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDL 689
           G  LY FGYGLSYT F+Y+ +  + K I  N      C+                     
Sbjct: 685 G-ALYSFGYGLSYTTFEYSGIEISPKVITPNQKATVRCK--------------------- 722

Query: 690 RCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKF 749
                        N G   G +VV +Y +       TY K + GF+R+ ++ G  K + F
Sbjct: 723 -----------VTNTGKRAGDEVVQLYVRDILSSVTTYEKNLAGFERIHLQPGETKEVVF 771

Query: 750 VFNACKSLNIVDYAANTLLPAGEHTIFVG 778
             +  K L ++D     ++  G+ +I VG
Sbjct: 772 TLDR-KQLELLDKHMEWVVEPGDFSIMVG 799


>gi|365121914|ref|ZP_09338824.1| hypothetical protein HMPREF1033_02170 [Tannerella sp.
           6_1_58FAA_CT1]
 gi|363643627|gb|EHL82934.1| hypothetical protein HMPREF1033_02170 [Tannerella sp.
           6_1_58FAA_CT1]
          Length = 1073

 Score =  270 bits (691), Expect = 2e-69,   Method: Compositional matrix adjust.
 Identities = 167/442 (37%), Positives = 248/442 (56%), Gaps = 48/442 (10%)

Query: 45  LGLQMSSFL-------FCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYE 97
           L LQ+SSF        F D++L +  R+KDL+SR+ + EK+  L   +  +PRLG+ +Y 
Sbjct: 13  LLLQISSFAVAQINYPFRDTTLSHHERIKDLLSRLNVSEKISLLRATSPAIPRLGIDKYY 72

Query: 98  WWSEALHGVSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNL 157
             +EALHGV  V PG          T FP  I   + +N    +++  A+S EAR  +N 
Sbjct: 73  HGNEALHGV--VRPGKF--------TVFPQAIGLASMWNPDFLQEVSTAISDEARGRWNE 122

Query: 158 GRAG----------LTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEG 207
              G          LT+WSP IN+ARDPRWGR  ET GEDPF+ G     +VRGLQ   G
Sbjct: 123 LNQGKDQTAGASDLLTFWSPTINMARDPRWGRTPETYGEDPFLTGTLGTAFVRGLQ---G 179

Query: 208 HENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKE 267
           ++       + +KV S  KH+AA + ++    +R   +A ++E+D+ E +   FE C+KE
Sbjct: 180 ND------PKYIKVVSTPKHFAANNEEH----NRASGNAVISERDLREYYFPAFEKCIKE 229

Query: 268 GDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSK 327
           G A SVM +YN VNGIP   +  LL   +R +W   GY+V+DC + + +V  H ++ D+ 
Sbjct: 230 GQAQSVMSAYNAVNGIPCTLNKWLLTDVLRDDWGFDGYVVSDCSAPEYIVSQHHYV-DTY 288

Query: 328 EDAVAQTLKAGLDLDCG-QYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSP 386
           E+A +  +KAGLDL+CG   Y     NA  +G V  ++ID +   +    MRLG FD   
Sbjct: 289 EEAASLCIKAGLDLECGDNVYITPLLNAYNRGMVTMSEIDSAAYRVLRGRMRLGLFDDPN 348

Query: 387 Q--YVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATV 444
           +  Y  +    +  +++ ELA EAAR+ +VLLKND++ LP+ +  +K++AVVG   NA  
Sbjct: 349 ENPYNKISPSIVGCEKHRELALEAARQSLVLLKNDKDMLPIQTDNIKSIAVVG--INAAN 406

Query: 445 AMIGNYAGIPCRYMSPIAGFSG 466
              G+Y+G P    +PI+   G
Sbjct: 407 CEFGDYSGTPVN--TPISVLEG 426



 Score =  124 bits (311), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 84/258 (32%), Positives = 121/258 (46%), Gaps = 46/258 (17%)

Query: 490 ASEAAKTADATIILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAG 549
           A E  + +D TI + G+D ++E E  DR  + LP  Q   I +  +     V++++    
Sbjct: 736 AGEIIRGSDLTIAVLGIDRTIEREGQDRSTIELPEDQQIFIEEAYKANPNTVVVLV---A 792

Query: 550 GVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLT 609
           G  +A    + NI A+L A YPGE+GG A+A+ +FG +NPGGRLP+T+YN        L+
Sbjct: 793 GSSLAINWIDQNIPAVLDAWYPGEQGGTAVAEALFGDYNPGGRLPLTFYNS-------LS 845

Query: 610 SMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRN 669
            +P    D      RTY ++ G  LYPFGYGLSYT F Y                   R 
Sbjct: 846 DLPAFD-DYNVRNNRTYMYFEGKPLYPFGYGLSYTDFAY-------------------RG 885

Query: 670 LNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIK 729
           L+ T D                      K    N G+ DG +V  VY + P +     +K
Sbjct: 886 LDVTQDEENVTV----------------KFFVSNTGNYDGDEVAQVYIQFPDQGTTLPLK 929

Query: 730 QVIGFQRVFVRAGRNKRI 747
           Q+ GF+RV +  G+   I
Sbjct: 930 QLKGFKRVHISKGQETEI 947


>gi|383115356|ref|ZP_09936112.1| hypothetical protein BSGG_2769 [Bacteroides sp. D2]
 gi|313695234|gb|EFS32069.1| hypothetical protein BSGG_2769 [Bacteroides sp. D2]
          Length = 735

 Score =  270 bits (691), Expect = 2e-69,   Method: Compositional matrix adjust.
 Identities = 218/775 (28%), Positives = 352/775 (45%), Gaps = 111/775 (14%)

Query: 53  LFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPG 112
           L+ D+  P   R+ DL+SRMTL+EKV QL  +  G         E   E     S +G  
Sbjct: 29  LYKDAKAPIEKRIDDLISRMTLEEKVLQLNQYTLGRNNNVNNVGE---EVKKVPSEIGSL 85

Query: 113 THFD---------------------------DVIPG-ATSFPTVILTTASFNESLWKKIG 144
            +FD                           D I G  T +P  +    S+N  L ++  
Sbjct: 86  IYFDINPELRNSMQKKAMEESRLGIPIIFGYDAIHGFRTIYPISLGQACSWNPGLVEQAC 145

Query: 145 QAVSTEARAMYNLGRAGLTY-WSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQ 203
              + EAR       +G+ + +SP I+VARDPRWGR+ E  GEDP+  G +A   VRG Q
Sbjct: 146 AVSAQEARM------SGVDWTFSPMIDVARDPRWGRVAEGYGEDPYTNGVFAAASVRGYQ 199

Query: 204 DVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEM 263
                    D  S   ++++C KHY  Y         R +    ++ Q + +T+L P+EM
Sbjct: 200 G--------DDMSAENRIAACLKHYIGYGASE---AGRDYVYTEISAQTLWDTYLLPYEM 248

Query: 264 CVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFL 323
            VK G A+++M S+N ++G+P  A+   +   ++  W   G+IV+D  +++ +   ++ L
Sbjct: 249 GVKAG-AATLMSSFNDISGVPGSANHYTMTAILKERWKHDGFIVSDWGAVEQL--KNQGL 305

Query: 324 ADSKEDAVAQTLKAGLDLDCGQY-YTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFF 382
           A +K+DA      AGL++D   + Y       V++GKV    +D+S++ +  V  RLG F
Sbjct: 306 AATKKDAAWYAFNAGLEMDMMSHAYDRHLKELVEEGKVTMAQVDESVRRVLRVKFRLGLF 365

Query: 383 DGSPQYVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANA 442
           +     V+  K      +++ +AA+ A E +VLLKND   LPL +   K +AVVGP A  
Sbjct: 366 ERPYTPVTNEKDRFFRPQSMAVAAQLAAESMVLLKNDNQILPLTNK--KRIAVVGPMAKN 423

Query: 443 TVAMIGNYAG------IPCRYMSPIAGFSGYANVTYKTGCDDVACKSNNSIFA-ASEAAK 495
              ++G++ G      +   Y    A F G A + Y  GC      ++ S FA A +  +
Sbjct: 424 GWDLLGSWCGHGKDTDVEMLYDGLTAEFGGEAELRYAMGCKPQG--NDRSGFAGALDVVR 481

Query: 496 TADATIILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAF 555
            +D  I+  G  L+   E+  R  + LP  Q +L+ ++ E  K P+ILV+  + G  +  
Sbjct: 482 WSDVVIVCLGEMLTWSGENASRSTIALPQIQEELVKELKEAGK-PIILVL--SNGRPLEL 538

Query: 556 AETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRP 615
                   AIL    PG  G R++A ++ G+ NP G+L IT          P ++  +  
Sbjct: 539 NRMEPLCDAILEIWQPGINGARSMAGILSGRINPSGKLAIT---------FPYSTGQIPI 589

Query: 616 VDSLGYPGRTYK-FYNGPT---LYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLN 671
             +    GR ++ FY   T    Y FGYGLSYT+F+Y +++ + T      KL       
Sbjct: 590 YYNRRKSGRWHQGFYKDITSDPFYSFGYGLSYTEFQYGVVTPSSTTVKRGEKLS------ 643

Query: 672 YTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQV 731
                                     +V   N G  DG++ V  +   P       +K++
Sbjct: 644 -------------------------VEVTVTNAGKRDGAETVHWFISDPYCSITRPVKEL 678

Query: 732 IGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGNGGVSFPI 786
             F++ F++ G  +  +F  +  + L  VD      L AGE+ I+V +  V   +
Sbjct: 679 KHFEKQFIKVGETRTFRFDVDLERDLGFVDGNGKRFLEAGEYNIWVQDQKVKIEL 733


>gi|333380553|ref|ZP_08472244.1| hypothetical protein HMPREF9455_00410 [Dysgonomonas gadei ATCC
           BAA-286]
 gi|332826548|gb|EGJ99377.1| hypothetical protein HMPREF9455_00410 [Dysgonomonas gadei ATCC
           BAA-286]
          Length = 957

 Score =  270 bits (691), Expect = 2e-69,   Method: Compositional matrix adjust.
 Identities = 228/754 (30%), Positives = 358/754 (47%), Gaps = 107/754 (14%)

Query: 54  FCDSSLPYSIRVKDLVSRMTLDEKVQQL--GDFAHGVPRLGLPQYEWWSEALHGVSNVGP 111
           + + +LP   RV+DL+S MT+++K++ L  G    G+P LG+P      EA+HG S    
Sbjct: 170 YMNPNLPLESRVEDLLSVMTVEDKMELLREGWGIPGIPHLGVPAIHK-VEAIHGFSYGS- 227

Query: 112 GTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNINV 171
                    GAT FP  I   A++N+ L +    A+  E      +    +  WSP ++V
Sbjct: 228 ---------GATIFPQSIGMGATWNKRLIEAAAMAIGDET-----VSANAVQAWSPVLDV 273

Query: 172 ARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAY 231
           A+D RWGR  ET GEDP +V      +++G Q       +  L + P       KH+AA+
Sbjct: 274 AQDARWGRCEETYGEDPVLVTEIGGAWIKGYQ-------SKGLMTTP-------KHFAAH 319

Query: 232 DVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKL 291
                 G D +  D  ++E++M E  L PF    K+    S+M SY+   G+P     +L
Sbjct: 320 GAP-LGGRDSH--DIGLSEREMREIHLVPFRDIYKKYKYQSIMMSYSDFLGVPVAKSKEL 376

Query: 292 LNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTN-F 350
           L   +R EW   G+IV+DC +I  +     + A  K +A  Q L AG+  +CG  Y +  
Sbjct: 377 LKGILRDEWGFDGFIVSDCGAIGNLTARKHYTAVDKVEAARQALAAGIATNCGDTYNDPD 436

Query: 351 TGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSP-QYVSLGK--QDICSDENIELAAE 407
              A ++G++   D+D + K L   L R G F+ +P + +   K      S E+  LA +
Sbjct: 437 VIAAAKRGELNMDDLDFTCKTLLRTLFRNGLFENNPCKPLDWNKIYPGWNSPEHQALARK 496

Query: 408 AAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIP--CRYMSPIAGFS 465
            A+E IVLL+N  N LPL S  +KT+AV+GP A+      G+Y   P   +  S + G  
Sbjct: 497 TAQESIVLLENKGNILPL-SKSLKTIAVIGPGADNL--QPGDYTSKPQPGQLKSVLTGIK 553

Query: 466 GYAN----VTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEA--------- 512
              N    V Y+ GC  +  +  + I  A +AA+ AD  +++ G   + EA         
Sbjct: 554 AAVNSSTKVLYEEGCRFIGTEGTD-IAKAVKAAENADVAVLVLGDCSTSEALKGITNTSG 612

Query: 513 ESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPG 572
           E+ D   L LPG Q +L+  V +  K PV+L++ +    ++++A  N     + W   PG
Sbjct: 613 ENHDLATLILPGEQQKLLEAVCKTGK-PVVLILQAGRPYNLSYAAENCQAVLVNW--LPG 669

Query: 573 EEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGP 632
           +EGG A ADV+FG +NP GRLP+T+         P  +  L    +    GR Y + + P
Sbjct: 670 QEGGYATADVLFGDYNPAGRLPMTF---------PRDAAQLPLYYNFKTSGRVYDYVDMP 720

Query: 633 --TLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLR 690
              LY FGYGLSYT F Y+ L+      ++L K     N N + +A+ T           
Sbjct: 721 YYPLYQFGYGLSYTSFNYSDLN------ISLEK-----NGNVSVNATVT----------- 758

Query: 691 CDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFV 750
                       N G   G +VV +Y         T + ++  F RV++  G +K++ FV
Sbjct: 759 ------------NTGKVAGDEVVQLYITDMYASVKTRVMELKDFDRVYLNPGESKKVSFV 806

Query: 751 FNACKSLNIVDYAANTLLPAGEHTIFVGNGGVSF 784
               + L++++   + ++  G   I VG    S+
Sbjct: 807 LTPYQ-LSLLNDEMDRVVEKGLFKIMVGGKSPSY 839


>gi|300773468|ref|ZP_07083337.1| possible beta-glucosidase [Sphingobacterium spiritivorum ATCC
           33861]
 gi|300759639|gb|EFK56466.1| possible beta-glucosidase [Sphingobacterium spiritivorum ATCC
           33861]
          Length = 777

 Score =  270 bits (691), Expect = 2e-69,   Method: Compositional matrix adjust.
 Identities = 211/724 (29%), Positives = 330/724 (45%), Gaps = 117/724 (16%)

Query: 90  RLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVST 149
           RLG+P +    EA HG   +G            T FPT I   +++N +L +K+   V+ 
Sbjct: 126 RLGIPVF-LAEEAPHGHMAIG-----------TTVFPTGIGQASTWNPALLQKMSATVAK 173

Query: 150 EARAMYNLGRAGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHE 209
           E R      +     + P ++++RDPRW R+ E+ GEDP + G  A   VRGL    G  
Sbjct: 174 EVRQ-----QGAHISYGPVLDLSRDPRWSRVEESYGEDPVLTGTLAAAIVRGL----GSG 224

Query: 210 NATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGD 269
           N +D    P       KH+ AY +            A V E+++ E FL PF+  V  G 
Sbjct: 225 NLSD----PFATIPTLKHFVAYGIPEG---GHNGSAASVGERELREYFLPPFQSAVAAG- 276

Query: 270 ASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKED 329
           A SVM +YN V+GIP  ++  LL   +R EW  +G+ V+D  SI+ +  +H+   D K+ 
Sbjct: 277 AKSVMAAYNSVDGIPCSSNKFLLTDILRKEWSFNGFTVSDLGSIEGIKGSHRVAKDHKQA 336

Query: 330 AVAQTLKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYV 389
           A+   ++AGLD D G         AV+QG+V+E  ID+++  +  +   +G F+     V
Sbjct: 337 AIL-AIEAGLDADLGGNAYVRLIEAVKQGEVQENSIDQAVSRILALKFEMGLFEKPFVDV 395

Query: 390 SLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGN 449
              K+++ ++ NI L+ + ARE IVLL+N  N LPL   K   +A+VGP+A+    M+G+
Sbjct: 396 KTAKKEVKTESNIALSRQVARESIVLLENKNNILPLR--KDVKIAIVGPNADNVYNMLGD 453

Query: 450 YA-----GIPCRYMSPIAGFSGYANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILA 504
           Y      G        I+     A V+Y  GC  +   +N+ I AA  AA+ +D  + + 
Sbjct: 454 YTAPQPDGAVTTVRQAISARLPKAQVSYVKGC-AIRDTTNSDIPAAVTAARQSDIIVAVV 512

Query: 505 G----LDLSVE-------------------AESLDREDLWLPGYQTQLINQVAEVAKGPV 541
           G     D   E                    E  DR  L L G Q +L+  + +  K P+
Sbjct: 513 GGSSARDFKTEYISTGAAVASDKSVSDMESGEGFDRSTLDLLGRQMELLKALKQTGK-PL 571

Query: 542 ILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGD 601
           +++ +    +++ +A T  +  A+L A YPG+EGG AIADV+FG +NP G++P++     
Sbjct: 572 VVIYIQGRPLNMNWAATQAD--ALLCAWYPGQEGGHAIADVLFGDYNPAGKMPLSVPRS- 628

Query: 602 YVQMLPL-----TSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKT 656
            V  +P+     +S+  R V+    P           LY FGYG SY+ F+Y  L   K 
Sbjct: 629 -VGQIPVHYNRKSSLDHRYVEEAATP-----------LYAFGYGKSYSDFEYKDLKIQK- 675

Query: 657 IQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVY 716
                       N +Y                              N G  DG +V  +Y
Sbjct: 676 -----------ENTDY-----------------------HVSFTLTNTGKYDGDEVPQLY 701

Query: 717 SKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIF 776
            +      +  ++Q+  F+R+ ++ G +K + FV  A     I       L P     I 
Sbjct: 702 IRNQYASVSQPVQQLKHFERIHLKTGESKTVSFVLTAGDFSVINTQMKKVLEPGSSFKIR 761

Query: 777 VGNG 780
           VG+ 
Sbjct: 762 VGSA 765


>gi|189464325|ref|ZP_03013110.1| hypothetical protein BACINT_00666 [Bacteroides intestinalis DSM
           17393]
 gi|189438115|gb|EDV07100.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
           intestinalis DSM 17393]
          Length = 935

 Score =  270 bits (690), Expect = 2e-69,   Method: Compositional matrix adjust.
 Identities = 226/760 (29%), Positives = 357/760 (46%), Gaps = 111/760 (14%)

Query: 48  QMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQL--GDFAHGVPRLGLPQYEWWSEALHG 105
           + +S  + D +LP   RV+ L+S MT ++K++ +  G    G+P L +P      EA+HG
Sbjct: 145 EKTSLRYMDPTLPVEERVESLLSVMTPEDKMELIREGWGIPGIPHLYVPPITK-VEAVHG 203

Query: 106 VSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYW 165
            S             GAT FP  +   A++N+ L +++  AV  E      L    +  W
Sbjct: 204 FSYGS----------GATIFPQALAMGATWNKKLTEEVAMAVGDE-----TLSAGTMQAW 248

Query: 166 SPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCC 225
           SP ++VA+D RWGR  ET GEDP +V +    +++G Q               + + +  
Sbjct: 249 SPVLDVAQDARWGRCEETFGEDPVLVSQIGGAWIKGYQS--------------MGLYTTP 294

Query: 226 KHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPS 285
           KH+  +      G D +  D  ++E++M E  L PF   ++  D  S+M +Y+   G+P 
Sbjct: 295 KHFGGHGAP-LGGRDSH--DIGLSEREMREVHLVPFRHVIRNYDCQSLMMAYSDFLGVPV 351

Query: 286 CADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQ 345
               +LL+  +R EW   G+IV+DC +I  +     + A +K +A  Q L AG+  +CG 
Sbjct: 352 AKSRELLHNILREEWGFSGFIVSDCGAIGNLTARKHYTAKNKIEAANQALAAGIATNCGD 411

Query: 346 YYTNF-TGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQDIC----SDE 400
            Y +     A + G++   ++D+  + +  ++ R   F+ +P    L    I     SD 
Sbjct: 412 TYNDKEVIQAAKDGRINMENLDEVCRTMLRMMFRNELFEKAPNK-PLDWNKIYPGWNSDS 470

Query: 401 NIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGI--PCRYM 458
           + E+A +AARE IVLL+N  N LPL S  ++T+AV+GP AN      G+Y     P +  
Sbjct: 471 HKEMARQAARESIVLLENKDNILPL-SKDMRTIAVLGPGANDLQP--GDYTPKLQPGQLK 527

Query: 459 SPIAGFS----GYANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEA-- 512
           S + G          V Y+ GCD  +   NN I  A + A  +D  +++ G   + EA  
Sbjct: 528 SVLTGIKQAVGKQTKVIYEQGCDFTSLGENN-IAKAVKVASQSDVVLLVLGDCSTSEATT 586

Query: 513 -------ESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAI 565
                  E+ D   L LPG Q +L+  V    K PVIL++ +  G     ++ +   KAI
Sbjct: 587 DVYKTSGENHDYATLILPGKQQELLEAVCATGK-PVILILQA--GRPYNLSKASELCKAI 643

Query: 566 LWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRT 625
           L    PG+EGG A ADV+FG +NP GRLP+T+    +V  LPL         +    GR 
Sbjct: 644 LVNWLPGQEGGPATADVLFGDYNPAGRLPMTFPR--HVGQLPLYY-------NFKTSGRR 694

Query: 626 YKFYNGP--TLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPG 683
           Y++ +     LY FGYGLSYT F+Y+ L           K+Q   N N T  A+      
Sbjct: 695 YEYSDMEYYPLYYFGYGLSYTSFEYSGL-----------KIQEKENGNITVQAT------ 737

Query: 684 VLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGR 743
                             +N+G   G +VV +Y         T I ++  F R+ ++ G 
Sbjct: 738 -----------------VKNIGQRAGDEVVQLYVTDMYASVKTRITELKDFTRIHLKPGE 780

Query: 744 NKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGNGGVS 783
            K + F     + L++++   + ++  G   I V  GGVS
Sbjct: 781 AKTVSFELTPYE-LSLLNDHMDRVVEKGAFKILV--GGVS 817


>gi|427387416|ref|ZP_18883472.1| hypothetical protein HMPREF9447_04505 [Bacteroides oleiciplenus YIT
           12058]
 gi|425725577|gb|EKU88448.1| hypothetical protein HMPREF9447_04505 [Bacteroides oleiciplenus YIT
           12058]
          Length = 733

 Score =  270 bits (690), Expect = 2e-69,   Method: Compositional matrix adjust.
 Identities = 220/792 (27%), Positives = 374/792 (47%), Gaps = 126/792 (15%)

Query: 45  LGLQMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFA------------------- 85
           L ++    ++ D+  P   RVKDL+ RMTL EKV QL  +                    
Sbjct: 16  LSVRSQKPVYKDAGQPVETRVKDLLKRMTLHEKVLQLNQYTFGENDNPNNIGTEVKNLPA 75

Query: 86  --------HGVPRL-GLPQYEWWSEALHGVSNVGPGTHFDDVIPG-ATSFPTVILTTASF 135
                   H  P+L    Q +   E+  G+    P     DVI G  T +P  +    SF
Sbjct: 76  EIGSLIYLHTDPKLRNQIQRKAMEESRLGI----PILFGFDVIHGLRTVYPISLAQACSF 131

Query: 136 NESLWKKIGQAVSTEARAMYNLGRAGLTY-WSPNINVARDPRWGRITETPGEDPFVVGRY 194
           N  L   + QA    A+       +G+ + +SP I+VARDPRWGRI+E  GEDP+     
Sbjct: 132 NPDL---VTQACGMAAKESV---LSGIDWTFSPMIDVARDPRWGRISECYGEDPY----- 180

Query: 195 AVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDME 254
            +N V G+  V+G++   +  S P  +++C KHY  Y      G D  + D  ++ Q + 
Sbjct: 181 -LNTVFGVASVQGYQG--EKLSDPYSIAACLKHYVGYGASE-GGRDYRYTD--ISPQALW 234

Query: 255 ETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQ 314
           ET+L P+E CVK G A+++M S+N ++G+P+ ++  +L + ++ +W   G++V+D ++I+
Sbjct: 235 ETYLPPYEACVKAG-AATLMSSFNDISGVPATSNHYILTEILKNKWRHDGFVVSDWNAIE 293

Query: 315 VMVDNHKFLADSKEDAVAQTLKAGLDLDC-GQYYTNFTGNAVQQGKVKETDIDKSLKYLY 373
            ++  ++ +A  +++A  +   AG+++D     Y  +    V + K++ + ID ++  + 
Sbjct: 294 QLI--YQGVAKDRKEAAYKAFHAGVEMDMRDNIYYEYLEQLVAEKKIQMSQIDDAVARIL 351

Query: 374 TVLMRLGFFDGSPQYVSLGKQD-ICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKT 432
            V  RLG FD  P    L +Q+     E+I LAA  A E +VLLKN+ N LPL+S  VK 
Sbjct: 352 RVKFRLGLFD-EPYTKELTEQERYLQKEDIALAARLAEESMVLLKNENNLLPLSST-VKR 409

Query: 433 VAVVGPHANATVAMIGNYA------GIPCRYMSPIAGFSGYANVTYKTGCDDVACKSNNS 486
           VA++GP A  +  ++G +A       +   Y      F     + Y+ GC   A   N+ 
Sbjct: 410 VALIGPMAKDSANLLGAWAFKGHAEDVETIYEGMQKEFGDKVQLDYEQGC---ALDGNDE 466

Query: 487 --IFAASEAAKTADATIILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILV 544
               AA + A+ +D  ++  G       E+  R  + LP  Q +L+  + +  K P++LV
Sbjct: 467 SGFSAALKTAEASDVVVVCLGESKQWSGENASRSTIALPDIQEKLLLHLKQANK-PIVLV 525

Query: 545 IMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQ 604
           + S  G  +        ++AI+    PG  GG  +A ++ G+ NP G+L +T+       
Sbjct: 526 LSS--GRPLELIRLEPQVEAIIEMWQPGVAGGTPLAGILSGRVNPSGKLSVTF------- 576

Query: 605 MLPLTS--MPL--------RPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFT 654
             PL++  +P+        RP D++G     Y+      LYPFG+GLSYT F Y+     
Sbjct: 577 --PLSTGQIPVYYNMRQSARPFDAMG----DYQDIPTKPLYPFGHGLSYTTFVYS----- 625

Query: 655 KTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVI 714
                 L+ L+  +N   T++ + T                       N G  +G + V+
Sbjct: 626 ---DAKLSSLKIRKNQKITAEVTVT-----------------------NAGKMEGKETVL 659

Query: 715 VYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHT 774
            Y   P    +  +K++  F++  + AG ++  +F  +  + L+  D      L AGE  
Sbjct: 660 WYVSDPFCSISRPMKELKFFEKHSLNAGESRVFRFEIDPMRDLSYTDATGKRFLEAGEFI 719

Query: 775 IFVGNGGVSFPI 786
           + VG   ++F +
Sbjct: 720 VSVGGRKLTFEV 731


>gi|386819249|ref|ZP_10106465.1| beta-glucosidase-like glycosyl hydrolase [Joostella marina DSM
           19592]
 gi|386424355|gb|EIJ38185.1| beta-glucosidase-like glycosyl hydrolase [Joostella marina DSM
           19592]
          Length = 878

 Score =  270 bits (690), Expect = 2e-69,   Method: Compositional matrix adjust.
 Identities = 165/446 (36%), Positives = 246/446 (55%), Gaps = 46/446 (10%)

Query: 48  QMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVS 107
           Q   + F ++ LP   RV DL++R+T+DEK+ QL   +  + RLG+P Y WW+E+LHGV+
Sbjct: 20  QSEKYPFQNTELPEDERVNDLINRLTVDEKIAQLLYQSPAIERLGIPAYNWWNESLHGVA 79

Query: 108 NVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYN--LGRA----- 160
             G           AT FP  I   AS+++ L  ++   +S EARA ++  L R      
Sbjct: 80  RAG----------YATVFPQSITIAASWDDELVAEVANVISDEARAKHHEYLRRGQHDIY 129

Query: 161 -GLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPL 219
            GLT+WSPNIN+ RDPRWGR  ET GEDP++ G     YV+GLQ           N++ L
Sbjct: 130 QGLTFWSPNINIFRDPRWGRGHETYGEDPYLTGVLGTEYVKGLQGN---------NAKYL 180

Query: 220 KVSSCCKHYAAYDVDNWKGVD--RYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSY 277
           KV +  KH+A +      G +  R+ FD   +++D+ ET+L  F   VK+G+  S+M +Y
Sbjct: 181 KVVATAKHFAVHS-----GPEPLRHEFDVAPSQRDLWETYLPAFRTLVKDGNVYSIMTAY 235

Query: 278 NRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKA 337
           NR+ G  + A   L +  +R +W  +GY+V+DC +I  M   H    D+ E A A  +K 
Sbjct: 236 NRIYGEAASASNSLYS-ILRDKWGFNGYVVSDCGAIADMWKTHHVAKDAAE-ASAMAVKE 293

Query: 338 GLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQDIC 397
           G DL+CG  Y   T +A+Q G + E D+D +L  L     +LG FD S + V   K    
Sbjct: 294 GCDLNCGNSYEKLT-DALQDGLITEADLDVALHRLMRARFKLGMFD-SDEKVPYAKIPFS 351

Query: 398 SDENIE---LAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIP 454
            + N +   LA +AA++ IVLLKN+   LPL S  +K +AV+GP+A+   ++ GNY G+P
Sbjct: 352 VNNNPKHKVLALKAAQKSIVLLKNENAILPL-SKNLKNIAVIGPNADNIQSLWGNYNGMP 410

Query: 455 CRYMSPIAGFS----GYANVTYKTGC 476
              ++ + G         NV ++ G 
Sbjct: 411 KNPVTVLEGIKNKVGAQVNVHFEEGA 436



 Score =  144 bits (364), Expect = 1e-31,   Method: Compositional matrix adjust.
 Identities = 97/322 (30%), Positives = 156/322 (48%), Gaps = 55/322 (17%)

Query: 480 ACKSNNSIFAASEAAKTADATIILAGLDLSVEAESL----------DREDLWLPGYQTQL 529
           +  + N +  A  AA  +D  ++  GL+  +E E +          DR  L LP  Q +L
Sbjct: 582 SIPTENQLEKAVLAANKSDVVVLALGLNERLEGEEMKVEVEGFADGDRTSLNLPKKQVEL 641

Query: 530 INQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNP 589
           + +V    K PV+LV+++   + I +A  + NI AI+ AGYPG+EGG AIA+V+FG +NP
Sbjct: 642 MKEVVATGK-PVVLVLLNGSALSINWA--SENIPAIISAGYPGQEGGNAIANVLFGDYNP 698

Query: 590 GGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYN 649
            GRLP+T+Y    V  LP       P +     GRTYK++    LYPFGYGLSYT+FKY+
Sbjct: 699 AGRLPVTYYKS--VDDLP-------PFEDYNMDGRTYKYFKKEPLYPFGYGLSYTKFKYS 749

Query: 650 LLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDG 709
            L     I++N                                +  +  V   N G  DG
Sbjct: 750 NLEIPLEIKIN--------------------------------EPIKVSVQVANEGDFDG 777

Query: 710 SDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLP 769
            +VV +Y +         I +++GF+R+ ++ G  ++++F     + L +++     ++ 
Sbjct: 778 DEVVQLYVRDEEGSTPRPICELVGFKRIHLKKGARQKVEFTIQP-RELAMINKDDKFVIE 836

Query: 770 AGEHTIFVGNGGVSFPIHLNFN 791
            G  +I VG    +F  + + N
Sbjct: 837 PGWFSISVGGSQPNFTENKHIN 858


>gi|293372493|ref|ZP_06618877.1| glycosyl hydrolase family 3 N-terminal domain protein [Bacteroides
           ovatus SD CMC 3f]
 gi|299144770|ref|ZP_07037838.1| periplasmic beta-glucosidase [Bacteroides sp. 3_1_23]
 gi|292632676|gb|EFF51270.1| glycosyl hydrolase family 3 N-terminal domain protein [Bacteroides
           ovatus SD CMC 3f]
 gi|298515261|gb|EFI39142.1| periplasmic beta-glucosidase [Bacteroides sp. 3_1_23]
          Length = 735

 Score =  270 bits (690), Expect = 2e-69,   Method: Compositional matrix adjust.
 Identities = 214/765 (27%), Positives = 362/765 (47%), Gaps = 109/765 (14%)

Query: 53  LFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHG--------------VP-RLGLPQYE 97
           L+ D   P   RV DL+SRMTL+EKV QL  +  G              VP  +G   Y 
Sbjct: 29  LYKDPKAPIEKRVNDLLSRMTLEEKVMQLNQYTLGRNNNVNNVGEEVKKVPAEIGSLIYF 88

Query: 98  WWSEALHGV--------SNVGPGTHFD-DVIPG-ATSFPTVILTTASFNESLWKKIGQAV 147
             + AL           S +G    F  D I G  T +P  +    S+N  L ++     
Sbjct: 89  ETNPALRNSMQKKAMEESRLGIPIIFGYDAIHGFRTVYPISLAQACSWNPDLVEQACAVS 148

Query: 148 STEARAMYNLGRAGLTY-WSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVE 206
           + EAR       +G+ + +SP I+VARDPRWGR+ E  GEDP+  G +    V+G Q   
Sbjct: 149 AQEARM------SGVDWTFSPMIDVARDPRWGRVAEGYGEDPYTNGVFGAASVKGYQ--- 199

Query: 207 GHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVK 266
                 DL++   ++++C KHY  Y         R +    +++Q + +T+L P+EM VK
Sbjct: 200 ----GDDLSAEN-RMAACLKHYVGYGASE---AGRDYVYTEISKQTLWDTYLLPYEMGVK 251

Query: 267 EGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADS 326
            G A+++M S+N ++G+P  A+P ++ + ++  W   G+IV+D  +I+ +   ++ LA +
Sbjct: 252 AG-AATLMSSFNDISGVPGSANPYIMTEILKKRWGHDGFIVSDWGAIEQL--KNQGLAAT 308

Query: 327 KEDAVAQTLKAGLDLDCGQY-YTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGS 385
           K++A      AGL++D   + Y       V++G+V    +D++++ +  +  RLG F+  
Sbjct: 309 KKEAAWHAFTAGLEMDMMSHAYDRHLQELVEEGRVSVAQVDEAVRRVLLLKFRLGLFERP 368

Query: 386 PQYVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVA 445
               +  K+     +++++AA  A E +VLLKN+  TLPL     K +AV+GP A     
Sbjct: 369 YTPATSEKERFFRPQSMDIAARLAAESMVLLKNENKTLPLTDK--KKIAVIGPMAKNGWD 426

Query: 446 MIGNYAG------IPCRYMSPIAGFSGYANVTYKTGCDDVACKSNNS--IFAASEAAKTA 497
           ++G++ G      +   Y      F+G A + Y  GC   A K +N      A EAA+ +
Sbjct: 427 LLGSWCGHGKDTDVAMLYNGLATEFAGKAELRYAAGC---ATKGDNKEGFAEALEAARWS 483

Query: 498 DATIILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAE 557
           D  ++  G  ++   E+  R  + LP  Q +L  ++ +  K P++LV+++   +++   E
Sbjct: 484 DVVVLCLGEMMTWSGENASRSSIALPQIQEELAAELKKAGK-PIVLVLVNGRPLELNRLE 542

Query: 558 TNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTS--MPL-- 613
             ++  AIL    PG  G   +A ++ G+ NP G+L +T          P ++  +P+  
Sbjct: 543 LISD--AILEIWQPGVNGALPMAGILSGRINPSGKLAMT---------FPYSTGQIPIYY 591

Query: 614 -RPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNY 672
            R     G+ G  YK      LYPFG+GLSYT+FKY  ++ +                  
Sbjct: 592 NRRKSGRGHQG-FYKDITSDPLYPFGHGLSYTEFKYGTVTPS------------------ 632

Query: 673 TSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVI 732
                        V  ++  D    +V   NVG+ DG++ V  +   P       +K++ 
Sbjct: 633 -------------VTKVKRGDRLSVEVTVTNVGARDGAETVHWFISDPYCSITRPVKELK 679

Query: 733 GFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFV 777
            F++  +RAG  K  +F  +  +    V+      L AGE+ I V
Sbjct: 680 HFEKQLIRAGETKTFRFDIDLERDFGFVNEDGKRFLEAGEYHILV 724


>gi|313204584|ref|YP_004043241.1| glycoside hydrolase [Paludibacter propionicigenes WB4]
 gi|312443900|gb|ADQ80256.1| glycoside hydrolase family 3 domain protein [Paludibacter
           propionicigenes WB4]
          Length = 727

 Score =  270 bits (690), Expect = 2e-69,   Method: Compositional matrix adjust.
 Identities = 219/740 (29%), Positives = 349/740 (47%), Gaps = 107/740 (14%)

Query: 47  LQMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGV 106
           +  ++F F ++ LP + R+ +L+S MTLDEKV  L     GVPRLG+ +    SE LHG+
Sbjct: 20  VSQTTFPFQNTGLPDNERLDNLLSLMTLDEKVNALST-NLGVPRLGI-RNTGHSEGLHGM 77

Query: 107 SNVGPGTHFDDVIPGATSFPTVILTTA-----SFNESLWKKIGQAVSTEAR---AMYNLG 158
           +  GPG         A ++PT I   A     +++  L +K+    +TE R      NL 
Sbjct: 78  ALGGPGNWGGSERGVAKTYPTTIFPQAYGLGETWDTELIQKVADIEATEIRFYAQNANLQ 137

Query: 159 RAGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRP 218
           + G+   +PN ++ARDPRWGR  E+ GED F+  R  V +V+GLQ   G++       + 
Sbjct: 138 KGGMVMRAPNADLARDPRWGRTEESYGEDAFLGSRLTVAFVKGLQ---GND------PKY 188

Query: 219 LKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYN 278
            K +S  KH+ A   ++ +     +FD R+      E +  PF   + EG + + M SYN
Sbjct: 189 WKSASLMKHFLANSNEDGRDSTSSNFDERL----FREYYSFPFYKGITEGGSRAFMASYN 244

Query: 279 RVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAG 338
             NG+P   +P +L +  R EW  +G I  D  ++ ++V+ H       E A A  +KA 
Sbjct: 245 AWNGVPMTVNP-ILKKIARDEWGNNGIICTDGGALSLLVNAHHAFPTLTEGAAA-VVKAS 302

Query: 339 LDLDCGQYYTNFTG---NAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ---YVSLG 392
           +    GQ+  NF      A+++G + E +ID  ++  + V ++LG  D       Y  +G
Sbjct: 303 V----GQFLDNFRSYIYEALKKGLLTEKNIDNVIRGNFYVALKLGLLDADQSKVPYTGIG 358

Query: 393 KQDICSDENIE----LAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIG 448
             D  S  N +       +   + +VLLKN    LPLN +K+K++AV+GP AN    ++ 
Sbjct: 359 VTDTVSPWNKQDTKAFVRKVTAKSVVLLKNTAGLLPLNKSKIKSIAVIGPRANE--VLLD 416

Query: 449 NYAGIPCRYMSPIAGFSGYANVTYKTGCD-DVACKSNNSIFAASEAAKTADATIILAGLD 507
            Y+G P   +S + G      +    G D +V    ++ +  A+ AA+ AD  I+  G  
Sbjct: 417 WYSGTPPYAVSILQG------IKNAVGKDIEVFYAPSDEMDKATLAARKADVAIVCVGNH 470

Query: 508 -------------LSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIA 554
                         S   E++DR+ + L   Q  L+  V + A    ++V++S      A
Sbjct: 471 PYGTDARWKISPVPSDGREAVDRKSITLE--QEDLVKLVMQ-ANPKTVMVLVS--NFPFA 525

Query: 555 FAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLR 614
              +  N+ AIL      +E G  +ADV+FG  +P GR   TW       + P+    +R
Sbjct: 526 INWSQENVPAILHVTNNSQELGNGLADVIFGDVSPAGRTTQTWVK-SITDLPPMMDYDIR 584

Query: 615 PVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTS 674
                   GRTY+++    LYPFG+GLSYT F+Y+ L                     TS
Sbjct: 585 -------HGRTYQYFKSKPLYPFGFGLSYTSFEYSGLE--------------------TS 617

Query: 675 DASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGF 734
           + + T             D     V  +N+G  DG +V+ +Y   P       +KQ+ GF
Sbjct: 618 NPTLT-------------DSIFVSVKVKNIGKRDGDEVIQLYVSYPDSKVERPMKQLKGF 664

Query: 735 QRVFVRAGRNKRIKFVFNAC 754
           +RVF+ AG++K ++    A 
Sbjct: 665 KRVFIPAGKSKTVEIPLKAS 684


>gi|423222018|ref|ZP_17208488.1| hypothetical protein HMPREF1062_00674 [Bacteroides cellulosilyticus
           CL02T12C19]
 gi|392644204|gb|EIY37946.1| hypothetical protein HMPREF1062_00674 [Bacteroides cellulosilyticus
           CL02T12C19]
          Length = 942

 Score =  270 bits (689), Expect = 3e-69,   Method: Compositional matrix adjust.
 Identities = 230/810 (28%), Positives = 359/810 (44%), Gaps = 146/810 (18%)

Query: 53  LFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRL---GLPQYEW----WS----- 100
           ++ D +     R++DL+S+MTL+EK  Q+    +G  R+    LP  EW    W      
Sbjct: 52  VYEDPNASLDARIEDLLSQMTLEEKTCQMVTL-YGYKRVLKDDLPTPEWKQMLWKDGIGA 110

Query: 101 --EALHGVSNVG----------------------------------PGTHFDDVIPG--- 121
             E L+G    G                                  P    ++ I G   
Sbjct: 111 IDEHLNGFQQWGLPPSDNPYVWPASRHAWALNEVQRFFIEETRLGIPVDFTNEGIRGVES 170

Query: 122 --ATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLT-YWSPNINVARDPRWG 178
             AT+FPT +    ++N  L +++G     EAR +      G T  ++P ++V RD RWG
Sbjct: 171 YRATNFPTQLGLGHTWNRELIRQVGLITGREARML------GYTNVYAPILDVGRDQRWG 224

Query: 179 RITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKG 238
           R  E  GE P++V    +  VRG+Q    H +         +V++  KH+ AY  +    
Sbjct: 225 RYEEVYGESPYLVAELGIEMVRGMQ----HSH---------QVAATGKHFVAYSNNKGAR 271

Query: 239 VDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRG 298
                 D +++ +++E   + PF+  +KE     VM SYN  +G+P       L   +RG
Sbjct: 272 EGMARVDPQMSPREVEMIHVYPFKRVIKEAGLLGVMSSYNDYDGVPIQGSYYWLTTRLRG 331

Query: 299 EWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCG----QYYTNFTGNA 354
           E    GY+V+D D+++ +   H    D KE AV Q+++AGL++ C       Y       
Sbjct: 332 EMGFRGYVVSDSDAVEYLYTKHSTAKDMKE-AVRQSVEAGLNVRCTFRSPDSYVLPLREL 390

Query: 355 VQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQD--ICSDENIELAAEAAREG 412
           V++G + E  I+  ++ +  V   +G FD +P    L   D  +   EN  LA +A+RE 
Sbjct: 391 VKEGGLSEEVINDRVRDILRVKFLVGLFD-TPYQTDLAGADKEVEKAENESLALQASRES 449

Query: 413 IVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGF----SGYA 468
           +VLLKN+ N LPL+   VK +AV GP+A+     + +Y  +     + + G      G A
Sbjct: 450 LVLLKNENNVLPLDINNVKKIAVCGPNADEEGYALTHYGPLAVEVTTVLEGIRQKAEGKA 509

Query: 469 NVTYKTGCDDVACKSNNS--------------IFAASEAAKTADATIILAGLDLSVEAES 514
            V Y  GCD V      S              I  A E A+ AD  +++ G       E+
Sbjct: 510 EVLYTKGCDLVDANWPESELIDYPMTDSEQAEIDKAVENARQADVAVVVLGGGQRTCGEN 569

Query: 515 LDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEE 574
             R  L LPG Q +L+  V    K PV+LV+++   + I +A  +  +  IL A YPG +
Sbjct: 570 KSRSSLDLPGRQLKLLQAVQATGK-PVVLVLINGRPLSINWA--DKFVPVILEAWYPGSK 626

Query: 575 GGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRP---VDSLGYPGR--TYKFY 629
           GG A+ADV+FG +NPGG+L +T+     V  +P  + P +P   +D    PG        
Sbjct: 627 GGTAVADVLFGDYNPGGKLTVTFPKS--VGQIPF-NFPCKPSSQIDGGKNPGLDGNMSRV 683

Query: 630 NGPTLYPFGYGLSYTQFKYNLLSFT-KTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVND 688
           NG  LY FGYGLSYT F+Y+ +  + K I  N      C+                    
Sbjct: 684 NG-ALYSFGYGLSYTTFEYSDIEISPKVITPNQKATVRCK-------------------- 722

Query: 689 LRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIK 748
                         N G   G +VV +Y +       TY K + GF+R+ ++ G  K + 
Sbjct: 723 ------------VTNTGKRAGDEVVQLYVRDILSSVTTYEKNLAGFERIHLQPGETKEVV 770

Query: 749 FVFNACKSLNIVDYAANTLLPAGEHTIFVG 778
           F  +  K L ++D     ++  G+ +I VG
Sbjct: 771 FTLDR-KQLELLDKHMEWVVEPGDFSIMVG 799


>gi|189464583|ref|ZP_03013368.1| hypothetical protein BACINT_00926 [Bacteroides intestinalis DSM
           17393]
 gi|189436857|gb|EDV05842.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
           intestinalis DSM 17393]
          Length = 879

 Score =  270 bits (689), Expect = 3e-69,   Method: Compositional matrix adjust.
 Identities = 172/459 (37%), Positives = 239/459 (52%), Gaps = 47/459 (10%)

Query: 52  FLFCDSSLPY-------SIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALH 104
           FL C S  PY         R  DLV R+TL+EK   + + +  +PRLG+  Y+WW+EALH
Sbjct: 34  FLSC-SQPPYKNPALSPEERANDLVGRLTLEEKAALMQNTSPAIPRLGIKAYDWWNEALH 92

Query: 105 GVSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYN-------L 157
           GV   G           AT FP  I   ASFN  L   +  A+S EARA          L
Sbjct: 93  GVGRAGL----------ATVFPQAIGMGASFNNELLYDVFTAISDEARAKNTEFSKEGGL 142

Query: 158 GR-AGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNS 216
            R  GLT W+PNIN+ RDPRWGR  ET GEDP++  +  +  VRGLQ  EG +       
Sbjct: 143 KRYQGLTMWTPNINIFRDPRWGRGQETYGEDPYLTSQMGMAVVRGLQGPEGEKYD----- 197

Query: 217 RPLKVSSCCKHYAAYDVDNWKGVDRYHFDAR-VTEQDMEETFLRPFEMCVKEGDASSVMC 275
              K+ +C KHYA +    W   +R+ F+A  +  +D+ ET+L  F+  V++     VMC
Sbjct: 198 ---KLHACAKHYAVHSGPEW---NRHSFNAENIDPRDLWETYLPAFKDLVQKAHVKEVMC 251

Query: 276 SYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLAD-SKEDAVAQT 334
           +YNR  G P C   +LL   +R EW     +V+DC +I    +      D  K+ A A+ 
Sbjct: 252 AYNRFEGEPCCGSNRLLMHILRDEWGYKEIVVSDCWAISDFYNKGAHETDPDKQHASAKA 311

Query: 335 LKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ--YVSLG 392
           + +G D++CG  Y +    AV++G + E  ID SLK L      LG  D   Q  +  + 
Sbjct: 312 VLSGTDIECGDSYGSLP-EAVKEGLIDEKQIDISLKRLMKARFELGEMDEPSQVSWAQIP 370

Query: 393 KQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAG 452
              + S E+ ELA   ARE +VLL+N+Q+ LPLN  K   VAVVGP+AN +V   GNY G
Sbjct: 371 YSVVDSKEHRELALRMARESLVLLQNNQSLLPLN--KNLKVAVVGPNANDSVMQWGNYNG 428

Query: 453 IPCRYMSPIAGFSGY---ANVTYKTGCDDVACKSNNSIF 488
            P   ++ + G   Y   + + Y+ GCD  +  +  S+F
Sbjct: 429 FPSHTITLLEGIREYLPESQIIYEPGCDLTSDVTLQSVF 467



 Score =  107 bits (267), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 85/295 (28%), Positives = 126/295 (42%), Gaps = 56/295 (18%)

Query: 495 KTADATIILAGLDLSVEAESL----------DREDLWLPGYQTQLINQVAEVAKGPVILV 544
           K AD  I   G+  +VE E +          DRE + LP  Q++L+   AE+ K    +V
Sbjct: 614 KEADVIIFAGGISPAVEGEEMHVNIPGFKGGDRETIELPSIQSRLL---AELKKAGKKIV 670

Query: 545 IMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQ 604
            ++  G  IA    +    AIL A YPG+ GG AIA+V+FG +NP GRLP+T+Y      
Sbjct: 671 FVNFSGSAIALTPESKTCDAILQAWYPGQAGGTAIANVLFGDYNPAGRLPVTFYK----- 725

Query: 605 MLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKL 664
               ++  L   +      RTY++     L+PFG+GLSYT F+Y   S            
Sbjct: 726 ----STSQLPGFEDYSMKERTYRYMTEAPLFPFGHGLSYTTFRYGDASL----------- 770

Query: 665 QHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIA 724
               N     D  +T    +L             +   NVG  DG +VV VY + P +  
Sbjct: 771 ----NTQEVKDGEQT----ILT------------IPVSNVGEYDGEEVVQVYLRRPGDKE 810

Query: 725 ATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLP-AGEHTIFVG 778
                 +  F+R  +  G    +    +  +     D   NT+ P  G++ I  G
Sbjct: 811 GPS-HALRAFKRANIAKGATSNVTVSLSK-EDFEWFDTETNTMRPIEGDYEILYG 863


>gi|346226088|ref|ZP_08847230.1| glycoside hydrolase family 3 domain protein [Anaerophaga
           thermohalophila DSM 12881]
          Length = 749

 Score =  270 bits (689), Expect = 3e-69,   Method: Compositional matrix adjust.
 Identities = 223/731 (30%), Positives = 343/731 (46%), Gaps = 106/731 (14%)

Query: 50  SSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGL---PQYEWWSEALHGV 106
            S+ F +  L    R+ DL+SRMTLDEKV  L      VPRLG+   P  E +    HGV
Sbjct: 50  ESYPFQNPELDSEARIDDLLSRMTLDEKVSALSTDP-SVPRLGVKGAPHIEGY----HGV 104

Query: 107 SNVGPGT---HFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYN---LGRA 160
           +  GP       D+ +P  T+FP      A++N  L +  G+  S EAR ++    + + 
Sbjct: 105 AMGGPANWAPKGDEAVP-TTTFPQAYGMGATWNPELIRLAGEIESIEARYIFQNPEIAKG 163

Query: 161 GLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLK 220
           GL   +PN ++ RDPRWGR  E  GEDPF+VG  A  + +GLQ           + +  +
Sbjct: 164 GLVVRAPNADLGRDPRWGRTEECFGEDPFLVGTSATAFTKGLQGD---------DDQYWR 214

Query: 221 VSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRV 280
            +S  KH+ A   +N +      FD ++  +    +F R F     EG +++ M +YN +
Sbjct: 215 TASLLKHFLANSNENGRESSSSDFDMQLYHEYYGASFRRAF----IEGGSNAYMAAYNAI 270

Query: 281 NGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLD 340
           NG+P+     +  +     W + G    D    Q++V  HK+  D    A    +KAGL+
Sbjct: 271 NGVPAHVH-DMHKEITERMWGVDGIKCTDGGGYQLLVYGHKYY-DDLYLAAEGVIKAGLN 328

Query: 341 LDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ----YVSLGKQD- 395
                Y     G A+  G + E DID+ L+ +Y V+++LG  D  PQ    Y ++G+   
Sbjct: 329 QFLDNYREGVYG-ALAHGYITEADIDEVLRGVYRVMIKLGQLD--PQEKVPYSAIGRDGK 385

Query: 396 ---ICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAG 452
                + ++ + A   ARE IVLLKN+  TLPLN+ K+  VAV+G  A+    ++  Y+G
Sbjct: 386 PAPWTTQKHKDAALRMARESIVLLKNNNKTLPLNADKLNKVAVIGYLAD--TVLLDWYSG 443

Query: 453 IPCRYMSPIAGFSGYANVTYKTGCDD-VACKSNNSIFAASEAAKTADATIILAG------ 505
           +P   ++P+ G      +  K G D  V    +N   AA EAA  AD  I++ G      
Sbjct: 444 LPPYRITPLEG------IREKLGNDSKVLYAPDNDYNAAVEAASEADVAIVILGNYPTCN 497

Query: 506 -------LDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAET 558
                   D  +  E++DR+ L L      L+  V E A    I V+ S+    I +++ 
Sbjct: 498 SEIWADCPDPGMGREAIDRKTLRLT--DEYLVKLVME-ANPNTIFVLQSSFPYAINWSQ- 553

Query: 559 NTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDS 618
             N+ AIL   + G+E G A+ADV+FG +NPGG+L  TW   +  Q+  +    +R    
Sbjct: 554 -QNVPAILHLTHNGQETGSALADVLFGDYNPGGKLTQTWPKSE-DQLPDMMEYDIR---- 607

Query: 619 LGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASK 678
               G TY ++    LYPFG+GLSYT F +  +S  K +               ++D   
Sbjct: 608 ---KGHTYMYFEDKPLYPFGHGLSYTTFAWEDISINKPV--------------VSAD--- 647

Query: 679 TRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVF 738
                        D+     V  +N G   G +VV +Y+  P        K + GF+RV 
Sbjct: 648 -------------DEEVIITVKLKNTGDVKGDEVVQLYASFPESTVRRPAKALKGFKRVT 694

Query: 739 VRAGRNKRIKF 749
           +  G  K+I+ 
Sbjct: 695 LEPGEKKKIEI 705


>gi|29347190|ref|NP_810693.1| beta-glucosidase [Bacteroides thetaiotaomicron VPI-5482]
 gi|29339089|gb|AAO76887.1| periplasmic beta-glucosidase precursor [Bacteroides
           thetaiotaomicron VPI-5482]
          Length = 950

 Score =  269 bits (688), Expect = 3e-69,   Method: Compositional matrix adjust.
 Identities = 228/749 (30%), Positives = 354/749 (47%), Gaps = 109/749 (14%)

Query: 54  FCDSSLPYSIRVKDLVSRMTLDEKVQQL--GDFAHGVPRLGLPQYEWWSEALHGVSNVGP 111
           + D+SLP   RV+ L++ MT ++K++ +  G    G+P L +P      EA+HG S    
Sbjct: 166 YMDASLPVEERVESLLAVMTPEDKMELIREGWGIPGIPHLYVPPITK-VEAVHGFSYGS- 223

Query: 112 GTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNINV 171
                    GAT FP  +   A++N  L +++   +  E  A  N  +A    WSP ++V
Sbjct: 224 ---------GATIFPQALAMGATWNRKLTEEVAMVIGDETVAA-NTKQA----WSPVLDV 269

Query: 172 ARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAY 231
           A+D RWGR  ET GEDP +V +    +++G Q            SR L  +   KH+  +
Sbjct: 270 AQDARWGRCEETFGEDPVLVSQIGGAWIKGYQ------------SRGLFTTP--KHFGGH 315

Query: 232 DVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKL 291
                 G D +  D  ++E++M E  L PF   ++  D  S+M +Y+   G+P     +L
Sbjct: 316 GAP-LGGRDSH--DIGLSEREMREIHLVPFRHAIRNYDCQSLMMAYSDYMGVPVAKSKEL 372

Query: 292 LNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNF- 350
           L Q +R EW  +G+IV+DC +I  +     + A  K +A  Q L AG+  +CG  Y N  
Sbjct: 373 LQQILRQEWGFNGFIVSDCGAIGNLTARKHYTAKDKIEAANQALAAGIATNCGDTYNNKE 432

Query: 351 TGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQDIC----SDENIELAA 406
              A + G++   D+D   + +   + R   F+ +P    L  + I     SD + E+A 
Sbjct: 433 VIQAAKDGRINMEDLDNVCRTMLGTMFRNELFEKNP-CKPLDWKKIYPGWNSDSHKEMAR 491

Query: 407 EAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAG--IPCRYMSPIAGF 464
           +AARE IV+L+N  N LPL S  ++T+AV+GP A+      G+Y    +P +  S + G 
Sbjct: 492 QAARESIVMLENKDNLLPL-SKTLRTIAVLGPGADDLQP--GDYTPKLLPGQLKSVLTGI 548

Query: 465 SG----YANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEA-------- 512
            G       V Y+ GCD       N I  A +AA  +D  I++ G   + EA        
Sbjct: 549 KGAVGKQTKVLYEQGCDFTNPDETN-IPKAVKAASQSDVVIMVLGDCSTSEATNDVRKTC 607

Query: 513 -ESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYP 571
            E+ D   L LPG Q +L+  V    K PVIL++ +    DI  A  +   KAIL    P
Sbjct: 608 GENNDWATLILPGKQQELLEAVCATGK-PVILILQAGRPYDILKA--SEMCKAILVNWLP 664

Query: 572 GEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNG 631
           G+EGG A+ADV+FG +NP GRLP+T+    +V  LPL         +    GR Y++ + 
Sbjct: 665 GQEGGPAMADVLFGDYNPAGRLPMTFPR--HVGQLPLYY-------NFKTSGRRYEYVDM 715

Query: 632 P--TLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDL 689
               LY FG+GLSYT F+Y+ L           K+Q   N N                  
Sbjct: 716 EYYPLYRFGFGLSYTSFEYSNL-----------KIQEKANGN------------------ 746

Query: 690 RCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKF 749
                 E +   +NVGS  G +V  +Y         T + ++  F R+ ++ G +K + F
Sbjct: 747 -----VEVQATVKNVGSRAGDEVAQLYVTDMYASVKTRVMELKDFARIHLQPGESKTVSF 801

Query: 750 VFNACKSLNIVDYAANTLLPAGEHTIFVG 778
                  +++++   + ++  GE  I VG
Sbjct: 802 EMTPY-DISLLNDRMDRVVEKGEFKIMVG 829


>gi|423300729|ref|ZP_17278753.1| hypothetical protein HMPREF1057_01894 [Bacteroides finegoldii
           CL09T03C10]
 gi|408472616|gb|EKJ91142.1| hypothetical protein HMPREF1057_01894 [Bacteroides finegoldii
           CL09T03C10]
          Length = 735

 Score =  269 bits (688), Expect = 4e-69,   Method: Compositional matrix adjust.
 Identities = 212/767 (27%), Positives = 362/767 (47%), Gaps = 113/767 (14%)

Query: 53  LFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPG 112
           L+ D+  P   RV DL+SRMTL+EKV QL  +  G         E   E     + +G  
Sbjct: 29  LYKDAKAPIEKRVDDLLSRMTLEEKVMQLNQYTLGRNNNVNNVGE---EVKKVPAEIGSL 85

Query: 113 THFD---------------------------DVIPG-ATSFPTVILTTASFNESLWKKIG 144
            +F+                           D I G  T +P  +    S+N  L ++  
Sbjct: 86  IYFETNPELRNNMQKKAMEESRLGIPIIFGYDAIHGFRTVYPISLAQACSWNPDLVEQAC 145

Query: 145 QAVSTEARAMYNLGRAGLTY-WSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQ 203
              + EAR       +G+ + +SP I+VARDPRWGR+ E  GEDP+  G +    VRG Q
Sbjct: 146 AVSAQEARM------SGVDWTFSPMIDVARDPRWGRVAEGYGEDPYANGVFGAASVRGYQ 199

Query: 204 DVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEM 263
                +N +  N    +V++C KHY  Y         R +    +++Q + +T+L P++M
Sbjct: 200 G----DNMSAEN----RVAACLKHYVGYGASE---AGRDYVYTEISKQTLWDTYLLPYKM 248

Query: 264 CVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFL 323
            VK G A+++M S+N ++G+P  A+P  + + ++  W   G+IV+D  +I+ +   ++ L
Sbjct: 249 GVKAG-AATLMSSFNDISGVPGSANPYTMTEILKNRWRHDGFIVSDWGAIEQL--KNQGL 305

Query: 324 ADSKEDAVAQTLKAGLDLDCGQY-YTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFF 382
           A +K++A      AGL++D   + Y       V++GKV    +D++++ +  +  RLG F
Sbjct: 306 AATKKEAARHAFTAGLEMDMMSHAYDRHLQELVEEGKVSMAQVDEAVRRVLLLKFRLGLF 365

Query: 383 DGSPQYVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANA 442
           +     V+  K+     +++++AA  A E +VLLKN+ N LPL  A  K +AV+GP A  
Sbjct: 366 ERPYTPVTTEKERFLRPQSMDIAARLAAESMVLLKNENNVLPL--ADKKKIAVIGPMAKN 423

Query: 443 TVAMIGNYAG------IPCRYMSPIAGFSGYANVTYKTGCDDVACKSNNSIFA-ASEAAK 495
              ++G++ G      +   Y    A F+G A + Y  GC+      N   FA A  AA+
Sbjct: 424 GWDLLGSWRGHGKDTDVVMLYDGLAAEFAGKAELRYALGCNTKG--DNREGFAEALGAAR 481

Query: 496 TADATIILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAF 555
            +D  ++  G  ++   E+  R  + LP  Q +L  ++ +V K PV+L++++   +++  
Sbjct: 482 WSDVVVLCLGEMMTWSGENASRSSIALPQMQEELAKELKKVGK-PVVLILVNGRPLELNR 540

Query: 556 AETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTS--MPL 613
            E  ++  AIL    PG  G   +A ++ G+ NP G+L +T+         P ++  +P+
Sbjct: 541 LEPVSD--AILEIWQPGVNGALPMAGILSGRINPSGKLAMTF---------PYSTGQIPI 589

Query: 614 ---RPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNL 670
              R     G+ G  YK      LYPFG+GLSYT+FKY       T+  +  K++    L
Sbjct: 590 YYNRRKSGRGHQG-FYKDMTSDPLYPFGHGLSYTEFKYG------TVTPSATKVKRGEKL 642

Query: 671 NYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQ 730
           +                          +V   N+G+ DG++ V  +   P       +K+
Sbjct: 643 SA-------------------------EVTVTNIGARDGAETVHWFISDPYCSITRPVKE 677

Query: 731 VIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFV 777
           +  F++  ++AG  K  +F  +  +    V+      L  GE+ I V
Sbjct: 678 LKHFEKQLIKAGETKTFRFDIDLERDFGFVNEDGKRFLETGEYNIHV 724


>gi|333381842|ref|ZP_08473521.1| hypothetical protein HMPREF9455_01687 [Dysgonomonas gadei ATCC
           BAA-286]
 gi|332829771|gb|EGK02417.1| hypothetical protein HMPREF9455_01687 [Dysgonomonas gadei ATCC
           BAA-286]
          Length = 861

 Score =  269 bits (688), Expect = 4e-69,   Method: Compositional matrix adjust.
 Identities = 162/441 (36%), Positives = 246/441 (55%), Gaps = 39/441 (8%)

Query: 54  FCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGT 113
           + D++L    R +DL+SR+TL EKV  +GD +  V RLG+ ++ WWSEALHGV+N G   
Sbjct: 23  YKDANLTPEERAQDLLSRLTLKEKVGLMGDNSIEVTRLGVKKFAWWSEALHGVANQG--- 79

Query: 114 HFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRA---------GLTY 164
                  G T FP  I   ASFN+ L   +  A+S EARA ++             GL+ 
Sbjct: 80  -------GVTVFPEPIGMAASFNDELLYHVFDAISDEARARFHFREKKGDERRQDNGLSV 132

Query: 165 WSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSC 224
           W+PN+N+ RDPRWGR  ET GEDP++  R  ++ V GLQ  +        +++  K+ +C
Sbjct: 133 WTPNVNIFRDPRWGRGQETYGEDPYLTSRMGISVVNGLQGPK--------DAKYKKLLAC 184

Query: 225 CKHYAAYDVDNWKGVDRYHFDA-RVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGI 283
            KHYA +    W   +R+  +   +  + + ET++  F++ V++ D S VMC+Y+R +  
Sbjct: 185 AKHYAVHSGPEW---NRHVLNLNNLDNRHLWETYMPAFQVLVQKADVSQVMCAYHRQDDD 241

Query: 284 PSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDC 343
           P C +  LL + +R EW     +V+DC +I     +HK  +D+   AV   L AG D++C
Sbjct: 242 PCCGNNHLLKRILRDEWGFKRMVVSDCGAIADFYTSHKVSSDALHSAVKGVL-AGTDVEC 300

Query: 344 GQYYT-NFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSP--QYVSLGKQDICSDE 400
           G  YT +   +AV +G + E DIDKS+  L T   RLG FD +    + ++    I   +
Sbjct: 301 GFGYTYHELVDAVSRGLIYEADIDKSVLRLLTERFRLGDFDDNSIVPWANIPDTIINCKK 360

Query: 401 NIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSP 460
           +  LA E AR+ + LL+N  N LPL+S   K +AV+GP+A+    M GNY GIP + ++ 
Sbjct: 361 HQALALEMARQSMTLLQNKNNILPLSSK--KKIAVIGPNADDAKLMWGNYNGIPVKTVTI 418

Query: 461 IAGFSGYA--NVTYKTGCDDV 479
           + G    A  ++ Y+ GCD V
Sbjct: 419 LEGIKSIAGKDIFYEKGCDIV 439



 Score = 93.6 bits (231), Expect = 4e-16,   Method: Compositional matrix adjust.
 Identities = 55/162 (33%), Positives = 82/162 (50%), Gaps = 22/162 (13%)

Query: 495 KTADATIILAGLDLSVEAESL----------DREDLWLPGYQTQLINQVAEVAKGPVILV 544
           K  D  +   G+   +E E +          DR D+ LP  Q   I  + +  K    ++
Sbjct: 597 KDIDVVVFAGGISGELEGEEMPIEMPGFKGGDRTDIELPASQRNCIKALKKAGKR---VI 653

Query: 545 IMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQ 604
           +++  G  I     + + +AIL A Y G+ GG+AIA+V+FGK+NP G+LPIT+Y    + 
Sbjct: 654 MVNCSGSAIGLMPESESCEAILQAWYGGQSGGQAIAEVLFGKYNPSGKLPITFYKN--ID 711

Query: 605 MLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQF 646
            LP         +     GRTY++     L+PFGYGLSYT F
Sbjct: 712 QLP-------DFEEYDMKGRTYRYLEDKPLFPFGYGLSYTTF 746


>gi|237721771|ref|ZP_04552252.1| glycoside hydrolase [Bacteroides sp. 2_2_4]
 gi|229448640|gb|EEO54431.1| glycoside hydrolase [Bacteroides sp. 2_2_4]
          Length = 735

 Score =  269 bits (688), Expect = 4e-69,   Method: Compositional matrix adjust.
 Identities = 213/765 (27%), Positives = 362/765 (47%), Gaps = 109/765 (14%)

Query: 53  LFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHG--------------VP-RLGLPQYE 97
           L+ D   P   RV DL+SRMTL+EK+ QL  +  G              VP  +G   Y 
Sbjct: 29  LYKDPKAPIEKRVNDLLSRMTLEEKMMQLNQYTLGRNNNVNNVGEEVKKVPAEIGSLIYF 88

Query: 98  WWSEALHGV--------SNVGPGTHFD-DVIPG-ATSFPTVILTTASFNESLWKKIGQAV 147
             + AL           S +G    F  D I G  T +P  +    S+N  L ++     
Sbjct: 89  ETNPALRNSMQKKAMEESRLGIPIIFGYDAIHGFRTVYPISLAQACSWNPDLVEQACAVS 148

Query: 148 STEARAMYNLGRAGLTY-WSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVE 206
           + EAR       +G+ + +SP I+VARDPRWGR+ E  GEDP+  G +    V+G Q   
Sbjct: 149 AQEARM------SGVDWTFSPMIDVARDPRWGRVAEGYGEDPYTNGVFGAASVKGYQ--- 199

Query: 207 GHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVK 266
                 DL++   ++++C KHY  Y         R +    +++Q + +T+L P+EM VK
Sbjct: 200 ----GDDLSAEN-RMAACLKHYVGYGASE---AGRDYVYTEISKQTLWDTYLLPYEMGVK 251

Query: 267 EGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADS 326
            G A+++M S+N ++G+P  A+P ++ + ++  W   G+IV+D  +I+ +   ++ LA +
Sbjct: 252 AG-AATLMSSFNDISGVPGSANPYIMTEILKKRWGHDGFIVSDWGAIEQL--KNQGLAAT 308

Query: 327 KEDAVAQTLKAGLDLDCGQY-YTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGS 385
           K++A      AGL++D   + Y       V++G+V    +D++++ +  +  RLG F+  
Sbjct: 309 KKEAAWHAFTAGLEMDMMSHAYDRHLQELVEEGRVSVAQVDEAVRRVLLLKFRLGLFERP 368

Query: 386 PQYVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVA 445
               +  K+     +++++AA  A E +VLLKN+  TLPL     K +AV+GP A     
Sbjct: 369 YTPATSEKERFFRPQSMDIAARLAAESMVLLKNENKTLPLTDK--KKIAVIGPMAKNGWD 426

Query: 446 MIGNYAG------IPCRYMSPIAGFSGYANVTYKTGCDDVACKSNNS--IFAASEAAKTA 497
           ++G++ G      +   Y      F+G A + Y  GC   A K +N      A EAA+ +
Sbjct: 427 LLGSWCGHGKDTDVAMLYNGLATEFAGKAELRYAAGC---ATKGDNKEGFAEALEAARWS 483

Query: 498 DATIILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAE 557
           D  ++  G  ++   E+  R  + LP  Q +L  ++ +  K P++LV+++   +++   E
Sbjct: 484 DVVVLCLGEMMTWSGENASRSSIALPQIQEELAAELKKAGK-PIVLVLVNGRPLELNRLE 542

Query: 558 TNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTS--MPL-- 613
             ++  AIL    PG  G   +A ++ G+ NP G+L +T          P ++  +P+  
Sbjct: 543 LISD--AILEIWQPGVNGALPMAGILSGRINPSGKLAMT---------FPYSTGQIPIYY 591

Query: 614 -RPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNY 672
            R     G+ G  YK      LYPFG+GLSYT+FKY  ++ +                  
Sbjct: 592 NRRKSGRGHQG-FYKDITSDPLYPFGHGLSYTEFKYGTVTPS------------------ 632

Query: 673 TSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVI 732
                        V  ++  D    +V   NVG+ DG++ V  +   P       +K++ 
Sbjct: 633 -------------VTKVKRGDRLSVEVTVTNVGARDGAETVHWFISDPYCSITRPVKELK 679

Query: 733 GFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFV 777
            F++  +RAG  K  +F  +  +    V+      L AGE+ I V
Sbjct: 680 HFEKQLIRAGETKTFRFDIDLERDFGFVNEDGKRFLEAGEYHILV 724


>gi|363583088|ref|ZP_09315898.1| b-glucosidase [Flavobacteriaceae bacterium HQM9]
          Length = 779

 Score =  269 bits (688), Expect = 4e-69,   Method: Compositional matrix adjust.
 Identities = 223/774 (28%), Positives = 357/774 (46%), Gaps = 118/774 (15%)

Query: 63  IRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQ---YEWWSEALHGVSNVGPGTHFD--- 116
           ++++ L+++MTLD+KV QL           LP+    E     +    NV    + D   
Sbjct: 62  LKIEALIAKMTLDQKVGQLSLRGTSSRTKLLPEALKKEVKQGKIGAFLNVMNRAYVDELQ 121

Query: 117 -----------------DVIPG-ATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLG 158
                            DVI G  T FP  +   AS++    K   +  + EA +     
Sbjct: 122 RIAVEESPLGIPLIFARDVIHGFKTIFPIPLGLAASWDAETAKAAARVSAIEASSY---- 177

Query: 159 RAGLTY-WSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSR 217
             G+ + ++P +++ +D RWGRI E+PGEDP++    A  YV G QD        DL S+
Sbjct: 178 --GIRWTFAPMLDITQDSRWGRIAESPGEDPYLASVLAKAYVEGFQD-------NDL-SK 227

Query: 218 PLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSY 277
              +++C KH+  Y         R +  A + E  +  T+L+PFE  +  G A++VM S+
Sbjct: 228 STSLAACAKHFIGYGAAIG---GRDYNTAIIHEPLLRNTYLKPFEAAIDAG-AATVMTSF 283

Query: 278 NRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKA 337
           N +NG+P+  +  LLN+ +R E   HG++V+D +SI  M+  H + A++++ A A  + A
Sbjct: 284 NELNGVPASGNKWLLNEVLRKELGFHGFVVSDWNSITEMIA-HSY-AENEKHAAALGINA 341

Query: 338 GLDLD-CGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQDI 396
           GLD++   + Y N+    +++ K+ ET +D  +  +  V  RL  F+  P  +     + 
Sbjct: 342 GLDMEMTSKSYENYIKQLLKEKKITETQLDFLVSNILRVKFRLNLFE-KPYRLKKHTGNF 400

Query: 397 CSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYA--GIP 454
            S E+++LA  AA    VLLKN+Q  LPLN  K+  VAV+GP ANA    +G +   G  
Sbjct: 401 YSQEHMDLAKNAAIRSSVLLKNNQGLLPLN--KLTKVAVIGPLANAPHEQLGTWTFDGDQ 458

Query: 455 CRYMSPIAGF-SGYANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAE 513
              ++P+  F +   N  +         +S  +   A   A+++D  +   G +  +  E
Sbjct: 459 AYSVTPLQAFKNNKVNFNFAETLTYSRDQSTKAFDKALRTAQSSDVILFFGGEEAILSGE 518

Query: 514 SLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGE 573
           +  R  + LPG Q  LI  +A+  K P++ VIM+  G  I   +    + AIL   +PG 
Sbjct: 519 AHSRAHINLPGQQEALIKALAKTGK-PIVFVIMA--GRPITLTKVIDQVDAILMTWHPGT 575

Query: 574 EGGRAIADVVFGKFNPGGRLPITW----------YN----------GDYVQMLPLTSMPL 613
            GG AI ++++GK  PGGRLPITW          YN            +VQM    S+P+
Sbjct: 576 MGGEAIYEMLWGKNEPGGRLPITWPKTSGQLPLFYNHKNTGRPPSIKSFVQM---DSIPV 632

Query: 614 RP-VDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNY 672
                SLG           P  +PFGYGL YT FKY+ +  + T                
Sbjct: 633 GAWQSSLGNTSHYLDVGFTPQ-FPFGYGLGYTTFKYSDVKISTT---------------- 675

Query: 673 TSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVI 732
                           +  ++  E  V   N G   G+++V +Y +         +K++ 
Sbjct: 676 ---------------SITKNESLEVSVTLTNTGDRAGAELVQLYVQDVVGSLTRPVKELK 720

Query: 733 GFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPA---GEHTIFVGNGGVS 783
           GF+ + +  G +  +KF  NA    N + +  NTL P    GE  IFVG+   S
Sbjct: 721 GFKHIHLDKGASTIVKFTLNA----NDLMFVNNTLKPVLEKGEFNIFVGSSSQS 770


>gi|336417087|ref|ZP_08597416.1| hypothetical protein HMPREF1017_04524 [Bacteroides ovatus
           3_8_47FAA]
 gi|335936712|gb|EGM98630.1| hypothetical protein HMPREF1017_04524 [Bacteroides ovatus
           3_8_47FAA]
          Length = 954

 Score =  269 bits (687), Expect = 5e-69,   Method: Compositional matrix adjust.
 Identities = 226/749 (30%), Positives = 358/749 (47%), Gaps = 109/749 (14%)

Query: 54  FCDSSLPYSIRVKDLVSRMTLDEKVQQL--GDFAHGVPRLGLPQYEWWSEALHGVSNVGP 111
           + D+SLP   RV+ L++ MT ++K++ +  G    G+P L +P      EA+HG S    
Sbjct: 170 YMDASLPVEERVESLLAVMTPEDKMELIREGWGIPGIPHLYVPPITK-VEAVHGFSYGS- 227

Query: 112 GTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNINV 171
                    GAT FP  +   A++N  L +++   +  E  A  N  +A    WSP ++V
Sbjct: 228 ---------GATIFPQALAMGATWNRKLTEEVAMVIGDETVAA-NTKQA----WSPVLDV 273

Query: 172 ARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAY 231
           A+D RWGR  ET GEDP +V +    +++G Q            SR L  +   KH+  +
Sbjct: 274 AQDARWGRCEETFGEDPVLVSQIGGAWIKGYQ------------SRGLFTTP--KHFGGH 319

Query: 232 DVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKL 291
                 G D +  D  ++E++M E  L PF   ++  D  S+M +Y+   GIP     +L
Sbjct: 320 GAP-LGGRDSH--DIGLSEREMREVHLVPFRHAIRNYDCQSLMMAYSDYMGIPVAKSTEL 376

Query: 292 LNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNF- 350
           L Q +R EW  +G+IV+DC +I  +     + A  K +A  Q L AG+  +CG  Y N  
Sbjct: 377 LQQILRQEWGFNGFIVSDCGAIGNLTARKHYTAQDKIEAANQALAAGIATNCGDTYNNKE 436

Query: 351 TGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQDIC----SDENIELAA 406
              A + G++   ++D   + + + + R   F+ +P    L  + I     SD + E+A 
Sbjct: 437 VIQAAKDGRIDMENLDNVCRTMLSTMFRNELFEKNP-CKPLDWKKIYPGWNSDSHKEMAR 495

Query: 407 EAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAG--IPCRYMSPIAGF 464
           +AARE IV+L+N +N LPL +  ++T+AV+GP A+      G+Y    +P +  S + G 
Sbjct: 496 QAARESIVMLENKENLLPL-TKNLRTIAVLGPGADDLQP--GDYTPKLLPGQLKSVLTGI 552

Query: 465 S----GYANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEA-------- 512
                    V Y+ GCD       N I  A +AA  +D  +++ G   + EA        
Sbjct: 553 KEAVGKQTKVLYEQGCDFTNPDETN-IPKAVKAASQSDVVVMVLGDCSTSEATNDVRKTC 611

Query: 513 -ESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYP 571
            E+ D   L LPG Q +L+  V    K PVIL++ +    DI  A  +   KAIL    P
Sbjct: 612 GENNDWATLILPGKQQELLEAVCATGK-PVILILQAGRPYDILKA--SEMCKAILVNWLP 668

Query: 572 GEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNG 631
           G+EGG A+ADV+FG +NPGGRLP+T+    +V  LPL         +    GR Y++ + 
Sbjct: 669 GQEGGPAMADVLFGDYNPGGRLPMTFPR--HVGQLPLYY-------NFKTSGRRYEYVDM 719

Query: 632 P--TLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDL 689
               LY FG+GLSYT F+Y+ L           K+Q   N N T  A+            
Sbjct: 720 EYYPLYRFGFGLSYTSFEYSDL-----------KIQEKPNGNVTVQAT------------ 756

Query: 690 RCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKF 749
                       +N+GS  G +V  +Y         T + ++  F R++++ G +K + F
Sbjct: 757 -----------VKNIGSRAGDEVAQLYVTDMYASVKTRVMELKDFDRIYLQPGESKTVSF 805

Query: 750 VFNACKSLNIVDYAANTLLPAGEHTIFVG 778
                  +++++   + ++  GE  I VG
Sbjct: 806 ELTPY-DISLLNDHMDRVVEKGEFKICVG 833


>gi|383113360|ref|ZP_09934132.1| hypothetical protein BSGG_3064 [Bacteroides sp. D2]
 gi|382948727|gb|EFS32364.2| hypothetical protein BSGG_3064 [Bacteroides sp. D2]
          Length = 954

 Score =  269 bits (687), Expect = 5e-69,   Method: Compositional matrix adjust.
 Identities = 226/749 (30%), Positives = 358/749 (47%), Gaps = 109/749 (14%)

Query: 54  FCDSSLPYSIRVKDLVSRMTLDEKVQQL--GDFAHGVPRLGLPQYEWWSEALHGVSNVGP 111
           + D+SLP   RV+ L++ MT ++K++ +  G    G+P L +P      EA+HG S    
Sbjct: 170 YMDASLPVEERVESLLAVMTPEDKMELIREGWGIPGIPHLYVPPITK-VEAVHGFSYGS- 227

Query: 112 GTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNINV 171
                    GAT FP  +   A++N  L +++   +  E  A  N  +A    WSP ++V
Sbjct: 228 ---------GATIFPQALAMGATWNRKLTEEVAMVIGDETVAA-NTKQA----WSPVLDV 273

Query: 172 ARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAY 231
           A+D RWGR  ET GEDP +V +    +++G Q            SR L  +   KH+  +
Sbjct: 274 AQDARWGRCEETFGEDPVLVSQIGGAWIKGYQ------------SRGLFTTP--KHFGGH 319

Query: 232 DVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKL 291
                 G D +  D  ++E++M E  L PF   ++  D  S+M +Y+   GIP     +L
Sbjct: 320 GAP-LGGRDSH--DIGLSEREMREVHLVPFRHAIRNYDCQSLMMAYSDYMGIPVAKSTEL 376

Query: 292 LNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNF- 350
           L Q +R EW  +G+IV+DC +I  +     + A  K +A  Q L AG+  +CG  Y N  
Sbjct: 377 LQQILRQEWGFNGFIVSDCGAIGNLTARKHYTAQDKIEAANQALAAGIATNCGDTYNNKE 436

Query: 351 TGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQDIC----SDENIELAA 406
              A + G++   ++D   + + + + R   F+ +P    L  + I     SD + E+A 
Sbjct: 437 VIQAAKDGRINMENLDNVCRTMLSTMFRNELFEKNP-CKPLDWKKIYPGWNSDSHKEMAR 495

Query: 407 EAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAG--IPCRYMSPIAGF 464
           +AARE IV+L+N +N LPL +  ++T+AV+GP A+      G+Y    +P +  S + G 
Sbjct: 496 QAARESIVMLENKENLLPL-TKNLRTIAVLGPGADDLQP--GDYTPKLLPGQLKSVLTGI 552

Query: 465 S----GYANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEA-------- 512
                    V Y+ GCD       N I  A +AA  +D  +++ G   + EA        
Sbjct: 553 KEAVGKQTKVLYEQGCDFTNPDETN-IPKAVKAASQSDVVVMVLGDCSTSEATNDVRKTC 611

Query: 513 -ESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYP 571
            E+ D   L LPG Q +L+  V    K PVIL++ +    DI  A  +   KAIL    P
Sbjct: 612 GENNDWATLILPGKQQELLEAVCATGK-PVILILQAGRPYDILKA--SEMCKAILVNWLP 668

Query: 572 GEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNG 631
           G+EGG A+ADV+FG +NPGGRLP+T+    +V  LPL         +    GR Y++ + 
Sbjct: 669 GQEGGPAMADVLFGDYNPGGRLPMTFPR--HVGQLPLYY-------NFKTSGRRYEYVDM 719

Query: 632 P--TLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDL 689
               LY FG+GLSYT F+Y+ L           K+Q   N N T  A+            
Sbjct: 720 EYYPLYRFGFGLSYTSFEYSDL-----------KIQEKPNGNVTVQAT------------ 756

Query: 690 RCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKF 749
                       +N+GS  G +V  +Y         T + ++  F R++++ G +K + F
Sbjct: 757 -----------VKNIGSRAGDEVAQLYVTDMYASVKTRVMELKDFDRIYLQPGESKTVSF 805

Query: 750 VFNACKSLNIVDYAANTLLPAGEHTIFVG 778
                  +++++   + ++  GE  I VG
Sbjct: 806 ELTPY-DISLLNDHMDRVVEKGEFKICVG 833


>gi|423226659|ref|ZP_17213124.1| hypothetical protein HMPREF1062_05310 [Bacteroides cellulosilyticus
           CL02T12C19]
 gi|392628186|gb|EIY22220.1| hypothetical protein HMPREF1062_05310 [Bacteroides cellulosilyticus
           CL02T12C19]
          Length = 750

 Score =  269 bits (687), Expect = 5e-69,   Method: Compositional matrix adjust.
 Identities = 219/765 (28%), Positives = 359/765 (46%), Gaps = 114/765 (14%)

Query: 64  RVKDLVSRMTLDEKVQQLGDF-AHGVPRLGLPQYEWWSEALHGV---------------- 106
           R++ L+ +MTL+EK+ Q+        P L     +    ++  +                
Sbjct: 35  RIEALLGKMTLEEKIGQMNQLHCENFPYLKTETRKGRVGSVMSITDPNIFNEVQRIAVED 94

Query: 107 SNVG-PGTHFDDVIPG-ATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTY 164
           S +G P  +  DVI G  T FP  +   ASFN  + +   +  +TEA A      AG+ +
Sbjct: 95  SRLGIPLINARDVIHGFKTIFPIPLGQAASFNPEIAETGARIAATEASA------AGIRW 148

Query: 165 -WSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSS 223
            ++P I++  DPRWGRI E  GEDP +V +  V  ++G Q        + LN  P  +++
Sbjct: 149 TFAPMIDITHDPRWGRIAEGFGEDPLLVSQMGVAAIKGFQ-------GSSLN-HPTSIAA 200

Query: 224 CCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGI 283
           C KH+A Y         R +    +TE+     +LRPFE  V  G A+++M ++N  +GI
Sbjct: 201 CAKHFAGYGASEG---GRDYNSTYITERQFRNLYLRPFEAAVNAG-AATLMTAFNDNDGI 256

Query: 284 PSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLD- 342
           PS A+P LL   +R EW+  G +V+D  S+  M+  H F  D KE A+  T  AG D++ 
Sbjct: 257 PSSANPFLLKDVLRNEWNYRGTVVSDWASVSEMI-RHGFCEDEKEAALKAT-NAGTDIEM 314

Query: 343 CGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQDICSDENI 402
             + Y       +++GKV    ID +++ +  +  RLG F+  P      K+     + +
Sbjct: 315 VSETYIKHLPQLIKEGKVSMETIDNAVRNILRLKFRLGLFE-HPYIADQRKETFYRPDFL 373

Query: 403 ELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIG--------NYAGIP 454
           E A  AA +  VLLKN++ TLP+ S  +KT+ V GP A+A    +G        +Y+  P
Sbjct: 374 EAAQTAAEQSAVLLKNERGTLPIQS-NIKTILVTGPLADAPHEQLGTWVFDGDASYSQTP 432

Query: 455 CRYMSPIAGFSGYANVTYKTGCD---DVACKSNNSIFAASEAAKTADATIILAGLDLSVE 511
            + +  I+G S    V Y  G +   D A    N +    E A+ AD  +   G +  + 
Sbjct: 433 LQALRRISGDS--IKVLYAPGLNYSRDTATSQFNKVV---ELAREADLILAFVGEEAILS 487

Query: 512 AESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYP 571
            E+    +L L G Q++L+++++E  K P++ V+M+   + I   E N +  A+L+A +P
Sbjct: 488 GEAHCLANLNLQGAQSRLLHRLSETGK-PLVTVVMAGRPLTIG-REVNIS-DALLYAFHP 544

Query: 572 GEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPL------TSMPLR---------PV 616
           G  GG A+A+++FGK  P G+LP+T+        +P+      T  P           PV
Sbjct: 545 GTMGGPALANLLFGKVVPSGKLPVTF--PKETGQIPIYYNHTSTGRPASGSEKNIFTIPV 602

Query: 617 DSLGYPGRTYKFY---NGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYT 673
            +         FY       L+PFGYGLSYT F Y+ L  + T        Q+ RN    
Sbjct: 603 GAEQTSLGNTSFYLDAGKDPLFPFGYGLSYTTFAYSNLQLSST--------QYTRN---- 650

Query: 674 SDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIG 733
                              +      D  N G TDG+++  +Y +  A      +K++  
Sbjct: 651 -------------------EVIIITFDLTNTGKTDGTEIAQLYFRDLAASVTRPVKELAA 691

Query: 734 FQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVG 778
           F+R+ ++AG  + I+      K L+  +YA +  +  G+  +++G
Sbjct: 692 FERIHLKAGETRHIRMEL-PVKQLSFWNYAMDYCVEPGKFDLWIG 735


>gi|423303577|ref|ZP_17281576.1| hypothetical protein HMPREF1072_00516 [Bacteroides uniformis
           CL03T00C23]
 gi|423307700|ref|ZP_17285690.1| hypothetical protein HMPREF1073_00440 [Bacteroides uniformis
           CL03T12C37]
 gi|392687941|gb|EIY81232.1| hypothetical protein HMPREF1072_00516 [Bacteroides uniformis
           CL03T00C23]
 gi|392689569|gb|EIY82846.1| hypothetical protein HMPREF1073_00440 [Bacteroides uniformis
           CL03T12C37]
          Length = 942

 Score =  269 bits (687), Expect = 5e-69,   Method: Compositional matrix adjust.
 Identities = 227/808 (28%), Positives = 358/808 (44%), Gaps = 142/808 (17%)

Query: 53  LFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRL---GLPQYEW----WS----- 100
           ++ D S P   R+++L+ +MTLDEK  Q+    +G  R+    LP  EW    W      
Sbjct: 52  VYEDPSAPLEARIENLLQQMTLDEKTCQMVTL-YGYKRVLKDDLPTPEWKELLWKDGIGA 110

Query: 101 --EALHGVSNVG----------------------------------PGTHFDDVIPG--- 121
             E L+G    G                                  P    ++ I G   
Sbjct: 111 IDEHLNGFQQWGLPPSDNAYVWPASRHAWALNEVQRFFVEDTRLGIPVDFTNEGIRGVES 170

Query: 122 --ATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLT-YWSPNINVARDPRWG 178
             AT+FPT +    ++N  L +++G     EAR +      G T  ++P ++V RD RWG
Sbjct: 171 YRATNFPTQLGLGHTWNRELIRQVGLITGREARML------GYTNVYAPILDVGRDQRWG 224

Query: 179 RITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKG 238
           R  E  GE P++V    +  VRGLQ    H +         +V++  KH+AAY  +    
Sbjct: 225 RYEEVYGESPYLVAELGIEMVRGLQ----HNH---------QVAATGKHFAAYSNNKGAR 271

Query: 239 VDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRG 298
                 D +++ +++E   + PF+  ++E     VM SYN  +GIP       L   +RG
Sbjct: 272 EGMARVDPQMSPREVENIHIYPFKRVIREAGMLGVMSSYNDYDGIPVQGSYYWLTTRLRG 331

Query: 299 EWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCG----QYYTNFTGNA 354
           E    GY+V+D D+++ +   H    D KE AV Q+++AGL++ C       +       
Sbjct: 332 EMGFRGYVVSDSDAVEYLYTKHGTAKDMKE-AVRQSVEAGLNVRCTFRSPDSFVLPLREL 390

Query: 355 VQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLG-KQDICSDENIELAAEAAREGI 413
           V++G + E  I+  ++ +  V   +G FD   Q    G  +++  +EN  +A +A+ E +
Sbjct: 391 VKEGGLSEEVINDRVRDILRVKFLIGLFDAPYQTDLAGADREVEKEENEAIALQASHESV 450

Query: 414 VLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFS----GYAN 469
           VLLKN    LPL+    K +AV GP+AN     + +Y  +     + + G        A 
Sbjct: 451 VLLKNADELLPLDINSTKKIAVCGPNANEEGYALTHYGPLAVEVTTVLEGIQEKTKSKAE 510

Query: 470 VTYKTGCDDVACKSNNS--------------IFAASEAAKTADATIILAGLDLSVEAESL 515
           V Y  GCD V      S              I  A E A+ AD  +++ G       E+ 
Sbjct: 511 VLYTKGCDLVDAHWPESEIIDYPLTDDEQAEIDKAVENARQADVAVVVLGGGQRTCGENK 570

Query: 516 DREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEG 575
            R  L LPG Q QL+  +    K PV+L++++   + I +A  +  + AIL A YPG +G
Sbjct: 571 SRTSLDLPGRQLQLLQAIQATGK-PVVLILINGRPLSINWA--DKFVPAILEAWYPGSKG 627

Query: 576 GRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRP---VDSLGYPGRTYKF--YN 630
           G A+AD++FG +NPGG+L +T+     V  +P  + P +P   +D    PG T      N
Sbjct: 628 GTALADILFGDYNPGGKLTVTFPK--TVGQIPF-NFPCKPSSQIDGGKNPGPTGNMSRIN 684

Query: 631 GPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLR 690
           G  LYPFGYGLSYT F+Y+ L  T  +               T + S T           
Sbjct: 685 G-ALYPFGYGLSYTTFEYSDLDITPRV--------------ITPNESAT----------- 718

Query: 691 CDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFV 750
                  ++   N G   G +VV +Y +       TY K + GFQR+ +  G  + + F 
Sbjct: 719 ------VRLKVTNTGKRAGDEVVQLYIRDVLSSITTYEKNLAGFQRIHLEPGEAQELSFT 772

Query: 751 FNACKSLNIVDYAANTLLPAGEHTIFVG 778
            +  K L ++D     ++  G+  +  G
Sbjct: 773 IDR-KHLELLDADMKWVVEPGDFVLMAG 799


>gi|299149395|ref|ZP_07042452.1| periplasmic beta-glucosidase [Bacteroides sp. 3_1_23]
 gi|298512582|gb|EFI36474.1| periplasmic beta-glucosidase [Bacteroides sp. 3_1_23]
          Length = 950

 Score =  269 bits (687), Expect = 5e-69,   Method: Compositional matrix adjust.
 Identities = 226/749 (30%), Positives = 358/749 (47%), Gaps = 109/749 (14%)

Query: 54  FCDSSLPYSIRVKDLVSRMTLDEKVQQL--GDFAHGVPRLGLPQYEWWSEALHGVSNVGP 111
           + D+SLP   RV+ L++ MT ++K++ +  G    G+P L +P      EA+HG S    
Sbjct: 166 YMDASLPVEERVESLLAVMTPEDKMELIREGWGIPGIPHLYVPPITK-VEAVHGFSYGS- 223

Query: 112 GTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNINV 171
                    GAT FP  +   A++N  L +++   +  E  A  N  +A    WSP ++V
Sbjct: 224 ---------GATIFPQALAMGATWNRKLTEEVAMVIGDETVAA-NTKQA----WSPVLDV 269

Query: 172 ARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAY 231
           A+D RWGR  ET GEDP +V +    +++G Q            SR L  +   KH+  +
Sbjct: 270 AQDARWGRCEETFGEDPVLVSQIGGAWIKGYQ------------SRGLFTTP--KHFGGH 315

Query: 232 DVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKL 291
                 G D +  D  ++E++M E  L PF   ++  D  S+M +Y+   GIP     +L
Sbjct: 316 GAP-LGGRDSH--DIGLSEREMREVHLVPFRHAIRNYDCQSLMMAYSDYMGIPVAKSTEL 372

Query: 292 LNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNF- 350
           L Q +R EW  +G+IV+DC +I  +     + A  K +A  Q L AG+  +CG  Y N  
Sbjct: 373 LQQILRQEWGFNGFIVSDCGAIGNLTARKHYTAQDKIEAANQALAAGIATNCGDTYNNKE 432

Query: 351 TGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQDIC----SDENIELAA 406
              A + G++   ++D   + + + + R   F+ +P    L  + I     SD + E+A 
Sbjct: 433 VIQAAKDGRINMENLDNVCRTMLSTMFRNELFEKNP-CKPLDWKKIYPGWNSDSHKEMAR 491

Query: 407 EAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAG--IPCRYMSPIAGF 464
           +AARE IV+L+N +N LPL +  ++T+AV+GP A+      G+Y    +P +  S + G 
Sbjct: 492 QAARESIVMLENKENLLPL-TKNLRTIAVLGPGADDLQP--GDYTPKLLPGQLKSVLTGI 548

Query: 465 S----GYANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEA-------- 512
                    V Y+ GCD       N I  A +AA  +D  +++ G   + EA        
Sbjct: 549 KEAVGKQTKVLYEQGCDFTNPDETN-IPKAVKAASQSDVVVMVLGDCSTSEATNDVRKTC 607

Query: 513 -ESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYP 571
            E+ D   L LPG Q +L+  V    K PVIL++ +    DI  A  +   KAIL    P
Sbjct: 608 GENNDWATLILPGKQQELLEAVCATGK-PVILILQAGRPYDILKA--SEMCKAILVNWLP 664

Query: 572 GEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNG 631
           G+EGG A+ADV+FG +NPGGRLP+T+    +V  LPL         +    GR Y++ + 
Sbjct: 665 GQEGGPAMADVLFGDYNPGGRLPMTFPR--HVGQLPLYY-------NFKTSGRRYEYVDM 715

Query: 632 P--TLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDL 689
               LY FG+GLSYT F+Y+ L           K+Q   N N T  A+            
Sbjct: 716 EYYPLYRFGFGLSYTSFEYSDL-----------KIQEKPNGNVTVQAT------------ 752

Query: 690 RCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKF 749
                       +N+GS  G +V  +Y         T + ++  F R++++ G +K + F
Sbjct: 753 -----------VKNIGSRAGDEVAQLYVTDMYASVKTRVMELKDFDRIYLQPGESKTVSF 801

Query: 750 VFNACKSLNIVDYAANTLLPAGEHTIFVG 778
                  +++++   + ++  GE  I VG
Sbjct: 802 ELTPY-DISLLNDHMDRVVEKGEFKICVG 829


>gi|441498970|ref|ZP_20981160.1| Beta-glucosidase [Fulvivirga imtechensis AK7]
 gi|441437215|gb|ELR70569.1| Beta-glucosidase [Fulvivirga imtechensis AK7]
          Length = 752

 Score =  269 bits (687), Expect = 5e-69,   Method: Compositional matrix adjust.
 Identities = 218/775 (28%), Positives = 364/775 (46%), Gaps = 127/775 (16%)

Query: 64  RVKDLVSRMTLDEKVQQL----GDFAHGVPRLGLPQYEWWSEAL-----------HGVSN 108
           +++ L+ +MTL+EKV QL    GD  +  P +   + + + + +           HG + 
Sbjct: 32  KIEALIRQMTLEEKVGQLNFYVGDLFNTGPTVRTTESDKFDQLIREGKLTGLFNVHGAAY 91

Query: 109 VG--------------PGTHFDDVIPG-ATSFPTVILTTASFNESLWKKIGQAVSTEARA 153
            G              P     DVI G  T FP  + + AS++    +K  +  + E+ A
Sbjct: 92  TGRLQKIAVEESRLGIPLLFGADVIHGFKTVFPIPLASAASWDLEAIEKAERVAAIESTA 151

Query: 154 MYNLGRAGLTY-WSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENAT 212
                 AG+ + ++P ++++RDPRWGRI E  GEDPF+    A   VRG Q+    ++ T
Sbjct: 152 ------AGINFNFAPMVDISRDPRWGRIAEGAGEDPFLGSEVAKARVRGFQE----QSLT 201

Query: 213 DLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASS 272
           D    P  +++C KH+AAY   +  G D    D  ++E+ + E +L P++  +  G A++
Sbjct: 202 D----PQTMAACVKHFAAYGAPD-GGRDYNTVD--MSERLLREMYLPPYKAGIDAG-AAT 253

Query: 273 VMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVA 332
           +M S+N +NGI +     LL   +R EW   G +V+D  S+  MV +    A +  +A  
Sbjct: 254 IMTSFNELNGIAASGSQFLLRDILRKEWGFKGMVVSDWQSVNEMVAHGN--AANNAEAAM 311

Query: 333 QTLKAGLDLDC-GQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSL 391
             LKAG+D+D  G  Y       V +GK+    +D++++ +  +   LG FD   +Y   
Sbjct: 312 MALKAGVDMDMMGDVYLEEVPRLVNEGKLDIKFVDEAVRNVLKLKYDLGLFDDPYRYSDT 371

Query: 392 --GKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGN 449
              K +I + E++E A + A++ IVLLKN +  LPL  + + T+AV+GP A+    M G 
Sbjct: 372 IREKNNIRAVEHLEAARDVAKKSIVLLKNKEKLLPLKKS-IGTIAVIGPLADNQADMNGT 430

Query: 450 Y-----AGIPCRYMSPIA-GFSGYANVTYKTGCDDVACKSNNSIFAASEAAKTADATIIL 503
           +     A  P  ++  I    SG + V Y  GC+ +  +S +    A   AK AD  I+ 
Sbjct: 431 WSFFGEAQHPITFLQGIKDAVSGQSRVLYAEGCN-LYDRSKDKFAEAVNIAKKADVVILA 489

Query: 504 AGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIK 563
            G    +  E+  R D+ LPG Q +L+ ++A+  K PV+ ++MS   +D+++   + NI 
Sbjct: 490 VGESAVMNGEAGSRSDIRLPGIQPELVMEIAKTGK-PVVALVMSGRPLDLSW--LDENIP 546

Query: 564 AILWAGYPGEEGGRAIADVVFGKFNPGGRLPITW-------------------YNGDYVQ 604
           AIL     G E G A ADV+FG +NP G+LP+T+                   Y GDY +
Sbjct: 547 AILEVWTLGSEAGNAAADVLFGDYNPSGKLPVTFPRNVGQVPIYYNHKNTGRPYEGDYSE 606

Query: 605 MLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKL 664
                     P+    Y  +     N P LYPFGYGLSY+ F+Y+ ++ +          
Sbjct: 607 ----------PLSERIYRSKYRDVQNSP-LYPFGYGLSYSTFEYSDITLS---------- 645

Query: 665 QHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIA 724
                                 + L   +     V   N G  DG +VV +Y +      
Sbjct: 646 ---------------------ADTLNAGESITASVSITNEGPYDGEEVVQLYIRDLVGSV 684

Query: 725 ATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGN 779
              +K++ GF+++ ++ G   ++ F   +   L+   +     +  G+  IF+G+
Sbjct: 685 TRPVKELKGFKKLMIKNGETVKVDFTL-SSDDLSFYRHDMTYGIEPGDFQIFIGS 738


>gi|386821036|ref|ZP_10108252.1| beta-glucosidase-like glycosyl hydrolase [Joostella marina DSM
           19592]
 gi|386426142|gb|EIJ39972.1| beta-glucosidase-like glycosyl hydrolase [Joostella marina DSM
           19592]
          Length = 725

 Score =  269 bits (687), Expect = 5e-69,   Method: Compositional matrix adjust.
 Identities = 228/730 (31%), Positives = 343/730 (46%), Gaps = 104/730 (14%)

Query: 52  FLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVS---N 108
           + F +  +    RV +L+S MT+DEKV  L      VPRLG+ +     E LHG++    
Sbjct: 30  YPFQNPKIATEKRVDNLLSLMTIDEKVNALSTNPE-VPRLGV-KGTGHVEGLHGLALGGP 87

Query: 109 VGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEAR-AMYNLGRAGLTYWSP 167
            G G    + +P  T+FP       +++  L K+I +    EAR A+   GR GL   +P
Sbjct: 88  AGWGGKGKEPLP-TTTFPQAYGLGETWDTELLKEIAKIEGYEARYALQKYGRGGLVIRAP 146

Query: 168 NINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKH 227
           N ++ARDPRWGR  E+ GED F  G+  V +V+GLQ  +             + +S  KH
Sbjct: 147 NADLARDPRWGRTEESYGEDAFFNGKMTVAFVKGLQGSD---------KTYWQTASLMKH 197

Query: 228 YAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCA 287
           + A   ++ +      FD R+      E +  PF+M V EG + + M +YN+VNGIP+  
Sbjct: 198 FLANSNEDGRTYTSSDFDERL----WREYYALPFKMGVVEGGSRAYMAAYNKVNGIPAMV 253

Query: 288 DPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYY 347
            P L + TV  EW  +G I  D  + ++++ +HK+  D K    A T+KAG++    Q+ 
Sbjct: 254 HPMLKDITV-DEWGQNGIICTDGGAYKLLLSDHKYYKD-KYLGAAATIKAGIN----QFL 307

Query: 348 TNFTG---NAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ--YVSLGKQDICSD--- 399
            +FT     A+  G + E D+D+ L+  Y V+++LG  D S    Y  +G +    D   
Sbjct: 308 DDFTEGVYGALANGYLTEADLDEVLRGNYRVMIKLGMLDSSANNPYAKIGAEADSMDPWE 367

Query: 400 --ENIELAAEAAREGIVLLKND--QNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPC 455
              + +LA EA  + IVLLKND  +  LPL   KVK +A++G +A+A   ++  Y+G P 
Sbjct: 368 LEAHKKLALEATEKSIVLLKNDPAKRLLPLQKKKVKKIAIIGEYADAV--LLDWYSGTPP 425

Query: 456 RYMSPIAGFSGYANVTYKTGCD-DVACKSNNSIFAASEAAKTADATIILAGLDLSVEA-- 512
             +SP+ G      +  K G + +V    NN+   A E AK AD  I+  G   +  A  
Sbjct: 426 YTISPLQG------IKNKVGENVEVLFAKNNADGKAVEIAKNADVAIVFIGNHPTCNAGW 479

Query: 513 ----------ESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNI 562
                     E++DR+ L      ++  + V  V K     V+            T  NI
Sbjct: 480 AQCPVPSNGKEAVDRQAL-----NSEYEDLVKLVYKANPNTVVGLISSFPYTINWTQENI 534

Query: 563 KAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYP 622
            AI       +E G AIA+V+FG +NP GRL  TW   D   + PL    +R        
Sbjct: 535 PAIFHVTQNSQELGTAIANVLFGAYNPAGRLTQTWVK-DISDLPPLMDYNIR-------N 586

Query: 623 GRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCP 682
           GRTY ++ G  LY FG+GLSYT FKY  +   K I+ N                      
Sbjct: 587 GRTYMYFKGKPLYAFGHGLSYTTFKYKDMEIPKQIKEN---------------------- 624

Query: 683 GVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAG 742
                     +    KV+  N G  DG +VV +Y K         IK++  F+R+ ++AG
Sbjct: 625 ----------EEVSVKVNITNAGEVDGDEVVQLYVKHINSTVERPIKELKSFKRIHIKAG 674

Query: 743 RNKRIKFVFN 752
             K +  + N
Sbjct: 675 ETKTVSLLLN 684


>gi|383125188|ref|ZP_09945842.1| hypothetical protein BSIG_4348 [Bacteroides sp. 1_1_6]
 gi|382983435|gb|EES66611.2| hypothetical protein BSIG_4348 [Bacteroides sp. 1_1_6]
          Length = 954

 Score =  268 bits (686), Expect = 6e-69,   Method: Compositional matrix adjust.
 Identities = 228/749 (30%), Positives = 354/749 (47%), Gaps = 109/749 (14%)

Query: 54  FCDSSLPYSIRVKDLVSRMTLDEKVQQL--GDFAHGVPRLGLPQYEWWSEALHGVSNVGP 111
           + D+SLP   RV+ L++ MT ++K++ +  G    G+P L +P      EA+HG S    
Sbjct: 170 YMDASLPVEERVESLLAVMTPEDKMELIREGWGIPGIPHLYVPPITK-VEAVHGFSYGS- 227

Query: 112 GTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNINV 171
                    GAT FP  +   A++N  L +++   +  E  A  N  +A    WSP ++V
Sbjct: 228 ---------GATIFPQALAMGATWNRKLTEEVAMVIGDETVAA-NTKQA----WSPVLDV 273

Query: 172 ARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAY 231
           A+D RWGR  ET GEDP +V +    +++G Q            SR L  +   KH+  +
Sbjct: 274 AQDARWGRCEETFGEDPVLVSQIGGAWIKGYQ------------SRGLFTTP--KHFGGH 319

Query: 232 DVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKL 291
                 G D +  D  ++E++M E  L PF   ++  D  S+M +Y+   G+P     +L
Sbjct: 320 GAP-LGGRDSH--DIGLSEREMREIHLVPFRHAIRNYDCQSLMMAYSDYMGVPVAKSKEL 376

Query: 292 LNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNF- 350
           L Q +R EW  +G+IV+DC +I  +     + A  K +A  Q L AG+  +CG  Y N  
Sbjct: 377 LQQILRQEWGFNGFIVSDCGAIGNLTARKHYTAKDKIEAANQALAAGIATNCGDTYNNKE 436

Query: 351 TGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQDIC----SDENIELAA 406
              A + G++   D+D   + +   + R   F+ +P    L  + I     SD + E+A 
Sbjct: 437 VIQAAKDGRINMEDLDNVCRTMLGTMFRNELFEKNP-CKPLDWKKIYPGWNSDSHKEMAR 495

Query: 407 EAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAG--IPCRYMSPIAGF 464
           +AARE IV+L+N  N LPL S  ++T+AV+GP A+      G+Y    +P +  S + G 
Sbjct: 496 QAARESIVMLENKDNLLPL-SKTLRTIAVLGPGADDLQP--GDYTPKLLPGQLKSVLTGI 552

Query: 465 SG----YANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEA-------- 512
            G       V Y+ GCD       N I  A +AA  +D  I++ G   + EA        
Sbjct: 553 KGAVGKQTKVLYEQGCDFTNPDETN-IPKAVKAASQSDVVIMVLGDCSTSEATNDVRKTC 611

Query: 513 -ESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYP 571
            E+ D   L LPG Q +L+  V    K PVIL++ +    DI  A  +   KAIL    P
Sbjct: 612 GENNDWATLILPGKQQELLEAVCATGK-PVILILQAGRPYDILKA--SEMCKAILVNWLP 668

Query: 572 GEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNG 631
           G+EGG A+ADV+FG +NP GRLP+T+    +V  LPL         +    GR Y++ + 
Sbjct: 669 GQEGGPAMADVLFGDYNPAGRLPMTFPR--HVGQLPLYY-------NFKTSGRRYEYVDM 719

Query: 632 P--TLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDL 689
               LY FG+GLSYT F+Y+ L           K+Q   N N                  
Sbjct: 720 EYYPLYRFGFGLSYTSFEYSNL-----------KIQEKANGN------------------ 750

Query: 690 RCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKF 749
                 E +   +NVGS  G +V  +Y         T + ++  F R+ ++ G +K + F
Sbjct: 751 -----VEVQATVKNVGSRAGDEVAQLYVTDMYASVKTRVMELKDFARIHLQPGESKTVSF 805

Query: 750 VFNACKSLNIVDYAANTLLPAGEHTIFVG 778
                  +++++   + ++  GE  I VG
Sbjct: 806 EMTPY-DISLLNDRMDRVVEKGEFKIMVG 833


>gi|336412663|ref|ZP_08593016.1| hypothetical protein HMPREF1017_00124 [Bacteroides ovatus
           3_8_47FAA]
 gi|335942709|gb|EGN04551.1| hypothetical protein HMPREF1017_00124 [Bacteroides ovatus
           3_8_47FAA]
          Length = 735

 Score =  268 bits (686), Expect = 6e-69,   Method: Compositional matrix adjust.
 Identities = 215/765 (28%), Positives = 363/765 (47%), Gaps = 109/765 (14%)

Query: 53  LFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHG--------------VP-RLGLPQYE 97
           L+ D   P   RV DL+SRMTL+EKV QL  +  G              VP  +G   Y 
Sbjct: 29  LYKDPKAPIEKRVNDLLSRMTLEEKVMQLNQYTLGRNNNVNNVGEEVKKVPAEIGSLIYF 88

Query: 98  WWSEALHGV--------SNVGPGTHFD-DVIPG-ATSFPTVILTTASFNESLWKKIGQAV 147
             + AL           S +G    F  D I G  T +P  +    S+N  L ++     
Sbjct: 89  ETNPALRNSMQKKAMEKSRLGIPIIFGYDAIHGFRTVYPISLAQACSWNPDLVEQACAVS 148

Query: 148 STEARAMYNLGRAGLTY-WSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVE 206
           + EAR       +G+ + +SP I+VARDPRWGR+ E  GEDP+  G +    V+G Q   
Sbjct: 149 AQEARM------SGVDWTFSPMIDVARDPRWGRVAEGYGEDPYTNGVFGAASVKGYQ--- 199

Query: 207 GHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVK 266
                 DL++   ++++C KHY  Y         R +    +++Q + +T+L P+EM VK
Sbjct: 200 ----GDDLSAEN-RMAACLKHYVGYGASE---AGRDYVYTEISKQTLWDTYLLPYEMGVK 251

Query: 267 EGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADS 326
            G A+++M S+N ++G+P  A+P ++ + ++  W   G+IV+D  +I+ +   ++ LA +
Sbjct: 252 AG-AATLMSSFNDISGVPGSANPYIMTEILKKRWGHDGFIVSDWGAIEQL--KNQGLAAT 308

Query: 327 KEDAVAQTLKAGLDLDCGQY-YTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGS 385
           K++A      AGL++D   + Y       V++G+V    +D++++ +  +  RLG F+  
Sbjct: 309 KKEAAWHAFTAGLEMDMMSHAYDRHLQELVEEGRVSVAQVDEAVRRVLLLKFRLGLFERP 368

Query: 386 PQYVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVA 445
               +  K+     +++++AA  A E +VLLKN+  TLPL     K +AV+GP A     
Sbjct: 369 YTPATSEKERFFRPQSMDIAARLAAESMVLLKNENKTLPLTDK--KKIAVIGPMAKNGWD 426

Query: 446 MIGNYAG------IPCRYMSPIAGFSGYANVTYKTGCDDVACKSNN--SIFAASEAAKTA 497
           ++G++ G      +   Y      F+G A + Y  GC   A K +N      A EAA+ +
Sbjct: 427 LLGSWCGHGKDTDVAMLYNGLATEFAGKAELRYAAGC---ATKGDNREGFAEALEAARWS 483

Query: 498 DATIILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAE 557
           D  ++  G  ++   E+  R  + LP  Q +L  ++ +  K P++LV+++   +++   E
Sbjct: 484 DVVVLCLGEMMTWSGENASRSSIALPQIQEELAAELKKAGK-PIVLVLVNGRPLELNRLE 542

Query: 558 TNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTS--MPL-- 613
             ++  AIL    PG  G   +A ++ G+ NP G+L +T          P ++  +P+  
Sbjct: 543 PISD--AILEIWQPGVNGALPMAGILSGRINPSGKLAMT---------FPYSTGQIPIYY 591

Query: 614 -RPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNY 672
            R     G+ G  YK      LYPFG+GLSYT+FKY  +                     
Sbjct: 592 NRRKSGRGHQG-FYKDITSDPLYPFGHGLSYTEFKYGTV--------------------- 629

Query: 673 TSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVI 732
           T  A+K          ++  D    +V   NVG+ DG++ V  +   P       +K++ 
Sbjct: 630 TPSATK----------VKRGDRLSVEVTVTNVGARDGAETVHWFISDPYCSITRPVKELK 679

Query: 733 GFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFV 777
            F++  ++AG  K  +F  +  +    V+      L AGE+ I V
Sbjct: 680 HFEKQLIKAGETKTFRFDIDMERDFGFVNEDGKRFLEAGEYHILV 724


>gi|345881765|ref|ZP_08833275.1| hypothetical protein HMPREF9431_01939 [Prevotella oulorum F0390]
 gi|343918424|gb|EGV29187.1| hypothetical protein HMPREF9431_01939 [Prevotella oulorum F0390]
          Length = 1552

 Score =  268 bits (686), Expect = 6e-69,   Method: Compositional matrix adjust.
 Identities = 220/758 (29%), Positives = 334/758 (44%), Gaps = 131/758 (17%)

Query: 54   FCDSSLPYSIRVKDLVSRMTLDEKVQQL--------------------GDFAHGVPRLGL 93
            + +++LP +IRV DL+ RMTLDEK+ Q+                     ++ H +     
Sbjct: 721  YQNAALPSAIRVHDLLQRMTLDEKLAQMRHIHFKHYNTDGHVDLTKLRNNYTHSMSFGCF 780

Query: 94   PQYEW----WSEALHGVS-NVGPGTHFD-DVIP-----------GATSFPTVILTTASFN 136
              + +    + +A+  +  N    T F   VIP           G T FP  I   A+FN
Sbjct: 781  EAFPYSSTQYRQAVSTIQQNAADSTRFGIPVIPVIEGIHGIVQDGCTIFPQAIAQGATFN 840

Query: 137  ESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAV 196
              L  ++ Q + TE RA+           +P++++AR+ RWGR+ ET GEDP+++ R   
Sbjct: 841  PQLVFRMAQHIGTEMRAI-----GARQVLAPDLDIAREQRWGRVEETFGEDPYLISRMGY 895

Query: 197  NYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAY-------DVDNWKGVDRYHFDARVT 249
            NYV+G+Q   G                  KH+ A+       ++ + KG  R  FD    
Sbjct: 896  NYVKGIQSRGG--------------IPTLKHFVAHGTPQGGLNLASVKGGQRELFD---- 937

Query: 250  EQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVAD 309
                   +++PFE  ++   A SVM  Y+  +     + P  L   +R      GYI +D
Sbjct: 938  ------VYVKPFEYVIRHTKAGSVMNCYSAYDNEAITSSPFFLRTLLRDSLHFKGYIYSD 991

Query: 310  CDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSL 369
              SI ++   H   ADS+ +A  Q + AG+DL+ G  Y       + QG + +  ID + 
Sbjct: 992  WGSIPMLRYFHH-TADSETEAAQQAINAGVDLEAGSDYYRTAPTLIAQGLLDKARIDSAA 1050

Query: 370  KYLYTVLMRLGFFDGSPQYVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAK 429
             ++       G FD         +Q I + E + +A + A E +VLL+N  + LPL+  +
Sbjct: 1051 AHVLYTKFEAGLFDELASDTLHWRQQIHTPEAVAVAKQLADESLVLLENRNHFLPLDLNR 1110

Query: 430  VKTVAVVGPHANATVAMIGNYAGIP-CRY-MSPIAGFSGYA----NVTYKTGCDDVACKS 483
            + ++AVVGP  NA     G+Y+     R+ ++P+AG    A     V Y  GCD    ++
Sbjct: 1111 LHSIAVVGP--NAAQVQFGDYSWTADNRHGITPLAGIQQVAGMRTKVRYVKGCD-YYSQN 1167

Query: 484  NNSIFAASEAAKTADATIILAGLDL---------SVEAESLDREDLWLPGYQTQLINQVA 534
             +SI  A   AK +D T+++ G            S   E  D  DL LPG Q QLI ++A
Sbjct: 1168 TDSIDEAVALAKQSDVTVVVVGTQSMLLARPSQPSTSGEGYDLSDLILPGVQQQLIERIA 1227

Query: 535  EVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLP 594
              A G   +V+M  G   +  A  N    A+L   Y GE+ G ++A  +FG+ NP GRLP
Sbjct: 1228 --ATGKPFIVVMVTGRPLLTEAFKN-KADALLVQWYGGEQAGLSLAQALFGQLNPSGRLP 1284

Query: 595  ITWYNGD-----YVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYN 649
            I++         Y   LP          +   PGR Y F +    YPFGYGLSYT FKY+
Sbjct: 1285 ISFPKATGQLPVYYNHLPTDKGYYNKKGTPDKPGRDYVFADPYPAYPFGYGLSYTTFKYS 1344

Query: 650  LLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDG 709
             L+ +K  Q N N                              D        QN G   G
Sbjct: 1345 QLALSKK-QTNEN------------------------------DTIAVTFRVQNTGKRAG 1373

Query: 710  SDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRI 747
             +V  +Y +      AT IKQ+ GF++  ++ G  K I
Sbjct: 1374 KEVAQLYIRDMKSSVATPIKQLFGFEKCALQPGETKTI 1411


>gi|227536644|ref|ZP_03966693.1| possible beta-glucosidase [Sphingobacterium spiritivorum ATCC
           33300]
 gi|227243445|gb|EEI93460.1| possible beta-glucosidase [Sphingobacterium spiritivorum ATCC
           33300]
          Length = 777

 Score =  268 bits (686), Expect = 7e-69,   Method: Compositional matrix adjust.
 Identities = 207/721 (28%), Positives = 330/721 (45%), Gaps = 111/721 (15%)

Query: 90  RLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVST 149
           RLG+P +    EA HG   +G            T FPT I   +++N +L +K+   V+ 
Sbjct: 126 RLGIPVF-LAEEAPHGHMAIG-----------TTVFPTGIGQASTWNPALLQKMSATVAK 173

Query: 150 EARAMYNLGRAGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHE 209
           E R      +     + P ++++RDPRW R+ E+ GEDP + G  A   V GL    G  
Sbjct: 174 EVRQ-----QGAHISYGPVLDLSRDPRWSRVEESYGEDPVLTGTLAAAIVTGL----GSG 224

Query: 210 NATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGD 269
           N +D    P       KH+ AY +            A + E+++ E FL PF+  V  G 
Sbjct: 225 NLSD----PFATIPTLKHFVAYGIPEG---GHNGSAASIGERELREYFLPPFQSAVAAG- 276

Query: 270 ASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKED 329
           A SVM +YN V+GIP  ++  LL   +R EW+ +G+ V+D  SI+ +  +H+   D K+ 
Sbjct: 277 AKSVMAAYNSVDGIPCSSNKFLLTDILRKEWNFNGFTVSDLGSIEGIKGSHRVAKDHKQA 336

Query: 330 AVAQTLKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYV 389
           A+   ++AGLD D G         AV+QG+V+E  ID+++  +  +   +G F+      
Sbjct: 337 AIL-AIEAGLDADLGGNAYVRLIEAVKQGEVQENSIDQAVSRVLALKFEMGLFEKPFVDA 395

Query: 390 SLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGN 449
              K+++ ++ NI L+ + ARE IVLL+N  N LPL   K   +A++GP+A+    M+G+
Sbjct: 396 KTAKKEVKTEANIALSRQVARESIVLLENKNNILPLR--KDVKIAIIGPNADNIYNMLGD 453

Query: 450 YA-----GIPCRYMSPIAGFSGYANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILA 504
           Y      G        I+     A V+Y  GC  +   +N+ I AA  AA+ +D  + + 
Sbjct: 454 YTAPQPDGAVTTVRQAISARLPKAQVSYVKGC-SIRDTTNSDIPAAVTAAQQSDIIVAVV 512

Query: 505 G----LDLSVE-------------------AESLDREDLWLPGYQTQLINQVAEVAKGPV 541
           G     D   E                    E  DR  L L G Q +L+  + +  K P+
Sbjct: 513 GGSSARDFKTEYISTGAAVASDKSVSDMESGEGFDRSTLDLLGRQMELLKALKQTGK-PL 571

Query: 542 ILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGD 601
           +++ +    +++ +A T+ +  A+L A YPG+EGG AIADV+FG +NP G++P++     
Sbjct: 572 VVIYIQGRPLNMNWAATHAD--ALLCAWYPGQEGGHAIADVLFGDYNPAGKMPLSVPRS- 628

Query: 602 YVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNL 661
            V  +P+      P+D        Y       LY FGYG SY+ F+Y  L   K      
Sbjct: 629 -VGQIPVHYNRKSPLD------HRYVEEAATPLYAFGYGKSYSDFEYKDLKIQK------ 675

Query: 662 NKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDF--QNVGSTDGSDVVIVYSKP 719
                                          D  +++V F   N G  DG +V  +Y + 
Sbjct: 676 -------------------------------DNKDYRVSFTLTNTGKYDGDEVAQLYIRN 704

Query: 720 PAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGN 779
                +  ++Q+  F+R+ ++ G +K + FV  A     I       L P     I VG+
Sbjct: 705 QYASVSQPVQQLKHFERIHLKTGESKTVSFVLTAGDLSVINTQMKKVLEPGSSFKIRVGS 764

Query: 780 G 780
            
Sbjct: 765 A 765


>gi|408369545|ref|ZP_11167326.1| glycoside hydrolase [Galbibacter sp. ck-I2-15]
 gi|407745291|gb|EKF56857.1| glycoside hydrolase [Galbibacter sp. ck-I2-15]
          Length = 881

 Score =  268 bits (686), Expect = 7e-69,   Method: Compositional matrix adjust.
 Identities = 168/432 (38%), Positives = 238/432 (55%), Gaps = 43/432 (9%)

Query: 50  SSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNV 109
             + F +  L  S RV DL+ R+T++EK+ QL   +  + RLG+P+Y WW+E+LHGV+  
Sbjct: 25  QQYPFQNPELDDSARVADLLERLTVEEKIDQLLYTSPAIERLGIPEYNWWNESLHGVARA 84

Query: 110 GPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYN--LGRA------G 161
           G           AT FP  I   A+++  L K++  A+S EARA ++  + R       G
Sbjct: 85  G----------YATVFPQSITIAAAWDSDLLKEVADAISDEARAKHHEYIRRGQRGIYQG 134

Query: 162 LTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKV 221
           LT+WSPNIN+ RDPRWGR  ET GEDP++ G+  + YV+GLQ         D N   LK+
Sbjct: 135 LTFWSPNINIFRDPRWGRGHETYGEDPYLTGQLGIAYVKGLQ-------GNDPNY--LKL 185

Query: 222 SSCCKHYAAYDVDNWKGVD--RYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNR 279
            +  KH+A +      G +  R+ FD   +++D+ ET+L  F   VK+GD  SVM +YNR
Sbjct: 186 VATAKHFAVH-----SGPEPLRHEFDVSPSKRDLWETYLPAFRYLVKQGDVKSVMTAYNR 240

Query: 280 VNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGL 339
           V G  + A   L    +R  WD  GY+V+DC +I  +   HK   D+ E A A  +  G 
Sbjct: 241 VYGEAASASDTLFT-ILRDYWDFDGYVVSDCFAISDIWKYHKIAKDAAE-ASAMAVIEGC 298

Query: 340 DLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSP--QYVSLGKQDIC 397
           DL+CG  Y      A QQG V E DID +L  L    ++LG FD      Y  +      
Sbjct: 299 DLNCGDSYEKLN-QAYQQGMVTEKDIDIALSRLMEARIKLGMFDPEQLVPYAQIPFNVNT 357

Query: 398 SDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRY 457
           S+++ +LA +AA+E IVLLKN  + LPL S  +K+VAV+GP+A+   ++ GNY G P   
Sbjct: 358 SEKHNQLALKAAKESIVLLKNQGDLLPL-SKDLKSVAVIGPNADNIQSLWGNYNGNP--- 413

Query: 458 MSPIAGFSGYAN 469
             PI    G  N
Sbjct: 414 KDPITVLQGIQN 425



 Score =  136 bits (342), Expect = 6e-29,   Method: Compositional matrix adjust.
 Identities = 88/290 (30%), Positives = 141/290 (48%), Gaps = 48/290 (16%)

Query: 503 LAGLDLSVEAESL---DREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETN 559
           L G ++ V  E     DR  L LP  Q  L+ +VA+  K P++LV+++   + I +A  N
Sbjct: 615 LEGEEMDVVVEGFAGGDRTALDLPASQRTLLKEVAKTGK-PIVLVLLNGSALSINWAAEN 673

Query: 560 TNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSL 619
             I AI+ AGY G++GG A+A+V+FG +NP  RLP+T+Y    V+ LP         +  
Sbjct: 674 --IPAIMTAGYAGQQGGNAVAEVLFGDYNPAARLPVTYYKS--VEDLP-------DFEDY 722

Query: 620 GYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKT 679
              GRTY+++    LYPFGYGLSYT F Y+       I +N                   
Sbjct: 723 NMDGRTYRYFEKEPLYPFGYGLSYTTFDYSKFQLPSKIDMN------------------- 763

Query: 680 RCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFV 739
                        +  E  V+  N G+ DG +VV VY           I++++GF+R+ +
Sbjct: 764 -------------ESIELSVEVTNTGAYDGDEVVQVYLTDEKGSTPRPIRELVGFKRIHL 810

Query: 740 RAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGNGGVSFPIHLN 789
           + G +++++F     + L+++D   + ++  G  +I VG     F   LN
Sbjct: 811 KKGESQKVQFTIEP-RQLSMIDDKGDLVIEPGVFSISVGGEQPGFNAKLN 859


>gi|319953334|ref|YP_004164601.1| beta-glucosidase [Cellulophaga algicola DSM 14237]
 gi|319421994|gb|ADV49103.1| Beta-glucosidase [Cellulophaga algicola DSM 14237]
          Length = 756

 Score =  268 bits (686), Expect = 7e-69,   Method: Compositional matrix adjust.
 Identities = 216/734 (29%), Positives = 347/734 (47%), Gaps = 105/734 (14%)

Query: 76  EKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGATSFPTVILTTASF 135
           EK++   DFA    RLG+P + + S+ +HG                 T+FP  +  ++S+
Sbjct: 83  EKIKTAQDFAVKKTRLGIPLF-FGSDIIHGYK---------------TTFPIPLGLSSSW 126

Query: 136 NESLWKKIGQAVSTEARAMYNLGRAGLTY-WSPNINVARDPRWGRITETPGEDPFVVGRY 194
           +  L K+  Q  + EA A       G+ + +SP ++++RDPRWGRI+E  GEDP++  + 
Sbjct: 127 DMELLKRTAQVAALEATA------DGINWNFSPMVDISRDPRWGRISEGAGEDPYLGSQI 180

Query: 195 AVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDME 254
           A   V G Q         DL ++   +++  KH+A Y      G D    D  ++   M 
Sbjct: 181 AKAMVTGYQ-------GEDLMAKNTMLATV-KHFALYGAAE-AGRDYNSVD--MSRLKMY 229

Query: 255 ETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQ 314
             +L P++  +  G   SVM S+N ++GIP+  +  LL   +R +W  +G++V+D  S+ 
Sbjct: 230 NEYLPPYKAAIDAG-VGSVMSSFNDIDGIPASGNKWLLTDLLRDDWKFNGFVVSDYTSVN 288

Query: 315 VMVDNHKFLADSKEDAVAQTLKAGLDLD-CGQYYTNFTGNAVQQGKVKETDIDKSLKYLY 373
            M+ +   L D +    A +LKAGLD+D  G+ +      ++ +GKV   +I  + + + 
Sbjct: 289 EMIAHG--LGDLQA-VSALSLKAGLDMDMVGEGFLTTLKKSLDEGKVTAEEITTACRRIL 345

Query: 374 TVLMRLGFFDGSPQYVSLGK--QDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVK 431
               +LG FD   +Y+   +  +DI  DEN  LA EAA++  VLLKND   LP+N  K  
Sbjct: 346 EAKFKLGLFDDPYKYIDKKRPAKDILKDENRALAREAAKKSFVLLKNDTKNLPIN--KSS 403

Query: 432 TVAVVGPHANATVAMIGNYA--GIPCRYMSPIAGFSGYA---NVTYKTGC---DDVACKS 483
            +A++G  AN+   M+G +A  G P   +S + GF   A    +T+  G    DD A   
Sbjct: 404 KIALIGDLANSKDNMLGTWAPTGDPQLSVSILQGFKNVAPNAQITHAKGANITDDAALAK 463

Query: 484 NNSIFA----------------ASEAAKTADATIILAGLDLSVEAESLDREDLWLPGYQT 527
             ++F                 A E AK +D  + + G       ES  R D+ +P  Q 
Sbjct: 464 KINVFGERVTIDKRSAEEMLNEAVELAKKSDIIVAVVGEATEFTGESSSRTDISIPQSQK 523

Query: 528 QLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKF 587
           +LI  +A   K P++LV+MS  G  +   E      +IL   +PG E G AIADVVFG +
Sbjct: 524 KLIRALAATGK-PLVLVLMS--GRPLVLEEELALSASILQVWFPGVEAGNAIADVVFGDY 580

Query: 588 NPGGRLPITW-YNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPT--LYPFGYGLSYT 644
           NP G+L  TW  N   + +        RP  +  +   T  + + P   L PFGYGLSYT
Sbjct: 581 NPSGKLTATWPRNVGQIPIYHSIKNTGRPQLTSEFEKFTSNYLDAPNTPLLPFGYGLSYT 640

Query: 645 QFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNV 704
           +F+Y+ L+   + Q+N N+                  P ++             V   N 
Sbjct: 641 EFEYSNLNVNAS-QINQNE------------------PLIVT------------VSVTNT 669

Query: 705 GSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAA 764
           G+ DG +VV +Y +         +KQ+ GF++V ++ G  K++         L   +   
Sbjct: 670 GNFDGEEVVQLYLRDVVRSITQPVKQLKGFKKVMLKKGETKQVTLTLTP-DDLKFYNSNL 728

Query: 765 NTLLPAGEHTIFVG 778
           + +   G+  I+VG
Sbjct: 729 DFVAEPGDFEIYVG 742


>gi|189467437|ref|ZP_03016222.1| hypothetical protein BACINT_03826 [Bacteroides intestinalis DSM
           17393]
 gi|189435701|gb|EDV04686.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
           intestinalis DSM 17393]
          Length = 863

 Score =  268 bits (686), Expect = 7e-69,   Method: Compositional matrix adjust.
 Identities = 158/441 (35%), Positives = 229/441 (51%), Gaps = 32/441 (7%)

Query: 35  FVCDPGRFSKLGLQMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLP 94
            +C    FS      ++  F +  LP   RV DLV R+TL+EK+ Q+ + A  + RLG+P
Sbjct: 7   LICSLLLFSVTVAGQATCKFLNPELPIVERVNDLVGRLTLEEKISQMLNNAPAIDRLGIP 66

Query: 95  QYEWWSEALHGVSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAM 154
            Y WW+E LHGV+            P  TSFP  I   A+++     ++    S E RA+
Sbjct: 67  AYNWWNECLHGVAR--------SPYP-VTSFPQAIAMAATWDTESVHQMAVYASDEGRAI 117

Query: 155 YNLGRA--------GLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVE 206
           Y+            GLTYWSPNIN+ RDPRWGR  ET GEDPF+     V++V+GLQ   
Sbjct: 118 YHDATRKGTPGIFRGLTYWSPNINIFRDPRWGRGQETYGEDPFLTASIGVSFVKGLQ--- 174

Query: 207 GHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVK 266
           G +         LK S+C KHYA +    W   +R+ +DA+V   D+ +T+L  F+  V 
Sbjct: 175 GDDPVY------LKSSACAKHYAVHSGPEW---NRHTYDAKVNNHDLWDTYLPAFKELVV 225

Query: 267 EGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADS 326
           EG  + VMC+YN   G P C +  L+   +R  W   GY+ +DC +++   + HK   D+
Sbjct: 226 EGKVTGVMCAYNSFFGQPCCGNDLLMMDILRNHWKFGGYVTSDCGAVEDFYNTHKTHQDA 285

Query: 327 KEDAVAQTLKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSP 386
              +    L  G D +CG        +AV +G + E  ID+SLK L+ +  RLG FD   
Sbjct: 286 AAASADAVLH-GTDCECGNGAYRALADAVLRGLITEKQIDESLKKLFEIRFRLGMFDPDD 344

Query: 387 Q--YVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATV 444
           +  Y ++    +  D +   A + AR+ IVLLKN    LPLN  K+K +AVVGP+A+   
Sbjct: 345 RVPYSNIPLSVLECDAHKAHALKIARQSIVLLKNQDQLLPLNKNKIKKIAVVGPNADDKS 404

Query: 445 AMIGNYAGIPCRYMSPIAGFS 465
            ++ NY G P    + + G  
Sbjct: 405 VLLANYYGYPSHITTALEGIQ 425



 Score =  125 bits (313), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 89/298 (29%), Positives = 147/298 (49%), Gaps = 57/298 (19%)

Query: 493 AAKTADATIILAGLDLSVEAESL----------DREDLWLPGYQTQLINQVAEVAKGPVI 542
           A K AD  I + GL   VE E +          DR  + +P  Q  L+ ++    K PV+
Sbjct: 595 AVKDADVIIFVGGLSAKVEGEEMGVEIEGFKRGDRTSISIPSVQQNLLKELYATGK-PVV 653

Query: 543 LVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDY 602
            V+M+   + + +   + ++ AIL A Y G+ GG+AIADV+FG +NP GRLP+T+Y    
Sbjct: 654 FVMMTGSALGLEWE--SAHLPAILNAWYGGQAGGQAIADVLFGDYNPSGRLPLTFYKS-- 709

Query: 603 VQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLN 662
           V  LP         +      RTY+++ G  +YPFGYGLSYT F+Y+ L     +Q + +
Sbjct: 710 VNDLP-------DFEDYSMENRTYRYFTGTPVYPFGYGLSYTTFQYSSLK----LQPSPD 758

Query: 663 KLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAE 722
           K    R++  T+  +                         N G  +G +V  +Y   P +
Sbjct: 759 K----RSVKVTAKIT-------------------------NTGKMEGEEVAQLYVSNPRD 789

Query: 723 IAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGNG 780
              T I+ + GF+R+ ++ G ++ ++FV  + K L++VD +  ++   G+  I +G G
Sbjct: 790 F-VTPIRALKGFKRINLKPGESQTVEFVLTS-KELSVVDISGKSVPMKGKVQISLGGG 845


>gi|224537403|ref|ZP_03677942.1| hypothetical protein BACCELL_02281 [Bacteroides cellulosilyticus
           DSM 14838]
 gi|224520981|gb|EEF90086.1| hypothetical protein BACCELL_02281 [Bacteroides cellulosilyticus
           DSM 14838]
          Length = 750

 Score =  268 bits (685), Expect = 8e-69,   Method: Compositional matrix adjust.
 Identities = 218/765 (28%), Positives = 359/765 (46%), Gaps = 114/765 (14%)

Query: 64  RVKDLVSRMTLDEKVQQLGDF-AHGVPRLGLPQYEWWSEALHGV---------------- 106
           R++ L+ +MTL+EK+ Q+        P L     +    ++  +                
Sbjct: 35  RIEALLGKMTLEEKIGQMNQLHCENFPYLKTETRKGRVGSVMSITDPNIFNEVQRIAVED 94

Query: 107 SNVG-PGTHFDDVIPG-ATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTY 164
           S +G P  +  DVI G  T FP  +   ASFN  + +   +  +TEA A      AG+ +
Sbjct: 95  SRLGIPLINARDVIHGFKTIFPIPLGQAASFNPEIAETGARIAATEASA------AGIRW 148

Query: 165 -WSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSS 223
            ++P I++  DPRWGRI E  GEDP +V +  V  ++G Q        + LN  P  +++
Sbjct: 149 TFAPMIDITHDPRWGRIAEGFGEDPLLVSQMGVAAIKGFQ-------GSSLN-HPTSIAA 200

Query: 224 CCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGI 283
           C KH+A Y         R +    +TE+     +LRPFE  V  G A+++M ++N  +GI
Sbjct: 201 CAKHFAGYGASEG---GRDYNSTYITERQFRNLYLRPFEAAVNAG-AATLMTAFNDNDGI 256

Query: 284 PSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLD- 342
           PS A+P LL   +R EW+  G +V+D  S+  M+  H F  D KE A+  T  AG D++ 
Sbjct: 257 PSSANPFLLKDVLRNEWNYRGTVVSDWASVSEMI-RHGFCEDEKEAALKAT-NAGTDIEM 314

Query: 343 CGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQDICSDENI 402
             + Y  +    +++GKV    ID +++ +  +  RLG F+  P      K+     + +
Sbjct: 315 VSETYIKYLPQLIKEGKVSMETIDNAVRNILRLKFRLGLFE-HPYIADQRKETFYRPDFL 373

Query: 403 ELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIG--------NYAGIP 454
           E A  AA +  VLLKN++ TLP+ S  +KT+ V GP A+A    +G        +Y+  P
Sbjct: 374 EAAQTAAEQSAVLLKNERGTLPIQS-NIKTILVTGPLADAPHEQLGTWVFDGDASYSQTP 432

Query: 455 CRYMSPIAGFSGYANVTYKTGCD---DVACKSNNSIFAASEAAKTADATIILAGLDLSVE 511
            + +   +G S    V Y  G +   D A    N +    E A+ AD  +   G +  + 
Sbjct: 433 LQALRRTSGDS--IKVLYAPGLNYSRDTATSQFNKVV---ELAREADLILAFVGEEAILS 487

Query: 512 AESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYP 571
            E+    +L L G Q++L+++++E  K P++ V+M+   + I   E N +  A+L+A +P
Sbjct: 488 GEAHCLANLNLQGAQSRLLHRLSETGK-PLVTVVMAGRPLTIG-REVNIS-DALLYAFHP 544

Query: 572 GEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPL------TSMPLR---------PV 616
           G  GG A+A+++FGK  P G+LP+T+        +P+      T  P           PV
Sbjct: 545 GTMGGPALANLLFGKVVPSGKLPVTF--PKETGQIPIYYNHTSTGRPASGSEKNIFTIPV 602

Query: 617 DSLGYPGRTYKFY---NGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYT 673
            +         FY       L+PFGYGLSYT F Y+ L  + T        Q+ RN    
Sbjct: 603 GAEQTSLGNTSFYLDAGKDPLFPFGYGLSYTTFAYSNLQLSST--------QYTRN---- 650

Query: 674 SDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIG 733
                              +      D  N G TDG+++  +Y +  A      +K++  
Sbjct: 651 -------------------EVIIITFDLTNTGKTDGTEIAQLYFRDLAASVTRPVKELAA 691

Query: 734 FQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVG 778
           F+R+ ++AG  + I+      K L+  +YA +  +  G+  +++G
Sbjct: 692 FERIHLKAGETRHIRMEL-PVKQLSFWNYAMDYCVEPGKFDLWIG 735


>gi|255693560|ref|ZP_05417235.1| periplasmic beta-glucosidase [Bacteroides finegoldii DSM 17565]
 gi|260620625|gb|EEX43496.1| glycosyl hydrolase family 3 N-terminal domain protein [Bacteroides
           finegoldii DSM 17565]
          Length = 770

 Score =  268 bits (685), Expect = 8e-69,   Method: Compositional matrix adjust.
 Identities = 222/818 (27%), Positives = 383/818 (46%), Gaps = 148/818 (18%)

Query: 36  VCDPGRFSKLGLQMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQ 95
           +C     S L  Q  S  + D +LP S RV  L+S+MTL+EKV Q+  +  G+  +   +
Sbjct: 10  ICCAIGISTLACQDKSKDYTDPTLPVSERVSSLMSQMTLEEKVAQMCQYV-GLEHMKKAE 68

Query: 96  YEWWSEALHGVSNVG--PGTHFDDV----------------------------------I 119
            +  +E L    + G  P  H  DV                                  I
Sbjct: 69  KDMSAEDLKHSHSQGFYPNLHSSDVEEMTKKGLISSFLHVVKAEEANYLQSLAQQSRLKI 128

Query: 120 P---------------GATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTY 164
           P               G+T +PT I   A+F+ +L +++ +  + E RA      +G+ +
Sbjct: 129 PLLIGIDAIHGNGLYRGSTIYPTPIGQAATFDPALVERMSRETAIEMRA------SGMHW 182

Query: 165 -WSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQ--DVEGHENATDLNSRPLKV 221
            ++PN+ VARD RWGR+ ET GEDP++VG+     VRG Q  D  G++          KV
Sbjct: 183 TFTPNVEVARDARWGRVGETFGEDPYLVGQMGAATVRGFQTKDFTGND----------KV 232

Query: 222 SSCCKHY--AAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNR 279
            +C KH    +   +   G       A ++E+ ++E F  PF+ C++ G   +VM ++N 
Sbjct: 233 IACAKHLVGGSQPANGINGAP-----AELSERTLQEVFFPPFKDCLEAG-VFTVMTAHNE 286

Query: 280 VNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGL 339
           +NGIP   +  L+ + +R +W   G++V+D   I+ M D H  +A++ +DA   ++ AG+
Sbjct: 287 LNGIPCHGNKYLMTEVLRNQWKFDGFVVSDWMDIERMHDYHN-VAETLKDAYQISVDAGM 345

Query: 340 DLDC-GQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQD--I 396
            +   G  +       V++G + E  ID ++  +  V  RLG F+    ++ L K+D  +
Sbjct: 346 GMHMHGPEFYEAIIECVKEGSIPEKQIDAAVSKILEVKFRLGLFENP--FIDLKKKDEIV 403

Query: 397 CSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYA-GIPC 455
            ++++ + A E AR+ IVLLKN+ N LPL+++K K V V G +AN   +++G++A   P 
Sbjct: 404 FNEKHQQTALEGARKSIVLLKNEGNMLPLDASKYKKVFVTGHNAN-NQSILGDWAMEQPE 462

Query: 456 RYMSPIAGFSGYANVTYKTGCD------DVACKSNNSIFAASEAAKTADATIILAG---- 505
            +++ +    G   ++ +T  +      +V   S+N I  A + A+++D  I++ G    
Sbjct: 463 EHVTTV--LKGLKAISPETNYNFLDLGWNVRLLSDNQIKEAVQQARSSDLAILVVGENSM 520

Query: 506 ---LDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNI 562
               +     E+ DR +L LPG Q +L+  VA      V++++    G  +     + N+
Sbjct: 521 RYHWNEKTCGENSDRYELSLPGRQQELVEAVAATGVPTVVILV---NGRPLTTEWIDENM 577

Query: 563 KAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYP 622
             I+ A  PG  GG+A+A++++GK NP G+LPIT         +P ++  ++ + +  + 
Sbjct: 578 PCIIEAWEPGVAGGQALAEILYGKVNPSGKLPIT---------IPRSTGQIQCMYNHKFT 628

Query: 623 GRTYKFYNGPTL--YPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTR 680
              + +  G +L  Y FGYGLSYT +KY  L  ++                       T 
Sbjct: 629 NHWFPYATGNSLPLYEFGYGLSYTTYKYENLKLSEA----------------------TI 666

Query: 681 CPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVR 740
            P         D   +  VD  N G  DG + V +Y +     A   +K++  F R+ ++
Sbjct: 667 TP---------DKSVKVTVDVTNTGKMDGEETVQLYIRDEYSSATRPVKELKDFARIPLK 717

Query: 741 AGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVG 778
           AG  K + F     + L+  D   +  +  G   I VG
Sbjct: 718 AGETKEVSFTLTP-EMLSYYDANMHYGVEKGTFKIMVG 754


>gi|347736643|ref|ZP_08869226.1| xylosidase/arabinosidase [Azospirillum amazonense Y2]
 gi|346919803|gb|EGY01181.1| xylosidase/arabinosidase [Azospirillum amazonense Y2]
          Length = 775

 Score =  268 bits (685), Expect = 9e-69,   Method: Compositional matrix adjust.
 Identities = 218/725 (30%), Positives = 335/725 (46%), Gaps = 109/725 (15%)

Query: 90  RLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVST 149
           RLG+P   +  E LHG   +GP           TSFP  I   +S++  L +++   V+ 
Sbjct: 121 RLGIPVL-FHEEGLHGYPAIGP-----------TSFPQAIAQASSWDPDLIREVDSVVAR 168

Query: 150 EARAMYNLGRAGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHE 209
           E R      R      SP ++VARDPRWGRI ET GEDP++ G   V  V+GLQ      
Sbjct: 169 EIRV-----RGVSLVLSPVVDVARDPRWGRIEETFGEDPYLAGEMGVAAVQGLQG----- 218

Query: 210 NATDLNSRPL---KVSSCCKHYAAY-DVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCV 265
                +S PL   KV +  KH   +   ++   V      A V E+ + E F  PFE  +
Sbjct: 219 -----DSLPLADGKVFATLKHLTGHGQPESGTNVG----PASVGERTLREMFFPPFEQVI 269

Query: 266 KEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLAD 325
              +  +VM SYN ++G+PS  +  LL+  +RGEW   G I++D  +I  +V  H  + D
Sbjct: 270 HRTNVRAVMASYNEIDGVPSHVNTWLLHDILRGEWGYKGSIISDYSAIDQLVSIHHVVPD 329

Query: 326 SKEDAVAQTLKAGLDLDC--GQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFD 383
               A+ + ++AG+D D   G+ Y +   ++V+ GK+KE  ID++++ +  +  + G F+
Sbjct: 330 LPSAAI-RAIQAGVDADLPDGESYASLA-DSVRAGKIKEEVIDRAVRRILELKFQAGLFE 387

Query: 384 GSPQYVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANAT 443
                    +    + E   +A +AA++ +VLLKND   LPL+ AKVKT+AV+GP  NA 
Sbjct: 388 HPYADADKAEALTANGEARAVALKAAQKSVVLLKND-GVLPLDMAKVKTLAVIGP--NAA 444

Query: 444 VAMIGNYAGIPCRYMSPIAGFS----GYANVTYKTGC----DD--------VACKSNNS- 486
            A +G Y+G P + +S + G          VTY  G     DD        +A  + N+ 
Sbjct: 445 KAHLGGYSGEPKQTVSILDGIKAKVGARVKVTYAEGVRITKDDDWYGDTVELADPAENAR 504

Query: 487 -IFAASEAAKTADATIILAGLDLSVEAESL------DREDLWLPGYQTQLINQVAEVAKG 539
            I  A   AKTAD  +++ G +     E        DR+ L L G Q  L   +  + K 
Sbjct: 505 LIQQAVAVAKTADHIVLVIGDNEQTSREGWANNHLGDRDSLDLVGQQNDLAKALFALGK- 563

Query: 540 PVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYN 599
           PV++V+ +  G  ++  +      A++   Y G+EGG A+ADV+FG  NPGG+LP+T   
Sbjct: 564 PVVVVLQN--GRPLSVVDVAARANALVEGWYLGQEGGTAMADVLFGDVNPGGKLPVTVAR 621

Query: 600 GDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQV 659
              V  LP+        +      R Y F     L+PFGYGLSYT F             
Sbjct: 622 S--VGQLPMF------YNKKPSARRGYLFDTTDPLFPFGYGLSYTTFDVG---------- 663

Query: 660 NLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKP 719
                                 P +    +  D      VD +N G   G +VV +Y   
Sbjct: 664 ---------------------SPRLSTPTIAKDGAITVAVDVRNTGKRAGDEVVQLYLHQ 702

Query: 720 PAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGN 779
                   +K++ GFQR+ +  G ++ + F  +  K+L + +     ++  G   I VG+
Sbjct: 703 QVASVTRPVKELKGFQRITLAPGESRTVTFTVDG-KALALWNQDMKRVVEPGAFDIMVGD 761

Query: 780 GGVSF 784
             V  
Sbjct: 762 NSVDL 766


>gi|380696432|ref|ZP_09861291.1| beta-glucosidase [Bacteroides faecis MAJ27]
          Length = 954

 Score =  268 bits (685), Expect = 9e-69,   Method: Compositional matrix adjust.
 Identities = 226/749 (30%), Positives = 355/749 (47%), Gaps = 109/749 (14%)

Query: 54  FCDSSLPYSIRVKDLVSRMTLDEKVQQL--GDFAHGVPRLGLPQYEWWSEALHGVSNVGP 111
           + D SLP   RV+ L++ MT ++K++ +  G    G+P L +P      EA+HG S    
Sbjct: 170 YMDVSLPVEERVESLLAVMTPEDKMELIREGWGIPGIPHLYVPPITK-VEAVHGFSYGS- 227

Query: 112 GTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNINV 171
                    GAT FP  +   A++N+ L +++   +  E  A  N  +A    WSP ++V
Sbjct: 228 ---------GATIFPQALAMGATWNKKLTEEVAMVIGDETVAA-NTKQA----WSPVLDV 273

Query: 172 ARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAY 231
           A+D RWGR  ET GEDP +V +    +++G Q            SR L  +   KH+  +
Sbjct: 274 AQDARWGRCEETFGEDPVLVSQMGGAWIKGYQ------------SRGLFTTP--KHFGGH 319

Query: 232 DVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKL 291
                 G D +  D  ++E++M E  L PF   ++  D  S+M +Y+   G+P     +L
Sbjct: 320 GAP-LGGRDSH--DIGLSEREMREIHLVPFRHAIRNYDCQSLMMAYSDYMGVPVAKSKEL 376

Query: 292 LNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNF- 350
           L Q +R EW  +G+IV+DC +I  +     + A  K +A  Q L AG+  +CG  Y N  
Sbjct: 377 LQQILRQEWGFNGFIVSDCGAIGNLTARKHYTAKDKIEAANQALAAGIATNCGDTYNNKE 436

Query: 351 TGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQDIC----SDENIELAA 406
              A + G++   D+D   + + + + R   F+ +P    L  + I     SD + E+A 
Sbjct: 437 VIQAAKDGRINMEDLDNVCRTMLSTMFRNELFEKNP-CKPLDWKKIYPGWNSDSHKEMAR 495

Query: 407 EAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAG--IPCRYMSPIAGF 464
           +AARE IV+L+N +N LPL S  ++T+AVVGP A+      G+Y    +P +  S + G 
Sbjct: 496 QAARESIVMLENKENLLPL-SKTLRTIAVVGPGADDLQP--GDYTPKLLPGQLKSVLTGI 552

Query: 465 SG----YANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEA-------- 512
                    V Y+ GCD     + N I  A + A  +D  I++ G   + EA        
Sbjct: 553 KSAVGKQTKVLYEQGCDFTNPDATN-IPKAVKTASQSDVVIMVLGDCSTSEATNDVRKTC 611

Query: 513 -ESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYP 571
            E+ D   L LPG Q +L+  V    K PVIL++ +    DI  A  +   KAIL    P
Sbjct: 612 GENNDWATLILPGKQQELLEAVCATGK-PVILILQAGRPYDILKA--SEMCKAILVNWLP 668

Query: 572 GEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNG 631
           G+EGG A+ADV+FG +NP GRLP+T+    +V  LPL         +    GR Y++ + 
Sbjct: 669 GQEGGPAMADVLFGDYNPAGRLPMTFPR--HVGQLPLYY-------NFKTSGRRYEYVDM 719

Query: 632 P--TLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDL 689
               LY FG+GLSYT F+Y+ L           K+Q   N N                  
Sbjct: 720 EYYPLYRFGFGLSYTSFEYSNL-----------KIQEKANGN------------------ 750

Query: 690 RCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKF 749
                 E +   +NVGS  G +V  +Y         T + ++  F R+ ++ G +K + F
Sbjct: 751 -----VEVQATVKNVGSCAGDEVAQLYVTDMYASVKTRVMELKDFTRIHLQPGESKTVSF 805

Query: 750 VFNACKSLNIVDYAANTLLPAGEHTIFVG 778
                  +++++   + ++  GE  I +G
Sbjct: 806 EMTPY-DISLLNDRMDRVVEKGEFKIMIG 833


>gi|423217451|ref|ZP_17203947.1| hypothetical protein HMPREF1061_00720 [Bacteroides caccae
           CL03T12C61]
 gi|392628610|gb|EIY22636.1| hypothetical protein HMPREF1061_00720 [Bacteroides caccae
           CL03T12C61]
          Length = 946

 Score =  268 bits (685), Expect = 1e-68,   Method: Compositional matrix adjust.
 Identities = 235/819 (28%), Positives = 371/819 (45%), Gaps = 146/819 (17%)

Query: 54  FCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRL---GLPQYEW----WS------ 100
           + D + P   R++DL+S+MTL+EK  Q+    +G  R+    LP  EW    W       
Sbjct: 53  YEDPTAPIDARIEDLLSQMTLEEKTCQMVTL-YGYKRVLKDDLPTSEWKNQLWKDGIGAI 111

Query: 101 -EALHGVSNVG-PGTHFDDVIPG------------------------------------- 121
            E L+G    G P +  + V P                                      
Sbjct: 112 DEHLNGFQQWGLPPSDNEYVWPASKHAWALNEVQRFFIEETRLGIPTDFTNEGIRGVESY 171

Query: 122 -ATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLT-YWSPNINVARDPRWGR 179
            AT+FPT +    ++N  L  ++G     EAR +      G T  ++P ++V RD RWGR
Sbjct: 172 KATNFPTQLGLGHTWNRQLIHQVGLITGREARML------GYTNVYAPILDVGRDQRWGR 225

Query: 180 ITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGV 239
             E  GE P++V    +  VRG+Q    H +         +V++  KH+ AY  +     
Sbjct: 226 YEEVYGESPYLVAELGIEMVRGMQ----HNH---------QVAATGKHFIAYSNNKGARE 272

Query: 240 DRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGE 299
                D +++ +++E     PF+  ++E     VM SYN  +G P  +    L   +RGE
Sbjct: 273 GMARVDPQMSPREVEMLHAYPFKRVIREAGLLGVMSSYNDYDGFPIQSSYYWLTTRLRGE 332

Query: 300 WDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCG----QYYTNFTGNAV 355
               GY+V+D D+++ +   H    D KE AV Q+++AGL++ C       Y       V
Sbjct: 333 MGFRGYVVSDSDAVEYLYTKHGTAKDMKE-AVRQSVEAGLNVRCTFRSPDSYVLPLRELV 391

Query: 356 QQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQD--ICSDENIELAAEAAREGI 413
           ++G + E  I+  ++ +  V   +G FD +P    L   D  +   EN E+A +A+RE I
Sbjct: 392 KEGGLSEEVINDRVRDILRVKFLVGLFD-TPYQTDLKGADEEVEKKENEEVALQASRESI 450

Query: 414 VLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAG----FSGYAN 469
           VLLKN++N LPL+ +K++ +AV GP+A+     + +Y  +     S + G        A+
Sbjct: 451 VLLKNEKNVLPLDPSKIRKIAVCGPNADEHSYALTHYGPLAVEVTSVLKGIQEKMKDKAD 510

Query: 470 VTYKTGCDDVAC--------------KSNNSIFAASEAAKTADATIILAGLDLSVEAESL 515
           V Y  GCD V                +    I  A   AK AD  I++ G       E+ 
Sbjct: 511 VLYTKGCDLVDANWPESELIDYPLTDEEQKEIDKAVSQAKQADVAIVVLGGGQRTCGENK 570

Query: 516 DREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEG 575
            R  L LPG Q  L+  V    K PV+LV+++   + I +A+    + AIL A YPG +G
Sbjct: 571 SRSSLDLPGRQLDLLKAVVATGK-PVVLVLINGRPLSINWAD--KFVPAILEAWYPGSKG 627

Query: 576 GRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRP---VDSLGYPGR--TYKFYN 630
           G A+AD++FG +NPGG+L +T+     V  +P  + P +P   +D    PG        N
Sbjct: 628 GIAVADILFGDYNPGGKLTVTFPK--TVGQIPF-NFPCKPSSQIDGGKNPGPDGNMSRAN 684

Query: 631 GPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLR 690
           G  LYPFGYGLSYT F+Y+ L  +                           P ++  + +
Sbjct: 685 G-ALYPFGYGLSYTTFEYSDLKIS---------------------------PAIITPNQK 716

Query: 691 CDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFV 750
              Y   KV   N G   G +V+ +Y +       TY K ++GF+RV ++ G  K I F 
Sbjct: 717 A--YVTCKV--TNTGKRSGDEVIQLYVRDVLSSVTTYEKNLVGFERVHLKPGETKEITFP 772

Query: 751 FNACKSLNIVDYAANTLLPAGEHTIFVGNGGVSFPIHLN 789
            +  K+L +++   + ++  G+ T+ +  G  S  I LN
Sbjct: 773 IDR-KALELLNADMHWVVEPGDFTLML--GASSTDIRLN 808


>gi|329850151|ref|ZP_08264997.1| beta-xylosidase B [Asticcacaulis biprosthecum C19]
 gi|328842062|gb|EGF91632.1| beta-xylosidase B [Asticcacaulis biprosthecum C19]
          Length = 877

 Score =  268 bits (684), Expect = 1e-68,   Method: Compositional matrix adjust.
 Identities = 163/450 (36%), Positives = 243/450 (54%), Gaps = 47/450 (10%)

Query: 49  MSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSN 108
           +++  + D++L    R  DLVSRMTL+EK  QLG  A  +PRLG+P+Y WW+E LHGV+ 
Sbjct: 18  VAAMAYRDTALDPKARAADLVSRMTLEEKAAQLGHTAPAIPRLGVPKYNWWNEGLHGVAR 77

Query: 109 VGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRA-------- 160
            G           AT FP  I   A+++E +   +G  VSTE RA Y + R         
Sbjct: 78  AGV----------ATVFPQAIGMAATWDEPMMTTVGDVVSTEFRAKY-VERVHPDGGTDW 126

Query: 161 --GLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRP 218
             GLT WSPNIN+ RDPRWGR  ET GEDP++  R  + Y+ GLQ           + + 
Sbjct: 127 YRGLTVWSPNINIFRDPRWGRGQETYGEDPYLTSRIGIGYIHGLQGN---------DPKF 177

Query: 219 LKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYN 278
            K  +  KH+A   V +    +R+  D   ++ D+E+T+L  F   V EG A SVMC YN
Sbjct: 178 FKTVATSKHFA---VHSGPESNRHKEDVYPSKFDLEDTYLPAFRATVTEGKAYSVMCVYN 234

Query: 279 RVNGIPSCADPKLLNQTVRGEWDLHGYIVADCD-SIQVMVDNHKFLADSKEDAVAQTLKA 337
            V G+P CA   L+ + +R  W   G++V+DC  +  +  ++      + E+ VA  LKA
Sbjct: 235 AVYGVPGCASDFLMEEKLRQNWGFPGFVVSDCGAAANIFREDALHYTKTAEEGVAVGLKA 294

Query: 338 GLDLDCGQYYTNFTG------NAVQQGKVKETDIDKSLKYLYTVLMRLGFFD--GSPQYV 389
           G+DL CG Y    +       NAV+ G++    +D++L  L+   +RLG FD   S  + 
Sbjct: 295 GMDLICGDYRNKMSTEVQPIINAVKAGQLPIAVVDQALVRLFEGRIRLGMFDPPASLPFA 354

Query: 390 SLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGN 449
            +   D  +  +  +A + A++ +VLLKND   LPL  A+ KT+AV+GP+A++  A++GN
Sbjct: 355 HITADDSDTPAHHAVALDMAKKSMVLLKND-GLLPLK-AEPKTIAVIGPNADSLDALVGN 412

Query: 450 YAGIPCRYMSPIAGFSGY---ANVTYKTGC 476
           Y G P + ++ + G       A + Y  G 
Sbjct: 413 YYGKPSKPVTVLDGIRARFPTAKIVYAEGT 442



 Score =  122 bits (305), Expect = 9e-25,   Method: Compositional matrix adjust.
 Identities = 96/309 (31%), Positives = 144/309 (46%), Gaps = 73/309 (23%)

Query: 490 ASEAAKTADATIILAGLDLSVEAESL----------DREDLWLPGYQTQLINQVAEVAKG 539
           A + AKTAD  + + GL   VE E +          DR  + LP  Q QL+ +V    K 
Sbjct: 591 AVDVAKTADFVVFVGGLSARVEGEEMKVEAEGFAGGDRTSIDLPKPQQQLLEKVIGTGK- 649

Query: 540 PVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYN 599
           P +LV+MS   + + +A  + ++ AI+ A YPG EGG A+A ++ G ++P GRLP+T+Y 
Sbjct: 650 PTVLVLMSGSALGVNWA--DKHVPAIIEAWYPGGEGGHAVAQLIAGDYSPAGRLPVTFY- 706

Query: 600 GDYVQMLPLTSMPLRPVDSLGYPG--------RTYKFYNGPTLYPFGYGLSYTQFKYNLL 651
                         R VD+L  PG        RTY+++NG  LYPFG+GLSYT F Y   
Sbjct: 707 --------------RSVDAL--PGFSDYTMKNRTYRYFNGEVLYPFGHGLSYTTFAYA-- 748

Query: 652 SFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSD 711
                                         P V    +         VD  N G+ D  +
Sbjct: 749 -----------------------------NPKVSAASVAAGSSVTVSVDVSNSGAMDSDE 779

Query: 712 VVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAG 771
           VV +Y   P     T I+ + GFQRV ++ G  K ++F  +  ++L++VD      + AG
Sbjct: 780 VVQLYVSHP---GGTAIRSLQGFQRVSLKKGETKTVQFKLDD-RALSVVDEHGGRKVQAG 835

Query: 772 EHTIFVGNG 780
           +  +++G G
Sbjct: 836 QVDLWIGGG 844


>gi|170731072|ref|YP_001776505.1| beta-glucosidase [Xylella fastidiosa M12]
 gi|167965865|gb|ACA12875.1| Beta-glucosidase [Xylella fastidiosa M12]
          Length = 882

 Score =  268 bits (684), Expect = 1e-68,   Method: Compositional matrix adjust.
 Identities = 164/423 (38%), Positives = 238/423 (56%), Gaps = 40/423 (9%)

Query: 68  LVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGATSFPT 127
           LV++MT  EK+ Q  + A  +PRLG+P Y+WWSE LHG++  G           AT FP 
Sbjct: 37  LVAQMTRQEKIAQTMNAAPAIPRLGIPAYDWWSEGLHGIARNG----------YATVFPQ 86

Query: 128 VILTTASFNESLWKKIGQAVSTEARAMYNL----GR-----AGLTYWSPNINVARDPRWG 178
            I   AS+N  L + +G   STEARA +NL    G+     AGLT WSPNIN+ RDPRWG
Sbjct: 87  AIGLAASWNTDLLQHVGTVTSTEARAKFNLTGGPGKDHPRYAGLTLWSPNINIFRDPRWG 146

Query: 179 RITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKG 238
           R  ET GEDP++  + AV+++RGLQ         ++   P  +++  KH+A   V +   
Sbjct: 147 RGMETYGEDPYLTSQLAVSFIRGLQG--------NIPDHPRTIATP-KHFA---VHSGPE 194

Query: 239 VDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRG 298
             R+ FD  V+  D+E T+   F   + +G A SVMC+YN ++G P+CA   LLN  +R 
Sbjct: 195 PGRHSFDVDVSAYDLEATYTPAFRAAIVDGHAGSVMCAYNALHGTPACASDWLLNTRLRN 254

Query: 299 EWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFTGNAVQQG 358
           +W  +G++V+DCD+I+ M   H F  D+   A A  LK+G DL+CG  Y +    A+ +G
Sbjct: 255 DWGFNGFVVSDCDAIEDMTRFHFFRQDNAS-ASAAALKSGDDLNCGNTYRDLN-QAIARG 312

Query: 359 KVKETDIDKSLKYLYTVLMRLGFFDGSPQ--YVSLGKQDICSDENIELAAEAAREGIVLL 416
            + E+ +D++L  L+T   RLG         Y ++G + I +  +  LA +AA + +VLL
Sbjct: 313 DIDESTLDQALIRLFTARQRLGTLQPREHDPYAAIGIKHIDTPAHRALALQAAAQSLVLL 372

Query: 417 KNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFS---GYANVTYK 473
           KN  NTLPL      T+AV+GP A++  A+  NY G     ++P+ G     G A V Y 
Sbjct: 373 KNSGNTLPLTPG--TTLAVLGPDADSLTALEANYQGTSSTPVTPLIGLRTRFGTAKVHYA 430

Query: 474 TGC 476
            G 
Sbjct: 431 QGA 433



 Score =  146 bits (368), Expect = 5e-32,   Method: Compositional matrix adjust.
 Identities = 100/301 (33%), Positives = 140/301 (46%), Gaps = 55/301 (18%)

Query: 490 ASEAAKTADATIILAGLDLSVEAESL----------DREDLWLPGYQTQLINQVAEVAKG 539
           A  A   ADA +   GL   VE E L          DR  + LP  Q  L+  V    K 
Sbjct: 604 AERAVAHADAIVAFVGLSPEVEGEELHIDTPGFSGGDRTTIDLPATQETLLQHVKTTGK- 662

Query: 540 PVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYN 599
           P+I+V+MS   V + +A+ + +  AIL A YPG+ GG AIA  + G  NPGGRLP+T+Y 
Sbjct: 663 PLIVVLMSGSAVALNWAQHHAD--AILAAWYPGQSGGTAIAQALAGDVNPGGRLPMTFYR 720

Query: 600 GDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQV 659
               Q LP       P  S    GRTY+++ G  LYPFGYGLSYTQF Y           
Sbjct: 721 S--TQDLP-------PYISYDMTGRTYRYFKGQPLYPFGYGLSYTQFAYE---------- 761

Query: 660 NLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKP 719
                                 P +    L+  D        +N G+  G +VV +Y +P
Sbjct: 762 ---------------------APQLSTATLKAGDTLTVTAHVRNTGTRAGDEVVQLYLEP 800

Query: 720 PAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGN 779
           P    A  ++ ++GF+RV +R G ++ + F  +A + L+ V       + AG + +FVG 
Sbjct: 801 PHSPQAP-LRNLVGFKRVTLRPGESRLLTFTLDA-RQLSSVQQTGQRSVEAGHYHLFVGG 858

Query: 780 G 780
           G
Sbjct: 859 G 859


>gi|261880507|ref|ZP_06006934.1| xylosidase [Prevotella bergensis DSM 17361]
 gi|270332847|gb|EFA43633.1| xylosidase [Prevotella bergensis DSM 17361]
          Length = 948

 Score =  268 bits (684), Expect = 1e-68,   Method: Compositional matrix adjust.
 Identities = 223/806 (27%), Positives = 350/806 (43%), Gaps = 140/806 (17%)

Query: 54  FCDSSLPYSIRVKDLVSRMTLDEKVQQL-------------------------------G 82
           + D+S P + R+ DL+ +MT++EK  Q+                                
Sbjct: 61  YEDASAPLNDRINDLLEQMTIEEKTNQMVTLYGYKRVLEDDLPNAGWKQKLWKDGIGAID 120

Query: 83  DFAHGVPRLGLPQYE--W-WSEALHG----------VSNVGPGTHFDDVIPG-------- 121
           +  +G  + GLP  +  W W  + H           V     G   D    G        
Sbjct: 121 EHLNGFVQWGLPPSDNPWVWPASKHAWAINEVQRFFVEETRLGIPVDFTNEGIRGIESYK 180

Query: 122 ATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLT-YWSPNINVARDPRWGRI 180
           AT+FPT +    ++N  L +++G     EAR +      G T  ++P ++V RD RWGR 
Sbjct: 181 ATNFPTQLGLGTTWNRQLIRQVGYITGREARLL------GYTNVYAPILDVGRDQRWGRY 234

Query: 181 TETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVD 240
            E  GE PF+V    +   RGLQ        TD      +V+S  KH+AAY  +      
Sbjct: 235 EEIYGESPFLVAELGIQMTRGLQ--------TDF-----QVASTAKHFAAYSNNKGGREG 281

Query: 241 RYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEW 300
               D ++  +++E   L P+E  V+E      M SYN  +GIP       L + +R  +
Sbjct: 282 MSRVDPQMPPREVENIHLYPWERVVQEAGLLGAMSSYNDYDGIPIQGSYHWLTEVLRHRF 341

Query: 301 DLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCG----QYYTNFTGNAVQ 356
              GYIV+D D+++ +   H   AD KE AV Q + AGL++ C       +       ++
Sbjct: 342 GFRGYIVSDSDALEYLFSKHHTAADMKE-AVYQAVMAGLNVRCTFRSPDSFVLPLRELIR 400

Query: 357 QGKVKETDIDKSLKYLYTVLMRLGFFDGSPQY-VSLGKQDICSDENIELAAEAAREGIVL 415
           +G++  + ID+ +  +  V    G FD   Q  +    Q++ S+ N  +A +A+R+ IVL
Sbjct: 401 EGRIPMSVIDRLVGDILRVKFITGIFDNPYQMNLKAADQEVNSERNQAVALQASRQSIVL 460

Query: 416 LKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFSGYA----NVT 471
           LKN    LPL+ +K++ + V GP+A+     + +Y  +     + + G          V+
Sbjct: 461 LKNQDRLLPLDRSKLRRILVCGPNADDASYALTHYGPLAVDVTTVLEGIRDKVENNIEVS 520

Query: 472 YKTGCDDV--------------ACKSNNSIFAASEAAKTADATIILAGLDLSVEAESLDR 517
           Y  GCD V                +    I  A   AK +D  I++ G +     E+  R
Sbjct: 521 YAKGCDVVDPHWPESEIIGYPMTSQEQQDIDHAVALAKESDVAIVVLGGNSRTCGENKSR 580

Query: 518 EDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGR 577
             L LPG Q  L+  V    K PV+LV+++   + + +A+    I AI+ A YPG +GG 
Sbjct: 581 SSLDLPGRQLDLLKAVQATGK-PVVLVLINGRPLSVNWAD--RFIPAIVEAWYPGSQGGT 637

Query: 578 AIADVVFGKFNPGGRLPITWYNGDYVQMLPLT--SMPLRPVD---SLGYPGRTYKFYNGP 632
           A+ADV+FG +NPGG+L +T+     V  +P    S P   VD    LG  G   +  NG 
Sbjct: 638 AVADVLFGDYNPGGKLTVTFPKS--VGQIPFNFPSKPASQVDGGNKLGLQGNASRI-NG- 693

Query: 633 TLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCD 692
            LY FG+GLSYT FKY+ L  +K                                 +  +
Sbjct: 694 ALYSFGHGLSYTTFKYSNLRLSKET-------------------------------MTLN 722

Query: 693 DYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFN 752
           D      D  N G  +G +VV +Y +       TY K + GF R+ ++ G  K + F   
Sbjct: 723 DSINISCDVSNTGDREGDEVVQLYIRDVISSVTTYEKNLRGFDRIHLKPGETKTLTFTIK 782

Query: 753 ACKSLNIVDYAANTLLPAGEHTIFVG 778
             + L +V+     ++  GE  I +G
Sbjct: 783 P-EHLKLVNKDFEKVVEPGEFKIMIG 807


>gi|313204470|ref|YP_004043127.1| glycoside hydrolase [Paludibacter propionicigenes WB4]
 gi|312443786|gb|ADQ80142.1| glycoside hydrolase family 3 domain protein [Paludibacter
           propionicigenes WB4]
          Length = 746

 Score =  268 bits (684), Expect = 1e-68,   Method: Compositional matrix adjust.
 Identities = 223/762 (29%), Positives = 356/762 (46%), Gaps = 111/762 (14%)

Query: 68  LVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVG----------------- 110
           L+ +MTLDEK+ QL  ++      G    E   E       VG                 
Sbjct: 32  LIRQMTLDEKIGQLNQYSSDWESTGKITAEGDKETQIRQGKVGSMLNVTGVDKTRKLQEL 91

Query: 111 --------PGTHFDDVIPG-ATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAG 161
                   P     DVI G  T+FP  +  TAS++ +L +K  +  +TEA A       G
Sbjct: 92  AMQSRLHIPMIFGLDVIHGFRTTFPIPLGETASWDLALIEKSARIAATEASAY------G 145

Query: 162 LTY-WSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQ-DVEGHENATDLNSRPL 219
           + + ++P +++ARDPRWGR+ E  GED ++    A   V G Q +  G+ +A        
Sbjct: 146 VQWTFAPMVDIARDPRWGRVMEGAGEDTYLGSLVAKARVHGFQGNGLGNVDA-------- 197

Query: 220 KVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNR 279
            + +C KH+AAY      G D    D  + +  + ET+L PF+  V E + ++ M S+N 
Sbjct: 198 -IMACAKHFAAYGA-AIGGRDYNSVDMSLRQ--LNETYLPPFKAAV-EANVATFMNSFND 252

Query: 280 VNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGL 339
           +NGIP+ A+  +    ++G+W+  G++V+D  SI  M+  H +  DS  DA  + + AG 
Sbjct: 253 INGIPATANKYIQRDILKGQWNFKGFVVSDWGSIGEMI-AHGYAKDSY-DAAMKAINAGS 310

Query: 340 DLDC-GQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQDICS 398
           D+D   + Y N     VQ GKV  + ID+++K +      LG FD   ++ +  ++   +
Sbjct: 311 DMDMESRCYRNNLKQLVQDGKVDISVIDEAVKRILVKKFELGLFDDPYRFCNAAREKKQT 370

Query: 399 D--ENIELAAEAAREGIVLLKND-----QNTLPLNSAKVKTVAVVGPHANATVAMIGNYA 451
           +  EN   A E  ++ IVLLKN+     +  LPL S + KTVA++GP   AT A  G ++
Sbjct: 371 NNPENRAFAREIGKKSIVLLKNEPLSNGKTLLPL-SKQTKTVALIGPLFKATKANHGFWS 429

Query: 452 -GIPCRYMSPIAGFSGYAN-------VTYKTGCDDVACKSNNSIFAASEAAKTADATIIL 503
              P      I+ + G  N       + Y  GC+ +          A  AAK+AD  I+ 
Sbjct: 430 IAFPDDSTRIISQYQGIKNQLDKSSSIVYAKGCN-INDNDKTGFAEAINAAKSADVVIMS 488

Query: 504 AGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIK 563
            G    +  E+  + +L LPG Q +L+ ++ +  K PV+L++ +  G  + F   + NI 
Sbjct: 489 LGEAADMSGEAKSKSNLQLPGVQEELLKEIYKTGK-PVVLLLNA--GRPLIFNWASDNIP 545

Query: 564 AILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPL------TSMPLRPVD 617
           +IL+  + G E G AIADV+FG +NP G+LPI++   +    +P+      T  P +  +
Sbjct: 546 SILYTWWLGTEAGNAIADVLFGDYNPAGKLPISFPRTE--GQIPIYYNHFNTGRPAKDEN 603

Query: 618 SLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDAS 677
              Y        N P  YPFGYGLSYT+F                      NL  +SD  
Sbjct: 604 DKNYVSAYIDLQNSPK-YPFGYGLSYTKF-------------------DISNLKLSSDK- 642

Query: 678 KTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRV 737
                      L   +     VD  N G+ DG +VV +Y +         +K++ GFQ++
Sbjct: 643 -----------LSSGNKLTVTVDIANTGNYDGEEVVQLYVRDLVGSVVRPVKELKGFQKL 691

Query: 738 FVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGN 779
            ++ G  K++ F     + L   +     +  AG++ +FVGN
Sbjct: 692 MLKKGETKQLTFTLTP-EDLKFFNNEIQYINEAGDYELFVGN 732


>gi|383115617|ref|ZP_09936373.1| hypothetical protein BSGG_2514 [Bacteroides sp. D2]
 gi|313694979|gb|EFS31814.1| hypothetical protein BSGG_2514 [Bacteroides sp. D2]
          Length = 946

 Score =  268 bits (684), Expect = 1e-68,   Method: Compositional matrix adjust.
 Identities = 238/831 (28%), Positives = 375/831 (45%), Gaps = 149/831 (17%)

Query: 42  FSKLGLQMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRL---GLPQYEW 98
           F+K G++    ++ D S P   R++DL+ +MTL+EK  Q+    +G  R+    LP  EW
Sbjct: 44  FNKNGMKD---IYEDPSAPVDARIEDLLKQMTLEEKTCQMVTL-YGYKRVLKDDLPTPEW 99

Query: 99  ----WS-------EALHGVSNVG-PGTHFDDVIPG------------------------- 121
               W        E L+G    G P +  + V P                          
Sbjct: 100 KNQLWKDGIGAIDEHLNGFQQWGLPPSDNEYVWPASRHAWALNEVQRFFIEETRLGIPTD 159

Query: 122 -------------ATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLT-YWSP 167
                        AT+FPT +    ++N  L +++G     EAR +      G T  ++P
Sbjct: 160 FTNEGIRGVESYKATNFPTQLGLGHTWNRELIRQVGVITGREARML------GYTNVYAP 213

Query: 168 NINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKH 227
            ++V RD RWGR  E  GE P++V    +  VRG+Q             +  +V++  KH
Sbjct: 214 ILDVGRDQRWGRYEEVYGESPYLVAELGIEMVRGMQ-------------QDYQVAATGKH 260

Query: 228 YAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCA 287
           + AY  +          D +++ +++E   + PF+  ++E     VM SYN  +G P  +
Sbjct: 261 FIAYSNNKGGREGMSRVDPQMSPREVEMVHVYPFKRVIREAGLLGVMSSYNDYDGFPIQS 320

Query: 288 DPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCG--- 344
               L   +RGE    GY+V+D D+++ +   H    D KE AV Q+++AGL++ C    
Sbjct: 321 SYYWLTTRLRGEMGFRGYVVSDSDAVEYLYTKHNTAKDMKE-AVRQSVEAGLNVRCTFRS 379

Query: 345 -QYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQD--ICSDEN 401
              Y       V++G + E  I+  ++ +  V   +G FD  P    L   D  +   EN
Sbjct: 380 PDSYVLPLRELVKEGGLSEEVINDRVRDILRVKFLVGLFD-HPYQTDLKGADEEVEKAEN 438

Query: 402 IELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPI 461
            E+A +A+RE IVLLKNDQ+ LPL+ + +K +AV GP+A+     +G+Y  +     S +
Sbjct: 439 EEVALQASRESIVLLKNDQDVLPLDISGIKKIAVCGPNADECSYALGHYGPLAVEVTSVL 498

Query: 462 AGFS----GYANVTYKTGCDDVAC--------------KSNNSIFAASEAAKTADATIIL 503
            G      G   V Y  GC+ V                +    I  A   AK AD  +++
Sbjct: 499 KGIQEKTDGKVEVLYSKGCELVDANWPESELIDFPLTEEEQKEIDRAVSQAKEADVAVVV 558

Query: 504 AGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIK 563
            G       E+  R  L LPG Q  L+  V    K PV+LV+++   + I +A  +  + 
Sbjct: 559 LGGGQRTCGENKSRSSLDLPGRQLDLLKAVVATGK-PVVLVLINGRPLSINWA--DKFVP 615

Query: 564 AILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRP---VDSLG 620
           AIL A YPG +GG+A+ADV+FG +NPGG+L +T+     V  +P  + P +P   +D   
Sbjct: 616 AILEAWYPGAKGGKAVADVLFGDYNPGGKLTVTFPK--TVGQIPF-NFPCKPSSQIDGGK 672

Query: 621 YPGR--TYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASK 678
            PG        NG  LY FG+GLSYT F+Y+ L  T                        
Sbjct: 673 NPGMDGNMSRANG-ALYAFGHGLSYTSFEYSDLKIT------------------------ 707

Query: 679 TRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVF 738
              P V+  + +   Y   KV   N G   G +VV +Y +       TY K + GF+R+ 
Sbjct: 708 ---PAVITPNQKT--YVTCKV--TNTGKRAGDEVVQLYVRDVLSSVTTYEKNLAGFERIH 760

Query: 739 VRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGNGGVSFPIHLN 789
           ++ G  K + F  +  K+L +++   + ++  G+ T+ V  G  S  I LN
Sbjct: 761 LKPGETKEVFFPIDR-KALELLNADMHWVVEPGDFTLMV--GASSTDIRLN 808


>gi|298387489|ref|ZP_06997041.1| periplasmic beta-glucosidase [Bacteroides sp. 1_1_14]
 gi|298259696|gb|EFI02568.1| periplasmic beta-glucosidase [Bacteroides sp. 1_1_14]
          Length = 950

 Score =  268 bits (684), Expect = 1e-68,   Method: Compositional matrix adjust.
 Identities = 228/749 (30%), Positives = 354/749 (47%), Gaps = 109/749 (14%)

Query: 54  FCDSSLPYSIRVKDLVSRMTLDEKVQQL--GDFAHGVPRLGLPQYEWWSEALHGVSNVGP 111
           + D+SLP   RV+ L++ MT ++K++ +  G    G+P L +P      EA+HG S    
Sbjct: 166 YMDASLPVEERVESLLAVMTPEDKMELIREGWGIPGIPHLYVPPITK-VEAVHGFSYGS- 223

Query: 112 GTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNINV 171
                    GAT FP  +   A++N  L +++   +  E  A  N  +A    WSP ++V
Sbjct: 224 ---------GATIFPQALAMGATWNRKLTEEVAMVIGDETVAA-NTKQA----WSPVLDV 269

Query: 172 ARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAY 231
           A+D RWGR  ET GEDP +V +    +++G Q            SR L  +   KH+  +
Sbjct: 270 AQDARWGRCEETFGEDPVLVSQIGGAWIKGYQ------------SRGLFTTP--KHFGGH 315

Query: 232 DVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKL 291
                 G D +  D  ++E++M E  L PF   ++  D  S+M +Y+   G+P     +L
Sbjct: 316 GAP-LGGRDSH--DIGLSEREMREIHLVPFRHAIRNYDCQSLMMAYSDYMGVPVAKSKEL 372

Query: 292 LNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNF- 350
           L Q +R EW  +G+IV+DC +I  +     + A  K +A  Q L AG+  +CG  Y N  
Sbjct: 373 LQQILRQEWGFNGFIVSDCGAIGNLTARKHYTAKDKIEAANQALAAGIATNCGDTYNNKE 432

Query: 351 TGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQDIC----SDENIELAA 406
              A + G++   D+D   + +   + R   F+ +P    L  + I     SD + E+A 
Sbjct: 433 VIQAAKDGRINMEDLDNVCRTMLGTMFRNELFEKNP-CKPLDWKKIYPGWNSDSHKEMAR 491

Query: 407 EAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAG--IPCRYMSPIAGF 464
           +AARE IV+L+N +N LPL S  + T+AV+GP A+      G+Y    +P +  S + G 
Sbjct: 492 QAARESIVMLENKENLLPL-SKTLCTIAVLGPGADDLQP--GDYTPKLLPGQLKSVLTGI 548

Query: 465 SG----YANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEA-------- 512
            G       V Y+ GCD       N I  A +AA  +D  I++ G   + EA        
Sbjct: 549 KGAVGKQTKVLYEQGCDFTNPDETN-IPKAVKAASQSDVVIMVLGDCSTSEATNDVRKTC 607

Query: 513 -ESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYP 571
            E+ D   L LPG Q +L+  V    K PVIL++ +    DI  A  +   KAIL    P
Sbjct: 608 GENNDWATLILPGKQQELLEAVCATGK-PVILILQAGRPYDILKA--SEMCKAILVNWLP 664

Query: 572 GEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNG 631
           G+EGG A+ADV+FG +NP GRLP+T+    +V  LPL         +    GR Y++ + 
Sbjct: 665 GQEGGPAMADVLFGDYNPAGRLPMTFPR--HVGQLPLYY-------NFKTSGRRYEYVDM 715

Query: 632 P--TLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDL 689
               LY FG+GLSYT F+Y+ L           K+Q   N N                  
Sbjct: 716 EYYPLYRFGFGLSYTSFEYSNL-----------KIQEKANGN------------------ 746

Query: 690 RCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKF 749
                 E +   +NVGS  G +V  +Y         T + ++  F R+ ++ G +K + F
Sbjct: 747 -----VEVQATVKNVGSRAGDEVAQLYVTDMYASVKTRVMELKDFARIHLQPGESKTVSF 801

Query: 750 VFNACKSLNIVDYAANTLLPAGEHTIFVG 778
                  +++++   + ++  GE  I VG
Sbjct: 802 EMTPY-DISLLNDRMDRVVEKGEFKIMVG 829


>gi|358342292|dbj|GAA27551.2| probable beta-D-xylosidase 7 [Clonorchis sinensis]
          Length = 826

 Score =  268 bits (684), Expect = 1e-68,   Method: Compositional matrix adjust.
 Identities = 219/831 (26%), Positives = 373/831 (44%), Gaps = 157/831 (18%)

Query: 54  FCDSSLPYSIRVKDLVSRMTLDEKVQQLGDF--------AHGVPRLGLPQYEWWSEALHG 105
           F + SLP + RV DL++R+T +E +QQ+ +         A G+ RL +  Y+W       
Sbjct: 29  FRNPSLPANFRVDDLLARLTNEELIQQVSNGGAGPQHGPAPGIARLNISAYQW------- 81

Query: 106 VSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTY- 164
            +N G G          T FP  +   A+F+     ++ +A   E RA +N  +A  TY 
Sbjct: 82  RTNPGDGR--------ITPFPQPVNLGATFDVHTVYRVARATGLEMRARWNRAKAKKTYR 133

Query: 165 -------WSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHE----NATD 213
                  ++P +N+ R P WGR  ET GEDPF++G+ A  +VRGL   +  E    +  +
Sbjct: 134 DGNGIHLFAPVVNLLRHPLWGRNQETFGEDPFMIGKLARTFVRGLGGWKNAEPQSLDEQN 193

Query: 214 LNSRP--LKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDAS 271
           L+S+P  L V + CKH+A +       V R  F+A VT+ D+ +T+L  F  C++ G A 
Sbjct: 194 LSSQPDVLLVGANCKHFAVHTGPEDFPVSRLSFEANVTDVDLWQTYLPAFRACLEAG-AV 252

Query: 272 SVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAV 331
           SVMC+Y+ +NG P C +  LL + +R +W   G++V DC ++Q ++  H+      E A+
Sbjct: 253 SVMCAYSGINGTPDCINHWLLTELLRQKWKFKGFVVTDCGALQFVIWKHQIFNHYNETAM 312

Query: 332 AQTLKAGLDLDCGQYYT----NFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFF---DG 384
           A  ++AG++L+    Y     +   + +  G +    + +  + L+   +  G F   + 
Sbjct: 313 A-AVRAGVNLENSVVYATEVFSTLPHLLASGSLSRDQLIEMARPLFLTRLMQGEFNPVEM 371

Query: 385 SPQYVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNS------AKVKTVAVVGP 438
            P  +   ++ I ++++  +A       IVLL+N    LPL +        ++ +A+VGP
Sbjct: 372 DPYRLLAPEEAILNEDHRRVALATTARSIVLLQNRDRFLPLKNNMSDSGGPLRHIAIVGP 431

Query: 439 HANATVAMIGNYAGIPCRYMS-PIAGFSGYANVTYKTGCDDVA-----CKS-NNSIFAAS 491
            A +   + G+Y   P   +  P++   G + ++ +    D+      C S N+    ++
Sbjct: 432 FATSVTELYGHYRTAPEPEIEVPLS--KGLSQLSRRMHASDICTDGGRCSSLNDDALHST 489

Query: 492 EAAKTADATIILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKG-----------P 540
                 D  ++  G    VE E++DR+++ LPG Q +L+ +  +++ G           P
Sbjct: 490 LGYDDLDLIVLSLGTGSEVEGENVDRQNITLPGKQPELLEETLKLSSGLGNSGLSKRTVP 549

Query: 541 VILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGK-------------- 586
           +IL++ SAG ++I+ A  N N+KAI W G+PG   G A+  ++ G               
Sbjct: 550 IILLVFSAGPINISRAVENENVKAIFWCGFPGPLVGDAMRHLLLGSSGELFGPSKPISVG 609

Query: 587 -------------------FNPGGRLPITWYNG-DYVQMLPLTSMPLRPVDSLGYPGRTY 626
                              + P  RLP TWY   D +  + +  M            +TY
Sbjct: 610 FHSFQEAYRWDVTPDDGYWWIPAARLPFTWYESIDQLANITVYEM----------TNQTY 659

Query: 627 KFYNG-----------PTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSD 675
           ++              P LYPFGYGLSY    +NL   +  +  +L              
Sbjct: 660 RYLPTQCHMSSEDCKIPVLYPFGYGLSY---NFNLSGASGFVYSDL-------------- 702

Query: 676 ASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIK-----Q 730
                 P   V+    +    F V  QN G     +VV VY+K          +     Q
Sbjct: 703 ----IAPSSAVSS---NQRIVFYVTVQNEGPIACEEVVQVYTKWLNRTENDNSRNGPLIQ 755

Query: 731 VIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGNGG 781
           + GF+RV +  G  K++KF     + L +   + NT++P G   + +  GG
Sbjct: 756 LAGFERVRLDVGEYKQLKFTLIPSEHLAVWSLSENTMIP-GRGVLQISVGG 805


>gi|255689951|ref|ZP_05413626.1| beta-glucosidase [Bacteroides finegoldii DSM 17565]
 gi|260624557|gb|EEX47428.1| glycosyl hydrolase family 3 N-terminal domain protein [Bacteroides
           finegoldii DSM 17565]
          Length = 735

 Score =  268 bits (684), Expect = 1e-68,   Method: Compositional matrix adjust.
 Identities = 215/779 (27%), Positives = 363/779 (46%), Gaps = 119/779 (15%)

Query: 53  LFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHG----VPRLGLPQYEWWSEALHGVSN 108
           L+ D+ +P   RV DL+SRMTL+EK+ QL  +  G    V  +G        E     + 
Sbjct: 29  LYKDAKVPIEKRVDDLLSRMTLEEKILQLNQYTMGRNNNVNNIG-------EEVKKVPAE 81

Query: 109 VGPGTHFD---------------------------DVIPG-ATSFPTVILTTASFNESLW 140
           +G   ++D                           D I G  T +P  +    S+N  L 
Sbjct: 82  IGSLIYYDTNPTLRNNVQKKAMEESRLGIPIIFGYDAIHGFRTVYPISLGQACSWNPELV 141

Query: 141 KKIGQAVSTEARAMYNLGRAGLTY-WSPNINVARDPRWGRITETPGEDPFVVGRYAVNYV 199
           +K     + EAR       +G+ + +SP I+VARDPRWGR+ E  GEDP+  G +A   V
Sbjct: 142 EKACAVTAQEARM------SGVDWTFSPMIDVARDPRWGRVAEGYGEDPYTNGVFAAASV 195

Query: 200 RGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLR 259
           RG Q         D+++   ++++C KHY  Y         R +    ++ Q + +T+L 
Sbjct: 196 RGYQ-------GDDMSAED-RIAACLKHYIGYGASE---AGRDYVYTEISRQTLWDTYLL 244

Query: 260 PFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDN 319
           P+EM VK G A+++M S+N ++GIP  A+   + + ++  W   G+IV+D  +I+ +   
Sbjct: 245 PYEMGVKAG-AATLMSSFNDISGIPGSANHYTMTEILKERWGHDGFIVSDWGAIEQL--K 301

Query: 320 HKFLADSKEDAVAQTLKAGLDLDCGQY-YTNFTGNAVQQGKVKETDIDKSLKYLYTVLMR 378
           ++ LA +K++A      AGL++D   + Y  +    V++GK+    +D+S++ +  V  R
Sbjct: 302 NQGLAANKKEAAVYAFNAGLEMDMMSHAYDRYMKELVEEGKITMAQVDESVRRVLRVKFR 361

Query: 379 LGFFDGSPQYVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGP 438
           LG F+     V+  K+     +++++AA+ A E +VLLKN+   LPL     K +AVVGP
Sbjct: 362 LGLFERPYTPVTSEKERFFRPQSMDIAAQLAAESMVLLKNENQILPLTDK--KKIAVVGP 419

Query: 439 HANATVAMIGNYAG------IPCRYMSPIAGFSGYANVTYKTGCDDVACKSNNSIFAASE 492
            A     ++G++ G      +   Y      F G A + Y  GC      +      A E
Sbjct: 420 MAKNGWDLLGSWCGHGKDTDVVMLYNGLATEFVGKAELRYALGC-RTQGDNRKGFEEALE 478

Query: 493 AAKTADATIILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVD 552
           AA+ +D  ++  G  ++   E+  R  + LP  Q +L  ++ +V K P++LV+++   ++
Sbjct: 479 AARWSDVVVLCLGEMMTWSGENASRSSIALPQIQEELAKELKKVGK-PIVLVLVNGRPLE 537

Query: 553 IAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPIT--WYNGDYVQMLPLTS 610
           +   E  ++  AIL    PG  G   +A ++ G+ NP G+L +T  + NG          
Sbjct: 538 LNRLEPISD--AILEIWQPGVNGALPMAGILSGRINPSGKLAMTFPYSNG---------Q 586

Query: 611 MPL---RPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHC 667
           +P+   R     G+ G  YK      LYPFG+GLSYT+FKY +++ +             
Sbjct: 587 IPIYYNRRKSGRGHQG-FYKDITSDPLYPFGHGLSYTEFKYGVVTLS------------- 632

Query: 668 RNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATY 727
                   ASK          ++  +    +V   N G  DG + V  +   P       
Sbjct: 633 --------ASK----------VKRGEKLSAEVTVTNTGKRDGLETVHWFISDPYCSITRP 674

Query: 728 IKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGNGGVSFPI 786
           +K++  F++  ++AG  K  +F  +  + L  VD      L AGE+ I V +  V   +
Sbjct: 675 VKELKYFEKQSIKAGETKIFRFDIDLERDLGFVDGNGKRFLEAGEYYIQVKDQKVKIEL 733


>gi|383115540|ref|ZP_09936296.1| hypothetical protein BSGG_2590 [Bacteroides sp. D2]
 gi|313695055|gb|EFS31890.1| hypothetical protein BSGG_2590 [Bacteroides sp. D2]
          Length = 770

 Score =  268 bits (684), Expect = 1e-68,   Method: Compositional matrix adjust.
 Identities = 222/818 (27%), Positives = 382/818 (46%), Gaps = 148/818 (18%)

Query: 36  VCDPGRFSKLGLQMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQ 95
           +C     S L  Q  S  + D +LP S RV  L+S+MTL+EKV Q+  +  G+  +   +
Sbjct: 10  ICCAIGISTLACQDKSKDYTDPTLPVSERVSSLMSQMTLEEKVAQMCQYV-GLEHMKKAE 68

Query: 96  YEWWSEALHGVSNVG--PGTHFDDV----------------------------------I 119
            +  +E L    + G  P  H  DV                                  I
Sbjct: 69  KDMSAEDLKHSHSQGFYPNLHSSDVEEMTKKGLISSFLHVVKAEEANYLQSLAQQSRLKI 128

Query: 120 P---------------GATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTY 164
           P               G+T +PT I   A+F+ +L +++ +  + E RA      +G+ +
Sbjct: 129 PLLIGIDAIHGNGLYRGSTIYPTPIGQAATFDPALVERMSRETAIEMRA------SGMHW 182

Query: 165 -WSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQ--DVEGHENATDLNSRPLKV 221
            ++PN+ VARD RWGR+ ET GEDP++VG+     VRG Q  D  G++          KV
Sbjct: 183 TFTPNVEVARDARWGRVGETFGEDPYLVGQMGAATVRGFQTKDFTGND----------KV 232

Query: 222 SSCCKHY--AAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNR 279
            +C KH    +   +   G       A ++E+ ++E F  PF+ C++ G   +VM ++N 
Sbjct: 233 IACAKHLVGGSQPANGINGAP-----AELSERTLQEVFFPPFKDCLEAG-VFTVMTAHNE 286

Query: 280 VNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGL 339
           +NGIP   +  L+ + +R +W   G++V+D   I+ M D H  +A++ +DA   ++ AG+
Sbjct: 287 LNGIPCHGNKYLMTEVLRNQWKFDGFVVSDWMDIERMHDYHN-VAETLKDAYRISVDAGM 345

Query: 340 DLDC-GQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQD--I 396
            +   G  +       V++G + E  ID ++  +  V  RLG F+    ++ L K+D  +
Sbjct: 346 GMHMHGPEFYEAIIECVKEGSIPEKQIDAAVSKILEVKFRLGLFENP--FIDLKKKDEIV 403

Query: 397 CSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYA-GIPC 455
            ++++ + A E AR+ IVLLKN+ N LPL+++K K V V G +AN   +++G++A   P 
Sbjct: 404 FNEKHQQTALEGARKSIVLLKNEGNMLPLDASKYKKVFVTGHNAN-NQSILGDWAMEQPE 462

Query: 456 RYMSPIAGFSGYANVTYKTGCD------DVACKSNNSIFAASEAAKTADATIILAG---- 505
            +++ +    G   ++ +T  +      +V   S+N I  A + A+ +D  I++ G    
Sbjct: 463 EHVTTV--LKGLKAISPETNYNFLDLGWNVRLLSDNQIKEAVQQARNSDLAILVVGENSM 520

Query: 506 ---LDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNI 562
               +     E+ DR +L LPG Q +L+  VA      V++++    G  +     + N+
Sbjct: 521 RYHWNEKTCGENSDRYELSLPGRQQELVKAVAATGVPTVVILV---NGRPLTTEWIDENM 577

Query: 563 KAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYP 622
             I+ A  PG  GG+A+A++++GK NP G+LPIT         +P ++  ++ + +  + 
Sbjct: 578 PCIIEAWEPGVAGGQALAEILYGKVNPSGKLPIT---------IPRSTGQIQCMYNHKFT 628

Query: 623 GRTYKFYNGPTL--YPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTR 680
              + +  G +L  Y FGYGLSYT +KY  L  ++                       T 
Sbjct: 629 NHWFPYATGNSLPLYEFGYGLSYTTYKYENLKLSEA----------------------TI 666

Query: 681 CPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVR 740
            P         D   +  VD  N G  DG + V +Y +     A   +K++  F R+ ++
Sbjct: 667 TP---------DKSVKVTVDVTNTGKMDGEETVQLYIRDEYSSATRPVKELKDFARIPLK 717

Query: 741 AGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVG 778
           AG  K + F     + L+  D   +  +  G   I VG
Sbjct: 718 AGETKEVSFTLTP-EMLSYYDANMHYGVEKGTFKIMVG 754


>gi|28199699|ref|NP_780013.1| family 3 glycoside hydrolase [Xylella fastidiosa Temecula1]
 gi|182682443|ref|YP_001830603.1| beta-glucosidase [Xylella fastidiosa M23]
 gi|417557804|ref|ZP_12208815.1| Beta-glucosidase [Xylella fastidiosa EB92.1]
 gi|28057820|gb|AAO29662.1| family 3 glycoside hydrolase [Xylella fastidiosa Temecula1]
 gi|182632553|gb|ACB93329.1| Beta-glucosidase [Xylella fastidiosa M23]
 gi|338179587|gb|EGO82522.1| Beta-glucosidase [Xylella fastidiosa EB92.1]
          Length = 882

 Score =  268 bits (684), Expect = 1e-68,   Method: Compositional matrix adjust.
 Identities = 165/423 (39%), Positives = 237/423 (56%), Gaps = 40/423 (9%)

Query: 68  LVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGATSFPT 127
           LV++MT  EK+ Q  + A  +PRLG+P Y+WWSE LHG++  G           AT FP 
Sbjct: 37  LVAQMTRQEKIAQTMNAAPAIPRLGIPAYDWWSEGLHGIARNG----------YATVFPQ 86

Query: 128 VILTTASFNESLWKKIGQAVSTEARAMYNL----GR-----AGLTYWSPNINVARDPRWG 178
            I   AS+N  L + +G   STEARA +NL    G+     AGLT WSPNIN+ RDPRWG
Sbjct: 87  AIGLAASWNTDLLQHVGTVTSTEARAKFNLTGGPGKDHPRYAGLTLWSPNINIFRDPRWG 146

Query: 179 RITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKG 238
           R  ET GEDP++  + AV+++RGLQ         D    P  +++  KH+A   V +   
Sbjct: 147 RGMETYGEDPYLTSQLAVSFIRGLQG--------DTPDHPRTIATP-KHFA---VHSGPE 194

Query: 239 VDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRG 298
             R+ FD  V+  D+E T+   F   + +G A SVMC+YN ++G P+CA   LLN  +R 
Sbjct: 195 QGRHSFDVDVSAYDLEATYTPAFRAAIVDGHAGSVMCAYNALHGTPACASDWLLNTRLRN 254

Query: 299 EWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFTGNAVQQG 358
           +W  +G++V+DCD+I+ M   H F  D+   A A  LK+G DL+CG  Y +    A+ +G
Sbjct: 255 DWGFNGFVVSDCDAIEDMTRFHFFRQDNAS-ASAAALKSGNDLNCGNTYRDLN-QAIARG 312

Query: 359 KVKETDIDKSLKYLYTVLMRLGFFDGSPQ--YVSLGKQDICSDENIELAAEAAREGIVLL 416
            + E+ +D++L  L+T   RLG         Y ++G + I +  +  LA +AA + +VLL
Sbjct: 313 DIDESTLDQALIRLFTARQRLGTLQPREHDPYAAIGIKHIDTPAHRALALQAAAQSLVLL 372

Query: 417 KNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFS---GYANVTYK 473
           KN  NTLPL      T+AV+GP A++  A+  NY G     ++P+ G     G A V Y 
Sbjct: 373 KNSGNTLPL--PPETTLAVLGPDADSLTALEANYQGTSSTPVTPLTGLRTRFGTAKVHYA 430

Query: 474 TGC 476
            G 
Sbjct: 431 QGA 433



 Score =  144 bits (363), Expect = 2e-31,   Method: Compositional matrix adjust.
 Identities = 99/301 (32%), Positives = 140/301 (46%), Gaps = 55/301 (18%)

Query: 490 ASEAAKTADATIILAGLDLSVEAESL----------DREDLWLPGYQTQLINQVAEVAKG 539
           A  A   ADA +   GL   VE E L          DR  + LP  Q  L+  V    K 
Sbjct: 604 AERAVAHADAIVAFVGLSPEVEGEELHIDTPGFSGGDRTTIDLPATQETLLQHVKTTGK- 662

Query: 540 PVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYN 599
           P+I+V+MS   V + +A+ + +  AIL A YPG+ GG AIA  + G  NPGGRLP+T+Y 
Sbjct: 663 PLIVVLMSGSAVALNWAQHHAD--AILAAWYPGQSGGTAIAQALAGDVNPGGRLPVTFYR 720

Query: 600 GDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQV 659
               Q LP       P  S    GRTY+++ G  LYPFGYGLSYTQF Y           
Sbjct: 721 S--TQDLP-------PYISYDMTGRTYRYFKGQPLYPFGYGLSYTQFAYE---------- 761

Query: 660 NLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKP 719
                                 P +    L+  +        +N G+  G +VV +Y +P
Sbjct: 762 ---------------------APQLSTATLKAGNTLTVTTHVRNTGTRAGDEVVQLYLEP 800

Query: 720 PAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGN 779
           P    A  ++ ++GF+RV +R G ++ + F  +A + L+ V       + AG + +FVG 
Sbjct: 801 PYSPQAP-LRSLVGFKRVTLRPGESRLLTFTLDA-RQLSSVQQTGQRSVEAGHYHLFVGG 858

Query: 780 G 780
           G
Sbjct: 859 G 859


>gi|223936933|ref|ZP_03628842.1| Beta-glucosidase [bacterium Ellin514]
 gi|223894502|gb|EEF60954.1| Beta-glucosidase [bacterium Ellin514]
          Length = 774

 Score =  268 bits (684), Expect = 1e-68,   Method: Compositional matrix adjust.
 Identities = 242/808 (29%), Positives = 370/808 (45%), Gaps = 162/808 (20%)

Query: 64  RVKDLVSRMTLDEKVQQL---------------GDF-----------AHGVPRLGLPQ-- 95
           RVKDL++RMTL+EK  Q+               G+F            HG+ ++G P   
Sbjct: 21  RVKDLLARMTLEEKAAQMMCVWQEKAAKLLDGNGNFDPAKAKAAFKKGHGLGQVGRPSDA 80

Query: 96  ------------YEWWSEALHGV-------SNVG-PGTHFDDVIPG-----ATSFPTVIL 130
                           +E  + +       S +G P    ++ + G      TSFP  I 
Sbjct: 81  GSDPATPANGKTARGMAELTNAIQKFFLEHSRLGIPVMFHEECLHGHAARDGTSFPQPIG 140

Query: 131 TTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNINVARDPRWGRITETPGEDPFV 190
             A+FN +L +K+    + E R      R G    +P ++VARD RWGR+ ET GEDPF+
Sbjct: 141 LGATFNPALVEKLYAMTAHETRV-----RGGHQALTPVVDVARDARWGRVEETYGEDPFL 195

Query: 191 VGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDV----DNWKGVDRYHFDA 246
             +  +  VRG Q   G  +  D       V +  KH+AA+       N   V+      
Sbjct: 196 NTQLGIAAVRGFQ---GDASFKDKKH----VIATLKHFAAHGQPESGQNCAPVN------ 242

Query: 247 RVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYI 306
            V+E+ + ETFL PF  C+K+G A SVM SYN ++G+PS A   LL   +R EW   G++
Sbjct: 243 -VSERLLRETFLHPFRDCLKKGGAISVMASYNEIDGVPSHASRWLLRDVLRKEWGFKGFV 301

Query: 307 VADCDSIQVMV---DNH-KFLADSKEDAVAQTLKAGLDL-----DCGQYYTNFTGNAVQQ 357
           V+D  +I  +    D+H   +A  K++A    +KAG+++     DC ++        V++
Sbjct: 302 VSDYYAIWELSHRPDSHGHHVAADKKEACVLAVKAGVNIEFPEPDCYRHLVEL----VRK 357

Query: 358 GKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQDICSDENIELAAEAAREGIVLLK 417
             + ET++D+ +  +     ++G FD          + +  + + ELA+EAARE I LLK
Sbjct: 358 KVLHETELDELIAPMLLWKFKMGLFDDPYVDPEEAARVVGCEVHRELASEAARETITLLK 417

Query: 418 NDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAG----FSGYANVTYK 473
           N+ + LPLN AK+KTVAV+GP+AN +  ++G Y+G+P   ++ + G      G   V + 
Sbjct: 418 NENDLLPLNPAKLKTVAVIGPNANRS--LLGGYSGVPAHNVTVLDGIKARLGGAVKVVHA 475

Query: 474 TGC----------DDV----ACKSNNSIFAASEAAKTADATIILAGLD--LSVEAESL-- 515
            GC          D+V      +    I  A + A +AD  I+  G +   S EA SL  
Sbjct: 476 EGCKITVGGSWQQDEVLASDPAEDRKQIDEAVKVAWSADVVIVAIGGNEQTSREAWSLKH 535

Query: 516 --DREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGE 573
             DR  L L G+Q +LI  +    K PV+ ++ +  G  +A      N+ AIL   Y G+
Sbjct: 536 MGDRTSLDLIGHQDELIRALLATGK-PVVALVFN--GRPLAINHVAQNVPAILECWYLGQ 592

Query: 574 EGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPG--RTYKFYNG 631
           E G A+A V+FG  NPGG+LPI+         +P +   L PV     P   R + +   
Sbjct: 593 ECGSAVAAVLFGDHNPGGKLPIS---------IPRSVGQL-PVFYNHKPSARRGFLWDEA 642

Query: 632 PTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRC 691
             L+PFG+GLSYT+F +  +   K I                   S+T    V       
Sbjct: 643 TPLFPFGFGLSYTKFTFKNVRLAKKI------------------ISRTGSTHV------- 677

Query: 692 DDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVF 751
                  VD  N G   G++VV VY +         +K++  FQ++ +  G  K +    
Sbjct: 678 ------SVDVTNAGKRAGTEVVQVYVRDLISSVTRPVKELKVFQKITLAPGETKTVSLDL 731

Query: 752 NACKSLNIVDYAANTLLPAGEHTIFVGN 779
              +SL   D     ++  GE  I VGN
Sbjct: 732 TP-ESLAFYDVNMKYVVEPGEFEIMVGN 758


>gi|317503000|ref|ZP_07961085.1| beta-glucosidase, partial [Prevotella salivae DSM 15606]
 gi|315665888|gb|EFV05470.1| beta-glucosidase [Prevotella salivae DSM 15606]
          Length = 770

 Score =  267 bits (683), Expect = 1e-68,   Method: Compositional matrix adjust.
 Identities = 231/816 (28%), Positives = 368/816 (45%), Gaps = 166/816 (20%)

Query: 48  QMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGV------------------- 88
           +M   L+ + +   + RV DL+ RMTL+EKV Q+     G+                   
Sbjct: 20  KMEKPLYKNPNASVAQRVDDLLRRMTLEEKVGQMNQLV-GIEHFKTNSITMSAEELATNT 78

Query: 89  -----PRLGLPQYEWW------SEALHGVSNVGPGTHFD----------------DVIPG 121
                P + + + E+W      S  LH V  +    +                  D I G
Sbjct: 79  ATAFYPGVTVSEIEYWVRRGWVSSFLH-VLTLEEANYLQKLSMQSRLQIPLIIGIDAIHG 137

Query: 122 A------TSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWS--PNINVAR 173
                  T +PT I   +SF+  L  KI +  + E RAM         +W+  PN+ VAR
Sbjct: 138 NAKCKNNTVYPTNIGLASSFDVDLAYKIARQTAEEMRAMN-------MHWNFNPNVEVAR 190

Query: 174 DPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHY--AAY 231
           D RWGR  ET GEDP++V +  V   +G Q     +N +D       V  C KH+   +Y
Sbjct: 191 DGRWGRCGETFGEDPYLVMQMGVATNKGYQ--RNLDNTSD-------VLGCVKHFVGGSY 241

Query: 232 DVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKL 291
            ++   G         V+E+ + E F  PF+  +++G   +VM S+N +NGIP   +  L
Sbjct: 242 SINGTNGAP-----CDVSERTLREVFFPPFKATLQQGGDWNVMMSHNELNGIPCHTNRWL 296

Query: 292 LNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDC-GQYYTNF 350
           +   +R EW   G+IV+D   I+  VD H    D+KE A  Q++ AG+D+   G  +   
Sbjct: 297 MTDVLRKEWGFQGFIVSDWMDIEHCVDQHHTAKDNKE-AFYQSIMAGMDMHMHGPEWQKD 355

Query: 351 TGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQDICSDENIELAAEAAR 410
               V++G++ E+ ID+S++ + TV  RLG F+     V    + I    + + A +A+R
Sbjct: 356 VVELVREGRIPESRIDESVRRILTVKFRLGLFEHPYSDVKTRDRVINDPVHKQTALDASR 415

Query: 411 EGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPC--RYMSPIAGF---S 465
           E IVLLKN++  LPL+  K K V V G +AN    M G+++ +    +  + + G    S
Sbjct: 416 ESIVLLKNEKQLLPLDEQKYKKVLVTGINANDQNIM-GDWSELQPEDKVWTVLKGLKLVS 474

Query: 466 GYANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAG-------LDLSVEAESLDRE 518
            + +  +     D    S + + AA EAAK +D  I+  G        +     E  DR+
Sbjct: 475 PHTDFRFVDQGWDPRNMSQSQVDAAVEAAKESDLNIVCCGEYMMRFRWNERTSGEDTDRD 534

Query: 519 DLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRA 578
           +L L G Q QLI ++ E  K P IL+I+S   + + +A    ++ AI+ A  PG+ GG+A
Sbjct: 535 NLELVGLQEQLIRRLNETGK-PTILIIISGRPLSVRYAA--DHVPAIVNAWEPGQYGGQA 591

Query: 579 IADVVFGKFNPGGRLPIT----------WYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKF 628
           IA++++GK NP  +L +T          WYN                        R+  F
Sbjct: 592 IAEILYGKINPSAKLAMTIPRHVGQISSWYNHK----------------------RSAYF 629

Query: 629 Y------NGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCP 682
           +      N P LYPFGYGLSYT+FKY+ L  + T+  N  K                   
Sbjct: 630 HPAVCADNTP-LYPFGYGLSYTKFKYSNLVLSDTVIENDGK------------------- 669

Query: 683 GVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAG 742
                        + ++  +N+G+ +G++V  +Y        A  +K++  F+RV ++AG
Sbjct: 670 ----------SAIKAQITIENIGNREGTEVCQLYINDIVSSVARPVKELKDFRRVTLKAG 719

Query: 743 RNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVG 778
             + I+F+    K L   D      +  GE  + +G
Sbjct: 720 EKQTIEFIITPDK-LAFYDVDMKLKIEPGEFKVMIG 754


>gi|146298537|ref|YP_001193128.1| glycoside hydrolase family 3 protein [Flavobacterium johnsoniae
           UW101]
 gi|146152955|gb|ABQ03809.1| Candidate beta-glycosidase; Glycoside hydrolase family 3
           [Flavobacterium johnsoniae UW101]
          Length = 745

 Score =  267 bits (683), Expect = 1e-68,   Method: Compositional matrix adjust.
 Identities = 237/797 (29%), Positives = 355/797 (44%), Gaps = 141/797 (17%)

Query: 48  QMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGD---FAH-GVPRLGLPQYEWWSEAL 103
           Q   ++  + S  +   +  L+S+MTL+EK+  L     FA+ GV RLG+P+ +     L
Sbjct: 33  QTEEYVGKEISTDHDAEIDKLISQMTLEEKIGMLHGNSMFANAGVKRLGIPELKMADGPL 92

Query: 104 HGV------SNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNL 157
            GV       N  P    +D    AT +P      A++N  +    G ++  E RA    
Sbjct: 93  -GVREEISRDNWAPAGWTNDF---ATYYPAGGALAATWNAEMAHTFGTSLGEELRA---- 144

Query: 158 GRAGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSR 217
            R      SP IN+ R P  GR  E   EDPF+  + AV  V GLQ+ +           
Sbjct: 145 -RDKDMLLSPAINMVRTPLGGRTYEYMSEDPFLNKKIAVPLVVGLQEKD----------- 192

Query: 218 PLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSY 277
              V +C KHYAA    N +  +R   D ++ E+ + E +L  FE  VKE  A S+M +Y
Sbjct: 193 ---VMACVKHYAA----NNQETNRDFVDVQIDERTLREIYLPAFEATVKEAKAYSIMGAY 245

Query: 278 NRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKA 337
           N+  G   C +  +LN+ +R EW   G +V+D  ++                + A++LK 
Sbjct: 246 NKFRGEYLCENDYMLNKILRDEWGFKGVVVSDWAAVH---------------STAKSLKN 290

Query: 338 GLDLDCGQ-------YYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVS 390
           GLD++ G        +  +    AV+ G+V E +ID  +K +  VL ++    G  +   
Sbjct: 291 GLDIEMGTPKPFNEFFLADKLIAAVKSGEVSEKEIDLHVKRILRVLFQVKAMGGGER--- 347

Query: 391 LGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNY 450
             K  I ++ + + A + A E I+LLKN+ N LPL    VK++AV+G +A    A+ G  
Sbjct: 348 -AKGSIATEAHYQDAYKIAAEAIILLKNENNALPLKLDGVKSIAVIGNNATKKNALGGFG 406

Query: 451 AGIPC-RYMSPIAGF-------------SGY------------ANVTYKTGCDDVACKSN 484
           AG+   R ++P+ G               GY             N+T  TG   +     
Sbjct: 407 AGVKTKREVTPLEGLKNRLPSSVKINYAEGYLEKYEEKNKGNLGNIT-STGPVTIDKLDP 465

Query: 485 NSIFAASEAAKTADATIILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILV 544
             +  A EAAK +D  II AG +   E E+ DR DL LP  Q +LI +V E    P  +V
Sbjct: 466 AKVQEAVEAAKKSDVAIIFAGSNRDYETEASDRRDLHLPFGQEELIKKVIEA--NPKTIV 523

Query: 545 IMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQ 604
           +M AG       E +    A++W+ + G EGG A+ADV+ GK NP G+LP  W     ++
Sbjct: 524 VMIAGA-PFDLNEVSQKSSALVWSWFNGSEGGNALADVILGKVNPSGKLP--WTMPKQLK 580

Query: 605 MLPLTSMPLRPVDS---------LGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTK 655
             P  +    P D          +GY  R +   N   LYPFGYGLSYT F         
Sbjct: 581 DSPAHATNSFPGDKAVNYAEGILIGY--RWFDTKNVAPLYPFGYGLSYTTFAL------- 631

Query: 656 TIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIV 715
                              D +KT       ND+      E  VD +N G  DG +VV +
Sbjct: 632 -------------------DNAKTDKDSYAQNDV-----IEVTVDVKNTGKVDGKEVVQL 667

Query: 716 YSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAAN--TLLPAGEH 773
           Y+           +++ GF++  V+AG +++I       K L   D AA   T+ P G++
Sbjct: 668 YTSKSDSKITRAAQELKGFKKADVKAGGSEKITIKV-PVKELAYYDVAAKKWTVEP-GKY 725

Query: 774 TIFVGNGGVSFPIHLNF 790
           TI +G         +NF
Sbjct: 726 TIKLGTSSRDIKKEINF 742


>gi|365122063|ref|ZP_09338970.1| hypothetical protein HMPREF1033_02316 [Tannerella sp.
           6_1_58FAA_CT1]
 gi|363643257|gb|EHL82578.1| hypothetical protein HMPREF1033_02316 [Tannerella sp.
           6_1_58FAA_CT1]
          Length = 819

 Score =  267 bits (683), Expect = 1e-68,   Method: Compositional matrix adjust.
 Identities = 228/823 (27%), Positives = 354/823 (43%), Gaps = 151/823 (18%)

Query: 53  LFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRL---GLPQYEW----WSEALHG 105
           +F +   P   RV+DL+S+M LDEK  QL    +G  R+    LP  EW    W + +  
Sbjct: 52  VFENPKQPIEKRVQDLLSQMNLDEKTCQLATL-YGYKRVMSDSLPTPEWKNKIWKDGIAN 110

Query: 106 V----SNVGPGTHF-----------------------------------DDVIPG----- 121
           +    + VG G                                      ++ I G     
Sbjct: 111 IDEQLNGVGRGAKIAQDLIYPFSKHAEAINKTQKWFIEETRLGIPVDFSNETIHGLNHTK 170

Query: 122 ATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNINVARDPRWGRIT 181
           AT  P  I   +++N  L  K G     EA+A   LG   +  ++P +++ARDPRWGR+ 
Sbjct: 171 ATPLPAPIGIGSTWNAPLVYKAGSIAGKEAKA---LGYTNI--YAPILDLARDPRWGRVL 225

Query: 182 ETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDR 241
           E  GEDPF+V       V+G+Q+ +G             V++  KH+A Y V        
Sbjct: 226 ECYGEDPFLVATLGTQMVKGIQE-QG-------------VAATLKHFAVYSVPKGGRDGS 271

Query: 242 YHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWD 301
              D  V  ++M +  L PF+  +++     VM SYN  +G+P  A    L Q +R E+ 
Sbjct: 272 VRTDPHVAPREMHQMHLYPFKKVIQDAHPMGVMSSYNDWDGVPVTASYYFLTQLLRQEFG 331

Query: 302 LHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFTG--------- 352
             GY+V+D D+++ + + H  +A++ E+AV   L+AGL++      T F           
Sbjct: 332 FDGYVVSDSDAVEYVYNKH-HVAETYEEAVRMVLEAGLNV-----RTTFAAPDIFILPAR 385

Query: 353 NAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQD--ICSDENIELAAEAAR 410
             V++G++    ID+ +  +  V  RLG FD  P        D  + +D+N +   +  R
Sbjct: 386 KLVKEGRLSMKVIDERVADVLRVKFRLGLFD-QPFVADPKAADKIVGADKNKDFVLDIQR 444

Query: 411 EGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFSGYAN- 469
           + +VLLKN+ N LPL+  K+  + + GP A     M+  Y       ++   G   Y   
Sbjct: 445 QSLVLLKNENNLLPLDKNKLSRILITGPLAKEENYMVSRYGPQELENITVYEGIKNYLGN 504

Query: 470 ---VTYKTGC--------------DDVACKSNNSIFAASEAAKTADATIILAGLDLSVEA 512
              V Y  GC                +  +    I  A E AK +D  I + G D     
Sbjct: 505 KVAVDYALGCKVKDAKWPESEIIHSPLTTEEQQEIQNAVEKAKLSDIVIAVLGEDEESTG 564

Query: 513 ESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPG 572
           ES  R  L LPG Q QL+  +    K PV+LV+++   + I +A  +  I AIL A +PG
Sbjct: 565 ESKSRSGLDLPGRQQQLLEALYATGK-PVVLVLINGQPLTINWA--DRYIPAILEAWFPG 621

Query: 573 EEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYP-----GRTYK 627
           + GG AIA+ +FG +NPGG+LP+T+     +  + L + P +P      P     G    
Sbjct: 622 QMGGTAIAETLFGDYNPGGKLPVTF--PKTLGQIEL-NFPFKPASQSKQPEAGPNGYGKT 678

Query: 628 FYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVN 687
             NG  LYPFG+GLSYT F+Y+ L  +   Q     +Q                      
Sbjct: 679 RVNG-ALYPFGFGLSYTTFEYSNLKVSPERQGPKGDIQ---------------------- 715

Query: 688 DLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRI 747
                       D  N G   G ++V +Y K       +Y   + GF+RV ++ G  K I
Sbjct: 716 ---------VSFDITNTGKRAGDEIVQLYVKDKVSSVISYESLLRGFERVSLQPGETKNI 766

Query: 748 KFVFNACKSLNIVDYAANTLLPAGEHTIFVGNGGVSFPIHLNF 790
           +F  +  + L I+D   N  +  GE  + +G       +  +F
Sbjct: 767 QFTLHP-EDLEILDINMNWNVEPGEFEVRIGASSEDIKLKKSF 808


>gi|224536377|ref|ZP_03676916.1| hypothetical protein BACCELL_01251 [Bacteroides cellulosilyticus
           DSM 14838]
 gi|224522015|gb|EEF91120.1| hypothetical protein BACCELL_01251 [Bacteroides cellulosilyticus
           DSM 14838]
          Length = 954

 Score =  267 bits (683), Expect = 1e-68,   Method: Compositional matrix adjust.
 Identities = 228/760 (30%), Positives = 358/760 (47%), Gaps = 111/760 (14%)

Query: 48  QMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQL--GDFAHGVPRLGLPQYEWWSEALHG 105
           + +S  + D +LP   RV+ L+S MT ++K++ +  G    G+P L +P      EA+HG
Sbjct: 164 EKTSLRYMDPTLPVEERVESLLSVMTPEDKMELIREGWGIPGIPHLYVPPITK-VEAVHG 222

Query: 106 VSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYW 165
            S             GAT FP  +   A++N+ L + +  AV  E      L    +  W
Sbjct: 223 FSYGS----------GATIFPQALAMGATWNKKLTEDVAMAVGDE-----TLAAGTMQAW 267

Query: 166 SPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCC 225
           SP ++VA+D RWGR  ET GEDP +V +    +++G Q       +  L + P       
Sbjct: 268 SPVLDVAQDARWGRCEETFGEDPVLVSQIGGAWIKGYQ-------SKGLFTTP------- 313

Query: 226 KHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPS 285
           KH+  +      G D +  D  ++E++M E  L PF   ++  D  SVM +Y+   G+P 
Sbjct: 314 KHFGGHGAP-LGGRDSH--DIGLSEREMREVHLVPFRHVIRNYDCQSVMMAYSDYLGVPV 370

Query: 286 CADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQ 345
               +LL+  +R EW   G+IV+DC +I  +     + A  K +A  Q L AG+  +CG 
Sbjct: 371 AKSRELLHSILREEWGFDGFIVSDCGAIGNLTARKHYTAKDKIEAANQALAAGIATNCGD 430

Query: 346 YYTNF-TGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQDIC----SDE 400
            Y +     A + G++   ++D+  + +  ++ R   F+ +P    L    I     SD 
Sbjct: 431 TYNDKEVIQAAKDGRINMENLDEVCRTMLRMMFRNELFEKTPNK-PLDWNKIYPGWNSDS 489

Query: 401 NIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAG--IPCRYM 458
           + E+A +AARE IV+L+N  N LPL +  ++T+AVVGP A+      G+Y    +P +  
Sbjct: 490 HKEMARQAARESIVMLENKDNILPL-AKDMRTIAVVGPGADDLQP--GDYTPKLLPGQLK 546

Query: 459 SPIAGFS----GYANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEA-- 512
           S + G          V Y+ GCD  +    N I  A +AA  +D  +++ G   + E+  
Sbjct: 547 SVLTGIKQAVGKQTKVVYEQGCDFTSSNGTN-IPKAVKAASQSDVVVLVLGDCSTSESTT 605

Query: 513 -------ESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAI 565
                  E+ D   L LPG Q +L+  V    K PVIL++ +  G     ++ +   KAI
Sbjct: 606 DVYKTSGENHDYATLILPGKQQELLEAVCATGK-PVILILQA--GRPYNLSKASELCKAI 662

Query: 566 LWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRT 625
           L    PG+EGG A ADV+FG +NP GRLP+T+    +V  LPL         +    GR 
Sbjct: 663 LVNWLPGQEGGPATADVLFGDYNPAGRLPMTFPR--HVGQLPLYY-------NFKTSGRR 713

Query: 626 YKFYNGP--TLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPG 683
           Y++ +     LY FGYGLSYT F+Y+ L           K+Q   N N    A+      
Sbjct: 714 YEYSDMEFYPLYYFGYGLSYTSFEYSGL-----------KIQEKDNGNVAIQAT------ 756

Query: 684 VLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGR 743
                             +NVG   G +VV +Y         T I ++  F RV ++ G 
Sbjct: 757 -----------------VKNVGQRAGDEVVQLYITDMYASVKTRITELKDFTRVHLQPGE 799

Query: 744 NKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGNGGVS 783
           +K + F     + L++++   + ++  GE  I V  GGVS
Sbjct: 800 SKIVSFELTPYE-LSLLNDRMDRVVEKGEFKILV--GGVS 836


>gi|225872720|ref|YP_002754177.1| xylan 1,4-beta-xylosidase [Acidobacterium capsulatum ATCC 51196]
 gi|225793233|gb|ACO33323.1| xylann 1,4-beta-xylosidase [Acidobacterium capsulatum ATCC 51196]
          Length = 721

 Score =  267 bits (683), Expect = 2e-68,   Method: Compositional matrix adjust.
 Identities = 223/731 (30%), Positives = 343/731 (46%), Gaps = 100/731 (13%)

Query: 52  FLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGP 111
           + F + +L    R+ DL+SRMTL EK+Q LGD   GVPRLG+P      E LHG +  GP
Sbjct: 24  YPFQNPALSPDQRIDDLLSRMTLQEKIQALGDDP-GVPRLGIPG-ALTEEGLHGAAIGGP 81

Query: 112 GTHFDD----VIPGATSFPTVILTTASFNESLWKKIGQAVSTEAR-AMYNLGRAGLTYWS 166
             H++     V+P  T FP       +++ +L +K     + E R A+      GL   +
Sbjct: 82  A-HWEGRGRAVVP-TTQFPQNHGLGQTWDPALLQKAANVEAYETRWAVNKYHDGGLIVRA 139

Query: 167 PNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCK 226
           PN N++RDPRWGR  E+ GEDP++VG  AV +++GLQ           N R  + ++  K
Sbjct: 140 PNANLSRDPRWGRTEESYGEDPYLVGTLAVAWIKGLQGN---------NPRYWETAALMK 190

Query: 227 HYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSC 286
           H+ AY  +  +     +F  R+      E +  PF M +++G + + M SYN  NGIP  
Sbjct: 191 HFDAYSNEANRDGSSSNFGKRL----FYEYYSVPFRMGIEQGHSDAFMTSYNAWNGIPMT 246

Query: 287 ADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQY 346
           A+P +L   V  +W  +G I  D  ++  MV +  +     E A A  + AG++    +Y
Sbjct: 247 ANP-VLKSVVMKKWGFNGIICTDAGALSNMVTHFHYYKTMPE-AAAGAVHAGINQFLDRY 304

Query: 347 YTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ--YVSLG-------KQDIC 397
                  A+QQ  + E  ID+ LK +Y V++RLG  D S    Y  +G       K D  
Sbjct: 305 QQPVE-EALQQKLLTEQQIDQDLKGVYRVVLRLGLMDPSSMSPYSMIGLTNDNPAKGDPW 363

Query: 398 S-DENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCR 456
               +I L  +   E IVLLKN  + LPL++ K+ ++AV+GP AN  +  +  Y+G P  
Sbjct: 364 DWPSHIALDRKVTDESIVLLKNQNHALPLDAKKLHSIAVIGPWAN--IVALDWYSGTPPF 421

Query: 457 YMSPIAGFSGYANVTYKTGCD-DVACKSNNSIFAASEAAKTADATIILAGLDLSVEA--- 512
            ++P+ G      +  + G D  V     +++ AA+  AK +D  I++ G   + +A   
Sbjct: 422 GVTPVEG------IRQRVGPDVKVTFNDGSNLQAAAALAKQSDEAIVIIGNHPTCDAGWG 475

Query: 513 ---------ESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIK 563
                    E+ DR  L LP    + I +    A    ++V+ ++      +  T  +I 
Sbjct: 476 KCALPSEGKEAFDRTALNLP---DESIAKAVYAANPHTVVVLQTSFPYTTDW--TQAHIP 530

Query: 564 AILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPG 623
           AIL   +  EE G A+ADV+FG ++P GRL  TW      Q+ P+    +R        G
Sbjct: 531 AILEMAHNSEEQGTALADVLFGDYDPAGRLAQTWV-ASIGQLPPMMDYNIR-------DG 582

Query: 624 RTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPG 683
           RTY +     LYPFG+GLSYT FKY+                   NL  +S         
Sbjct: 583 RTYMYLKSKPLYPFGFGLSYTTFKYS-------------------NLRLSS--------- 614

Query: 684 VLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGR 743
              + L         VD  N G  +G +VV +Y K      +  ++ + GF RV +  G+
Sbjct: 615 ---HTLPAGGQLTVSVDVTNTGKYNGDEVVQMYVKHLDSKVSRPLEALKGFDRVSIPVGQ 671

Query: 744 NKRIKFVFNAC 754
            + +     A 
Sbjct: 672 TRTVTLPLKAS 682


>gi|393786908|ref|ZP_10375040.1| hypothetical protein HMPREF1068_01320 [Bacteroides nordii
           CL02T12C05]
 gi|392658143|gb|EIY51773.1| hypothetical protein HMPREF1068_01320 [Bacteroides nordii
           CL02T12C05]
          Length = 854

 Score =  267 bits (683), Expect = 2e-68,   Method: Compositional matrix adjust.
 Identities = 158/427 (37%), Positives = 244/427 (57%), Gaps = 41/427 (9%)

Query: 53  LFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPG 112
           ++ D + P   R+ DL+S++T++EK+  L   + G+PRL + +Y   +EALHGV  V PG
Sbjct: 27  VYLDMNAPRHERILDLLSKLTIEEKISLLRATSPGIPRLHIDKYYHGNEALHGV--VRPG 84

Query: 113 THFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAG----------L 162
                     T FP  I   A +N  L  +I   +S EARA +N    G          L
Sbjct: 85  NF--------TVFPQAIGLAAMWNPQLLNEISTVISDEARARWNELEQGKKQLGQFSDLL 136

Query: 163 TYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVS 222
           T+WSP +N+ARDPRWGR  ET GEDPF+ G+  V++V+GLQ  +          R LK+ 
Sbjct: 137 TFWSPTVNMARDPRWGRTPETYGEDPFLSGKLGVSFVKGLQGDD---------PRYLKIV 187

Query: 223 SCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNG 282
           S  KH+AA + ++    +R+  +  ++E+D+ E +L  FE C+ EG A+S+M +YN +N 
Sbjct: 188 STPKHFAANNEEH----NRFECNPIISEKDLREYYLPAFEKCIIEGKAASIMTAYNAIND 243

Query: 283 IPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLD 342
           +P   +  LL + +R +W   GY+V+DC     +V +HK++  + E A A +++AGLDL+
Sbjct: 244 VPCTLNNWLLKKVLRHDWGFDGYVVSDCGGPSFLVTHHKYVK-TLEAAAALSIQAGLDLE 302

Query: 343 CG-QYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSP--QYVSLGKQDICSD 399
           CG + Y     NA +Q  V E +ID +  ++    MRLG FD      Y  +    +  +
Sbjct: 303 CGDEVYMEPLLNAYKQYMVSEAEIDSAAYHVLRARMRLGLFDDPALNPYNKISPSIVGCE 362

Query: 400 ENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMS 459
           ++ +LA EAAR+ IVLLKN++  LPL+S K+K++AVVG   NA  +  G+Y+G P     
Sbjct: 363 KHSKLALEAARQSIVLLKNEKKFLPLDSKKIKSIAVVG--INAGNSEFGDYSGTPVN--Q 418

Query: 460 PIAGFSG 466
           P++   G
Sbjct: 419 PVSILEG 425



 Score =  114 bits (286), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 83/292 (28%), Positives = 132/292 (45%), Gaps = 50/292 (17%)

Query: 490 ASEAAKTADATIILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAG 549
           A +  +  D T+ + G++ S+E E  DR  + LP  Q   I +  ++    V++++    
Sbjct: 595 AGDIMRKCDLTVAVLGINKSIEREGQDRYSIELPKDQQIFIEEAYKINPNTVVVLV---A 651

Query: 550 GVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLP-L 608
           G  +A    + +I AI+ A YPGE GG A+A+V+FG +NPGG+LP+T+Y    +  LP  
Sbjct: 652 GSSLAINWMDEHIPAIVNAWYPGEAGGTAVAEVLFGDYNPGGKLPLTYYRS--LDELPAF 709

Query: 609 TSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCR 668
               +R        GRTY+F+ G  LY FG+GLSYT F Y  LS                
Sbjct: 710 DDYDIR-------KGRTYQFFEGDPLYAFGHGLSYTTFSYKKLSI--------------- 747

Query: 669 NLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATY- 727
                 DA+               D        +N G  +G +V  +Y K     +    
Sbjct: 748 ------DAA--------------GDVVSVSFTLKNTGKYEGDEVAQLYVKYQGSDSQVKL 787

Query: 728 -IKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVG 778
            +KQ+ GF+R+ ++ G +K+I       +     +       PAG++   VG
Sbjct: 788 PLKQLKGFERIHLKKGESKQINLTVPKSELRFWNEEKGEFYTPAGDYLFMVG 839


>gi|224536087|ref|ZP_03676626.1| hypothetical protein BACCELL_00952 [Bacteroides cellulosilyticus
           DSM 14838]
 gi|224522306|gb|EEF91411.1| hypothetical protein BACCELL_00952 [Bacteroides cellulosilyticus
           DSM 14838]
          Length = 791

 Score =  267 bits (683), Expect = 2e-68,   Method: Compositional matrix adjust.
 Identities = 227/818 (27%), Positives = 362/818 (44%), Gaps = 153/818 (18%)

Query: 53  LFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRL---GLPQYEWWSEAL--HGVS 107
           ++ D S P   RV DL+S+MTL+EK+ Q+    +G  R+    LP+ E W +AL   G+ 
Sbjct: 46  IYEDPSAPMEERVNDLLSQMTLEEKICQMATL-YGSGRVLEDALPE-EHWKQALWKDGIG 103

Query: 108 NV-----GPGTHFDDV-------------------------IP--------------GAT 123
           N+     G GT   +                          IP               AT
Sbjct: 104 NIDEEHNGLGTFGSEYSFPYNKHVKAKHEIQRWFVEETRLGIPVDFTNEGIRGLCHDRAT 163

Query: 124 SFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNINVARDPRWGRITET 183
            FP+     +++N+ L  +IG+  + EA A   LG   +  +SP +++ +DPRWGR  E 
Sbjct: 164 FFPSQSGQGSTWNKELIARIGEVEAKEAIA---LGYTNI--YSPILDICQDPRWGRSVEC 218

Query: 184 PGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYH 243
            GEDP++VG+     ++ LQ                ++ S  KH+A Y +       +  
Sbjct: 219 YGEDPYLVGQLGKQMIQSLQK--------------HRLVSTVKHFAVYSIPVGGRDGKTR 264

Query: 244 FDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLH 303
            D  V+ ++M   +L PF     E  A  VM SYN  +G P  +    L + +R E+   
Sbjct: 265 TDPHVSPREMRTLYLEPFRRAFCEAGALGVMSSYNDYDGEPITSSHHFLTEILRQEYGFK 324

Query: 304 GYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFTG---------NA 354
           GY+V+D ++++ +   H  +++  E  VAQ + AGL++      T+FT           A
Sbjct: 325 GYVVSDSEAVEFITTKHHVVSNEVE-GVAQAVNAGLNIR-----THFTKPEDFVLPLRQA 378

Query: 355 VQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQD---ICSDENIELAAEAARE 411
           +++GKV    I+  +  +  +   LG FD    Y    KQ+   +   E+ ++A EAAR+
Sbjct: 379 IKEGKVSPETINSRVADILRIKFWLGLFDNP--YRGDEKQEEKIVHCKEHQQVALEAARQ 436

Query: 412 GIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNY--AGIPCRYM-SPIAGFSGYA 468
            +VLLKN+   LPL    VK+VAV+GP+AN    +I  Y  A  P + +   I       
Sbjct: 437 SLVLLKNENQLLPLKKT-VKSVAVIGPNANEQTQLICRYGPANAPIKTVYQGIKELLPET 495

Query: 469 NVTYKTGCD--------------DVACKSNNSIFAASEAAKTADATI-ILAGLDLSVEAE 513
            V Y+ GC+              +   +    +  A  AA+ A+  + +L G +L+V  E
Sbjct: 496 EVVYRKGCEIIDSHFPESEILPFEKTTEEQQMLDEAVAAARNAEVVVLVLGGSELTVR-E 554

Query: 514 SLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGE 573
              R  L LPG+Q +L+  +    K P +LV++      I +A  N  I AIL A +PGE
Sbjct: 555 DRSRTSLDLPGHQQELMQAIHATGK-PTVLVLLDGRAATINYA--NQYIPAILHAWFPGE 611

Query: 574 EGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPT 633
             G A+A+ +FG +NPGGRL +T+     V  +P  + P +P          Y       
Sbjct: 612 FAGTAVAEALFGDYNPGGRLAVTFPKS--VGQIPF-AFPFKPGSDEPCETAVYG-----A 663

Query: 634 LYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDD 693
           LYPFGYGLSYT+F Y  L  T   Q    ++                        + C  
Sbjct: 664 LYPFGYGLSYTKFSYKNLQITPEEQGPQGEI-----------------------TVSC-- 698

Query: 694 YFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNA 753
                 +  N+G   G +VV +Y +       TY+K + GF+R+ +  G  K++ F+   
Sbjct: 699 ------EVTNIGDRTGDEVVQLYLRDEVSSVTTYMKVLRGFERITLNPGETKKVTFILTP 752

Query: 754 CKSLNIVDYAANTLLPAGEHTIFVGNGGVSFPIHLNFN 791
            + L + D     ++  G   + +G       +   FN
Sbjct: 753 -QDLGLWDKNNKFVVEPGMFKVMIGAASTDIRLEGKFN 789


>gi|393789624|ref|ZP_10377744.1| hypothetical protein HMPREF1068_04024 [Bacteroides nordii
           CL02T12C05]
 gi|392650340|gb|EIY44009.1| hypothetical protein HMPREF1068_04024 [Bacteroides nordii
           CL02T12C05]
          Length = 855

 Score =  267 bits (683), Expect = 2e-68,   Method: Compositional matrix adjust.
 Identities = 167/435 (38%), Positives = 243/435 (55%), Gaps = 40/435 (9%)

Query: 48  QMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVS 107
           Q  + LF D + P   R+ DL++R+T++EKV  L + A  +PRL + +Y   +EALHGV 
Sbjct: 23  QNKTELFRDMTAPQHERILDLLNRLTVEEKVSLLVNDAREIPRLNIDKYNHGNEALHGV- 81

Query: 108 NVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNL---------- 157
            V PG          T FP  I   A++N +L  ++  A+S EAR  +            
Sbjct: 82  -VRPGEF--------TVFPQAIGLAATWNPNLIFRVSTAISDEARGRWKELDYGKKQIAG 132

Query: 158 GRAGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSR 217
           G   LT+WSP +N+ARDPRWGR  ET GEDPF+ GR    +V+GLQ           N R
Sbjct: 133 GSDLLTFWSPTVNMARDPRWGRTPETYGEDPFLSGRIGCEFVKGLQGD---------NPR 183

Query: 218 PLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSY 277
            LK  S  KH+AA + ++    +R   +AR++E+D+ E +L  FE C+ +G A S+M +Y
Sbjct: 184 YLKTVSTPKHFAANNEEH----NRSSCNARMSERDLREYYLPAFERCIVDGKAQSIMMAY 239

Query: 278 NRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKA 337
           N VN +P   +  L+ + +RG+W+ +GYIV+DC + + MV  HK++ +  E A    LKA
Sbjct: 240 NAVNDVPCTVNIYLIKKVLRGDWNFNGYIVSDCSAPEWMVTKHKYVKNL-EAAATLALKA 298

Query: 338 GLDLDCG-QYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ--YVSLGKQ 394
           GLDL+CG + YT     A  +  V E +ID +  ++    M LG FD   Q  Y  +   
Sbjct: 299 GLDLECGDRVYTAPLLKAYNEYMVSEAEIDSAAYHILRGRMLLGLFDDPSQNPYNKIEPS 358

Query: 395 DICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIP 454
            I   E+ ELA E AR+ +VLLKN +N LPLN  K++++AVVG   +A     G+Y+G P
Sbjct: 359 VIGCKEHQELALETARQSMVLLKNQKNFLPLNRKKIRSIAVVG--ISAAHCEFGDYSGNP 416

Query: 455 CRY-MSPIAGFSGYA 468
               +S + G   YA
Sbjct: 417 KNTPVSVLDGIKKYA 431



 Score =  140 bits (354), Expect = 2e-30,   Method: Compositional matrix adjust.
 Identities = 94/299 (31%), Positives = 138/299 (46%), Gaps = 49/299 (16%)

Query: 490 ASEAAKTADATIILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAG 549
           A + AK  D T+ + G++ S+E E  DR  L LP  Q + I ++ +V    V++++    
Sbjct: 596 AGKVAKECDVTVAVLGINKSIEREGQDRYSLELPIDQQEFIKELYKVNPNTVVVLV---A 652

Query: 550 GVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLT 609
           G  +A    + N+ AIL A YPGE+GG A+A+V+FG +NPGGRLP+T+YN        L 
Sbjct: 653 GSSMAINWMDENVPAILNAWYPGEQGGNAVAEVLFGDYNPGGRLPLTYYNS-------LD 705

Query: 610 SMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRN 669
            +P    D      RTY+++ G  LY FGYGLSYT FKY   S                 
Sbjct: 706 ELP--AFDDYSVKNRTYQYFEGKPLYEFGYGLSYTNFKYKKKSI---------------- 747

Query: 670 LNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIK 729
                              ++ +D  +   +  NVG  DG +V  VY + P       +K
Sbjct: 748 -------------------MQSNDTVDITFNLSNVGKYDGDEVAQVYVRYPETGTYMPLK 788

Query: 730 QVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLL-PAGEHTIFVGNGGVSFPIH 787
           Q+ GF RV ++ G++  I       K L   D      + P GE+   VG    +  I 
Sbjct: 789 QLKGFSRVHLKKGKSADITISIPK-KELRYWDEKTRQFVTPTGEYVFQVGGSSENISIE 846


>gi|423212854|ref|ZP_17199383.1| hypothetical protein HMPREF1074_00915 [Bacteroides xylanisolvens
           CL03T12C04]
 gi|392694712|gb|EIY87939.1| hypothetical protein HMPREF1074_00915 [Bacteroides xylanisolvens
           CL03T12C04]
          Length = 782

 Score =  267 bits (682), Expect = 2e-68,   Method: Compositional matrix adjust.
 Identities = 218/727 (29%), Positives = 340/727 (46%), Gaps = 124/727 (17%)

Query: 90  RLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVST 149
           RLG+P +    EA HG   +G           AT FPT I   A+++  L K++GQ ++ 
Sbjct: 129 RLGIPMF-LAEEAPHGHMAIG-----------ATVFPTGIGMAATWSPELVKEVGQVIAK 176

Query: 150 EARAMYNLGRAGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHE 209
           E R+     + G   + P +++ RDPRW R+ ET GEDP + G    + V GL    G  
Sbjct: 177 EIRS-----QGGHISYGPVLDLTRDPRWSRVEETFGEDPVLSGILGASMVDGL----GGG 227

Query: 210 NATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGD 269
           N     S+     +  KH+ AY V        Y   A V  +D+ + FL PF   +  G 
Sbjct: 228 NL----SQKYATIATLKHFLAYAVPEGGQNGNY---ASVGIRDLHQNFLPPFRKAIDAG- 279

Query: 270 ASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKED 329
           A SVM SYN ++GIP  ++  LL Q +R EW   G++V+D  SI+ + ++H F+A +KE+
Sbjct: 280 ALSVMTSYNSIDGIPCTSNHYLLTQLLRNEWKFRGFVVSDLYSIEGIHESH-FVAPTKEN 338

Query: 330 AVAQTLKAGLDLDCG-QYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQY 388
           A  Q++ AG+D+D G   YTN   +AVQ G++ +T ID ++  +  +   +G F+     
Sbjct: 339 AAIQSVTAGVDVDLGGDAYTNLC-HAVQSGQMDKTVIDTAVCRVLRMKFEMGLFEHPYVD 397

Query: 389 VSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIG 448
             +  + +   E+IELA + A+  I LLKN+ + LPL S  +  VAV+GP+A+    M+G
Sbjct: 398 PKIAAKTVRRKEHIELARKIAQSSITLLKNENSILPL-SKTINKVAVIGPNADNRYNMLG 456

Query: 449 NYA-------------GIPCRYMSPIAGFSGYANVTYKTGCDDVACKSNNSIFAASEAAK 495
           +Y              GI  + +SP         V Y  GC  +   + N I  A EAA+
Sbjct: 457 DYTAPQEDSNVKTVLDGILTK-LSPF-------RVEYVRGCA-IRDTTVNEIEQAIEAAR 507

Query: 496 TAD----------------------ATIILAGLDLSVE-AESLDREDLWLPGYQTQLINQ 532
            ++                      A +   G    +E  E  DR  L L G Q +L+  
Sbjct: 508 RSEVVIVVVGGSSARDFKTSYKETGAAVAEEGSVSDMECGEGFDRASLSLLGRQQELLES 567

Query: 533 VAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGR 592
           + +  K P+I+V +    ++  +A    +  A+L A YPG+EGG AIADV+FG +NP GR
Sbjct: 568 LQKTGK-PLIVVYIEGRPLEKNWASEYAD--ALLTAYYPGQEGGNAIADVLFGDYNPSGR 624

Query: 593 LPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLS 652
           LPI+      V  +P+      P +        Y   +   LY FGYG+SYT F+Y+ L 
Sbjct: 625 LPISVPRS--VGQIPVYYNKKAPRN------HDYVEVSSSPLYSFGYGMSYTTFEYSDLQ 676

Query: 653 FTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDV 712
                                          V+    RC   FE     +N G  DG +V
Sbjct: 677 -------------------------------VVQKSARC---FEVSFKVKNTGKYDGEEV 702

Query: 713 VIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGE 772
             +Y +         +KQ+  F+R  ++ G  K++ FV    +   +V+Y    ++ +G 
Sbjct: 703 SQLYMRDEYASVVQPMKQLKHFERFHLKKGEEKKVTFVLTE-EDFFLVNYTLKKVVESGT 761

Query: 773 HTIFVGN 779
             + +G+
Sbjct: 762 FQVMIGS 768


>gi|317474225|ref|ZP_07933501.1| glycosyl hydrolase family 3 N terminal domain-containing protein
           [Bacteroides eggerthii 1_2_48FAA]
 gi|316909535|gb|EFV31213.1| glycosyl hydrolase family 3 N terminal domain-containing protein
           [Bacteroides eggerthii 1_2_48FAA]
          Length = 858

 Score =  267 bits (682), Expect = 2e-68,   Method: Compositional matrix adjust.
 Identities = 160/427 (37%), Positives = 247/427 (57%), Gaps = 41/427 (9%)

Query: 53  LFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPG 112
           L+ +   P   R+ DL+SR+T++EK+  L   + G+ RL +P+Y   +EALHGV  V PG
Sbjct: 28  LYKNEKAPIHERIMDLLSRLTVEEKISLLRATSPGISRLDIPKYYHGNEALHGV--VRPG 85

Query: 113 THFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAG----------L 162
                     T FP  I   A++N  L K++   +S EARA +N    G          L
Sbjct: 86  RF--------TVFPQAIGLAATWNPVLQKQVATVISDEARARWNELDQGREQNSQFSDLL 137

Query: 163 TYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVS 222
           T+WSP +N+ARDPRWGR  ET GEDP++ G     +V+GLQ   G+      +SR LK+ 
Sbjct: 138 TFWSPTVNMARDPRWGRTPETYGEDPYLSGIMGTAFVKGLQ---GN------DSRYLKIV 188

Query: 223 SCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNG 282
           S  KH+AA + ++    +R+  + +++E+ + E +L  FE CVKEG ++S+M +YN +N 
Sbjct: 189 STPKHFAANNEEH----NRFVCNPQISEKQLREYYLPAFEACVKEGKSASIMSAYNALND 244

Query: 283 IPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLD 342
           +P   +  LL + +R +W   GY+V+DC    ++V+ HK++  +KE A   ++KAGLDL+
Sbjct: 245 VPCTLNAWLLTKVLREDWGFKGYVVSDCGGPALLVNAHKYVK-TKEAAATLSIKAGLDLE 303

Query: 343 CG-QYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ--YVSLGKQDICSD 399
           CG   Y     NA +Q  V + DID +   +    M+LG FD      Y  +  + I S 
Sbjct: 304 CGDDVYDAPLLNAYRQYMVTDADIDSAAYRVLRARMQLGLFDSGENNPYTKISPKVIGSK 363

Query: 400 ENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMS 459
           E+ ++A +AARE IVLLKN    LPL++ K+K++AVVG   NA  +  G+Y+G+P   ++
Sbjct: 364 EHQKVALDAARECIVLLKNQNKMLPLDAKKIKSIAVVG--INAGRSEFGDYSGLPV--IA 419

Query: 460 PIAGFSG 466
           P++   G
Sbjct: 420 PVSILQG 426



 Score =  135 bits (340), Expect = 1e-28,   Method: Compositional matrix adjust.
 Identities = 91/293 (31%), Positives = 143/293 (48%), Gaps = 54/293 (18%)

Query: 490 ASEAAKTADATIILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAG 549
           A    +  +  + + G++ ++E E  DR D+ LP  Q + + ++ +V   P I+V++ AG
Sbjct: 596 AGRVVRECEKVVAVLGINKAIEREGQDRSDIQLPADQREFLKEIYKV--NPNIVVVLVAG 653

Query: 550 GVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLT 609
              ++    + +I AI+ A YPGE GG+A+A+V+FG +NPGGRLP+T+Y         L 
Sbjct: 654 S-SLSINWMDEHIPAIINAWYPGESGGKAVAEVLFGDYNPGGRLPLTYYRS-------LD 705

Query: 610 SMPLRPVDSLGY-PGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCR 668
            +P  P D      GRTY+++ G  LYPFGYGLSYT FKY+ L  T+  Q          
Sbjct: 706 ELP--PFDDYDITKGRTYQYFKGNVLYPFGYGLSYTSFKYSDLQVTEGNQ---------- 753

Query: 669 NLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDF--QNVGSTDGSDVVIVYSKPPAEIAAT 726
                                      E  V F  +NVG   G +V  +Y K P      
Sbjct: 754 ---------------------------EVNVSFCLKNVGKYAGDEVAQIYVKLPERDKIM 786

Query: 727 YIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLL-PAGEHTIFVG 778
            IK++ GF+R+ ++ G ++++         L   D      + P+G++TI VG
Sbjct: 787 PIKELKGFERISLKRGGSRKVTIRLKK-DLLRYWDEEKGCFVHPSGDYTIMVG 838


>gi|218130696|ref|ZP_03459500.1| hypothetical protein BACEGG_02285 [Bacteroides eggerthii DSM 20697]
 gi|217987040|gb|EEC53371.1| glycosyl hydrolase family 3 N-terminal domain protein [Bacteroides
           eggerthii DSM 20697]
          Length = 858

 Score =  267 bits (682), Expect = 2e-68,   Method: Compositional matrix adjust.
 Identities = 160/427 (37%), Positives = 247/427 (57%), Gaps = 41/427 (9%)

Query: 53  LFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPG 112
           L+ +   P   R+ DL+SR+T++EK+  L   + G+ RL +P+Y   +EALHGV  V PG
Sbjct: 28  LYKNEKAPIHERIMDLLSRLTVEEKISLLRATSPGISRLDIPKYYHGNEALHGV--VRPG 85

Query: 113 THFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAG----------L 162
                     T FP  I   A++N  L K++   +S EARA +N    G          L
Sbjct: 86  RF--------TVFPQAIGLAATWNPVLQKQVATVISDEARARWNELDQGREQNSQFSDLL 137

Query: 163 TYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVS 222
           T+WSP +N+ARDPRWGR  ET GEDP++ G     +V+GLQ   G+      +SR LK+ 
Sbjct: 138 TFWSPTVNMARDPRWGRTPETYGEDPYLSGIMGTAFVKGLQ---GN------DSRYLKIV 188

Query: 223 SCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNG 282
           S  KH+AA + ++    +R+  + +++E+ + E +L  FE CVKEG ++S+M +YN +N 
Sbjct: 189 STPKHFAANNEEH----NRFVCNPQISEKQLREYYLPAFEACVKEGKSASIMSAYNALND 244

Query: 283 IPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLD 342
           +P   +  LL + +R +W   GY+V+DC    ++V+ HK++  +KE A   ++KAGLDL+
Sbjct: 245 VPCTLNAWLLTKVLREDWGFKGYVVSDCGGPALLVNAHKYVK-TKEAAATLSIKAGLDLE 303

Query: 343 CG-QYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ--YVSLGKQDICSD 399
           CG   Y     NA +Q  V + DID +   +    M+LG FD      Y  +  + I S 
Sbjct: 304 CGDDVYDAPLLNAYRQYMVTDADIDSAAYRVLRARMQLGLFDSGENNPYTKISPKVIGSK 363

Query: 400 ENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMS 459
           E+ ++A +AARE IVLLKN    LPL++ K+K++AVVG   NA  +  G+Y+G+P   ++
Sbjct: 364 EHQKVALDAARECIVLLKNQNKMLPLDAKKIKSIAVVG--INAGRSEFGDYSGLPV--IA 419

Query: 460 PIAGFSG 466
           P++   G
Sbjct: 420 PVSILQG 426



 Score =  135 bits (340), Expect = 9e-29,   Method: Compositional matrix adjust.
 Identities = 90/293 (30%), Positives = 142/293 (48%), Gaps = 54/293 (18%)

Query: 490 ASEAAKTADATIILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAG 549
           A    +  +  + + G++ ++E E  DR D+ LP  Q + + ++ +V   P I+V++ AG
Sbjct: 596 AGRVVRECEKVVAVLGINKAIEREGQDRSDIQLPADQREFLKEIYKV--NPNIVVVLVAG 653

Query: 550 GVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLT 609
              ++    + +I AI+ A YPGE GG+A+A+V+FG +NPGGRLP+T+Y         L 
Sbjct: 654 S-SLSINWMDEHIPAIINAWYPGESGGKAVAEVLFGDYNPGGRLPLTYYRS-------LD 705

Query: 610 SMPLRPVDSLGY-PGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCR 668
            +P  P D      GRTY+++ G  LYPFGYGLSYT FKY+ L  T   Q          
Sbjct: 706 ELP--PFDDYDITKGRTYQYFKGNVLYPFGYGLSYTSFKYSDLQVTDGNQ---------- 753

Query: 669 NLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDF--QNVGSTDGSDVVIVYSKPPAEIAAT 726
                                      E  V F  +NVG   G +V  +Y K P      
Sbjct: 754 ---------------------------EVNVSFCLKNVGKYAGDEVAQIYVKLPERDKIM 786

Query: 727 YIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLL-PAGEHTIFVG 778
            IK++ GF+R+ ++ G ++++         L   D      + P+G++TI +G
Sbjct: 787 PIKELKGFERISLKRGESRKVTIRLKK-DLLRYWDEEKECFVHPSGDYTIMIG 838


>gi|404487205|ref|ZP_11022392.1| hypothetical protein HMPREF9448_02853 [Barnesiella intestinihominis
           YIT 11860]
 gi|404335701|gb|EJZ62170.1| hypothetical protein HMPREF9448_02853 [Barnesiella intestinihominis
           YIT 11860]
          Length = 860

 Score =  267 bits (682), Expect = 2e-68,   Method: Compositional matrix adjust.
 Identities = 223/802 (27%), Positives = 348/802 (43%), Gaps = 144/802 (17%)

Query: 48  QMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGD------------------------ 83
           +  S  + + +LP   RV+DL++RMT+DEK+ Q+                          
Sbjct: 23  KAQSLPYKNKNLPIEERVEDLLNRMTVDEKIAQIRHIHSSKIFNGQELDMKKLTDWAGNT 82

Query: 84  ---FAHGVP---------------------RLGLPQYEWWSEALHGVSNVGPGTHFDDVI 119
              F  G P                     RLG+P +   +E+LHG            V 
Sbjct: 83  SWGFVEGFPLTGDNCAKSMYLIQKYMVEKTRLGIPIFTV-AESLHGA-----------VH 130

Query: 120 PGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNINVARDPRWGR 179
            GAT +P  I   ++FN  L +K  Q +S +  +M           SP I+V RD RWGR
Sbjct: 131 DGATIYPQNIALGSTFNPELARKKTQMISDDLHSM-----GFRQVLSPCIDVVRDLRWGR 185

Query: 180 ITETPGEDPFVVGRYAVNYVRGLQDVEGH-ENATDLNSRPLKVSSCCKHYAAYDVDNWKG 238
           + E+ GEDP++ G +      G+++V G+ EN          +S   KHY  +  +   G
Sbjct: 186 VEESYGEDPYLCGLF------GIEEVSGYLENG---------ISPMLKHYGPHG-NPLSG 229

Query: 239 VDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRG 298
           ++    +  +  +D+ E +L+PFEM VK     +VM +YN  N IP+ A   LL   +R 
Sbjct: 230 LNLASVECGL--RDLHEIYLKPFEMVVKNTGILAVMSTYNSWNHIPNSASHYLLTDILRD 287

Query: 299 EWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFTGNAVQQG 358
           EW   GY+ +D  +I+++   H F A +  +A  Q + AGLD +       F    +++G
Sbjct: 288 EWGFKGYVYSDWGAIEMLKTLH-FTARNSSEAAIQAISAGLDAEASSKCYPFLKGLIEKG 346

Query: 359 KVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQDICSDENIELAAEAAREGIVLLKN 418
           +  E  +D +++ +      +G F+  P   +   +   S E+++LA   A E  VLLKN
Sbjct: 347 QFDEKILDTAVRRVLFAKFAMGLFE-DPYGKTFKNRKRHSPESVKLAKTIADESTVLLKN 405

Query: 419 DQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRY--MSPIAGFSGYAN----VTY 472
           +   LPL++  +K++A++GP  NA     G+Y         ++P+ G     N    + Y
Sbjct: 406 ENQLLPLDAKSLKSIAIIGP--NADQVQFGDYTWSRNNKDGVTPLQGIKNRVNKNTAIHY 463

Query: 473 KTGCDDVACKSNNSIFAASEAAKTADATIILAG---------LDLSVEAESLDREDLWLP 523
             GC  +     + I  A EAAK ++  +I  G            S   E  D  DL L 
Sbjct: 464 AKGCS-LTSLDTSGIAEAVEAAKNSEVAVIFGGSASAALARDYKSSTCGEGFDLNDLNLT 522

Query: 524 GYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVV 583
           G Q+QLI +V      PVILV+++     I + + N  + AIL   Y GE+ G +IAD++
Sbjct: 523 GAQSQLIREVYRTGT-PVILVLVTGKPFVIEWEKNN--LPAILVQWYAGEQAGNSIADIL 579

Query: 584 FGKFNPGGRLPITWYNGD-----YVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFG 638
           FG+  P GRL  ++         Y   LP      +   S   PGR Y F     LY FG
Sbjct: 580 FGEVVPSGRLTFSFPRSTGHLPVYYNYLPSDRGFYKNPGSYDSPGRDYVFSAPSALYSFG 639

Query: 639 YGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFK 698
           YGLSYT F Y  LS  K                               +    +D     
Sbjct: 640 YGLSYTSFVYKNLSTDK-------------------------------DKYELNDTIHAT 668

Query: 699 VDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLN 758
           V+ +N G   G +VV +Y +  A    T +KQ+  F+++ +  G  + ++        L 
Sbjct: 669 VEVKNTGKYTGKEVVQLYVRDKASTYVTPVKQLRDFKKIELAPGETRTVQLQV-PISDLY 727

Query: 759 IVDYAANTLLPAGEHTIFVGNG 780
           +VD      + AGE  + VG  
Sbjct: 728 LVDEKNQRFVEAGEFILEVGQA 749


>gi|427384377|ref|ZP_18880882.1| hypothetical protein HMPREF9447_01915 [Bacteroides oleiciplenus YIT
           12058]
 gi|425727638|gb|EKU90497.1| hypothetical protein HMPREF9447_01915 [Bacteroides oleiciplenus YIT
           12058]
          Length = 1050

 Score =  267 bits (682), Expect = 2e-68,   Method: Compositional matrix adjust.
 Identities = 159/427 (37%), Positives = 248/427 (58%), Gaps = 41/427 (9%)

Query: 53  LFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPG 112
           L+ + + P   R+ DL+SR+T++EK+  L   + G+ RL +P+Y   +EALHGV  V PG
Sbjct: 28  LYKNENAPTHERIMDLLSRLTVEEKISLLRATSPGISRLDIPKYYHGNEALHGV--VRPG 85

Query: 113 THFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAG----------L 162
                     T FP  I   A++N  L +++   +S EARA +N    G          L
Sbjct: 86  RF--------TVFPQAIGLAATWNPVLQEQVATVISDEARARWNELDQGREQKSQFSDLL 137

Query: 163 TYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVS 222
           T+WSP +N+ARDPRWGR  ET GEDP++ G     +V+GLQ   G+      +SR LK+ 
Sbjct: 138 TFWSPTVNMARDPRWGRTPETYGEDPYLSGIMGTAFVKGLQ---GN------DSRYLKIV 188

Query: 223 SCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNG 282
           S  KH+AA + ++    +R+  + +++E+ + E +L  FE CVK+G ++S+M +YN +N 
Sbjct: 189 STPKHFAANNEEH----NRFVCNPQISEKQLREYYLPAFEACVKDGKSASIMSAYNALND 244

Query: 283 IPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLD 342
           +P   +  LL + +R +W   GY+V+DC    ++V+ HK++  +KE A   ++KAGLDL+
Sbjct: 245 VPCTLNAWLLTKVLRNDWGFKGYVVSDCGGPSLLVNAHKYVK-TKEAAATLSIKAGLDLE 303

Query: 343 CG-QYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ--YVSLGKQDICSD 399
           CG   Y     +A +Q  V + DID +   +    M+LG FD   +  Y  +    I S 
Sbjct: 304 CGDDVYDEPLLSAYRQYMVTDADIDSAAYRVLRARMQLGLFDSGEKNPYTKISPAVIGSK 363

Query: 400 ENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMS 459
           E+ E+A  AARE IVLLKN +  LPLN+ K+K++AVVG   NA  +  G+Y+G+P   ++
Sbjct: 364 EHQEVALNAARECIVLLKNQKKMLPLNAKKIKSIAVVG--INAGSSEFGDYSGLPV--IA 419

Query: 460 PIAGFSG 466
           P++   G
Sbjct: 420 PVSVLQG 426



 Score =  140 bits (352), Expect = 4e-30,   Method: Compositional matrix adjust.
 Identities = 96/293 (32%), Positives = 144/293 (49%), Gaps = 54/293 (18%)

Query: 490 ASEAAKTADATIILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAG 549
           A +A +  +  + + G++ S+E E  DR D+ LP  Q + + ++ +V   P I+V++ AG
Sbjct: 596 AGKAVRECETVVAVLGINKSIEREGQDRYDIQLPADQREFLQEIYKV--NPNIVVVLVAG 653

Query: 550 GVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLT 609
              +A    + ++ AI+ A YPGE GG+A+A+V+FG +NPGGRLP+T+Y         L 
Sbjct: 654 S-SLAVNWMDEHVPAIVNAWYPGESGGKAVAEVLFGDYNPGGRLPLTYYRS-------LD 705

Query: 610 SMPLRPVDSLGY-PGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCR 668
            +P  P D      GRTYK++ G  LYPFGYGLSYT FKY+       +QV         
Sbjct: 706 ELP--PFDDYDITKGRTYKYFKGDVLYPFGYGLSYTSFKYS------NLQV--------- 748

Query: 669 NLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQ--NVGSTDGSDVVIVYSKPPAEIAAT 726
                                  D   E  V FQ  N G   G +V  VY K P      
Sbjct: 749 ----------------------ADGEEEVSVSFQLKNTGRYAGDEVAQVYVKLPEREEVM 786

Query: 727 YIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLL-PAGEHTIFVG 778
            +K++ GF+RV +++G +K++         L   D A    + P+G + I VG
Sbjct: 787 PVKELKGFERVSLKSGESKKVTIKLRK-DLLRYWDEAKGKFIYPSGNYNIMVG 838


>gi|189468358|ref|ZP_03017143.1| hypothetical protein BACINT_04755 [Bacteroides intestinalis DSM
           17393]
 gi|189436622|gb|EDV05607.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
           intestinalis DSM 17393]
          Length = 865

 Score =  267 bits (682), Expect = 2e-68,   Method: Compositional matrix adjust.
 Identities = 160/450 (35%), Positives = 245/450 (54%), Gaps = 38/450 (8%)

Query: 54  FCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGT 113
           + + +L  S R  DL+ RMTL+EK+ Q+ + +  + RLG+P Y+WW+EALHGV+  G   
Sbjct: 25  YRNPNLSPSERAWDLLKRMTLEEKISQMKNGSPAIERLGIPAYDWWNEALHGVARAGK-- 82

Query: 114 HFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYN--------LGRAGLTYW 165
                   AT FP  I   A+F+     +    VS EARA Y+         G  GLT+W
Sbjct: 83  --------ATVFPQAIGLAATFDNQAVYETFDIVSDEARAKYHDFQRKGERGGYKGLTFW 134

Query: 166 SPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCC 225
           +PNIN+ RDPRWGR  ET GEDP++     +  V+GLQ         +   +  K  +C 
Sbjct: 135 TPNINIYRDPRWGRGMETYGEDPYLTSLMGLAVVKGLQG--------NGAGKYDKAHACA 186

Query: 226 KHYAAYDVDNWKGVDRYHFDAR-VTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIP 284
           KHYA +    W   +R+ FD++ ++++D+ ET+L  F+  V EG    VMC+YNR  G P
Sbjct: 187 KHYAVHSGPEW---NRHSFDSKNISQRDLWETYLPAFKTLVTEGKVKEVMCAYNRFEGEP 243

Query: 285 SCADPKLLNQTVRGEWDLHGYIVADCDSI-QVMVDNHKFLADSKEDAVAQTLKAGLDLDC 343
            C++ +LL + +R +W     +V+DC +I      NH     S E A A  + +G DL+C
Sbjct: 244 CCSNKQLLIRILREDWGYDDIVVSDCGAIGDFYYPNHHETHPSAEAASADAVVSGTDLEC 303

Query: 344 GQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSP--QYVSLGKQDICSDEN 401
           G  Y++    AV++G + E  I++S+  L     +LG FD      +  +    + S E+
Sbjct: 304 GGSYSSLN-EAVKKGLITEDKINESVFRLLRARFQLGMFDDDTLVSWSEIPYSVVESKEH 362

Query: 402 IELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPI 461
           ++ A E AR+ +VLL N  N+LPL S  ++ VAV+GP+AN +V +  NY G P + ++ +
Sbjct: 363 VDKALEMARKSMVLLTNKNNSLPL-SKSIRKVAVLGPNANDSVMLWANYNGFPTKSVTIL 421

Query: 462 AGFSGY---ANVTYKTGCDDVACKSNNSIF 488
            G         V Y+ GCD V+ ++  S F
Sbjct: 422 EGIRSKLPEGAVYYEKGCDFVSTQTLFSDF 451



 Score =  119 bits (298), Expect = 6e-24,   Method: Compositional matrix adjust.
 Identities = 80/263 (30%), Positives = 123/263 (46%), Gaps = 54/263 (20%)

Query: 501 IILAGLDLSVEAESL----------DREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGG 550
           I + GL  ++E E +          DR ++ LP  Q +++  + +  K PVI V+ S  G
Sbjct: 605 IFVGGLSSALEGEEMPVDLPGFKKGDRTNIDLPRVQEEMLKALKKTGK-PVIFVVCS--G 661

Query: 551 VDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTS 610
             +A      N+ A+L A YPG++GG A+ADV+FG +NP GRLP+T+Y  D       + 
Sbjct: 662 STLALPWEAENLDAMLEAWYPGQQGGTAVADVLFGDYNPAGRLPLTFYASD-------SD 714

Query: 611 MPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNL 670
           +P    +      RTY+++ G  L+PFGYGLSYT F Y      K               
Sbjct: 715 LP--DFEDYNMSNRTYRYFKGKPLFPFGYGLSYTTFDYGKAKVDK--------------- 757

Query: 671 NYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQ 730
                             ++  D     +  +N G  DG +VV VY + PA+     IK 
Sbjct: 758 ----------------KSIKTGDSMTLTIPLKNTGKMDGDEVVQVYLRNPADKEGP-IKM 800

Query: 731 VIGFQRVFVRAGRNKRIKFVFNA 753
           +  F+RV ++AG+ + I+    A
Sbjct: 801 LRAFRRVSLKAGQAENIQIELPA 823


>gi|371776218|ref|ZP_09482540.1| beta-glucosidase [Anaerophaga sp. HS1]
          Length = 774

 Score =  266 bits (681), Expect = 3e-68,   Method: Compositional matrix adjust.
 Identities = 220/783 (28%), Positives = 364/783 (46%), Gaps = 111/783 (14%)

Query: 64  RVKDLVSRMTLDEKVQQL----GDFAHGVPRLGLPQYEWWSEALHGVS-NVGPGTHFD-- 116
           +V  +++ MTLDEK+ QL    G+FA           E+  + + G + NV    H    
Sbjct: 43  KVDSVLNLMTLDEKIGQLNQYSGNFAVTGEVTDTKSGEYLKKGMIGSTFNVFGADHVRML 102

Query: 117 ------------------DVIPG-ATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNL 157
                             DVI G  T+FP  +    S++  L +K  +  + EA A    
Sbjct: 103 QEQNLKYSRLKIPMLFAADVIHGLETTFPIPLAEACSWDLQLMEKSARIAAEEATA---- 158

Query: 158 GRAGLTY-WSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNS 216
             +G+ + ++P ++++RDPRWGRI E  GEDPF+    A   VRG Q ++ +++     S
Sbjct: 159 --SGVAWNFAPMVDISRDPRWGRIMEGAGEDPFLGSLIARARVRGFQGIDSYKDF----S 212

Query: 217 RPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCS 276
           +P  + +C KH+  Y      G D +  D  ++E+ + ET+L PF+  V EG  +S M +
Sbjct: 213 KPNTMMACAKHFVGYGAAQ-AGRDYHTVD--ISERTLFETYLPPFKAAVDEG-VASFMTA 268

Query: 277 YNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLK 336
           +N +NG+P   +  +    +R +W+ +G +V D  +IQ MV  H F  D K+ A    + 
Sbjct: 269 FNELNGVPCTGNKYIFQDILRHQWNFNGMVVTDYTAIQEMV-AHGFAKDLKQ-ASKLAID 326

Query: 337 AGLDLD-CGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQY--VSLGK 393
           AG+D+D   + +  +    V++G+V E  ID ++  +  +   LG FD   +Y      K
Sbjct: 327 AGIDMDMISEGFVTYLKELVEEGQVSEKQIDVAVARILEMKFLLGLFDDPYKYCDAEREK 386

Query: 394 QDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYA-- 451
           + + + ++++ A E A+  IVLLKN+ N LPL     K VA++GP      ++ G +A  
Sbjct: 387 EVLMNPQHLQAAREVAQRSIVLLKNENNVLPLRKDIPKRVALIGPFVKERESLNGEWAIK 446

Query: 452 -----------GIPCRYMSPIAGFSGYANVTYKTGCD----DVACKS--NNSIFA-ASEA 493
                      G+  +Y      F+ YA  T     D     V+ +   + S FA A   
Sbjct: 447 GDRSKSVTLWEGLQEKYADTPVRFN-YAKGTSLPLIDGATRHVSLEQGFDKSGFAEALRV 505

Query: 494 AKTADATIILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDI 553
           AKT+D  ++  G       E+  R D+ LPG Q +L+ ++ +  K P++LV+ +   +D+
Sbjct: 506 AKTSDLILVAMGEHYHWSGEAASRTDITLPGNQRELLKELKKTGK-PIVLVLFNGRPLDL 564

Query: 554 AFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPL----- 608
           ++     N+ AI+ A YPG   G A+ADV+ G +NP  RL +T+     V  +P+     
Sbjct: 565 SWEA--ENVDAIVEAWYPGIMAGHAVADVLSGDYNPSARLVVTFPRN--VGQIPIFYNMK 620

Query: 609 -TSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHC 667
            T  P        Y        N P L+PFG+GLSYT F+Y+                  
Sbjct: 621 NTGRPFDENHPADYKSSYIDSPNSP-LFPFGFGLSYTSFQYD------------------ 661

Query: 668 RNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATY 727
              N T  + K    G L+            VD  N G+ DG +VV +Y           
Sbjct: 662 ---NATISSQKLTKGGSLI----------VSVDVTNTGNVDGEEVVQLYIHDKVGSVTRP 708

Query: 728 IKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGNGGVSFPIH 787
           +K++ GF+++F++ G  K ++F  N  + L + +   + +   GE  ++V         H
Sbjct: 709 VKELKGFKKIFLKKGETKTVEFTINE-EMLKMYNINMDWVAEPGEFDVWVACNSADESNH 767

Query: 788 LNF 790
           L F
Sbjct: 768 LEF 770


>gi|294647557|ref|ZP_06725134.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
           ovatus SD CC 2a]
 gi|294807095|ref|ZP_06765914.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
           xylanisolvens SD CC 1b]
 gi|345508184|ref|ZP_08787819.1| periplasmic beta-glucosidase [Bacteroides sp. D1]
 gi|292637099|gb|EFF55540.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
           ovatus SD CC 2a]
 gi|294445794|gb|EFG14442.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
           xylanisolvens SD CC 1b]
 gi|345455214|gb|EEO50370.2| periplasmic beta-glucosidase [Bacteroides sp. D1]
          Length = 783

 Score =  266 bits (681), Expect = 3e-68,   Method: Compositional matrix adjust.
 Identities = 210/717 (29%), Positives = 323/717 (45%), Gaps = 107/717 (14%)

Query: 90  RLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVST 149
           RLG+P +    EA HG   +G           AT FPT I   A+++  L +++G+A+  
Sbjct: 131 RLGIPLF-LAEEAPHGHMAIG-----------ATVFPTGIGMAATWSPQLIREVGKAIGK 178

Query: 150 EARAMYNLGRAGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHE 209
           E R      + G   + P +++ARDPRW R+ ET GEDP + G      V GL       
Sbjct: 179 EIRL-----QGGHISYGPVLDLARDPRWSRVEETFGEDPVLTGEIGKAMVEGL------- 226

Query: 210 NATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGD 269
              DL S P    +  KH+ AY +          F      +++ E FL PF   +  G 
Sbjct: 227 GGGDL-SHPYSTLATLKHFLAYGISESGQNGNPSFAGI---RELHENFLPPFRQAIDAG- 281

Query: 270 ASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKED 329
           A SVM SYN ++G+P  A+  LL + +R EW   G +V+D  SI+ +  +H F+A + E+
Sbjct: 282 ALSVMTSYNSMDGVPCTANHSLLTELLRNEWKFRGIVVSDLYSIEGIHQSH-FVAPTMEE 340

Query: 330 AVAQTLKAGLDLDCG-QYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQY 388
           A    L AG+D+D G   Y N   NAV  G++ +T +D S+  +  +   +G F+     
Sbjct: 341 AAILALSAGVDVDLGGDAYMNLM-NAVNTGRISKTALDASVARVLRLKFEMGLFENPYVD 399

Query: 389 VSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIG 448
               K+++ S+E++ LA   A+  I LLKN+ + LPLN  K + VA++GP+A+    M+G
Sbjct: 400 PEKAKKEVRSEESVTLARRVAQASITLLKNEHSLLPLN--KNRKVALIGPNADNRYNMLG 457

Query: 449 NYAGIPCR-----YMSPIAGFSGYANVTYKTGC---DDVACKSNNSIFAASE-------- 492
           +Y            +  I      + V Y  GC   D V      ++ AA          
Sbjct: 458 DYTAPQEEENIKTVLDGIRAKLSSSQVEYVKGCSIRDTVTTDIEQAVAAAQRSEVIIAVV 517

Query: 493 ---AAKTADATIILAGLDLSVE--------AESLDREDLWLPGYQTQLINQVAEVAKGPV 541
              +A+    +    G  ++ E         E  DR  L L G Q +L+  +    K P+
Sbjct: 518 GGSSARDFKTSYKETGAAIADEKTISDMECGEGFDRATLSLLGKQQELLKALKATGK-PL 576

Query: 542 ILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGD 601
           I+V +    +D  +A  N +  A+L A YPG+EGG AIADV+FG FNP GRLP +     
Sbjct: 577 IVVYIEGRPLDKNWASENAD--AVLTAYYPGQEGGIAIADVLFGDFNPAGRLPFSVPRS- 633

Query: 602 YVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNL 661
            V  +PL      P          Y   +   LYPFGYGLSYT F Y+ L  +  +  + 
Sbjct: 634 -VGQIPLYYNKKAP------QSHDYVEMSASPLYPFGYGLSYTSFDYSDLHLSALMPRS- 685

Query: 662 NKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPA 721
                                            FE     +N G  DG +V  +Y +   
Sbjct: 686 ---------------------------------FEISFKVRNTGKYDGEEVAQLYLRDEY 712

Query: 722 EIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVG 778
                 +KQ+  F R +++ G  + +KF+ +  +  ++VD     ++  G   I +G
Sbjct: 713 ASVVQPLKQLKHFARFYLKRGEEREVKFILSE-EDFSLVDRNLKKIVEPGTFQIMIG 768


>gi|237709184|ref|ZP_04539665.1| glycoside hydrolase family 3 protein [Bacteroides sp. 9_1_42FAA]
 gi|229456880|gb|EEO62601.1| glycoside hydrolase family 3 protein [Bacteroides sp. 9_1_42FAA]
          Length = 864

 Score =  266 bits (681), Expect = 3e-68,   Method: Compositional matrix adjust.
 Identities = 166/453 (36%), Positives = 235/453 (51%), Gaps = 40/453 (8%)

Query: 54  FCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGT 113
           + +S+L    R +DL+ ++TL+EKV  + D +  V RLG+  Y WW+EALHGV+  G   
Sbjct: 24  YKNSNLSPEERAEDLLQQLTLEEKVALMMDNSKPVERLGIKPYNWWNEALHGVARSGL-- 81

Query: 114 HFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRA--------GLTYW 165
                   AT FP  I   ASF       I  AVS EARA      A        GLT W
Sbjct: 82  --------ATVFPQPIGMAASFEPDAIHTIYTAVSDEARAKNTAYSAAGSYERYQGLTMW 133

Query: 166 SPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCC 225
           +P +N+ RDPRWGR  ET GEDP++     VN V+GLQ        TD N +  K+ +C 
Sbjct: 134 TPTVNIYRDPRWGRGIETYGEDPYLTSVMGVNVVKGLQ-------CTDANQKYDKIHACA 186

Query: 226 KHYAAYDVDNWKGVDRYHFDAR-VTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIP 284
           KH+A +    W   +R+ F+A  +  +D+ ET+L PFE  VKEG    VMC+YNR+ G P
Sbjct: 187 KHFAVHSGPEW---NRHEFNAENIKPRDLHETYLVPFEALVKEGKVKEVMCAYNRLEGDP 243

Query: 285 SCADPKLLNQTVRGEWDLHGYIVADCDSIQ--VMVDNHKFLADSKEDAVAQTLKAGLDLD 342
            C   +LL Q +R EW   G +++DC +I        HK   D+ E A A  + +G DL+
Sbjct: 244 CCGSDRLLMQILRQEWGYEGIVLSDCGAIDDFYREKGHKTHPDA-ESASAAAVLSGTDLE 302

Query: 343 CGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFD--GSPQYVSLGKQDICSDE 400
           CG  Y     +A ++G + E DID S+K L      LG  D     ++  +    +CS E
Sbjct: 303 CGSSYKALVESA-KKGLISEKDIDVSVKRLLKARFELGEMDDPSKVEWTKIPYSVVCSAE 361

Query: 401 NIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSP 460
           +  L+ + AR+ + LL N  N LPL     +T+AV+GP+AN +V   GNY G P   ++ 
Sbjct: 362 HDSLSLDIARKSMTLLLNKNNILPLKRGG-QTIAVMGPNANDSVMQWGNYNGTPKHTITL 420

Query: 461 IAGFSGYA----NVTYKTGCDDVACKSNNSIFA 489
           + G          + Y+ GC  V      S+F+
Sbjct: 421 LEGIRSAMGENDKLIYEQGCSWVERSLIRSVFS 453



 Score =  126 bits (317), Expect = 4e-26,   Method: Compositional matrix adjust.
 Identities = 104/326 (31%), Positives = 144/326 (44%), Gaps = 62/326 (19%)

Query: 464 FSGYANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAESL-------- 515
           FSG A + +     D+  K   +I       K AD  I   G+  S+E E +        
Sbjct: 574 FSGDAQLNF-----DLGFKEEVNIKNTVAKVKDADIVIFAGGISPSLEGEEMGVNLPGFR 628

Query: 516 --DREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGE 573
             DR D+ LP  Q +LI  + +  K  VI V  S  G  IA        +AIL A YPG+
Sbjct: 629 KGDRTDIELPAVQRELIKALCDAGK-KVIFVNFS--GSPIAMEPETKYCQAILQAWYPGQ 685

Query: 574 EGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPT 633
            GG+A A+V+FG +NP GRLP+T+Y    +  LP         +     GRTY+++ G  
Sbjct: 686 SGGKAAAEVLFGDYNPAGRLPVTFYRN--IAQLP-------DFEDYNMTGRTYRYFKGDP 736

Query: 634 LYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDD 693
           L+PFGYGLSYT F Y+ +   +TI+V               + +K   P           
Sbjct: 737 LFPFGYGLSYTTFNYDNIKLDQTIKV--------------GETAKMVIP----------- 771

Query: 694 YFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNA 753
                    N G+ DG +VV VY K   E A    K +  F+RV + AG+   ++     
Sbjct: 772 -------VTNAGNRDGEEVVQVYLK-KQEDAEGPAKTLRAFKRVQIPAGKTVNVELELTP 823

Query: 754 CKSLNIVDYAANTLLP-AGEHTIFVG 778
            K L   D   NT+   AG   I VG
Sbjct: 824 -KQLEWWDAQTNTMRTIAGNFDIMVG 848


>gi|329956868|ref|ZP_08297436.1| glycosyl hydrolase family 3 protein [Bacteroides clarus YIT 12056]
 gi|328523625|gb|EGF50717.1| glycosyl hydrolase family 3 protein [Bacteroides clarus YIT 12056]
          Length = 864

 Score =  266 bits (680), Expect = 3e-68,   Method: Compositional matrix adjust.
 Identities = 171/466 (36%), Positives = 246/466 (52%), Gaps = 40/466 (8%)

Query: 49  MSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSN 108
           ++  ++ D+S   + R +DLV ++TL+EKV  + D +  V RLG+  Y WW+EALHGV+ 
Sbjct: 19  LAQSIYKDNSYSPAERAEDLVKQLTLEEKVALMMDNSKPVERLGIKPYNWWNEALHGVAR 78

Query: 109 VGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRA-------- 160
            G           AT FP  I   ASF+         AVS EARA      A        
Sbjct: 79  SG----------WATVFPQPIGMAASFSPEALHTAFVAVSDEARAKNAAYSAEGSYKRYQ 128

Query: 161 GLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLK 220
           GLT W+P +N+ RDPRWGR  ET GEDP++     V+ V+GLQ         D N +  K
Sbjct: 129 GLTIWTPTVNIYRDPRWGRGIETYGEDPYLASVMGVSVVKGLQ-------CLDENEKYDK 181

Query: 221 VSSCCKHYAAYDVDNWKGVDRYHFDAR-VTEQDMEETFLRPFEMCVKEGDASSVMCSYNR 279
           V +C KH+A +    W   +R+ F+A  ++ +D+ ET+L PFE  VKEG    VMC+YNR
Sbjct: 182 VHACAKHFAVHSGPEW---NRHSFNAENISPRDLYETYLPPFEALVKEGKVKEVMCAYNR 238

Query: 280 VNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDN--HKFLADSKEDAVAQTLKA 337
             G P C   +LLN  +R EW   G +VADC +I    ++  HK  AD+   + A  L +
Sbjct: 239 FEGEPCCGSNRLLNHILRREWGYDGIVVADCSAISDFHNDKGHKTHADAASASSAAVL-S 297

Query: 338 GLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ--YVSLGKQD 395
           G DL+CG  Y + T   V++G + E DID+S+K L      LG  D   Q  +  +    
Sbjct: 298 GTDLECGSNYRSLT-EGVKKGFIDEADIDRSVKRLLQARFELGEMDEPDQVRWAQIPYSV 356

Query: 396 ICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPC 455
           +CSD++  L+ + AR+ + LL N  N LPL      T+AV+GP+AN +V   GNY G+P 
Sbjct: 357 VCSDKHDSLSLDMARKSMTLLLNKNNALPLERGGT-TIAVMGPNANDSVMQWGNYNGLPK 415

Query: 456 RYMSPIAGFSGYA----NVTYKTGCDDVACKSNNSIFAASEAAKTA 497
           R ++ + G          + Y+ GC  V      S+F   ++ + A
Sbjct: 416 RTITILDGIRSAMGKDDKLIYEQGCSWVERTLIRSVFNQCKSKEGA 461



 Score =  131 bits (329), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 89/300 (29%), Positives = 132/300 (44%), Gaps = 56/300 (18%)

Query: 478 DVACKSNNSIFAASEAAKTADATIILAGLDLSVEAESL----------DREDLWLPGYQT 527
           D+  K    I  +    K AD  I   G+   +E E +          DR D+ LP  Q 
Sbjct: 583 DLGFKEEADIQRSVAKVKDADVVIFAGGISPQLEGEEMGVKLPGFRGGDRTDIELPAVQR 642

Query: 528 QLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKF 587
           ++I  + +  K    ++ ++  G  IA        +AIL A YPG+ GG+A+A+V+FG +
Sbjct: 643 EMIKALHDAGKK---VIFVNCSGSPIAMEPETEYCQAILQAWYPGQSGGKAVAEVLFGDY 699

Query: 588 NPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFK 647
           NP GRLP T+Y         L  +P    +     G TY+F+NG  L+PFGYGLSYT FK
Sbjct: 700 NPAGRLPATFYRN-------LAQLP--DFEDYNMAGHTYRFFNGEPLFPFGYGLSYTTFK 750

Query: 648 YNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGST 707
           Y  +    + Q                                 D+  +  V   N GS 
Sbjct: 751 YGKIQLKSSAQT--------------------------------DETVKITVPVTNTGSR 778

Query: 708 DGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTL 767
           +G +VV VY K   E     +K +  F+RV++ AG+  +++      K L   D A NT+
Sbjct: 779 NGEEVVQVYLKKQGETDGP-VKTLRAFKRVYIPAGKTVKVELELTP-KQLEWWDSATNTM 836


>gi|94497563|ref|ZP_01304132.1| xylosidase/arabinosidase [Sphingomonas sp. SKA58]
 gi|94422980|gb|EAT08012.1| xylosidase/arabinosidase [Sphingomonas sp. SKA58]
          Length = 774

 Score =  266 bits (680), Expect = 3e-68,   Method: Compositional matrix adjust.
 Identities = 228/738 (30%), Positives = 348/738 (47%), Gaps = 111/738 (15%)

Query: 78  VQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGATSFPTVILTTASFNE 137
           V  L  +A    RLG+P   +  E LHG + VG           ATSFP  I   +S++ 
Sbjct: 108 VNALQKWAMTQTRLGIPIL-FHEEGLHGYAAVG-----------ATSFPQSIALASSWDP 155

Query: 138 SLWKKIGQAVSTEARAMYNLGRAGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVN 197
            L +++   ++ E R      R      SP +++ARDPRWGRI ET GEDP++VG   V 
Sbjct: 156 HLVQQVNSVIAREIRV-----RGVPMVLSPVVDIARDPRWGRIEETYGEDPYLVGEMGVA 210

Query: 198 YVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAY-DVDNWKGVDRYHFDARVTEQDMEET 256
            V GLQ   G   + DL  RP KV +  KH   +   ++   V      A ++E+++ E 
Sbjct: 211 AVEGLQ---GEGRSHDL--RPGKVFATLKHLTGHGQPESGTNVG----PAPISERELREN 261

Query: 257 FLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVM 316
           F  PFE  VK    ++VM SYN ++G+PS  +  LL+  +RGEW   G +V+D   +  +
Sbjct: 262 FFPPFEQVVKRTGINAVMASYNEIDGVPSHMNRWLLDDVLRGEWGFRGAVVSDYSGVDQL 321

Query: 317 VDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFT-GNAVQQGKVKETDIDKSLKYLYTV 375
           ++ H  +A S ++A  + L AG+D D  +  +  T G+ V+ GKV E  +DK+++ +  +
Sbjct: 322 MNIH-HVAGSLDEAARRALDAGVDADLPEGLSYATLGDQVRAGKVSEAQVDKAVRRMLEL 380

Query: 376 LMRLGFFD----GSPQYVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVK 431
             R G F+     + Q V+L        E   LA  AA+  I LLKND   LPL   KV+
Sbjct: 381 KFRAGLFEHPYADAAQAVAL----TNDAEARALARTAAQRSITLLKND-GMLPL---KVE 432

Query: 432 -TVAVVGPHANATVAMIGNYAGIPCRYMSPIAG------------FSGYANVT-----YK 473
            ++AV+GP  +A VA +G Y G P   +S + G            F+    +T     + 
Sbjct: 433 GSIAVIGP--SAAVARLGGYYGQPPHVVSILDGIKARVGDRVRIVFAQGVKITQDDDWWA 490

Query: 474 TGCDDVACKSNNSIFA-ASEAAKTADATIILAGLDLSVEAESL------DREDLWLPGYQ 526
              D      N  + A A EAA+  D  ++  G       E        DR  L L G Q
Sbjct: 491 DKVDKADPAENRRLIAQAVEAARNVDRIVLTLGDTEQSSREGWAANHLGDRPSLDLVGEQ 550

Query: 527 TQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGK 586
            +L + +  + K P+ +V+++  G   +  + +    A+L   Y GE+GG A+AD++FG 
Sbjct: 551 QELFDALKTLGK-PITVVLIN--GRPASTVKVSEEANALLEGWYLGEQGGHAVADILFGD 607

Query: 587 FNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQF 646
            NPGG+LP+T      V  LP     ++P       GR Y F     LYPFG+GLSYT F
Sbjct: 608 VNPGGKLPVTVPRS--VGQLP-AFYNVKP-----SAGRGYLFDTNAPLYPFGFGLSYTNF 659

Query: 647 KYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGS 706
               LS  +  Q ++                    PG   +           VD +N G+
Sbjct: 660 T---LSPPRLAQSSIG-------------------PGGTTS---------VTVDVRNDGA 688

Query: 707 TDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANT 766
            DG +VV +Y           IK++ GF+RV ++ G  + ++F     +SL + +   + 
Sbjct: 689 RDGDEVVQLYIHDKVSSVTRPIKELKGFERVSLKPGEVRTVRFTIT-PESLQMWNDKMHR 747

Query: 767 LLPAGEHTIFVGNGGVSF 784
           ++  GE  I  GN  V+ 
Sbjct: 748 VVEPGEFEIMTGNSSVAL 765


>gi|375254464|ref|YP_005013631.1| glycosyl hydrolase family 3, C-terminal domain-containing protein
           [Tannerella forsythia ATCC 43037]
 gi|363407375|gb|AEW21061.1| glycosyl hydrolase family 3, C-terminal domain protein [Tannerella
           forsythia ATCC 43037]
          Length = 775

 Score =  266 bits (680), Expect = 3e-68,   Method: Compositional matrix adjust.
 Identities = 222/754 (29%), Positives = 356/754 (47%), Gaps = 122/754 (16%)

Query: 76  EKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGATSFPTVILTTASF 135
           E +  L  +A    RLG+P + +  E +HG   +G            T FPT I   +++
Sbjct: 107 EALNALQKYAMENTRLGIPIF-FAEECMHGHMAIG-----------TTVFPTSIGQASTW 154

Query: 136 NESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYA 195
           N +L +K+G A++ E R+     +     + P +++AR+PRW R+ ET GEDP + G   
Sbjct: 155 NRTLIEKMGAAIAHETRS-----QGAHIAYGPVLDLAREPRWSRVEETFGEDPVLSGILG 209

Query: 196 VNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEE 255
             +VRGLQ   G + A   ++      S  KH AAY +       R    A++  +++  
Sbjct: 210 SAFVRGLQ---GKDFADGRHTY-----STLKHLAAYGIPVGGHNGR---QAQIGARELIA 258

Query: 256 TFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQV 315
             L PFEM VK G A SVM SYN V+G+P  ++  +L + +RGEWD +G++V+D  SI+ 
Sbjct: 259 EHLLPFEMAVKAG-AQSVMTSYNAVDGVPCTSNTYILKKILRGEWDFNGFVVSDLGSIEG 317

Query: 316 MVDNHKFLADSKEDAVAQTLKAGLDLDCGQY-YTNFTGNAVQQGKVKETDIDKSLKYLYT 374
           +   H+   D K  A A  L AG+++D G   YT     A     +  ++ID ++  +  
Sbjct: 318 IATTHRVAPDIKH-AAAMALNAGVEMDLGGVAYTRNMEQAHTDSLISMSEIDDAVSRILR 376

Query: 375 VLMRLGFFDGSPQYVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVA 434
           +   +G F+      S   + I S E+  LA + A E IVLLKN+ N LPL S  + ++A
Sbjct: 377 LKFEMGLFESPYVQPSRTTEIIRSKEHNRLARKVAEESIVLLKNNANLLPL-SKNIGSIA 435

Query: 435 VVGPHANATVAMIGNY-AGIPCRYMSPIAGFSGYAN-------VTYKTGCDDVACKSNNS 486
           V+GP+A+     +G+Y A  P  ++  I    G  N       + Y  GC  V   + ++
Sbjct: 436 VIGPNADNLYNQLGDYTAPQPEEHIVTI--LEGIRNAVSPTTVIRYVKGC-AVRDTTQSN 492

Query: 487 IFAASEAAKTADATIILAG-------------------------LDLSVEA-ESLDREDL 520
           I  A  AA  ++A +++ G                         L   +E+ E  DR+ L
Sbjct: 493 IDEAVRAANASNAVVLVVGGSSARDFHTKYIETGAATVSSRENELIPDMESGEGYDRKSL 552

Query: 521 WLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIA 580
            L G+Q +LI  +A   K P+I+V +    +++  A+   +  A+L A YPGEEGG A+A
Sbjct: 553 TLLGHQEKLIESIAATGK-PLIMVYIQGRPLNMNLADKKAS--ALLTAWYPGEEGGNAVA 609

Query: 581 DVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYG 640
           +V+FG  NP GRLPI+         +P ++  L    SLG      +  + P LY FGYG
Sbjct: 610 NVIFGDVNPSGRLPIS---------VPRSTGQLPVYYSLGKSNDYVEGTSTP-LYAFGYG 659

Query: 641 LSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVD 700
           LSYT F+Y  L+ ++               N T   + T                     
Sbjct: 660 LSYTAFEYGNLTISR------------EGGNITVSCTVT--------------------- 686

Query: 701 FQNVGSTDGSDVVIVYSKPPAEIAATYIKQVI--GFQRVFVRAGRNKRIKFVFNACKSLN 758
             N G+TDG +VV +Y +    +A+  +  V+   F ++ ++ G + R+ FV    + L 
Sbjct: 687 --NTGNTDGDEVVQLYLRD--HVASVSVPPVLLKDFAKISLKKGESARVNFVLTP-EQLA 741

Query: 759 IVDYAANTLLPAGEHTIFVGNGGVSFPIHLNFNY 792
             +     ++  GE T+ +G       +  +F Y
Sbjct: 742 FFNTDLKRVVEPGEFTVMIGAASNDIRLKESFVY 775


>gi|262405113|ref|ZP_06081663.1| periplasmic beta-glucosidase [Bacteroides sp. 2_1_22]
 gi|262355988|gb|EEZ05078.1| periplasmic beta-glucosidase [Bacteroides sp. 2_1_22]
          Length = 769

 Score =  266 bits (680), Expect = 3e-68,   Method: Compositional matrix adjust.
 Identities = 210/717 (29%), Positives = 323/717 (45%), Gaps = 107/717 (14%)

Query: 90  RLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVST 149
           RLG+P +    EA HG   +G           AT FPT I   A+++  L +++G+A+  
Sbjct: 117 RLGIPLF-LAEEAPHGHMAIG-----------ATVFPTGIGMAATWSPQLIREVGKAIGK 164

Query: 150 EARAMYNLGRAGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHE 209
           E R      + G   + P +++ARDPRW R+ ET GEDP + G      V GL       
Sbjct: 165 EIRL-----QGGHISYGPVLDLARDPRWSRVEETFGEDPVLTGEIGKAMVEGL------- 212

Query: 210 NATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGD 269
              DL S P    +  KH+ AY +          F      +++ E FL PF   +  G 
Sbjct: 213 GGGDL-SHPYSTLATLKHFLAYGISESGQNGNPSFAGI---RELHENFLPPFRQAIDAG- 267

Query: 270 ASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKED 329
           A SVM SYN ++G+P  A+  LL + +R EW   G +V+D  SI+ +  +H F+A + E+
Sbjct: 268 ALSVMTSYNSMDGVPCTANHSLLTELLRNEWKFRGIVVSDLYSIEGIHQSH-FVAPTMEE 326

Query: 330 AVAQTLKAGLDLDCG-QYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQY 388
           A    L AG+D+D G   Y N   NAV  G++ +T +D S+  +  +   +G F+     
Sbjct: 327 AAILALSAGVDVDLGGDAYMNLM-NAVNTGRISKTALDASVARVLRLKFEMGLFENPYVD 385

Query: 389 VSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIG 448
               K+++ S+E++ LA   A+  I LLKN+ + LPLN  K + VA++GP+A+    M+G
Sbjct: 386 PEKAKKEVRSEESVTLARRVAQASITLLKNEHSLLPLN--KNRKVALIGPNADNRYNMLG 443

Query: 449 NYAGIPCR-----YMSPIAGFSGYANVTYKTGC---DDVACKSNNSIFAASE-------- 492
           +Y            +  I      + V Y  GC   D V      ++ AA          
Sbjct: 444 DYTAPQEEENIKTVLDGIRAKLSSSQVEYVKGCSIRDTVTTDIEQAVAAAQRSEVIIAVV 503

Query: 493 ---AAKTADATIILAGLDLSVE--------AESLDREDLWLPGYQTQLINQVAEVAKGPV 541
              +A+    +    G  ++ E         E  DR  L L G Q +L+  +    K P+
Sbjct: 504 GGSSARDFKTSYKETGAAIADEKTISDMECGEGFDRATLSLLGKQQELLKALKATGK-PL 562

Query: 542 ILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGD 601
           I+V +    +D  +A  N +  A+L A YPG+EGG AIADV+FG FNP GRLP +     
Sbjct: 563 IVVYIEGRPLDKNWASENAD--AVLTAYYPGQEGGIAIADVLFGDFNPAGRLPFSVPRS- 619

Query: 602 YVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNL 661
            V  +PL      P          Y   +   LYPFGYGLSYT F Y+ L  +  +  + 
Sbjct: 620 -VGQIPLYYNKKAP------QSHDYVEMSASPLYPFGYGLSYTSFDYSDLHLSALMPRS- 671

Query: 662 NKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPA 721
                                            FE     +N G  DG +V  +Y +   
Sbjct: 672 ---------------------------------FEISFKVRNTGKYDGEEVAQLYLRDEY 698

Query: 722 EIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVG 778
                 +KQ+  F R +++ G  + +KF+ +  +  ++VD     ++  G   I +G
Sbjct: 699 ASVVQPLKQLKHFARFYLKRGEEREVKFILSE-EDFSLVDRNLKKIVEPGTFQIMIG 754


>gi|153809292|ref|ZP_01961960.1| hypothetical protein BACCAC_03604 [Bacteroides caccae ATCC 43185]
 gi|149128062|gb|EDM19283.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
           caccae ATCC 43185]
          Length = 946

 Score =  266 bits (680), Expect = 3e-68,   Method: Compositional matrix adjust.
 Identities = 235/819 (28%), Positives = 370/819 (45%), Gaps = 146/819 (17%)

Query: 54  FCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRL---GLPQYEW----WS------ 100
           + D + P   R++DL+S+MTL+EK  Q+    +G  R+    LP  EW    W       
Sbjct: 53  YEDPTAPIDARIEDLLSQMTLEEKTCQMVTL-YGYKRVLKDDLPTSEWKNQLWKDGIGAI 111

Query: 101 -EALHGVSNVG-PGTHFDDVIPG------------------------------------- 121
            E L+G    G P +  + V P                                      
Sbjct: 112 DEHLNGFQQWGLPPSDNEYVWPASKHAWALNEVQRFFIEETRLGIPTDFTNEGIRGVESY 171

Query: 122 -ATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLT-YWSPNINVARDPRWGR 179
            AT+FPT +    ++N  L  ++G     EAR +      G T  ++P ++V RD RWGR
Sbjct: 172 KATNFPTQLGLGHTWNRQLIHQVGLITGREARML------GYTNVYAPILDVGRDQRWGR 225

Query: 180 ITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGV 239
             E  GE P++V    +  VRG+Q    H +         +V++  KH+ AY  +     
Sbjct: 226 YEEVYGESPYLVAELGIEMVRGMQ----HNH---------QVAATGKHFIAYSNNKGARE 272

Query: 240 DRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGE 299
                D +++ +++E     PF+  ++E     VM SYN  +G P  +    L   +RGE
Sbjct: 273 GMARVDPQMSPREVEMLHAYPFKRVIREAGLLGVMSSYNDYDGFPIQSSYYWLTTRLRGE 332

Query: 300 WDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCG----QYYTNFTGNAV 355
               GY+V+D D+++ +   H    D KE AV Q+++AGL++ C       Y       V
Sbjct: 333 MGFRGYVVSDSDAVEYLYTKHGTAKDMKE-AVRQSVEAGLNVRCTFRSPDSYVLPLRELV 391

Query: 356 QQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQD--ICSDENIELAAEAAREGI 413
           ++G + E  I+  ++ +  V   +G FD +P    L   D  +   EN E+A +A+RE I
Sbjct: 392 KEGGLSEEVINDRVRDILRVKFLVGLFD-TPYQTDLKGADEEVEKKENEEVALQASRESI 450

Query: 414 VLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAG----FSGYAN 469
           VLLKN++N LPL+ +K++ +AV GP+A+     + +Y  +     S + G        A+
Sbjct: 451 VLLKNEKNVLPLDPSKIRKIAVCGPNADEHSYALTHYGPLAVEVTSVLKGIQEKMKDKAD 510

Query: 470 VTYKTGCDDVAC--------------KSNNSIFAASEAAKTADATIILAGLDLSVEAESL 515
           V Y  GCD V                +    I  A   AK AD  I++ G       E+ 
Sbjct: 511 VLYTKGCDLVDANWPESELIDYPLTDEEQKEIDKAVSQAKQADVAIVVLGGGQRTCGENK 570

Query: 516 DREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEG 575
            R  L LPG Q  L+  V    K PV+LV+++   + I +A+    + AIL A YPG +G
Sbjct: 571 SRSSLDLPGRQLDLLKAVVATGK-PVVLVLINGRPLSINWAD--KFVPAILEAWYPGSKG 627

Query: 576 GRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRP---VDSLGYPGR--TYKFYN 630
           G A+AD++FG +NPGG+L +T+     V  +P  + P +P   +D    PG        N
Sbjct: 628 GIAVADILFGDYNPGGKLTVTFPK--TVGQIPF-NFPCKPSSQIDGGKNPGPDGNMSRAN 684

Query: 631 GPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLR 690
           G  LYPFGYGLSYT F+Y+ L  +                           P ++  + +
Sbjct: 685 G-ALYPFGYGLSYTTFEYSDLKIS---------------------------PAIITPNQK 716

Query: 691 CDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFV 750
              Y   KV   N G   G +V+ +Y +       TY K + GF+RV ++ G  K I F 
Sbjct: 717 A--YVTCKV--TNTGKRSGDEVIQLYVRDVLSSVTTYEKNLAGFERVHLKPGETKEITFP 772

Query: 751 FNACKSLNIVDYAANTLLPAGEHTIFVGNGGVSFPIHLN 789
            +  K+L +++   + ++  G+ T+ +  G  S  I LN
Sbjct: 773 IDR-KALELLNADMHWVVEPGDFTLML--GASSTDIRLN 808


>gi|255690486|ref|ZP_05414161.1| periplasmic beta-glucosidase [Bacteroides finegoldii DSM 17565]
 gi|260623937|gb|EEX46808.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
            finegoldii DSM 17565]
          Length = 1365

 Score =  266 bits (680), Expect = 3e-68,   Method: Compositional matrix adjust.
 Identities = 227/797 (28%), Positives = 352/797 (44%), Gaps = 146/797 (18%)

Query: 54   FCDSSLPYSIRVKDLVSRMTLDEKVQQLGD---------------------------FAH 86
            +  + LP   RVKDL+ RMT +EK+ Q+                             F  
Sbjct: 536  YQRADLPIEERVKDLLQRMTPEEKLAQIRHIHSWEIFNGQALDERKLEEKAQGMSWGFVE 595

Query: 87   GVP---------------------RLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGATSF 125
            G P                     RLG+P +   +E+LHGV           V  GAT F
Sbjct: 596  GFPLTAENCAKNMLAIQRFMVEKTRLGIPIFTV-AESLHGV-----------VHEGATVF 643

Query: 126  PTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNINVARDPRWGRITETPG 185
            P  I   ++F+  L  +    ++ E  A+           SP I+V RD RWGR+ E+ G
Sbjct: 644  PQNIALGSTFDTDLAYRKTSMIADELHAV-----GMRQVLSPCIDVVRDLRWGRVEESFG 698

Query: 186  EDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFD 245
            EDP++ GR+ +  V+G  D                +S   KHY  +  +   G++    +
Sbjct: 699  EDPYLCGRFGIAEVKGYMDN--------------GISPMLKHYGPHG-NPLSGLNLASVE 743

Query: 246  ARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGY 305
              +  +D+ E +L+PFEM +K+    +VM +YN  N IP+ A   LL   +R EW   GY
Sbjct: 744  TSI--RDLHEVYLKPFEMVMKQAPTLAVMSAYNSWNRIPNSASHYLLTDVLRKEWGFKGY 801

Query: 306  IVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFTGNAVQQGKVKETDI 365
            + +D  +I+ M+ N  F A + E+A  Q L AGLD++            +++G++    +
Sbjct: 802  VYSDWGAIE-MLKNFHFTARNSEEAALQALTAGLDVEASSDCYPAIPGLIERGELNREIV 860

Query: 366  DKSLKYLYTVLMRLGFFDGSPQYVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPL 425
            D++++ +     R+G FD  P      K  I S + I L+ + A E  VLLKN++  LPL
Sbjct: 861  DEAVRRVLYAKFRIGLFD-DPYGEKFAKGAIHSGKAIALSKKIADESTVLLKNERQLLPL 919

Query: 426  NSAKVKTVAVVGPHANATVAMIGNYAGI-PCRY-MSPIAGFSGYA----NVTYKTGCDDV 479
            +  K+K++AV+GP  NA     G+Y      R+ ++P+ G   +A     V Y  GC  V
Sbjct: 920  SIGKLKSIAVIGP--NADQIQFGDYTWTRDNRFGVTPLQGIRKWAGTNVKVNYAKGCSLV 977

Query: 480  ACKSNNSIFAASEAAKTADATIILAG---------LDLSVEAESLDREDLWLPGYQTQLI 530
            +    + I  A EAA+ +D  ++  G            S   E  D  DL L G Q  LI
Sbjct: 978  SM-DESGIRQAVEAAEQSDVCVLFCGSASAALARDYKSSTCGEGFDLNDLTLTGAQPALI 1036

Query: 531  NQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPG 590
              V    K PVILV+++  G   A      NI AIL   Y GE+ G +IAD++FGK +P 
Sbjct: 1037 KAVQATGK-PVILVLVT--GKPFAIPWEKKNIPAILVQWYAGEQSGNSIADILFGKVSPS 1093

Query: 591  GRLPITWYNGDYVQMLPLTSMPLR-------PVDSLGYPGRTYKFYNGPTLYPFGYGLSY 643
            GRL  ++   +    LP+    LR          S   PGR Y F     L+ FG+GL+Y
Sbjct: 1094 GRLTFSF--PESTGHLPVYYNHLRSDRGFYKSPGSYDSPGRDYVFSAPVPLWSFGHGLTY 1151

Query: 644  TQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQN 703
            T F+Y+ L                          +T     L+ND         ++  +N
Sbjct: 1152 TTFEYSNL--------------------------QTDRASYLLNDT-----VHVRIGLKN 1180

Query: 704  VGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYA 763
             G  +G +VV +Y        A  ++Q+  F++V ++AG  + ++        L I++  
Sbjct: 1181 TGKCEGKEVVQLYVSDVCSSVAMPVRQLRDFRKVALQAGETQIVRLSI-PVSELTILNEK 1239

Query: 764  ANTLLPAGEHTIFVGNG 780
               ++  GE  I VG+ 
Sbjct: 1240 NEAIVEPGEFEIQVGSA 1256


>gi|399025438|ref|ZP_10727439.1| beta-glucosidase-like glycosyl hydrolase [Chryseobacterium sp.
           CF314]
 gi|398078072|gb|EJL69004.1| beta-glucosidase-like glycosyl hydrolase [Chryseobacterium sp.
           CF314]
          Length = 740

 Score =  266 bits (680), Expect = 4e-68,   Method: Compositional matrix adjust.
 Identities = 221/763 (28%), Positives = 357/763 (46%), Gaps = 111/763 (14%)

Query: 64  RVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTHFD------- 116
           +V +L+S+MTL+EKV QL  ++ G      PQ    +  L  + +   G+  +       
Sbjct: 26  KVSELLSKMTLEEKVGQLVQYS-GFEYATGPQNSNSATVLEEIKSGKVGSMLNVAGVEET 84

Query: 117 --------------------DVIPG-ATSFPTVILTTASFNESLWKKIGQAVSTEARAMY 155
                               DVI G  T+FP  +   AS++  L +K  +  +TEA A Y
Sbjct: 85  RSFQKLALQSRLKIPLLFGQDVIHGYRTTFPVNLGQAASWDLGLIEKSERIAATEASA-Y 143

Query: 156 NLGRAGLTYWS--PNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATD 213
            +      +W+  P +++ARDPRWGR+ E  GED ++  +  +  ++G Q  +G  N   
Sbjct: 144 GI------HWTFAPMVDIARDPRWGRVMEGSGEDTYLGTQIGLARIKGFQG-KGLGNID- 195

Query: 214 LNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSV 273
                  + +C KH+AAY      G D    D  + +  + ET+L PF+   + G  ++ 
Sbjct: 196 ------AIMACAKHFAAYGA-AVGGRDYNSVDMSLRQ--LNETYLPPFKAAAEAG-VATF 245

Query: 274 MCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQ 333
           M S+N +NG+P+ A+  +L   ++G+W+  G++V+D  SI  M   H +  D K +A  +
Sbjct: 246 MNSFNDINGVPATANTYILRDLLKGKWNYKGFVVSDWGSIGEMT-YHGYTKD-KTEAAQK 303

Query: 334 TLKAGLDLDC-GQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLG 392
            + AG D+D   + Y       V++GKV    ID++ + + T    +G FD   ++    
Sbjct: 304 AILAGSDMDMESRVYMAELPKLVKEGKVDPKFIDEAARRILTKKFEMGLFDDPYRFSDDK 363

Query: 393 KQ--DICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNY 450
           +Q     + EN +   E   + +VLLKN +N LP+ S   KTVA++GP    TVA  G +
Sbjct: 364 RQKDQTNNQENRKFGREFGSKSMVLLKNQKNILPI-SKSTKTVALIGPFGKETVANHGFW 422

Query: 451 A-GIPCRYMSPIAGFSGYAN-------VTYKTGCDDVACKSNNSIFA-ASEAAKTADATI 501
           A G        ++ F G  N       + Y  GC+      + S+FA A E AK AD  I
Sbjct: 423 AVGFKDDSQRIVSQFDGIRNQLDQNSALLYAKGCN--VDDQDRSMFAEAVETAKKADVVI 480

Query: 502 ILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTN 561
           +  G   ++  E+  R ++   G Q  L+ ++A+  K P++L+I +  G  + F     N
Sbjct: 481 MTLGEGHAMSGEAKSRSNIHFSGVQEDLLKEIAKTGK-PIVLMINA--GRPLVFDWAADN 537

Query: 562 IKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPL------TSMPLRP 615
           I  I++  + G E G +IADV+FGK NPGG+LP+T+   +    +P+      T  P + 
Sbjct: 538 IPTIMYTWWLGTEAGNSIADVLFGKVNPGGKLPMTFPRTE--GQIPVYYNHYNTGRPAKT 595

Query: 616 VDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSD 675
                Y        N P  +PFGYGLSYTQFKY+ +  +                     
Sbjct: 596 NTERNYVSAYIDLDNDPK-FPFGYGLSYTQFKYSDMILSSA------------------- 635

Query: 676 ASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQ 735
                       DL+ +     KV+  N G+ DG +VV +Y +         +K++ GFQ
Sbjct: 636 ------------DLKGNQTLNIKVNISNTGNYDGEEVVQLYIRDLFGKVVRPVKELKGFQ 683

Query: 736 RVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVG 778
           ++F++ G  K + F     ++L   D A N     GE  I VG
Sbjct: 684 KIFLKKGETKIVSFNLTP-ENLKFYDDALNYDWEGGEFDIMVG 725


>gi|189461690|ref|ZP_03010475.1| hypothetical protein BACCOP_02354 [Bacteroides coprocola DSM 17136]
 gi|189431577|gb|EDV00562.1| glycosyl hydrolase family 3 N-terminal domain protein [Bacteroides
           coprocola DSM 17136]
          Length = 499

 Score =  266 bits (680), Expect = 4e-68,   Method: Compositional matrix adjust.
 Identities = 161/428 (37%), Positives = 241/428 (56%), Gaps = 43/428 (10%)

Query: 53  LFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPG 112
           L+ +   P   R+ DL+SR+T++EKV  L   + G+ RL +P+Y   +EALHGV  V PG
Sbjct: 27  LYKNEDAPLHERIMDLLSRLTVEEKVSLLRATSPGISRLDIPKYYHGNEALHGV--VRPG 84

Query: 113 THFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAG----------L 162
                     T FP  I   A++N  L  ++   +S EARA +N    G          L
Sbjct: 85  RF--------TVFPQAIGLAATWNPELQYQVATVISDEARARWNELDQGKLQKGQFSDLL 136

Query: 163 TYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVS 222
           T+WSP +N+ARDPRWGR  ET GEDP++ G     +VRGLQ  +         +R LKV 
Sbjct: 137 TFWSPTVNMARDPRWGRTPETYGEDPYLSGTMGTAFVRGLQGDD---------ARYLKVV 187

Query: 223 SCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNG 282
           S  KH+AA + ++    +R+  + +++E+ + E +L  FE C+K+G A+S+M +YN +N 
Sbjct: 188 STPKHFAANNEEH----NRFECNPQISEKQLREYYLPAFEACIKDGKAASIMSAYNAINN 243

Query: 283 IPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLD 342
           +P   +  LL + +R +W   GY+V+DC    ++V+ HK++  +KE A   ++KAGLDL+
Sbjct: 244 VPCTLNSWLLTKVLRHDWGFQGYVVSDCGGPSLLVNAHKYV-KTKEAAATLSIKAGLDLE 302

Query: 343 CGQ--YYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ--YVSLGKQDICS 398
           CG   YY     NA +Q  V + DID +  ++    MRLG FD      Y  +    I S
Sbjct: 303 CGDDVYYEPLL-NAYKQYMVSDADIDSTAYHVLKARMRLGLFDNGKNNPYTKISPSIIGS 361

Query: 399 DENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYM 458
             +  +A EAAR+ IVLLKN    LPL++ K+K++AVVG   NA     G+Y+G P   +
Sbjct: 362 KLHQRVALEAARQCIVLLKNHNWVLPLDTKKLKSIAVVG--INAGNCEFGDYSGSPV--I 417

Query: 459 SPIAGFSG 466
           +PI+   G
Sbjct: 418 APISILQG 425


>gi|345514226|ref|ZP_08793739.1| glycoside hydrolase family beta-glycosidase [Bacteroides dorei
           5_1_36/D4]
 gi|229437207|gb|EEO47284.1| glycoside hydrolase family beta-glycosidase [Bacteroides dorei
           5_1_36/D4]
          Length = 864

 Score =  266 bits (679), Expect = 4e-68,   Method: Compositional matrix adjust.
 Identities = 166/452 (36%), Positives = 234/452 (51%), Gaps = 40/452 (8%)

Query: 54  FCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGT 113
           + +S+L    R +DL+ ++TL+EKV  + D +  V RLG+  Y WW+EALHGV+  G   
Sbjct: 24  YKNSNLSPEERAEDLLQQLTLEEKVALMMDNSKPVERLGIKPYNWWNEALHGVARSGL-- 81

Query: 114 HFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRA--------GLTYW 165
                   AT FP  I   ASF       I  AVS EARA      A        GLT W
Sbjct: 82  --------ATVFPQPIGMAASFEPDAIHTIYTAVSDEARAKNTAYSAAGSYERYQGLTMW 133

Query: 166 SPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCC 225
           +P +N+ RDPRWGR  ET GEDP++     VN V+GLQ        TD N +  K+ +C 
Sbjct: 134 TPTVNIYRDPRWGRGIETYGEDPYLTSVMGVNVVKGLQ-------CTDANQKYDKIHACA 186

Query: 226 KHYAAYDVDNWKGVDRYHFDAR-VTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIP 284
           KH+A +    W   +R+ F+A  +  +D+ ET+L PFE  VKEG    VMC+YNR+ G P
Sbjct: 187 KHFAVHSGPEW---NRHEFNAENIKPRDLHETYLVPFEALVKEGKVKEVMCAYNRLEGDP 243

Query: 285 SCADPKLLNQTVRGEWDLHGYIVADCDSIQ--VMVDNHKFLADSKEDAVAQTLKAGLDLD 342
            C   +LL Q +R EW   G +++DC +I        HK   D+ E A A  + +G DL+
Sbjct: 244 CCGSDRLLMQILRQEWGYEGIVLSDCGAIDDFYREKGHKTHPDA-ESASAAAVLSGTDLE 302

Query: 343 CGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFD--GSPQYVSLGKQDICSDE 400
           CG  Y     +A ++G + E DID S+K L      LG  D     ++  +    +CS E
Sbjct: 303 CGSSYKALVESA-KKGLISEKDIDVSVKRLLKARFELGEMDDPSKVEWTKIPYSVVCSAE 361

Query: 401 NIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSP 460
           +  L+ + AR+ + LL N  N LPL     +T+AV+GP+AN +V   GNY G P   ++ 
Sbjct: 362 HDSLSLDIARKSMTLLLNKNNILPLKRGG-QTIAVMGPNANDSVMQWGNYNGTPKHTITL 420

Query: 461 IAGFSGYA----NVTYKTGCDDVACKSNNSIF 488
           + G          + Y+ GC  V      S+F
Sbjct: 421 LEGIRSAMGENDKLIYEQGCSWVERSLIRSVF 452



 Score =  126 bits (317), Expect = 4e-26,   Method: Compositional matrix adjust.
 Identities = 104/326 (31%), Positives = 144/326 (44%), Gaps = 62/326 (19%)

Query: 464 FSGYANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAESL-------- 515
           FSG A + +     D+  K   +I       K AD  I   G+  S+E E +        
Sbjct: 574 FSGDAQLNF-----DLGFKEEVNIKNTVAKVKDADIVIFAGGISPSLEGEEMGVNLPGFR 628

Query: 516 --DREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGE 573
             DR D+ LP  Q +LI  + +  K  VI V  S  G  IA        +AIL A YPG+
Sbjct: 629 KGDRTDIELPAVQRELIKALCDAGK-KVIFVNFS--GSPIAMEPETKYCQAILQAWYPGQ 685

Query: 574 EGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPT 633
            GG+A A+V+FG +NP GRLP+T+Y    +  LP         +     GRTY+++ G  
Sbjct: 686 SGGKAAAEVLFGDYNPAGRLPVTFYRN--IAQLP-------DFEDYNMTGRTYRYFKGDP 736

Query: 634 LYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDD 693
           L+PFGYGLSYT F Y+ +   +TI+V               + +K   P           
Sbjct: 737 LFPFGYGLSYTTFNYDNIKLDQTIKV--------------GETAKMVIP----------- 771

Query: 694 YFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNA 753
                    N G+ DG +VV VY K   E A    K +  F+RV + AG+   ++     
Sbjct: 772 -------VTNAGNRDGEEVVQVYLK-KQEDAEGPAKTLRAFKRVQIPAGKTVNVELELTP 823

Query: 754 CKSLNIVDYAANTLLP-AGEHTIFVG 778
            K L   D   NT+   AG   I VG
Sbjct: 824 -KQLEWWDAQTNTMRTIAGNFDIMVG 848


>gi|265752711|ref|ZP_06088280.1| glycoside hydrolase family beta-glycosidase [Bacteroides sp.
           3_1_33FAA]
 gi|263235897|gb|EEZ21392.1| glycoside hydrolase family beta-glycosidase [Bacteroides sp.
           3_1_33FAA]
          Length = 864

 Score =  266 bits (679), Expect = 4e-68,   Method: Compositional matrix adjust.
 Identities = 166/452 (36%), Positives = 234/452 (51%), Gaps = 40/452 (8%)

Query: 54  FCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGT 113
           + +S+L    R +DL+ ++TL+EKV  + D +  V RLG+  Y WW+EALHGV+  G   
Sbjct: 24  YKNSNLSPEERAEDLLQQLTLEEKVALMMDNSKPVERLGIKPYNWWNEALHGVARSGL-- 81

Query: 114 HFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRA--------GLTYW 165
                   AT FP  I   ASF       I  AVS EARA      A        GLT W
Sbjct: 82  --------ATVFPQPIGMAASFEPDAIHTIYTAVSDEARAKNTAYSAAGSYERYQGLTMW 133

Query: 166 SPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCC 225
           +P +N+ RDPRWGR  ET GEDP++     VN V+GLQ        TD N +  K+ +C 
Sbjct: 134 TPTVNIYRDPRWGRGIETYGEDPYLTSVMGVNVVKGLQ-------CTDANQKYDKIHACA 186

Query: 226 KHYAAYDVDNWKGVDRYHFDAR-VTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIP 284
           KH+A +    W   +R+ F+A  +  +D+ ET+L PFE  VKEG    VMC+YNR+ G P
Sbjct: 187 KHFAVHSGPEW---NRHEFNAENIKPRDLHETYLVPFEALVKEGKVKEVMCAYNRLEGDP 243

Query: 285 SCADPKLLNQTVRGEWDLHGYIVADCDSIQ--VMVDNHKFLADSKEDAVAQTLKAGLDLD 342
            C   +LL Q +R EW   G +++DC +I        HK   D+ E A A  + +G DL+
Sbjct: 244 CCGSDRLLMQILRQEWGYEGIVLSDCGAIDDFYREKGHKTHPDA-ESASAAAVLSGTDLE 302

Query: 343 CGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFD--GSPQYVSLGKQDICSDE 400
           CG  Y     +A ++G + E DID S+K L      LG  D     ++  +    +CS E
Sbjct: 303 CGSSYKALVESA-KKGLISEKDIDVSVKRLLKARFELGEMDDPSKVEWTKIPYSVVCSAE 361

Query: 401 NIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSP 460
           +  L+ + AR+ + LL N  N LPL     +T+AV+GP+AN +V   GNY G P   ++ 
Sbjct: 362 HDSLSLDIARKSMTLLLNKNNILPLKRGG-QTIAVMGPNANDSVMQWGNYNGTPKHTITL 420

Query: 461 IAGFSGYA----NVTYKTGCDDVACKSNNSIF 488
           + G          + Y+ GC  V      S+F
Sbjct: 421 LEGIRSAMGENDKLIYEQGCSWVERSLIRSVF 452



 Score =  125 bits (313), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 102/326 (31%), Positives = 142/326 (43%), Gaps = 62/326 (19%)

Query: 464 FSGYANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAESL-------- 515
           FSG A + +     D+  K   +I       K AD  I   G+  S+E E +        
Sbjct: 574 FSGDAQLNF-----DLGFKEEVNIKNTVAKVKDADIVIFAGGISPSLEGEEMGVNLPGFR 628

Query: 516 --DREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGE 573
             DR D+ LP  Q +LI  + +  K  VI V  S  G  IA        +AIL A YPG+
Sbjct: 629 KGDRTDIELPAVQRELIKALCDAGK-KVIFVNFS--GSPIAMEPETQYCQAILQAWYPGQ 685

Query: 574 EGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPT 633
            GG+A A+V+FG +NP GRLP+T+Y    +  LP         +     GRTY+++ G  
Sbjct: 686 SGGKAAAEVLFGDYNPAGRLPVTFYRN--IAQLP-------DFEDYNMTGRTYRYFKGDP 736

Query: 634 LYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDD 693
           L+PFGYGLSYT F Y  +   +TI+V               + +K   P           
Sbjct: 737 LFPFGYGLSYTTFNYGNIKLEQTIKV--------------GETAKMVIP----------- 771

Query: 694 YFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNA 753
                    N G+ DG +VV VY K   +      K +  F+RV + AG+   ++     
Sbjct: 772 -------VTNTGNRDGEEVVQVYLKKQEDTEGP-AKTLRAFKRVQIPAGKTVNVELELTP 823

Query: 754 CKSLNIVDYAANTLLP-AGEHTIFVG 778
            K L   D   NT+   AG   I VG
Sbjct: 824 -KQLEWWDAQTNTMRTIAGNFDIMVG 848


>gi|383115340|ref|ZP_09936096.1| hypothetical protein BSGG_2785 [Bacteroides sp. D2]
 gi|313695250|gb|EFS32085.1| hypothetical protein BSGG_2785 [Bacteroides sp. D2]
          Length = 735

 Score =  266 bits (679), Expect = 4e-68,   Method: Compositional matrix adjust.
 Identities = 212/765 (27%), Positives = 361/765 (47%), Gaps = 109/765 (14%)

Query: 53  LFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHG--------------VP-RLGLPQYE 97
           L+ D   P   RV DL+SRMTL+EKV QL  +  G              VP  +G   Y 
Sbjct: 29  LYKDPKAPIEKRVNDLLSRMTLEEKVMQLNQYTLGRNNNVNNVGEEVKKVPAEIGSLIYF 88

Query: 98  WWSEALHGV--------SNVGPGTHFD-DVIPG-ATSFPTVILTTASFNESLWKKIGQAV 147
             + AL           S +G    F  D I G  T +P  +    S+N  L ++     
Sbjct: 89  ETNPALRNSMQKKAMEESRLGIPIIFGYDAIHGFRTVYPISLAQACSWNPDLVEQACAVS 148

Query: 148 STEARAMYNLGRAGLTY-WSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVE 206
           + EAR       +G+ + +SP I+VARDPRWGR+ E  GEDP+  G +    V+G Q   
Sbjct: 149 AQEARM------SGVDWTFSPMIDVARDPRWGRVAEGYGEDPYTNGVFGAASVKGYQ--- 199

Query: 207 GHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVK 266
                 DL++   ++++C KHY  Y         R +    +++Q + +T+L P+EM VK
Sbjct: 200 ----GDDLSAEN-RMAACLKHYVGYGASE---AGRDYVYTEISKQTLWDTYLLPYEMGVK 251

Query: 267 EGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADS 326
            G A+++M S+N ++G+P  A+  ++ + ++  W   G+IV+D  +I+ +   ++ LA +
Sbjct: 252 AG-AATLMSSFNDISGVPGSANSYIMTEILKKRWGHDGFIVSDWGAIEQL--KNQGLAAT 308

Query: 327 KEDAVAQTLKAGLDLDCGQY-YTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGS 385
           K++A      AGL++D   + Y       V++G+V    +D++++ +  +  RLG F+  
Sbjct: 309 KKEAAWHAFTAGLEMDMMSHAYDRHLQELVEEGRVSVAQVDEAVRRVLLLKFRLGLFERP 368

Query: 386 PQYVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVA 445
               +  K+     +++++AA  A E +VLLKN+  TLPL     K +AV+GP A     
Sbjct: 369 YTPATSEKERFFRPQSMDIAARLAAESMVLLKNENKTLPLTDK--KKIAVIGPMAKNGWD 426

Query: 446 MIGNYAG------IPCRYMSPIAGFSGYANVTYKTGCDDVACKSNNS--IFAASEAAKTA 497
           ++G++ G      +   Y      F+G A + Y  GC   A K +N      A EAA+ +
Sbjct: 427 LLGSWCGHGKDTDVAMLYNGLATEFAGKAELRYAAGC---ATKGDNKEGFAEALEAARWS 483

Query: 498 DATIILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAE 557
           D  ++  G  ++   E+  R  + LP  Q +L  ++ +  K P++LV+++   +++   E
Sbjct: 484 DVVVLCLGEMMTWSGENASRSSIALPQIQEELAAELKKAGK-PIVLVLVNGRPLELNRLE 542

Query: 558 TNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTS--MPL-- 613
             ++  AIL    PG  G   +A ++ G+ NP G+L +T          P ++  +P+  
Sbjct: 543 PISD--AILEIWQPGVNGALPMAGILSGRINPSGKLAMT---------FPYSTGQIPIYY 591

Query: 614 -RPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNY 672
            R     G+ G  YK      LYPFG+GLSYT+FKY  ++ +                  
Sbjct: 592 NRRKSGRGHQG-FYKDITSDPLYPFGHGLSYTEFKYGTVTPS------------------ 632

Query: 673 TSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVI 732
                        V  ++  D    +V   NVG+ DG++ V  +   P       +K++ 
Sbjct: 633 -------------VTKVKRGDRLSVEVTVTNVGARDGAETVHWFISDPYCSITRPVKELK 679

Query: 733 GFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFV 777
            F++  ++AG  K  +F  +  +    V+      L AGE+ I V
Sbjct: 680 HFEKQLIKAGETKTFRFDIDLERDFGFVNEDGKRFLEAGEYHILV 724


>gi|423230604|ref|ZP_17217008.1| hypothetical protein HMPREF1063_02828 [Bacteroides dorei
           CL02T00C15]
 gi|423244313|ref|ZP_17225388.1| hypothetical protein HMPREF1064_01594 [Bacteroides dorei
           CL02T12C06]
 gi|392630748|gb|EIY24734.1| hypothetical protein HMPREF1063_02828 [Bacteroides dorei
           CL02T00C15]
 gi|392642494|gb|EIY36260.1| hypothetical protein HMPREF1064_01594 [Bacteroides dorei
           CL02T12C06]
          Length = 864

 Score =  266 bits (679), Expect = 4e-68,   Method: Compositional matrix adjust.
 Identities = 166/452 (36%), Positives = 234/452 (51%), Gaps = 40/452 (8%)

Query: 54  FCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGT 113
           + +S+L    R +DL+ ++TL+EKV  + D +  V RLG+  Y WW+EALHGV+  G   
Sbjct: 24  YKNSNLSPEERAEDLLQQLTLEEKVALMMDNSKPVERLGIKPYNWWNEALHGVARSGL-- 81

Query: 114 HFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRA--------GLTYW 165
                   AT FP  I   ASF       I  AVS EARA      A        GLT W
Sbjct: 82  --------ATVFPQPIGMAASFEPDAIHTIYTAVSDEARAKNTAYSAAGSYERYQGLTMW 133

Query: 166 SPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCC 225
           +P +N+ RDPRWGR  ET GEDP++     VN V+GLQ        TD N +  K+ +C 
Sbjct: 134 TPTVNIYRDPRWGRGIETYGEDPYLTSVMGVNVVKGLQ-------CTDANQKYDKIHACA 186

Query: 226 KHYAAYDVDNWKGVDRYHFDAR-VTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIP 284
           KH+A +    W   +R+ F+A  +  +D+ ET+L PFE  VKEG    VMC+YNR+ G P
Sbjct: 187 KHFAVHSGPEW---NRHEFNAENIKPRDLHETYLVPFEALVKEGKVKEVMCAYNRLEGDP 243

Query: 285 SCADPKLLNQTVRGEWDLHGYIVADCDSIQ--VMVDNHKFLADSKEDAVAQTLKAGLDLD 342
            C   +LL Q +R EW   G +++DC +I        HK   D+ E A A  + +G DL+
Sbjct: 244 CCGSDRLLMQILRQEWGYEGIVLSDCGAIDDFYREKGHKTHPDA-ESASAAAVLSGTDLE 302

Query: 343 CGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFD--GSPQYVSLGKQDICSDE 400
           CG  Y     +A ++G + E DID S+K L      LG  D     ++  +    +CS E
Sbjct: 303 CGSSYKALVESA-KKGLISEKDIDVSVKRLLKARFELGEMDDPSKVEWTKIPYSVVCSAE 361

Query: 401 NIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSP 460
           +  L+ + AR+ + LL N  N LPL     +T+AV+GP+AN +V   GNY G P   ++ 
Sbjct: 362 HDSLSLDIARKSMTLLLNKNNILPLKRGG-QTIAVMGPNANDSVMQWGNYNGTPKHTITL 420

Query: 461 IAGFSGYA----NVTYKTGCDDVACKSNNSIF 488
           + G          + Y+ GC  V      S+F
Sbjct: 421 LEGIRSAMGENDKLIYEQGCSWVERSLIRSVF 452



 Score =  125 bits (315), Expect = 7e-26,   Method: Compositional matrix adjust.
 Identities = 102/326 (31%), Positives = 143/326 (43%), Gaps = 62/326 (19%)

Query: 464 FSGYANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAESL-------- 515
           FSG A + +     D+  K   +I       K AD  I   G+  S+E E +        
Sbjct: 574 FSGDAQLNF-----DLGFKEEVNIKNTVAKVKDADIVIFAGGISPSLEGEEMGVNLPGFR 628

Query: 516 --DREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGE 573
             DR D+ LP  Q +LI  + +  K  VI V  S  G  IA        +AIL A YPG+
Sbjct: 629 KGDRTDIELPAVQRELIKALCDAGK-KVIFVNFS--GSPIAMEPETKYCQAILQAWYPGQ 685

Query: 574 EGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPT 633
            GG+A A+V+FG +NP GRLP+T+Y    +  LP         +     GRTY+++ G  
Sbjct: 686 SGGKAAAEVLFGDYNPAGRLPVTFYRN--IAQLP-------DFEDYNMTGRTYRYFKGDP 736

Query: 634 LYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDD 693
           L+PFGYGLSYT F Y+ +   +TI+V               + +K   P           
Sbjct: 737 LFPFGYGLSYTTFNYDNIKLEQTIKV--------------GETAKMVIP----------- 771

Query: 694 YFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNA 753
                    N G+ DG +VV VY K   +      K +  F+RV + AG+   ++     
Sbjct: 772 -------VTNTGNRDGEEVVQVYLKKQEDTEGP-TKTLRAFKRVQIPAGKTVNVELELTP 823

Query: 754 CKSLNIVDYAANTLLP-AGEHTIFVG 778
            K L   D   NT+   AG   I VG
Sbjct: 824 -KQLEWWDAQTNTMRTIAGNFDIMVG 848


>gi|212692496|ref|ZP_03300624.1| hypothetical protein BACDOR_01992 [Bacteroides dorei DSM 17855]
 gi|212664971|gb|EEB25543.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
           dorei DSM 17855]
          Length = 864

 Score =  266 bits (679), Expect = 5e-68,   Method: Compositional matrix adjust.
 Identities = 166/452 (36%), Positives = 234/452 (51%), Gaps = 40/452 (8%)

Query: 54  FCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGT 113
           + +S+L    R +DL+ ++TL+EKV  + D +  V RLG+  Y WW+EALHGV+  G   
Sbjct: 24  YKNSNLSPEERAEDLLQQLTLEEKVALMMDNSKPVERLGIKPYNWWNEALHGVARSGL-- 81

Query: 114 HFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRA--------GLTYW 165
                   AT FP  I   ASF       I  AVS EARA      A        GLT W
Sbjct: 82  --------ATVFPQPIGMAASFEPDAIHTIYTAVSDEARAKNTAYSAAGSYERYQGLTMW 133

Query: 166 SPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCC 225
           +P +N+ RDPRWGR  ET GEDP++     VN V+GLQ        TD N +  K+ +C 
Sbjct: 134 TPTVNIYRDPRWGRGIETYGEDPYLTSVMGVNVVKGLQ-------CTDANQKYDKIHACA 186

Query: 226 KHYAAYDVDNWKGVDRYHFDAR-VTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIP 284
           KH+A +    W   +R+ F+A  +  +D+ ET+L PFE  VKEG    VMC+YNR+ G P
Sbjct: 187 KHFAVHSGPEW---NRHEFNAENIKPRDLHETYLVPFEALVKEGKVKEVMCAYNRLEGDP 243

Query: 285 SCADPKLLNQTVRGEWDLHGYIVADCDSIQ--VMVDNHKFLADSKEDAVAQTLKAGLDLD 342
            C   +LL Q +R EW   G +++DC +I        HK   D+ E A A  + +G DL+
Sbjct: 244 CCGSDRLLMQILRQEWGYEGIVLSDCGAIDDFYREKGHKTHPDA-ESASAAAVLSGTDLE 302

Query: 343 CGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFD--GSPQYVSLGKQDICSDE 400
           CG  Y     +A ++G + E DID S+K L      LG  D     ++  +    +CS E
Sbjct: 303 CGSSYKALVESA-KKGLISEKDIDVSVKRLLKARFELGEMDDPSKVEWTKIPYSVVCSAE 361

Query: 401 NIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSP 460
           +  L+ + AR+ + LL N  N LPL     +T+AV+GP+AN +V   GNY G P   ++ 
Sbjct: 362 HDSLSLDIARKSMTLLLNKNNILPLKRGG-QTIAVMGPNANDSVMQWGNYNGTPKHTITL 420

Query: 461 IAGFSGYA----NVTYKTGCDDVACKSNNSIF 488
           + G          + Y+ GC  V      S+F
Sbjct: 421 LEGIRSAMGENDKLIYEQGCSWVERSLIRSVF 452



 Score =  126 bits (316), Expect = 5e-26,   Method: Compositional matrix adjust.
 Identities = 104/326 (31%), Positives = 144/326 (44%), Gaps = 62/326 (19%)

Query: 464 FSGYANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAESL-------- 515
           FSG A + +     D+  K   +I       K AD  I   G+  S+E E +        
Sbjct: 574 FSGDAQLNF-----DLGFKEEVNIKNTVAKVKDADIVIFAGGISPSLEGEEMGVNLPGFR 628

Query: 516 --DREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGE 573
             DR D+ LP  Q +LI  + +  K  VI V  S  G  IA        +AIL A YPG+
Sbjct: 629 KGDRTDIELPAVQRELIKALCDAGK-KVIFVNFS--GSPIAMEPETKYCQAILQAWYPGQ 685

Query: 574 EGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPT 633
            GG+A A+V+FG +NP GRLP+T+Y    +  LP         +     GRTY+++ G  
Sbjct: 686 SGGKAAAEVLFGDYNPAGRLPVTFYRN--IAQLP-------DFEDYNMTGRTYRYFKGDP 736

Query: 634 LYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDD 693
           L+PFGYGLSYT F Y+ +   +TI+V               + +K   P           
Sbjct: 737 LFPFGYGLSYTTFNYDNIKLDQTIKV--------------GETAKMVIP----------- 771

Query: 694 YFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNA 753
                    N G+ DG +VV VY K   E A    K +  F+RV + AG+   ++     
Sbjct: 772 -------VTNAGNRDGEEVVQVYLK-KQEDAEGPAKTLRAFKRVQIPAGKTVNVELELTP 823

Query: 754 CKSLNIVDYAANTLLP-AGEHTIFVG 778
            K L   D   NT+   AG   I VG
Sbjct: 824 -KQLEWWDAQTNTMRTIAGNFDIMVG 848


>gi|381200965|ref|ZP_09908097.1| beta-glucosidase [Sphingobium yanoikuyae XLDN2-5]
          Length = 774

 Score =  266 bits (679), Expect = 5e-68,   Method: Compositional matrix adjust.
 Identities = 219/733 (29%), Positives = 338/733 (46%), Gaps = 101/733 (13%)

Query: 78  VQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGATSFPTVILTTASFNE 137
           V  L  +A    RLG+P   +  E LHG + VG           ATSFP  I   +S++ 
Sbjct: 109 VNGLQKWAMTQTRLGIPIL-FHEEGLHGYAAVG-----------ATSFPQSIAMASSWDP 156

Query: 138 SLWKKIGQAVSTEARAMYNLGRAGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVN 197
           ++ +++ Q ++ E RA     R      SP +++ARDPRWGRI ET GEDP++VG   V 
Sbjct: 157 AMLRQVNQVIAREIRA-----RGVPMVLSPVVDIARDPRWGRIEETYGEDPYLVGEMGVA 211

Query: 198 YVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETF 257
            V GLQ V G       N     V +  KH   +      G +     A V+E+++ E F
Sbjct: 212 AVEGLQGV-GRSRTLQSN----HVFATLKHLTGHGQPE-SGTN--IGPAPVSERELRENF 263

Query: 258 LRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMV 317
             PFE  VK     +VM SYN ++G+PS A+  LL   +R EW   G +V+D  ++  ++
Sbjct: 264 FPPFEQVVKRTGIEAVMASYNEIDGVPSHANRWLLENILREEWGFRGAVVSDYSAVDQLM 323

Query: 318 DNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFT-GNAVQQGKVKETDIDKSLKYLYTVL 376
             H  +A + E+A  + L AG+D D  +  +  T G  V++GKV E  +D +++ +  + 
Sbjct: 324 SIH-HIAANLEEAAMRALDAGVDADLPEGLSYATLGKLVREGKVSEAKVDLAVRRMLELK 382

Query: 377 MRLGFFDGSPQYVSLGKQDICSDENIE-LAAEAAREGIVLLKNDQNTLPLNSAKVKTVAV 435
            R G F+ +P   +     I ++E+   LA  AA+  I LLKND   LPL      T+AV
Sbjct: 383 FRAGLFE-NPYADANAAAAITNNEDARALARTAAQRSITLLKND-GMLPLKPE--GTIAV 438

Query: 436 VGPHANATVAMIGNYAGIPCRYMSPIAGFSGY----ANVTYKTGCD---------DVACK 482
           +GP  +A VA +G Y G P   +S + G        AN+ +  G           D   K
Sbjct: 439 IGP--SAAVARLGGYYGQPPHSVSILEGIKARVGTKANIVFAQGVKITEDDDWWADSVTK 496

Query: 483 SNNS-----IFAASEAAKTADATIILAGLDLSVEAESL------DREDLWLPGYQTQLIN 531
           S+ +     I  A EAA+  D  I+  G       E        DR  L L   Q +L +
Sbjct: 497 SDPAENRKLIAQAVEAARNVDRIILTLGDTEQSSREGWADNHLGDRPSLDLVSEQQELFD 556

Query: 532 QVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGG 591
            +  + K P+ +V+++  G   +  + +    AIL   Y GE+GG A+AD++FG  NPGG
Sbjct: 557 ALKALGK-PITVVLIN--GRPASTVKVSEQANAILEGWYLGEQGGNAVADILFGDVNPGG 613

Query: 592 RLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLL 651
           +LP+T         LPL    ++P        R Y F     LYPFG+GLSYT F  +  
Sbjct: 614 KLPVTVPRS--AGQLPLF-YNMKPSAR-----RGYLFDTTDPLYPFGFGLSYTSFSLS-- 663

Query: 652 SFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSD 711
                                         P +    +         VD +N G+ +G +
Sbjct: 664 -----------------------------APRLSATKIGTGGKTSVSVDVRNTGAREGDE 694

Query: 712 VVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAG 771
           VV +Y +         +K++ GFQRV ++ G ++ + F     ++L + +   + ++  G
Sbjct: 695 VVQLYIRDKVSSVTRPVKELKGFQRVTLKPGESRTVTFTV-GPEALQMWNDQMHRVVEPG 753

Query: 772 EHTIFVGNGGVSF 784
           +  I  GN  V+ 
Sbjct: 754 DFEIMTGNSSVAL 766


>gi|393784569|ref|ZP_10372732.1| hypothetical protein HMPREF1071_03600 [Bacteroides salyersiae
           CL02T12C01]
 gi|392665550|gb|EIY59074.1| hypothetical protein HMPREF1071_03600 [Bacteroides salyersiae
           CL02T12C01]
          Length = 929

 Score =  265 bits (678), Expect = 5e-68,   Method: Compositional matrix adjust.
 Identities = 149/415 (35%), Positives = 230/415 (55%), Gaps = 30/415 (7%)

Query: 54  FCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGT 113
           F D SL +  R K+LVS +TL+EK+ Q+G     +PRL +  Y +W+EA+HGV+  G   
Sbjct: 42  FQDESLSFHERAKNLVSLLTLEEKINQVGHQTLAIPRLNIKGYNYWNEAIHGVARSGL-- 99

Query: 114 HFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNINVAR 173
                   ATSFP     +++++  L      A S EAR   N    GL YW P IN++R
Sbjct: 100 --------ATSFPVSKAMSSTWDLPLIFDCAVATSDEARVYSNTKDKGLIYWCPTINMSR 151

Query: 174 DPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDV 233
           DPRWGR  E  GEDPF+ G+ AV Y++G+Q  +          +  K  +  KH+AA + 
Sbjct: 152 DPRWGRDEENYGEDPFLTGKIAVEYIKGMQGDD---------PKYYKTIATAKHFAANNY 202

Query: 234 DNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLN 293
           +  +       DAR    ++ E +L  FEM VKEG+  SVM +YN +NGIP  A+ +LL 
Sbjct: 203 EKGRHSTSSDMDAR----NLREYYLPAFEMAVKEGNVRSVMSAYNALNGIPCGANHELLI 258

Query: 294 QTVRGEWDLHGYIVADCDSI-QVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFTG 352
             +R EW  +G++ +DC ++  V   N     ++  +A A ++  G DL+CG  + ++  
Sbjct: 259 DILRTEWGFNGFVTSDCGAVDDVYQSNRHHFVNTAAEASAVSIVNGEDLNCGNTFQDYCK 318

Query: 353 NAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ--YVSLGKQDICSDENIELAAEAAR 410
            A+++G ++E D+D +L  ++     +G FD +    + S+    +  +E+ +LA +AA+
Sbjct: 319 EAIEKGYMQEADLDTALVRVFEARFSVGEFDNASNVPWRSISDDVLDCEEHRQLAYKAAQ 378

Query: 411 EGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFS 465
           E IVLLKND N LPL+  K K+VAV+GP  N     +G Y+G P    +P  G +
Sbjct: 379 EAIVLLKNDNNILPLD--KTKSVAVIGPFGNTIT--LGGYSGSPTALTTPFGGIA 429



 Score =  119 bits (298), Expect = 6e-24,   Method: Compositional matrix adjust.
 Identities = 88/277 (31%), Positives = 131/277 (47%), Gaps = 50/277 (18%)

Query: 475 GCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAESLDREDLWLPGYQTQLINQVA 534
           GC  V   +  ++  A E A  AD  I  AG DL+V  ES DR +L LPG Q +L+  V 
Sbjct: 592 GCA-VTGTAETNLERAKEIAAKADVVIFAAGTDLTVSDESHDRTNLNLPGDQQKLLEAVY 650

Query: 535 EVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLP 594
             A   VIL++ +   V I +A+ +  + AI+ A Y G+  G+AIADV++G +NP G+L 
Sbjct: 651 S-ANPNVILLLQTCSSVTINWAKEH--VPAIIEAWYGGQAQGKAIADVLYGDYNPSGKLT 707

Query: 595 ITWYNGDYVQMLPLTSMPLRPVDSLGYPGR----TYKFYNGPTLYPFGYGLSYTQFKYNL 650
            TWYN          ++   P   L Y  R    TY +++   LYPFGYG+SYT F+Y  
Sbjct: 708 STWYN----------ALSDLPNGMLNYDIRDAKYTYMYHDKTPLYPFGYGMSYTTFEYQK 757

Query: 651 LSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGS 710
           L+ +K+                                L   +      D  N G   G+
Sbjct: 758 LNISKS-------------------------------RLAAGEELIVSADITNTGKYAGA 786

Query: 711 DVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRI 747
           ++V +Y+   + I    +KQ++GF RV +  G  K +
Sbjct: 787 EIVQLYAHVNSSIERP-LKQLVGFARVELEPGETKTV 822


>gi|383114908|ref|ZP_09935668.1| hypothetical protein BSGG_5166 [Bacteroides sp. D2]
 gi|382948422|gb|EIC71783.1| hypothetical protein BSGG_5166 [Bacteroides sp. D2]
          Length = 782

 Score =  265 bits (678), Expect = 5e-68,   Method: Compositional matrix adjust.
 Identities = 215/726 (29%), Positives = 341/726 (46%), Gaps = 124/726 (17%)

Query: 90  RLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVST 149
           RLG+P +    EA HG   +G           AT FPT I   A+++  L K++GQ ++ 
Sbjct: 129 RLGIPMF-LAEEAPHGHMAIG-----------ATVFPTGIGMAATWSPELVKEVGQVIAK 176

Query: 150 EARAMYNLGRAGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHE 209
           E R+     + G   + P +++ RDPRW R+ ET GEDP + G    + V GL       
Sbjct: 177 EIRS-----QGGHISYGPVLDLTRDPRWSRVEETFGEDPVLSGILGASMVDGL------- 224

Query: 210 NATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGD 269
              +L+ +   +++  KH+ AY V        Y   A V  +D+ + FL PF   +  G 
Sbjct: 225 GGGNLSQKYATIATL-KHFLAYAVPEGGQNGNY---ASVGIRDLHQNFLPPFRKAIDSG- 279

Query: 270 ASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKED 329
           A SVM SYN ++GIP  ++  LL Q +R EW   G++V+D  SI+ + ++H F+A +KE+
Sbjct: 280 ALSVMTSYNSIDGIPCTSNHYLLTQLLRNEWKFRGFVVSDLYSIEGIHESH-FVALTKEN 338

Query: 330 AVAQTLKAGLDLDCG-QYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQY 388
           A  Q++ AG+D+D G   YTN   +AVQ G++ +  ID ++  +  +   +G F+     
Sbjct: 339 AAIQSVTAGVDVDLGGDAYTNLC-HAVQSGQMDKAVIDTAVCRVLRMKFEMGLFEHPYVD 397

Query: 389 VSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIG 448
             +  + +   E+IELA + A+  I LLKN+ + LPL S  +  VAV+GP+A+    M+G
Sbjct: 398 PKIAAKTVRRKEHIELARKIAQSSITLLKNENSILPL-SKTINKVAVIGPNADNRYNMLG 456

Query: 449 NYA-------------GIPCRYMSPIAGFSGYANVTYKTGCDDVACKSNNSIFAASEAAK 495
           +Y              GI  + +SP       + V Y  GC  +   + N I  A EAA+
Sbjct: 457 DYTAPQEDSNVKTVLDGIITK-LSP-------SRVEYVRGCA-IRDTTVNEIEQAIEAAR 507

Query: 496 TAD----------------------ATIILAGLDLSVE-AESLDREDLWLPGYQTQLINQ 532
            ++                      A +   G    +E  E  DR  L L G Q +L+  
Sbjct: 508 RSEVVIVVVGGSSARDFKTSYKETGAAVAEEGSVSDMECGEGFDRASLSLLGRQQELLES 567

Query: 533 VAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGR 592
           + +  K P+I+V +    ++  +A    +  A+L A YPG+EGG AIADV+FG +NP GR
Sbjct: 568 LQKTGK-PLIVVYIEGRPLEKNWASEYAD--ALLTAYYPGQEGGNAIADVLFGDYNPSGR 624

Query: 593 LPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLS 652
           LPI+      V  +P+      P +        Y   +   LY FGYG+SYT F+Y+ L 
Sbjct: 625 LPISVPRS--VGQIPVYYNKKAPRN------HDYVEVSSSPLYSFGYGMSYTTFEYSALQ 676

Query: 653 FTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDV 712
                                          V+    RC   FE     +N G  DG +V
Sbjct: 677 -------------------------------VVQKSARC---FEVSFKVKNTGKYDGEEV 702

Query: 713 VIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGE 772
             +Y +         +KQ+  F+R  ++ G  K++ FV    +   +V+Y    ++ +G 
Sbjct: 703 SQLYMRDEYASVVQPMKQLKHFERFHLKKGEEKKVTFVLTE-EDFFLVNYTLKKVVESGN 761

Query: 773 HTIFVG 778
             + +G
Sbjct: 762 FHLMIG 767


>gi|427384392|ref|ZP_18880897.1| hypothetical protein HMPREF9447_01930 [Bacteroides oleiciplenus YIT
           12058]
 gi|425727653|gb|EKU90512.1| hypothetical protein HMPREF9447_01930 [Bacteroides oleiciplenus YIT
           12058]
          Length = 954

 Score =  265 bits (678), Expect = 6e-68,   Method: Compositional matrix adjust.
 Identities = 226/760 (29%), Positives = 357/760 (46%), Gaps = 111/760 (14%)

Query: 48  QMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQL--GDFAHGVPRLGLPQYEWWSEALHG 105
           + ++  + D +LP   RV+ L+S MT ++K++ +  G    G+P L +P      EA+HG
Sbjct: 164 EKTALRYMDPTLPVEERVESLLSVMTPEDKMELIREGWGIPGIPHLYVPPITK-VEAVHG 222

Query: 106 VSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYW 165
            S             GAT FP  +   A++N+ L ++I  AV  E      L    +  W
Sbjct: 223 FSYGS----------GATIFPQALAMGATWNKKLTEEIAMAVGDE-----TLAAGTMQAW 267

Query: 166 SPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCC 225
           SP ++VA+D RWGR  ET GEDP +V +    +++G Q       +  L + P       
Sbjct: 268 SPVLDVAQDARWGRCEETFGEDPVLVSQIGGAWIKGYQ-------SKGLFTTP------- 313

Query: 226 KHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPS 285
           KH+  +      G D +  D  ++E++M E  L PF   ++  D  S+M +Y+   G+P 
Sbjct: 314 KHFGGHGAP-LGGRDSH--DIGLSEREMREVHLVPFRHVIRNYDCQSLMMAYSDFLGVPV 370

Query: 286 CADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQ 345
               +LL+  +R EW   G+IV+DC +I  +     + A  K +A  Q L AG+  +CG 
Sbjct: 371 AKSKELLHNILREEWGFDGFIVSDCGAIGNLTARKHYTAKDKIEAANQALAAGIATNCGD 430

Query: 346 YYTNF-TGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQDIC----SDE 400
            Y +     A + G++   ++D   + +  ++ R   F+ +P    L    I     SD 
Sbjct: 431 TYNDKEVIQAAKDGRLNMENLDNVCRTMLRMMFRNELFEKAPNK-PLDWNKIYPGWNSDN 489

Query: 401 NIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAG--IPCRYM 458
           + E+A +AARE IV+L+N +N LPL+   ++++AV+GP A+      G+Y    +P +  
Sbjct: 490 HKEMARQAARESIVMLENKENILPLDKG-IRSIAVLGPGADDLQP--GDYTPKLLPGQLK 546

Query: 459 SPIAGFS----GYANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEA-- 512
           S + G          V Y+ GCD       N I  A +AA  +D  +++ G   + EA  
Sbjct: 547 SVLTGIKQAVGKQTKVIYEQGCDFTNLSETN-IPKAVKAASQSDVVVMVLGDCSTSEATT 605

Query: 513 -------ESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAI 565
                  E+ D   L LPG Q +L+  V    K PVILV+ +  G      + +   KAI
Sbjct: 606 DVYKTSGENHDYATLILPGKQQELLEAVCATGK-PVILVLQA--GRPYNLTKASKLCKAI 662

Query: 566 LWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRT 625
           +    PG+EGG A ADV+FG +NP GRLP+T+    +V  LPL         +    GR 
Sbjct: 663 IVNWLPGQEGGPATADVLFGDYNPAGRLPMTF--PQHVGQLPLYY-------NFKTSGRR 713

Query: 626 YKFYNGP--TLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPG 683
           Y++ +     LY FGYGLSYT F+Y+ L           K+Q   N N T  A+      
Sbjct: 714 YEYSDLEYYPLYYFGYGLSYTSFEYSGL-----------KVQEKDNGNITVQAT------ 756

Query: 684 VLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGR 743
                             +NVG   G +VV +Y         T I ++  F R+ ++ G 
Sbjct: 757 -----------------VKNVGQRAGDEVVQLYVTDMYASVKTRITELKDFTRINLKPGE 799

Query: 744 NKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGNGGVS 783
           +K + F       L++++   + ++  GE  I V  GGVS
Sbjct: 800 SKTVSFELTPY-DLSLLNDHMDRVVEKGEFKILV--GGVS 836


>gi|375149998|ref|YP_005012439.1| Beta-glucosidase [Niastella koreensis GR20-10]
 gi|361064044|gb|AEW03036.1| Beta-glucosidase [Niastella koreensis GR20-10]
          Length = 875

 Score =  265 bits (678), Expect = 6e-68,   Method: Compositional matrix adjust.
 Identities = 162/456 (35%), Positives = 235/456 (51%), Gaps = 36/456 (7%)

Query: 45  LGLQMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALH 104
           L  Q S F F +  L +  RV DLVSR+TL+EKV Q+ + A G+PRL +P Y+WW+E LH
Sbjct: 21  LQAQNSKFPFQNYRLSFEDRVNDLVSRLTLEEKVAQMLNAAPGIPRLDIPAYDWWNETLH 80

Query: 105 GVSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRA---- 160
           GV+     T ++      T FP  I   A+++ +   ++    + E R ++N   A    
Sbjct: 81  GVAR----TPYN-----VTVFPQAIAMAATWDTAALYRMADCSALEGRVIHNKAIAAGKE 131

Query: 161 -----GLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLN 215
                GLTYW+PNIN+ RDPRWGR  ET GEDP++    A  +VRGLQ   G++      
Sbjct: 132 KDRYLGLTYWTPNINIFRDPRWGRGQETYGEDPYLTAALADAFVRGLQ---GND------ 182

Query: 216 SRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMC 275
            + LK ++C KHYA   V +     R+ FD  VT  D+ +T+L  F+  V   + + VMC
Sbjct: 183 PKYLKAAACAKHYA---VHSGPEPSRHVFDVDVTPYDLWDTYLPSFKKLVTVSNVAGVMC 239

Query: 276 SYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTL 335
           +YN     P CA   L+   +R +W   GY+ +DC +I     NHK   D+   +     
Sbjct: 240 AYNAFRKQPCCASDVLMTDILRNQWSFKGYVTSDCGAIDDFYRNHKTHPDAAAASADAVF 299

Query: 336 KAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQD 395
             G D+DCG         AV++ K+ E  ID S+K L+ +  RLG FD  P  V   +  
Sbjct: 300 H-GTDIDCGNEAYRALVQAVKENKITEKQIDISVKRLFMIRFRLGMFD-PPSMVKYAQTP 357

Query: 396 ICSDENIELAAEA---AREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAG 452
               E+   A  A   A E IVLLKN  NTLPL    +K + V+GP+A   +A +GNY+G
Sbjct: 358 ATELESAAHAKHALLMAHESIVLLKNANNTLPLKKG-LKKIVVLGPNATNVIAPLGNYSG 416

Query: 453 IPCRYMSPIAGFSGYANVTYKTGCDDVACKSNNSIF 488
            P + ++   G    A    +   +     +NN++ 
Sbjct: 417 TPSKLITLFQGIKEKAGAATQVVYEKAVNYTNNNVL 452



 Score =  114 bits (284), Expect = 3e-22,   Method: Compositional matrix adjust.
 Identities = 89/292 (30%), Positives = 128/292 (43%), Gaps = 55/292 (18%)

Query: 497 ADATIILAGLDLSVEAESL----------DREDLWLPGYQTQLINQVAEVAKGPVILVIM 546
           ADA I   G+   +E E +          DR  + LP  QT+L+  +    K PV+ V+M
Sbjct: 607 ADAFIFAGGISPQLEGEEMKVSDPGFKGGDRTTILLPAIQTELMKALQASGK-PVVFVMM 665

Query: 547 SAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQML 606
           +  G  +A    + NI AI+ A Y G+  G A+ADV+FG +NP GRLP+T+Y  D     
Sbjct: 666 T--GSALATPWESENIPAIVNAWYGGQAAGTALADVLFGDYNPSGRLPVTFYGSD----- 718

Query: 607 PLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQH 666
                 L   +      RTY+++ G  LY FGYGLSYT F+Y+ L+   T        Q+
Sbjct: 719 ----NDLPSFEDYSMKNRTYRYFTGKPLYGFGYGLSYTTFRYDQLTMPVTA-------QN 767

Query: 667 CRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAAT 726
            + +  T                         V   N G T G +V  +Y         T
Sbjct: 768 GKPVKVT-------------------------VRVTNTGKTTGDEVAQIYVVNENTSIQT 802

Query: 727 YIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVG 778
            +K + GFQR+ +R   +K + FV  +   L  VD         G+  I VG
Sbjct: 803 ALKTLKGFQRISLRPAESKMVSFVLQS-DDLTYVDADGQRKPLTGKIQICVG 853


>gi|255530706|ref|YP_003091078.1| glycoside hydrolase family protein [Pedobacter heparinus DSM 2366]
 gi|255343690|gb|ACU03016.1| glycoside hydrolase family 3 domain protein [Pedobacter heparinus
           DSM 2366]
          Length = 801

 Score =  265 bits (678), Expect = 6e-68,   Method: Compositional matrix adjust.
 Identities = 232/823 (28%), Positives = 356/823 (43%), Gaps = 150/823 (18%)

Query: 53  LFCDSSLPYSIRVKDLVSRMTLDEKVQQLGD-FAHG-VPRLGLPQYEW----WSEAL--- 103
           +F D S P   RVKDL+ +M LDEK  Q    + +G V +  +P  EW    W + +   
Sbjct: 41  VFEDPSRPVDARVKDLLGQMNLDEKTCQTATLYGYGRVLKDEMPTAEWKTSIWKDGIANI 100

Query: 104 -------------------------HGVSNVG-----------PGTHFDDVIPG-----A 122
                                    H ++ V            P    ++ I G     A
Sbjct: 101 DEELNSLPYNKKAVTQYSFPFSKHAHAINTVQKWFVEETRLGIPVDFSNEGIHGLCHDRA 160

Query: 123 TSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNINVARDPRWGRITE 182
           T FP  +   +++N+S+  + G  V  EA+A   LG   +  ++P ++VARD RWGR+ E
Sbjct: 161 TPFPAPVNIGSTWNKSIVYQAGSIVGREAKA---LGYTNV--YAPILDVARDQRWGRVVE 215

Query: 183 TPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRY 242
              EDPF++         G+QD +G              ++  KHYA Y V       + 
Sbjct: 216 CYAEDPFLIAELGKQMTMGIQD-QG-------------TAATLKHYAVYSVPKGGRDGQA 261

Query: 243 HFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDL 302
             D  V  ++M E FL PF   ++E     +M SYN  NG P       L + +R ++  
Sbjct: 262 RTDPHVAPREMHEMFLYPFRRVIQEAKPMGIMSSYNDWNGEPVTGSYYFLTELLRKQYGF 321

Query: 303 HGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFTG---------N 353
            GY+V+D ++++ +   H    D K+ AV Q ++AGL++      T+FT           
Sbjct: 322 DGYVVSDSEAVEFISGKHHVAEDYKQ-AVKQAIEAGLNV-----RTHFTKPENFILPLRE 375

Query: 354 AVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYV---SLGKQDICSDENIELAAEAAR 410
            V++G V    +D+ +  +  V  RLG FD    YV   +   + + +  + ELA +  R
Sbjct: 376 LVKEGSVSMKTLDERVADVLRVKFRLGLFDDP--YVKDPAAADKKVHTRADEELAVQLNR 433

Query: 411 EGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFSGYA-- 468
           E +VLLKND+N LPL+ AK K + V GP A         Y       +S + G   YA  
Sbjct: 434 ESMVLLKNDKNLLPLDIAKYKRILVSGPLATEINYTTSRYGPSNNPIVSILDGIKAYAGK 493

Query: 469 --NVTYKTGCDDVACKSNNS--------------IFAASEAAKTADATIILAGLDLSVEA 512
              + Y  GC+ +  K   S              I  A  AAK +D  I + G       
Sbjct: 494 NSTIAYSKGCEVIDAKWPESEIIPVELTTEEQLQIDQAVAAAKASDVIIAVVGETDEQVG 553

Query: 513 ESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPG 572
           ES  R  L LPG Q  L+  +    K PV++V+++   + I +   N  + AIL AG+PG
Sbjct: 554 ESKSRTGLNLPGRQLMLLQALHATGK-PVVMVMVNGRPLTINWE--NRYLPAILQAGFPG 610

Query: 573 EEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGP 632
              G+ +A+ +FG  NPGG+L +T+     +  + L + P +P    G         NG 
Sbjct: 611 PSAGKVVAETLFGDNNPGGKLTMTYPKS--IGQIEL-NFPFKPGSQAGQGKNDDPNGNGK 667

Query: 633 T-----LYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVN 687
           T     LYPFGYGLSYT F+++ L        NL+K    + ++  +D            
Sbjct: 668 TRVLGALYPFGYGLSYTTFEFSNL--------NLDK----KEIHNQADV----------- 704

Query: 688 DLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRI 747
                   +  VD +N G   G +VV +Y K       TY   + GF+RV +  G  K +
Sbjct: 705 --------QVSVDVKNTGQRKGDEVVQLYLKDVVSSVTTYESVLRGFERVSLAPGETKTL 756

Query: 748 KFVFNACKSLNIVDYAANTLLPAGEHTIFVGNGGVSFPIHLNF 790
           KF  +    L I+D   N  +  G+  + +GN      +   F
Sbjct: 757 KFTLHP-DDLAILDKNMNRTVEPGKFIVMIGNSSEDIKLKKEF 798


>gi|404405497|ref|ZP_10997081.1| glycoside hydrolase family protein [Alistipes sp. JC136]
          Length = 804

 Score =  265 bits (678), Expect = 6e-68,   Method: Compositional matrix adjust.
 Identities = 211/726 (29%), Positives = 325/726 (44%), Gaps = 118/726 (16%)

Query: 90  RLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVST 149
           RLG+P  E+ +E +HG+++             AT  P  I   +++N +L  + G+    
Sbjct: 145 RLGIP-VEFTNEGIHGLNH-----------SRATPLPAPIAIGSTWNRALVHRAGEIAGH 192

Query: 150 EARAMYNLGRAGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHE 209
           EAR    LG   +  ++P ++VARDPRWGR+ E  GEDPF++    V  VRG+Q      
Sbjct: 193 EARV---LGYKNV--YAPILDVARDPRWGRVVECYGEDPFLIAELGVEMVRGIQS----- 242

Query: 210 NATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGD 269
                      V+S  KHYAAY V           D  +  +++ + +L PF   ++E  
Sbjct: 243 ---------QGVASTLKHYAAYSVPKGGRDGNCRTDPHIAPRELHQMYLYPFRRVIRESG 293

Query: 270 ASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKED 329
              VM SYN  +G+P  A    L   +R E+   GY+V+D ++++ +   H  +A++ ED
Sbjct: 294 PMGVMSSYNDWDGVPVTASRYFLTDLLRHEYGFDGYVVSDSEAVEYVHTKHA-VAETYED 352

Query: 330 AVAQTLKAGLDLDCGQYYTNFTGNA---------VQQGKVKETDIDKSLKYLYTVLMRLG 380
           AV Q L+AGL++      TNF+  A         V++G++    +D+ ++ +  V  RLG
Sbjct: 353 AVRQVLEAGLNV-----RTNFSPPARFILPVRKLVREGRLSMEVVDQRVREVLRVKFRLG 407

Query: 381 FFDGSPQYVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHA 440
            FD           +  +D++ +   +  R+ +VLLKN+  TLPL+  K   V V GP A
Sbjct: 408 LFDNPYNDPREAVAEAGADKHRDFVLDIQRQSLVLLKNEDKTLPLDKKKTARVLVAGPLA 467

Query: 441 NATVAMIGNYAGIPCRYMSPIAGFSGY----ANVTYKTGCDDVACKSNNSIFAAS----- 491
           +    MI  Y       ++ + G   Y    A V Y  GCD V     +S   A+     
Sbjct: 468 DEDNFMISRYGPNDLPTVTVLDGIRNYLGDGAEVRYAKGCDVVDAGFPDSELTATPLTAA 527

Query: 492 ------EAAKTA---DATIILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVI 542
                 EA K A   D  + + G D     ES  R  L LPG Q QL+  +      PV+
Sbjct: 528 ERAGINEAVKQAAGCDVIVAVLGEDDERVGESHSRTSLELPGRQQQLLEALHATGV-PVV 586

Query: 543 LVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDY 602
           LV+++   + + +A    N+ AIL   +P  EGG AIA+ +FG +NPGG+L IT+     
Sbjct: 587 LVLINGQPLTVNWAA--QNVPAILEGWFPSVEGGTAIAETLFGDYNPGGKLTITF----- 639

Query: 603 VQMLPLTSMPLR---PVDSLGYPGRTYKFYNG-------PTLYPFGYGLSYTQFKYNLLS 652
               P ++  +    P     +  +  K  NG        ++YPFGYGLSYT F Y    
Sbjct: 640 ----PRSTGQIELNFPYKKGSHGAQPRKGPNGGGVTRVLGSIYPFGYGLSYTTFAY---- 691

Query: 653 FTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDV 712
                          +NL    + S+T+              F    +  N G   G +V
Sbjct: 692 ---------------KNLRIAPEPSRTQGS------------FRVSCEVTNTGDRRGDEV 724

Query: 713 VIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGE 772
           V +Y         TY   + GF+RV +  G  K + F       L ++D   N  +  GE
Sbjct: 725 VQLYISDKFSSVVTYESVLRGFERVTLEPGETKTVSFEVTPSH-LELLDSNMNWTVEPGE 783

Query: 773 HTIFVG 778
             I +G
Sbjct: 784 FEIRIG 789


>gi|160886913|ref|ZP_02067916.1| hypothetical protein BACOVA_04927 [Bacteroides ovatus ATCC 8483]
 gi|423288977|ref|ZP_17267828.1| hypothetical protein HMPREF1069_02871 [Bacteroides ovatus
           CL02T12C04]
 gi|423294866|ref|ZP_17272993.1| hypothetical protein HMPREF1070_01658 [Bacteroides ovatus
           CL03T12C18]
 gi|156107324|gb|EDO09069.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
           ovatus ATCC 8483]
 gi|392668741|gb|EIY62235.1| hypothetical protein HMPREF1069_02871 [Bacteroides ovatus
           CL02T12C04]
 gi|392676057|gb|EIY69498.1| hypothetical protein HMPREF1070_01658 [Bacteroides ovatus
           CL03T12C18]
          Length = 863

 Score =  265 bits (677), Expect = 6e-68,   Method: Compositional matrix adjust.
 Identities = 164/458 (35%), Positives = 243/458 (53%), Gaps = 45/458 (9%)

Query: 48  QMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVS 107
           Q S + + D+ L    R  DL+ R+TL+EKV  + + +  +PRLG+  YEWW+EALHGV+
Sbjct: 22  QPSKYPYQDTKLTAEQRADDLLQRLTLEEKVALMQNNSPAIPRLGIKPYEWWNEALHGVA 81

Query: 108 NVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARA---MYNLG-----R 159
             G           AT FP  I   ASFN+ L  ++  AVS EARA    +N        
Sbjct: 82  RAGL----------ATVFPQAIGMAASFNDELLYEVFDAVSDEARAKNRQFNEKGQYKRY 131

Query: 160 AGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPL 219
            GLT W+PN+N+ RDPRWGR  ET GEDP++ GR  +  VRGLQ  E  E          
Sbjct: 132 QGLTMWTPNVNIFRDPRWGRGQETYGEDPYLSGRMGMAAVRGLQGPEDAEYD-------- 183

Query: 220 KVSSCCKHYAAYDVDNWKGVDRYHFDAR-VTEQDMEETFLRPFEMCVKEGDASSVMCSYN 278
           K+ +C KH+A +    W   +R+ F+A  +  +D+ ET+L  F+  V++     VMC+YN
Sbjct: 184 KLHACAKHFAVHSGPEW---NRHSFNAENIAPRDLWETYLPAFKELVQKAGVKEVMCAYN 240

Query: 279 RVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSI-----QVMVDNHKFLADSKEDAVAQ 333
           R  G P C   +LL Q +R +W   G +V DC +I     +   + H   A +  DAV  
Sbjct: 241 RFEGDPCCGSNRLLTQILRNDWGFKGIVVTDCGAIGDFFQRKKHETHPDAAHASADAVL- 299

Query: 334 TLKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGK 393
              +G DL+CG  + + T +AV++G + E  I+ S+K L      LG  + +  + ++  
Sbjct: 300 ---SGTDLECGGNFKSIT-DAVKKGLISEEKINTSVKRLLKARFELGEMNSTHPWSNIPF 355

Query: 394 QDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGI 453
             I   ++ ELA + A E +VLL+N+ N LPLN  +   VAV+GP+AN +V   GNY G 
Sbjct: 356 SVIDCPKHKELALKMAHESLVLLQNNNNILPLN--RQMKVAVIGPNANDSVMQWGNYNGF 413

Query: 454 PCRYMSPIAGFSGY---ANVTYKTGCDDVACKSNNSIF 488
           P   ++ + G       A + Y+  C      + +S+F
Sbjct: 414 PSHTVTLLEGIRAKLPDAQIIYEPVCGYTNDTTLHSLF 451



 Score =  119 bits (297), Expect = 8e-24,   Method: Compositional matrix adjust.
 Identities = 94/296 (31%), Positives = 131/296 (44%), Gaps = 56/296 (18%)

Query: 495 KTADATIILAGLDLSVEAESL----------DREDLWLPGYQTQLINQVAEVAKGPVILV 544
           ++AD  I   G+   +E ES+          DR ++ LP  Q +++   A + K     V
Sbjct: 598 QSADVVIFAGGISPLLEGESMRVSDPGFKGGDRTEIELPAIQREVL---ALLKKNGKKTV 654

Query: 545 IMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQ 604
            ++  G  +A      N  AIL A YPG+ GG A+ADV+FG +NP GRLPIT+Y    +Q
Sbjct: 655 FVNFSGSAMAIVPETQNCDAILQAWYPGQAGGTAVADVLFGDYNPAGRLPITFYKS--MQ 712

Query: 605 MLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKL 664
            LP         +     GRTY+F     LYPFGYGLSYT+F Y   +      +N +KL
Sbjct: 713 QLP-------DYEDYSMKGRTYRFMTETPLYPFGYGLSYTRFSYGKAT------LNQSKL 759

Query: 665 QHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIA 724
                   T                         +   NVG  DG +VV VY   P +  
Sbjct: 760 TKGEKAILT-------------------------IPVSNVGQRDGEEVVQVYICRPDDKE 794

Query: 725 ATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPA-GEHTIFVGN 779
               K + GFQRV +  G+ + ++       S    D A NT+ P  G + I  GN
Sbjct: 795 GPQ-KTLRGFQRVSIAKGKTQNVQIEL-PYDSFEWFDAATNTIRPLNGTYKILYGN 848


>gi|224535242|ref|ZP_03675781.1| hypothetical protein BACCELL_00103 [Bacteroides cellulosilyticus
           DSM 14838]
 gi|224523140|gb|EEF92245.1| hypothetical protein BACCELL_00103 [Bacteroides cellulosilyticus
           DSM 14838]
          Length = 864

 Score =  265 bits (677), Expect = 6e-68,   Method: Compositional matrix adjust.
 Identities = 163/450 (36%), Positives = 242/450 (53%), Gaps = 38/450 (8%)

Query: 54  FCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGT 113
           + +  L  S R  DL+ RMTL+EKV Q+ + +  + RLG+P Y+WW+EALHGV+  G   
Sbjct: 24  YKNPELSPSERAWDLLKRMTLEEKVSQMKNGSPAIERLGIPAYDWWNEALHGVARAGK-- 81

Query: 114 HFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYN--------LGRAGLTYW 165
                   AT FP  I   A+F+     +    VS EARA Y+         G  GLT+W
Sbjct: 82  --------ATVFPQAIGLAATFDNQAVYETFDIVSDEARAKYHDFQRKGERDGYKGLTFW 133

Query: 166 SPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCC 225
           +PNIN+ RDPRWGR  ET GEDP++     +  V+GLQ   G     D      K  +C 
Sbjct: 134 TPNINIYRDPRWGRGMETYGEDPYLTSLMGLAVVKGLQG--GGTGKYD------KAHACA 185

Query: 226 KHYAAYDVDNWKGVDRYHFDAR-VTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIP 284
           KHYA +    W   +R+ FDA+ ++++D+ ET+L  F+  VKEG    VMC+YNR  G P
Sbjct: 186 KHYAVHSGPEW---NRHSFDAKNISQRDLWETYLSAFKTLVKEGKVKEVMCAYNRFEGEP 242

Query: 285 SCADPKLLNQTVRGEWDLHGYIVADCDSI-QVMVDNHKFLADSKEDAVAQTLKAGLDLDC 343
            C++ +LL + +R +W     +V+DC +I      NH     +   A A  + +G DL+C
Sbjct: 243 CCSNKQLLIRILREDWGYDDIVVSDCGAIGDFYYPNHHETHPTAAAASADAVVSGTDLEC 302

Query: 344 GQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSP--QYVSLGKQDICSDEN 401
           G  Y++    AV++G + E  I++S+  L     +LG FD      +  +    + S E+
Sbjct: 303 GGSYSSLN-EAVRKGLISEEKINESVFRLLRARFQLGMFDDDALVSWSEIPYSVVESKEH 361

Query: 402 IELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPI 461
           +  A E AR+ +VLL N  +TLPL S  ++ VAV+GP+AN +V +  NY G P + ++ +
Sbjct: 362 VTKALEMARKSMVLLTNKNHTLPL-SKSIRKVAVLGPNANDSVMLWANYNGFPTKSVTIL 420

Query: 462 AGFSGY---ANVTYKTGCDDVACKSNNSIF 488
            G         V Y+ GCD V  ++  S F
Sbjct: 421 EGIKSKLPEGTVYYEKGCDYVNTQTVFSYF 450



 Score =  124 bits (311), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 85/286 (29%), Positives = 132/286 (46%), Gaps = 54/286 (18%)

Query: 478 DVACKSNNSIFAASEAAKTADATIILAGLDLSVEAESL----------DREDLWLPGYQT 527
           D+  K   +    ++ A  ADA I + GL  ++E E +          DR ++ LP  Q 
Sbjct: 581 DIGIKKEINYKEVADKAAEADAIIFVGGLSPTLEGEEMPVDLPGFRKGDRTNIDLPHVQA 640

Query: 528 QLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKF 587
           +++  + +  K PVI V+ S  G  +A      N+ AIL A YPG++GG A+ADV+FG +
Sbjct: 641 EMLKALKKTGK-PVIFVLCS--GSTLALPWEAENLDAILEAWYPGQQGGTAVADVLFGDY 697

Query: 588 NPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFK 647
           NP GRLP+T+Y          +S  L   +      RTY+++ G  L+PFG+GLSYT F 
Sbjct: 698 NPAGRLPLTFY---------ASSNDLPDFEDYDMSNRTYRYFKGKALFPFGHGLSYTIFD 748

Query: 648 YNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGST 707
           Y      K                                ++R  +     +  +N G  
Sbjct: 749 YGKAKVDK-------------------------------QNVRAGEGMTLTIPLKNTGKL 777

Query: 708 DGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNA 753
           DG +V+ VY + PA+     IK +  F+RV + AG+ + I+    A
Sbjct: 778 DGDEVIQVYLRNPADKEGP-IKTLRAFRRVSLPAGQTENIRIELPA 822


>gi|336415919|ref|ZP_08596257.1| hypothetical protein HMPREF1017_03365 [Bacteroides ovatus
           3_8_47FAA]
 gi|335939822|gb|EGN01694.1| hypothetical protein HMPREF1017_03365 [Bacteroides ovatus
           3_8_47FAA]
          Length = 782

 Score =  265 bits (677), Expect = 7e-68,   Method: Compositional matrix adjust.
 Identities = 215/726 (29%), Positives = 341/726 (46%), Gaps = 124/726 (17%)

Query: 90  RLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVST 149
           RLG+P +    EA HG   +G           AT FPT I   A+++  L K++GQ ++ 
Sbjct: 129 RLGIPMF-LAEEAPHGHMAIG-----------ATVFPTGIGMAATWSPELVKEVGQVIAK 176

Query: 150 EARAMYNLGRAGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHE 209
           E R+     + G   + P +++ RDPRW R+ ET GEDP + G    + V GL       
Sbjct: 177 EIRS-----QGGHISYGPVLDLTRDPRWSRVEETFGEDPVLSGILGASMVDGL------- 224

Query: 210 NATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGD 269
              +L+ +   +++  KH+ AY V        Y   A V  +D+ + FL PF   +  G 
Sbjct: 225 GGGNLSQKYATIATL-KHFLAYAVPEGGQNGNY---ASVGIRDLHQNFLPPFRKAIDAG- 279

Query: 270 ASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKED 329
           A SVM SYN ++GIP  ++  LL Q +R EW   G++V+D  SI+ + ++H F+A +KE+
Sbjct: 280 ALSVMTSYNSIDGIPCTSNHNLLTQLLRNEWKFRGFVVSDLYSIEGIHESH-FVAPTKEN 338

Query: 330 AVAQTLKAGLDLDCG-QYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQY 388
           A  Q++ AG+D+D G   YTN   +AVQ G++ +  ID ++  +  +   +G F+     
Sbjct: 339 AAIQSVTAGVDVDLGGDAYTNLC-HAVQSGQMDKAVIDTAVCRVLRMKFEMGLFEHPYVD 397

Query: 389 VSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIG 448
             +  + +   E+IELA + A+  I LLKN+ + LPL S  +  VAV+GP+A+    M+G
Sbjct: 398 PKIAAKTVRRKEHIELARKIAQSSITLLKNENSILPL-SKTINKVAVIGPNADNRYNMLG 456

Query: 449 NYA-------------GIPCRYMSPIAGFSGYANVTYKTGCDDVACKSNNSIFAASEAAK 495
           +Y              GI  + +SP       + V Y  GC  +   + N I  A EAA+
Sbjct: 457 DYTAPQEDSNVKTVLDGIITK-LSP-------SRVEYVRGCA-IRDTTVNEIEQAIEAAR 507

Query: 496 TAD----------------------ATIILAGLDLSVE-AESLDREDLWLPGYQTQLINQ 532
            ++                      A +   G    +E  E  DR  L L G Q +L+  
Sbjct: 508 RSEVVIVVVGGSSARDFKTSYKETGAAVAEEGSVSDMECGEGFDRASLSLLGRQQELLES 567

Query: 533 VAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGR 592
           + +  K P+I+V +    ++  +A    +  A+L A YPG+EGG AIADV+FG +NP GR
Sbjct: 568 LQKTGK-PLIVVYIEGRPLEKNWASEYAD--ALLTAYYPGQEGGNAIADVLFGDYNPSGR 624

Query: 593 LPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLS 652
           LPI+      V  +P+      P +        Y   +   LY FGYG+SYT F+Y+ L 
Sbjct: 625 LPISVPRS--VGQIPVYYNQKAPRN------HDYVEVSSSPLYSFGYGMSYTTFEYSDLQ 676

Query: 653 FTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDV 712
                                          V+    RC   FE     +N G  DG +V
Sbjct: 677 -------------------------------VVQKSARC---FEVSFKVKNTGKYDGEEV 702

Query: 713 VIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGE 772
             +Y +         +KQ+  F+R  ++ G  K++ FV    +   +V+Y    ++ +G 
Sbjct: 703 SQLYMRDEYASVVQPMKQLKHFERFHLKKGEEKKVTFVLTE-EDFFLVNYTLKKVVESGN 761

Query: 773 HTIFVG 778
             + +G
Sbjct: 762 FHLMIG 767


>gi|449527525|ref|XP_004170761.1| PREDICTED: beta-D-xylosidase 1-like [Cucumis sativus]
          Length = 241

 Score =  265 bits (677), Expect = 7e-68,   Method: Compositional matrix adjust.
 Identities = 125/195 (64%), Positives = 145/195 (74%), Gaps = 8/195 (4%)

Query: 54  FCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGT 113
           FC  SL    RVKDL+ R+TL EK++ L + A  VPRLG+  YEWWSEALHGVSNVGPGT
Sbjct: 46  FCQESLGIEERVKDLIGRLTLGEKIRLLVNNAIAVPRLGIRGYEWWSEALHGVSNVGPGT 105

Query: 114 HFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNINVAR 173
            F    PGATSFP VI T ASFN+SLW  IG+ VS EARAMYN G AGLTYWSPN+N+ R
Sbjct: 106 KFGGTFPGATSFPQVITTAASFNQSLWLLIGRVVSDEARAMYNGGTAGLTYWSPNVNIFR 165

Query: 174 DPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDV 233
           DPRWGR  ETPGEDP +  +YA NYV+GLQ  +G         + LKV++CCKHY AYD+
Sbjct: 166 DPRWGRGQETPGEDPILAAKYAANYVQGLQGNDG--------KKRLKVAACCKHYTAYDL 217

Query: 234 DNWKGVDRYHFDARV 248
           DNW GVDRYHF+A+V
Sbjct: 218 DNWNGVDRYHFNAKV 232


>gi|393781488|ref|ZP_10369683.1| hypothetical protein HMPREF1071_00551 [Bacteroides salyersiae
           CL02T12C01]
 gi|392676551|gb|EIY69983.1| hypothetical protein HMPREF1071_00551 [Bacteroides salyersiae
           CL02T12C01]
          Length = 850

 Score =  265 bits (677), Expect = 7e-68,   Method: Compositional matrix adjust.
 Identities = 168/459 (36%), Positives = 246/459 (53%), Gaps = 41/459 (8%)

Query: 45  LGLQMSSFL-FCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEAL 103
           LG  +S+ L + +  L    R  DL+ R+T++EK+  + + + G+PRLG+  YEWW+EAL
Sbjct: 6   LGTTLSAQLPYQNPDLTPEQRATDLLQRLTVEEKISLMQNNSPGIPRLGIRPYEWWNEAL 65

Query: 104 HGVSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARA----MYNLGR 159
           HGV+  G           AT FP  I   ASFN+SL +K+  AVS EARA      + G+
Sbjct: 66  HGVARAGL----------ATVFPQTIGMAASFNDSLVQKVFTAVSDEARAKNRAFNDQGQ 115

Query: 160 ----AGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLN 215
                GLT W+PN+N+ RDPRWGR  ET GEDP++  R  V  V+GLQ  +        +
Sbjct: 116 YKRYQGLTMWTPNVNIFRDPRWGRGQETYGEDPYLTSRMGVAVVKGLQGPD--------S 167

Query: 216 SRPLKVSSCCKHYAAYDVDNWKGVDRYHFDAR-VTEQDMEETFLRPFEMCVKEGDASSVM 274
           +R  K+ +C KH+A +    W   +R+ F+A  +  +D+ ET+L  F+  V+E D   VM
Sbjct: 168 ARYDKLHACAKHFAVHSGPEW---NRHSFNAENIAPRDLWETYLPAFKTLVQEADVKEVM 224

Query: 275 CSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVM--VDNHKFLADSKEDAVA 332
           C+YNR  G P C   +LL Q +R EW  +G +V+DC +I        H    D+   A A
Sbjct: 225 CAYNRFEGDPCCGSNRLLTQILRDEWGFNGIVVSDCGAISDFWGAKKHNTHPDAAH-ASA 283

Query: 333 QTLKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLG 392
             + +G DL+CG  Y   T +AV+ G + E  ID S+K L      LG  + S  + +L 
Sbjct: 284 DAVLSGTDLECGSNYRKLT-DAVKAGIISEEQIDISVKRLLKARFELGEMEESHPW-ALP 341

Query: 393 KQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAG 452
              +   E+  LA + A E + LL+N +N LPL+  K   VAV+GP+AN +V   GNY G
Sbjct: 342 YSIVDCPEHRHLALQIAHETMTLLQNKENILPLD--KHAKVAVIGPNANDSVMQWGNYNG 399

Query: 453 IPCRYMSPIAGFSGY---ANVTYKTGCDDVACKSNNSIF 488
            P    + ++        A + Y+  C      + NS+F
Sbjct: 400 TPSHTSTLLSALRSKLPAAQLIYEPVCGLTDDITFNSLF 438



 Score =  107 bits (268), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 86/302 (28%), Positives = 131/302 (43%), Gaps = 58/302 (19%)

Query: 489 AASEAAKTADATIILAGLDLSVEAESL----------DREDLWLPGYQTQLINQVAEVAK 538
           A  E  K  +  I   G+   +E E +          DR D+ LP  Q  ++  + +  K
Sbjct: 579 ATLEKLKDTEIVIFAGGISPLLEGEEMKVSAAGFKGGDRTDIELPAVQRNVLAALKKAGK 638

Query: 539 GPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWY 598
             VI V  S  G  +A      N  AIL A YPG+EGG A+ADV+FG +NP GRLP+T+Y
Sbjct: 639 -KVIFVNFS--GSAMALTPETENCDAILQAWYPGQEGGTAVADVLFGDYNPAGRLPVTFY 695

Query: 599 NGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQ 658
               ++ LP         +     GRTY++     L+PFGYGLSYT F Y          
Sbjct: 696 KN--MEQLP-------DFEDYSMQGRTYRYMKEAPLFPFGYGLSYTTFTYG--------- 737

Query: 659 VNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSK 718
                          + A K R        +   +     +   N+GS DG +VV VY +
Sbjct: 738 --------------KARADKKR--------ISTGEKMTLTIPVSNIGSRDGEEVVQVYLR 775

Query: 719 PPAEIAATYIKQVIGFQRVFVRAGR--NKRIKFVFNACKSLNIVDYAANTLLPAGEHTIF 776
              +      K +  F+RV +  G+  N +I+  + A +  +   +  +++   GE+ + 
Sbjct: 776 REDDPEGP-TKTLRAFKRVEITKGKSLNVKIELPYTAFEWFDNSTHTMHSM--KGEYEVL 832

Query: 777 VG 778
            G
Sbjct: 833 YG 834


>gi|383114360|ref|ZP_09935124.1| hypothetical protein BSGG_1469 [Bacteroides sp. D2]
 gi|313693934|gb|EFS30769.1| hypothetical protein BSGG_1469 [Bacteroides sp. D2]
          Length = 863

 Score =  265 bits (677), Expect = 7e-68,   Method: Compositional matrix adjust.
 Identities = 164/458 (35%), Positives = 243/458 (53%), Gaps = 45/458 (9%)

Query: 48  QMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVS 107
           Q S + + D+ L    R  DL+ R+TL+EKV  + + +  +PRLG+  YEWW+EALHGV+
Sbjct: 22  QPSKYPYQDTKLTVEQRADDLLQRLTLEEKVALMQNNSPAIPRLGIKPYEWWNEALHGVA 81

Query: 108 NVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARA---MYNLG-----R 159
             G           AT FP  I   ASFN+ L  ++  AVS EARA    +N        
Sbjct: 82  RAGL----------ATVFPQAIGMAASFNDELLYEVFDAVSDEARAKNRQFNEKGQYKRY 131

Query: 160 AGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPL 219
            GLT W+PN+N+ RDPRWGR  ET GEDP++ GR  +  VRGLQ  E  E          
Sbjct: 132 QGLTMWTPNVNIFRDPRWGRGQETYGEDPYLSGRMGMAAVRGLQGPEDAEYD-------- 183

Query: 220 KVSSCCKHYAAYDVDNWKGVDRYHFDAR-VTEQDMEETFLRPFEMCVKEGDASSVMCSYN 278
           K+ +C KH+A +    W   +R+ F+A  +  +D+ ET+L  F+  V++     VMC+YN
Sbjct: 184 KLHACAKHFAVHSGPEW---NRHSFNAENIAPRDLWETYLPAFKELVQKAGVKEVMCAYN 240

Query: 279 RVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSI-----QVMVDNHKFLADSKEDAVAQ 333
           R  G P C   +LL Q +R +W   G +V DC +I     +   + H   A +  DAV  
Sbjct: 241 RFEGDPCCGSNRLLTQILRNDWGFKGIVVTDCGAIGDFFQRKKHETHPDAAHASADAVL- 299

Query: 334 TLKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGK 393
              +G DL+CG  + + T +AV++G + E  I+ S+K L      LG  + +  + ++  
Sbjct: 300 ---SGTDLECGGNFKSIT-DAVKKGLISEEKINTSVKRLLKARFELGEMNSTHPWSNIPF 355

Query: 394 QDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGI 453
             I   ++ ELA + A E +VLL+N+ N LPLN  +   VAV+GP+AN +V   GNY G 
Sbjct: 356 SVIDCPKHKELALKMAHESLVLLQNNNNILPLN--RQMKVAVIGPNANDSVMQWGNYNGF 413

Query: 454 PCRYMSPIAGFSGY---ANVTYKTGCDDVACKSNNSIF 488
           P   ++ + G       A + Y+  C      + +S+F
Sbjct: 414 PSHTVTLLEGIRAKLPDAQIIYEPVCGYTNDTTLHSLF 451



 Score =  119 bits (297), Expect = 9e-24,   Method: Compositional matrix adjust.
 Identities = 94/296 (31%), Positives = 131/296 (44%), Gaps = 56/296 (18%)

Query: 495 KTADATIILAGLDLSVEAESL----------DREDLWLPGYQTQLINQVAEVAKGPVILV 544
           ++AD  I   G+   +E ES+          DR ++ LP  Q +++   A + K     V
Sbjct: 598 QSADVVIFAGGISPLLEGESMRVSDPGFKGGDRTEIELPAIQREVL---ALLKKNGKKTV 654

Query: 545 IMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQ 604
            ++  G  +A      N  AIL A YPG+ GG A+ADV+FG +NP GRLPIT+Y    +Q
Sbjct: 655 FVNFSGSAMAIVPETQNCDAILQAWYPGQAGGTAVADVLFGDYNPAGRLPITFYKS--MQ 712

Query: 605 MLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKL 664
            LP         +     GRTY+F     LYPFGYGLSYT+F Y   +      +N +KL
Sbjct: 713 QLP-------DYEDYSMKGRTYRFMTETPLYPFGYGLSYTRFSYGKAT------LNQSKL 759

Query: 665 QHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIA 724
                   T                         +   NVG  DG +VV VY   P +  
Sbjct: 760 TKGEKAILT-------------------------IPVSNVGQRDGEEVVQVYICRPDDKE 794

Query: 725 ATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPA-GEHTIFVGN 779
               K + GFQRV +  G+ + ++       S    D A NT+ P  G + I  GN
Sbjct: 795 GPQ-KTLRGFQRVSIAKGKTQNVQIEL-PYDSFEWFDAATNTIRPLNGTYKILYGN 848


>gi|374374543|ref|ZP_09632202.1| Beta-glucosidase [Niabella soli DSM 19437]
 gi|373233985|gb|EHP53779.1| Beta-glucosidase [Niabella soli DSM 19437]
          Length = 799

 Score =  265 bits (677), Expect = 8e-68,   Method: Compositional matrix adjust.
 Identities = 213/725 (29%), Positives = 334/725 (46%), Gaps = 107/725 (14%)

Query: 90  RLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVST 149
           RLG+P  ++ +E +HG++      H       AT+FP  I   +++N+ L  ++GQ +  
Sbjct: 138 RLGIP-VDFTNEGIHGLNQ----DH-------ATAFPAPIGIGSTWNKELVHQMGQIIGR 185

Query: 150 EARAMYNLGRAGLT-YWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGH 208
           EA+A+      G T  ++P ++VARD RWGR+ ET GEDPF+V         G+Q     
Sbjct: 186 EAKAL------GYTNVYAPILDVARDQRWGRVVETYGEDPFLVAGLGTALAGGIQ----- 234

Query: 209 ENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEG 268
           EN          V+S  KH+A Y V           D  V  ++M++ FL PF   ++  
Sbjct: 235 ENG---------VASTLKHFAVYSVPKGGRDGNARTDPHVAPREMQQLFLYPFRKVIQNV 285

Query: 269 DASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKE 328
               VM SYN  +G+P  A    L Q +R ++   GY+V+D  +++ + + H    D KE
Sbjct: 286 HPLGVMSSYNDWDGMPVTASNYFLTQLLRQQFGFDGYVVSDSRAVEFVYEKHHVAKDYKE 345

Query: 329 DAVAQTLKAGLDLDCG-QYYTNFT---GNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDG 384
            AV   ++AGL++       +NF       +++G +    +++ +  + +V  RLG FD 
Sbjct: 346 -AVKMVMEAGLNVRTEFNAPSNFILPLRQLIKEGGLSMETLNQRVGEVLSVKFRLGLFDA 404

Query: 385 SPQYVSLGK---QDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHAN 441
              YV   K   + + ++ +  +A +  RE +VLLKND+N LPL+  + + + V GP A+
Sbjct: 405 P--YVKDPKAADKIVATEASEAVALQMNRESLVLLKNDKNILPLSLGQYRNILVTGPLAD 462

Query: 442 ATVAMIGNYAGIPCRYMSPIAGFSGY----ANVTYKTGCDDVACKSNNS----------- 486
                I  Y     + +S + G   +    A + Y  GC+        S           
Sbjct: 463 EKEHAISRYGPSNKKVISVLEGIRHFAAKKATINYIKGCEAADATWPESEIIDTPPTPQE 522

Query: 487 ---IFAASEAAKTADATIILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVIL 543
              +  A EAAK  D  I + G +     ESL R  L LPG Q +L+ ++ +  K P++L
Sbjct: 523 IAEMNKAVEAAKQNDIIIAVMGENDKQVGESLSRTGLNLPGRQLRLLEELKKTGK-PMVL 581

Query: 544 VIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITW-YNGDY 602
           ++++   + I +   N  + AIL   +PG  GG A+A+ +FG +NPGG+L  T+      
Sbjct: 582 ILINGQPLTINWE--NRYLDAILETWFPGPAGGTAVAEAIFGAYNPGGKLTTTFPKTTGQ 639

Query: 603 VQMLPLTSMPLRPVDSLGYPGRTYKFYN-----GPTLYPFGYGLSYTQFKYNLLSFTKTI 657
           ++M    + P +P    G PG     Y      GP LYPFGYGLSYT F+Y         
Sbjct: 640 IEM----NFPFKPASHAGQPGDGPNGYGKTAVVGP-LYPFGYGLSYTTFEY--------- 685

Query: 658 QVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYS 717
                        N   D  K R         + D      VD +N G   G +VV +Y 
Sbjct: 686 ------------ANLKVDPEKART--------QAD--ISVAVDVKNTGKVKGDEVVQLYV 723

Query: 718 KPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFV 777
           K       TY   + GF+RV +  G  K + F       L+I+D   N ++  G   I V
Sbjct: 724 KQLVSSVTTYESILRGFERVSLSPGETKTVHFKLTP-DDLSILDKNMNFVVEPGAFDIMV 782

Query: 778 GNGGV 782
           G+  V
Sbjct: 783 GSSSV 787


>gi|423295566|ref|ZP_17273693.1| hypothetical protein HMPREF1070_02358 [Bacteroides ovatus
           CL03T12C18]
 gi|392672275|gb|EIY65744.1| hypothetical protein HMPREF1070_02358 [Bacteroides ovatus
           CL03T12C18]
          Length = 782

 Score =  265 bits (677), Expect = 8e-68,   Method: Compositional matrix adjust.
 Identities = 215/726 (29%), Positives = 341/726 (46%), Gaps = 124/726 (17%)

Query: 90  RLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVST 149
           RLG+P +    EA HG   +G           AT FPT I   A+++  L K++GQ ++ 
Sbjct: 129 RLGIPMF-LAEEAPHGHMAIG-----------ATVFPTGIGMAATWSPELVKEVGQVIAK 176

Query: 150 EARAMYNLGRAGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHE 209
           E R+     + G   + P +++ RDPRW R+ ET GEDP + G    + V GL       
Sbjct: 177 EIRS-----QGGHISYGPVLDLTRDPRWSRVEETFGEDPVLSGTLGASMVDGL------- 224

Query: 210 NATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGD 269
              +L+ +   +++  KH+ AY V        Y   A V  +D+ + FL PF   +  G 
Sbjct: 225 GGGNLSQKYATIATL-KHFLAYAVPEGGQNGNY---ASVGIRDLHQNFLPPFRKAIDAG- 279

Query: 270 ASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKED 329
           A SVM SYN ++GIP  ++  LL Q +R EW   G++V+D  SI+ + ++H F+A +KE+
Sbjct: 280 ALSVMTSYNSIDGIPCTSNHYLLTQLLRNEWKFRGFVVSDLYSIEGIHESH-FVAPTKEN 338

Query: 330 AVAQTLKAGLDLDCG-QYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQY 388
           A  Q++ AG+D+D G   YTN   +AVQ G++ +  ID ++  +  +   +G F+     
Sbjct: 339 AAIQSVMAGVDVDLGGDAYTNLC-HAVQSGQMDKAVIDTAVCRVLRMKFEMGLFEHPYVD 397

Query: 389 VSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIG 448
             +  + +   E+IELA + A+  I LLKN+ + LPL S  +  VAV+GP+A+    M+G
Sbjct: 398 PKIAAKTVRRKEHIELARKIAQSSITLLKNENSILPL-SKMINKVAVIGPNADNRYNMLG 456

Query: 449 NYA-------------GIPCRYMSPIAGFSGYANVTYKTGCDDVACKSNNSIFAASEAAK 495
           +Y              GI  + +SP       + V Y  GC  +   + N I  A EAA+
Sbjct: 457 DYTAPQEDSNVKTVLDGIITK-LSP-------SRVEYVRGCA-IRDTTVNEIEQAIEAAR 507

Query: 496 TAD----------------------ATIILAGLDLSVE-AESLDREDLWLPGYQTQLINQ 532
            ++                      A +   G    +E  E  DR  L L G Q +L+  
Sbjct: 508 RSEVVIVVVGGSSARDFKTSYKETGAAVAEEGSVSDMECGEGFDRASLSLLGRQQELLES 567

Query: 533 VAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGR 592
           + +  K P+I+V +    ++  +A    +  A+L A YPG+EGG AIADV+FG +NP GR
Sbjct: 568 LQKTGK-PLIVVYIEGRPLEKNWASEYAD--ALLTAYYPGQEGGNAIADVLFGDYNPSGR 624

Query: 593 LPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLS 652
           LPI+      V  +P+      P +        Y   +   LY FGYG+SYT F+Y+ L 
Sbjct: 625 LPISVPRS--VGQIPVYYNQKAPRN------HDYVEVSSSPLYSFGYGMSYTTFEYSDLQ 676

Query: 653 FTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDV 712
                                          V+    RC   FE     +N G  DG +V
Sbjct: 677 -------------------------------VVQKSARC---FEVSFKVKNTGKYDGEEV 702

Query: 713 VIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGE 772
             +Y +         +KQ+  F+R  ++ G  K++ FV    +   +V+Y    ++ +G 
Sbjct: 703 SQLYMRDEYASVVQPMKQLKHFERFHLKKGEEKKVTFVLTE-EDFFLVNYTLKKVVESGN 761

Query: 773 HTIFVG 778
             + +G
Sbjct: 762 FHLMIG 767


>gi|300777563|ref|ZP_07087421.1| beta-glucosidase [Chryseobacterium gleum ATCC 35910]
 gi|300503073|gb|EFK34213.1| beta-glucosidase [Chryseobacterium gleum ATCC 35910]
          Length = 896

 Score =  265 bits (677), Expect = 8e-68,   Method: Compositional matrix adjust.
 Identities = 155/453 (34%), Positives = 245/453 (54%), Gaps = 41/453 (9%)

Query: 52  FLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGP 111
           + F +  LP + R+++L++ +T +EK+  + D +  VPRL +P Y WW+EALHGV+  G 
Sbjct: 44  YPFRNPDLPVNERIENLLTLLTTEEKIGMMMDNSQAVPRLEIPAYGWWNEALHGVARAGI 103

Query: 112 GTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYN--------LGR-AGL 162
                     AT FP  I   A+++     K  + +S EARA YN         GR  GL
Sbjct: 104 ----------ATVFPQAIGMAATWDVPEHFKTFEMISDEARAKYNRSFDEALKTGRYEGL 153

Query: 163 TYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVS 222
           T+W+PNIN+ RDPRWGR  ET GEDP++     V  V+GLQ  +          +  K  
Sbjct: 154 TFWTPNINIFRDPRWGRGQETYGEDPYLTSVLGVAAVKGLQGND---------PKFFKTH 204

Query: 223 SCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNG 282
           +C KH+A +    W   +R+ ++A ++++D+ ET+L  F+  V+EG+   VMC+YN  +G
Sbjct: 205 ACAKHFAVHSGPEW---NRHSYNAEISKRDLYETYLPAFKALVQEGNVREVMCAYNAFDG 261

Query: 283 IPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDN--HKFLADSKEDAVAQTLKAGLD 340
            P CA+  LL + +RG+W   G +V+DC ++        H    D K  A A  LK   D
Sbjct: 262 QPCCANNTLLTEILRGKWKYDGMVVSDCWALADFFQKKYHGTHPDEKTTA-ADALKHSTD 320

Query: 341 LDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFD--GSPQYVSLGKQDICS 398
           L+CG  Y N    ++  G + E DID+S++ +      LG  D   S  + ++    + S
Sbjct: 321 LECGDTYNNLN-KSLASGLITEKDIDESMRRILKGWFELGMLDPKSSVHWNTIPYSVVDS 379

Query: 399 DENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYM 458
           +E+ + A + A++ IVL+KN++N LPLN   +K +AVVGP+A+  +  +GNY G P   +
Sbjct: 380 EEHKKQALKMAQKSIVLMKNEKNILPLNR-NIKKIAVVGPNADDGLMQLGNYNGTPSSIV 438

Query: 459 SPIAGFSGY---ANVTYKTGCDDVACKSNNSIF 488
           + + G       A + Y+ G +     S  S++
Sbjct: 439 TILDGIKTKFPNAEIIYEKGSEVTDPSSRTSLY 471



 Score =  104 bits (260), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 80/302 (26%), Positives = 134/302 (44%), Gaps = 48/302 (15%)

Query: 489 AASEAAKTADATIILAGLDLSVEAESL----------DREDLWLPGYQTQLINQVAEVAK 538
           +  E  K AD  +   GL  S+E E +          D+  + LP  Q  L+ ++ +  K
Sbjct: 615 SVREKVKNADVIVFAGGLSPSLEGEEMMVNAEGFKGGDKTSIALPKVQRDLLAELRKTGK 674

Query: 539 GPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWY 598
            PV+ V+ +  G  +   +   N  A+L A Y G+ GG A+ADV+ G +NP G+LPIT+Y
Sbjct: 675 -PVVFVLCT--GSALGLEQDEKNYDALLNAWYGGQSGGTAVADVLAGDYNPSGKLPITFY 731

Query: 599 -NGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTI 657
            N + +      +      ++    GRTY++     LYPFG+GLSY++F Y     +K  
Sbjct: 732 KNLEQLDNALSKTSKHEGFENYDMQGRTYRYMTEKPLYPFGHGLSYSKFVYGDSKLSK-- 789

Query: 658 QVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYS 717
                                        N +  ++     +   N+   +G +VV VY 
Sbjct: 790 -----------------------------NSISVNENVTITIPVTNISEREGEEVVQVYI 820

Query: 718 KPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPA-GEHTIF 776
           K   +  A  +K +  F+R  +++   K I+ + +   S    D  A+ L+   G++TIF
Sbjct: 821 KRNNDAQAP-VKTLRAFERTPIKSKETKNIQLILSK-DSFAFYDEKADDLVSKPGDYTIF 878

Query: 777 VG 778
            G
Sbjct: 879 YG 880


>gi|390957160|ref|YP_006420917.1| beta-glucosidase-like glycosyl hydrolase [Terriglobus roseus DSM
           18391]
 gi|390412078|gb|AFL87582.1| beta-glucosidase-like glycosyl hydrolase [Terriglobus roseus DSM
           18391]
          Length = 908

 Score =  265 bits (676), Expect = 9e-68,   Method: Compositional matrix adjust.
 Identities = 166/439 (37%), Positives = 237/439 (53%), Gaps = 42/439 (9%)

Query: 54  FCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGT 113
           + + +L    R  DLV RMTL+EK  Q+ + A  +PRL +P Y++W+E LHGV+  G   
Sbjct: 24  YLNPALTPQQRAADLVGRMTLEEKSLQMVNGAAAIPRLNVPAYDYWNEGLHGVARSG--- 80

Query: 114 HFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYN--LGRA------GLTYW 165
                   AT FP  I   A+++  L K+IG  ++TEARA  N  L R       GLT+W
Sbjct: 81  -------YATMFPQAIGMAATWDAPLLKQIGDVIATEARAKNNEALRRNNHDIYFGLTFW 133

Query: 166 SPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCC 225
           SPNIN+ RDPRWGR  ET GEDP +  +  VN++ GLQ  +          +  KV +  
Sbjct: 134 SPNINIFRDPRWGRGQETYGEDPHLTTQLGVNFIEGLQGTD---------PKFYKVIATP 184

Query: 226 KHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPS 285
           KH+A   V +     R+ FD   T  D+ +T+L  F   + +  A S+MC+YNR++G P+
Sbjct: 185 KHFA---VHSGPEEGRHKFDVEPTPHDLWDTYLPQFRAAIVDAKADSIMCAYNRIDGQPA 241

Query: 286 CADPKLLNQTVRGEWDLHGYIVADCDSIQ--VMVDNHKFLADSKEDAVAQTLKAGLDLDC 343
           C    LL   +R +W   G++ +DC +I      + H+   D+ E A    L AG D +C
Sbjct: 242 CGSKLLLVDILRNDWKFQGFVTSDCGAIDDFFRPNTHQTEPDA-EHADKAALLAGTDTNC 300

Query: 344 GQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFD--GSPQYVSLGKQDICSDEN 401
           G  Y    G+AV+ G +KE+DID SL+ L+   +RLG FD  GS  Y  +    + S  N
Sbjct: 301 GSTYRKL-GDAVKSGLIKESDIDVSLRRLFEARVRLGLFDPAGSVPYAQIPFSQVNSPAN 359

Query: 402 IELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPI 461
             +A  AA E +VLLKND   LPL + K KT+AV+GP+  +  ++ GNY G+      P+
Sbjct: 360 AAVAKRAAEESMVLLKND-GILPLKAGKYKTIAVIGPNGASLSSLEGNYNGMAHDPRMPV 418

Query: 462 ----AGFSGYANVTYKTGC 476
               +  SG  NV Y  G 
Sbjct: 419 DALRSALSG-TNVVYAPGA 436



 Score =  124 bits (311), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 89/301 (29%), Positives = 136/301 (45%), Gaps = 56/301 (18%)

Query: 490 ASEAAKTADATIILAGLDLSVEAESL----------DREDLWLPGYQTQLINQVAEVAKG 539
           A EAA  +D  + + GL   +E E +          DR D+ LP  Q  L+  +    K 
Sbjct: 624 ALEAANKSDLVVAMLGLSPDLEGEEMPVKLPGFVGGDRTDISLPASQQALLQGLIATGK- 682

Query: 540 PVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYN 599
           P I+V+++   + I  A+   N  AIL + YPGE G  A+AD + G+ NP GRLPIT+Y 
Sbjct: 683 PTIVVLLNGSALAINLADEKAN--AILESWYPGEAGSTALADTLVGRNNPSGRLPITFYK 740

Query: 600 GDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQV 659
            +       + +P    +      RTY+++ G  LY FG+GLSYT+F Y+ L   K    
Sbjct: 741 SE-------SDLP--GFEDYSMQNRTYRYFKGAPLYGFGFGLSYTKFAYSGLKLAKA--- 788

Query: 660 NLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKP 719
                                        L   D    +V  +N G   G +V  +Y  P
Sbjct: 789 ----------------------------KLNAGDTLTAEVTVKNTGKVAGEEVAELYLLP 820

Query: 720 PAEIAA--TYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFV 777
           PAE  A  +  +Q+ GFQRV ++ G ++++ F     + L+ VD      +  G + I +
Sbjct: 821 PAEGNAGLSPKQQLEGFQRVMLKPGESRKLTFTLTP-RQLSEVDAKGTRAIQPGTYAIAI 879

Query: 778 G 778
           G
Sbjct: 880 G 880


>gi|167765233|ref|ZP_02437346.1| hypothetical protein BACSTE_03621 [Bacteroides stercoris ATCC
           43183]
 gi|167696861|gb|EDS13440.1| glycosyl hydrolase family 3 N-terminal domain protein [Bacteroides
           stercoris ATCC 43183]
          Length = 818

 Score =  265 bits (676), Expect = 1e-67,   Method: Compositional matrix adjust.
 Identities = 209/729 (28%), Positives = 332/729 (45%), Gaps = 123/729 (16%)

Query: 90  RLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVST 149
           RLG+P  ++ +E +HG+++             AT  P  I   +++N+ L ++ G     
Sbjct: 157 RLGIP-VDFTNEGIHGLNHTK-----------ATPLPAPIAIGSTWNKELVRRAGVIAGQ 204

Query: 150 EARAMYNLGRAGLT-YWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGH 208
           EA+A+      G T  ++P +++ RDPRWGR  E  GE+P+++       V G+Q  +G 
Sbjct: 205 EAKAL------GYTNVYAPILDIVRDPRWGRTLECYGEEPYLIAALGTEMVNGIQS-QG- 256

Query: 209 ENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEG 268
                       V++  KHYA Y V           D  V  +++ E FL PF+  ++  
Sbjct: 257 ------------VAATLKHYAVYSVPKGGRDGNCRTDPHVAPRELHELFLYPFKKVIQNS 304

Query: 269 DASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKE 328
               VM SYN  +G+P  A    L + +R E+   GY+V+D ++++ +   H  +AD+ +
Sbjct: 305 HPMGVMSSYNDWDGVPVSASYYFLTELLREEYGFDGYVVSDSEAVEFVESKH-HVADTYD 363

Query: 329 DAVAQTLKAGLDLDCGQYYTNFTGNA---------VQQGKVKETDIDKSLKYLYTVLMRL 379
           +AV Q L+AGL++      T+FT  +         +++ K+    IDK +  +  V  RL
Sbjct: 364 EAVRQVLEAGLNVR-----THFTPPSDFILPIRRLLEEKKISMAVIDKRVSEVLRVKFRL 418

Query: 380 GFFDGSPQYVSLGKQDIC--SDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVG 437
           G FD  P        D    +D N++   +  ++ +VLLKN+ N LPL+  ++K V V G
Sbjct: 419 GLFD-QPYVADTKAADRVGGADRNMDFVKQMQQQALVLLKNENNILPLDKRQIKKVLVTG 477

Query: 438 PHANATVAMIGNYAGIPCRYMSPIAGFSGY----ANVTYKTGCDDV-------------- 479
           P A+    M   Y       ++ +AG   Y    A V Y  GCD V              
Sbjct: 478 PLADEDNFMTSRYGPNGLETVTVLAGLRNYLKGIAEVDYAKGCDIVDAGWPATEILPAPM 537

Query: 480 ACKSNNSIFAASEAAKTADATIILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKG 539
           + +    I  A   A  +D  I + G D     ES  R  L LPG Q QL+  +    K 
Sbjct: 538 SEQEKQGIAEAVAKAGESDVIIAVLGEDEYRTGESRSRTSLDLPGRQQQLLEALHATGK- 596

Query: 540 PVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYN 599
           PVILV+++   + + +A  N  I AIL + +PG +GG  IA+ +FG+ NPGG+L +T+  
Sbjct: 597 PVILVLINGQPLTVNWA--NAYIPAILESWFPGCQGGTVIAETLFGEHNPGGKLTVTFPK 654

Query: 600 GDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPT----------LYPFGYGLSYTQFKYN 649
              V  + L + P +P      P      ++GP           LYPFG+GLSYT F Y+
Sbjct: 655 S--VGQIEL-NFPFKPGSHGAQP------HSGPNGSGATRIIGELYPFGFGLSYTTFAYS 705

Query: 650 LLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDG 709
            L         ++ LQ      YT                        KV+  N G   G
Sbjct: 706 DLE--------VSPLQQHTQGEYT-----------------------IKVNVTNTGKRAG 734

Query: 710 SDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLP 769
            +VV +Y +       TY  Q+ GF+RV ++ G  +++ F     + L I+D   N  + 
Sbjct: 735 DEVVQLYVRDKVSSVITYDSQLRGFERVSLQPGETRQVTFSLKP-EDLQILDRNMNWTVE 793

Query: 770 AGEHTIFVG 778
            GE  + +G
Sbjct: 794 PGEFEVMIG 802


>gi|298482082|ref|ZP_07000270.1| beta-glucosidase [Bacteroides sp. D22]
 gi|298271639|gb|EFI13212.1| beta-glucosidase [Bacteroides sp. D22]
          Length = 863

 Score =  265 bits (676), Expect = 1e-67,   Method: Compositional matrix adjust.
 Identities = 164/456 (35%), Positives = 241/456 (52%), Gaps = 41/456 (8%)

Query: 48  QMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVS 107
           Q S + + D+ L    R  DL+ R+TL+EKV  + + +  +PRLG+  YEWW+EALHGV+
Sbjct: 22  QPSKYPYQDTKLTAEQRADDLLQRLTLEEKVALMQNNSPAIPRLGIKPYEWWNEALHGVA 81

Query: 108 NVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARA----MYNLGR---- 159
             G           AT FP  I   ASFN+ L  ++  AVS EARA        G+    
Sbjct: 82  RAGL----------ATVFPQAIGMAASFNDELLYEVFDAVSDEARAKNRQFNERGQYKRY 131

Query: 160 AGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPL 219
            GLT W+PN+N+ RDPRWGR  ET GEDP++ GR  +  VRGLQ  E  E          
Sbjct: 132 QGLTMWTPNVNIFRDPRWGRGQETYGEDPYLSGRMGMAVVRGLQGPEDAEYD-------- 183

Query: 220 KVSSCCKHYAAYDVDNWKGVDRYHFDAR-VTEQDMEETFLRPFEMCVKEGDASSVMCSYN 278
           K+ +C KH+A +    W   +R+ F+A  +  +D+ ET+L  F+  V++     VMC+YN
Sbjct: 184 KLHACAKHFAVHSGPEW---NRHSFNAENIAPRDLWETYLPAFKELVQKAGVKEVMCAYN 240

Query: 279 RVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKA- 337
           R  G P C   +LL Q +R +W   G +V DC +I       K   ++  DAV  +  A 
Sbjct: 241 RFEGDPCCGSNRLLTQILRNDWGFKGIVVTDCGAIGDFFQRKKH--ETHPDAVHASADAV 298

Query: 338 --GLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQD 395
             G DL+CG  + + T +AV++G + E  I+ S+K L      LG  + +  + ++    
Sbjct: 299 LNGTDLECGGNFKSIT-DAVKKGLISEEKINTSVKRLLKARFELGEMNPTHPWSNIPYSV 357

Query: 396 ICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPC 455
           I   ++ ELA + A E +VLL+N  N LPLN  +   VAV+GP+AN +V   GNY G P 
Sbjct: 358 IDCPKHKELALKMAHESLVLLQNKNNILPLN--RQMKVAVIGPNANDSVMQWGNYNGFPS 415

Query: 456 RYMSPIAGFSGY---ANVTYKTGCDDVACKSNNSIF 488
             ++ + G       A + Y+  C      + +S+F
Sbjct: 416 HTVTLLEGIRAKLPDAQIIYEPVCGYTNDTTLHSLF 451



 Score =  120 bits (300), Expect = 4e-24,   Method: Compositional matrix adjust.
 Identities = 95/296 (32%), Positives = 131/296 (44%), Gaps = 56/296 (18%)

Query: 495 KTADATIILAGLDLSVEAESL----------DREDLWLPGYQTQLINQVAEVAKGPVILV 544
           K AD  I   G+   +E ES+          DR ++ LP  Q +++   A + K     V
Sbjct: 598 KNADVVIFAGGISPLLEGESMRVSDPGFKGGDRTEIELPAIQREVL---ALLKKNGKKTV 654

Query: 545 IMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQ 604
            ++  G  +A      +  AIL A YPG+ GG A+ADV+FG +NP GRLPIT+Y    +Q
Sbjct: 655 FVNFSGSAMAIVPETQSCDAILQAWYPGQAGGTAVADVLFGDYNPAGRLPITFYKS--IQ 712

Query: 605 MLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKL 664
            LP         +     GRTY+F     LYPFGYGLSYT+F Y   +     Q  LNK 
Sbjct: 713 QLP-------DYEDYSMKGRTYRFMTETPLYPFGYGLSYTRFSYGKATLN---QSKLNKG 762

Query: 665 QHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIA 724
           +                  +L             +   NVG  DG +VV VY   P +  
Sbjct: 763 EKA----------------ILT------------IPVSNVGQRDGEEVVQVYICRPDDKE 794

Query: 725 ATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLP-AGEHTIFVGN 779
               K + GFQRV +  G+ + +        S    D A NT+ P +G + I  GN
Sbjct: 795 GPQ-KTLRGFQRVNIAKGKTQNVSIEL-PYDSFEWFDTATNTIRPLSGTYKILYGN 848


>gi|423227459|ref|ZP_17213920.1| hypothetical protein HMPREF1062_06106 [Bacteroides cellulosilyticus
           CL02T12C19]
 gi|392623089|gb|EIY17195.1| hypothetical protein HMPREF1062_06106 [Bacteroides cellulosilyticus
           CL02T12C19]
          Length = 864

 Score =  265 bits (676), Expect = 1e-67,   Method: Compositional matrix adjust.
 Identities = 163/450 (36%), Positives = 242/450 (53%), Gaps = 38/450 (8%)

Query: 54  FCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGT 113
           + +  L  S R  DL+ RMTL+EKV Q+ + +  + RLG+P Y+WW+EALHGV+  G   
Sbjct: 24  YKNPELSPSERAWDLLKRMTLEEKVSQMKNGSPAIERLGIPAYDWWNEALHGVARAGK-- 81

Query: 114 HFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYN--------LGRAGLTYW 165
                   AT FP  I   A+F+     +    VS EARA Y+         G  GLT+W
Sbjct: 82  --------ATVFPQAIGLAATFDNQAVYETFDIVSDEARAKYHDFQRKGERDGYKGLTFW 133

Query: 166 SPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCC 225
           +PNIN+ RDPRWGR  ET GEDP++     +  V+GLQ   G     D      K  +C 
Sbjct: 134 TPNINIYRDPRWGRGMETYGEDPYLTSLMGLAVVKGLQG--GGTGKYD------KAHACA 185

Query: 226 KHYAAYDVDNWKGVDRYHFDAR-VTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIP 284
           KHYA +    W   +R+ FDA+ ++++D+ ET+L  F+  VKEG    VMC+YNR  G P
Sbjct: 186 KHYAVHSGPEW---NRHSFDAKNISQRDLWETYLPAFKTLVKEGKVKEVMCAYNRFEGEP 242

Query: 285 SCADPKLLNQTVRGEWDLHGYIVADCDSI-QVMVDNHKFLADSKEDAVAQTLKAGLDLDC 343
            C++ +LL + +R +W     +V+DC +I      NH     +   A A  + +G DL+C
Sbjct: 243 CCSNKQLLIRILREDWGYDDIVVSDCGAIGDFYYPNHHETHPTAAAASADAVVSGTDLEC 302

Query: 344 GQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSP--QYVSLGKQDICSDEN 401
           G  Y++    AV++G + E  I++S+  L     +LG FD      +  +    + S E+
Sbjct: 303 GGSYSSLN-EAVRKGLISEEKINESVFRLLRARFQLGMFDDDALVSWSEIPYSVVESKEH 361

Query: 402 IELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPI 461
           +  A E AR+ +VLL N  +TLPL S  ++ VAV+GP+AN +V +  NY G P + ++ +
Sbjct: 362 VAKALEMARKSMVLLTNKNHTLPL-SKSIRKVAVLGPNANDSVMLWANYNGFPTKSVTIL 420

Query: 462 AGFSGY---ANVTYKTGCDDVACKSNNSIF 488
            G         V Y+ GCD V  ++  S F
Sbjct: 421 EGIKSKLPEGTVYYEKGCDYVNTQTVFSYF 450



 Score =  124 bits (310), Expect = 3e-25,   Method: Compositional matrix adjust.
 Identities = 85/286 (29%), Positives = 132/286 (46%), Gaps = 54/286 (18%)

Query: 478 DVACKSNNSIFAASEAAKTADATIILAGLDLSVEAESL----------DREDLWLPGYQT 527
           D+  K   +    ++ A  ADA I + GL  ++E E +          DR ++ LP  Q 
Sbjct: 581 DIGIKKEINYKEVADKAAEADAIIFVGGLSPTLEGEEMPVDLPGFRKGDRTNIDLPHVQA 640

Query: 528 QLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKF 587
           +++  + +  K PVI V+ S  G  +A      N+ AIL A YPG++GG A+ADV+FG +
Sbjct: 641 EMLKALKKTGK-PVIFVLCS--GSTLALPWEAENLDAILEAWYPGQQGGTAVADVLFGDY 697

Query: 588 NPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFK 647
           NP GRLP+T+Y          +S  L   +      RTY+++ G  L+PFG+GLSYT F 
Sbjct: 698 NPAGRLPLTFY---------ASSDDLPDFEDYDMSNRTYRYFKGKALFPFGHGLSYTIFD 748

Query: 648 YNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGST 707
           Y      K                                ++R  +     +  +N G  
Sbjct: 749 YGKAKVDK-------------------------------QNVRAGEGMTLTIPLKNTGKL 777

Query: 708 DGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNA 753
           DG +V+ VY + PA+     IK +  F+RV + AG+ + I+    A
Sbjct: 778 DGDEVIQVYLRNPADKEGP-IKTLRAFRRVSLPAGQTENIRIELPA 822


>gi|325286191|ref|YP_004261981.1| beta-glucosidase [Cellulophaga lytica DSM 7489]
 gi|324321645|gb|ADY29110.1| Beta-glucosidase [Cellulophaga lytica DSM 7489]
          Length = 754

 Score =  265 bits (676), Expect = 1e-67,   Method: Compositional matrix adjust.
 Identities = 222/741 (29%), Positives = 352/741 (47%), Gaps = 119/741 (16%)

Query: 76  EKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGATSFPTVILTTASF 135
           +K++   DFA    RL +P + + S+ +HG                 T+FP  + T +S+
Sbjct: 81  KKLKIAQDFAVNDTRLKIPLF-FGSDVIHGYK---------------TTFPIPLATASSW 124

Query: 136 NESLWKKIGQAVSTEARAMYNLGRAGLTY-WSPNINVARDPRWGRITETPGEDPFVVGRY 194
           +  L KK+ +  + EA A       G+ + +SP ++VARDPRWGRI E  GEDP++    
Sbjct: 125 DMDLIKKMAETAALEATA------DGINWNFSPMVDVARDPRWGRIAEGAGEDPYLGSAI 178

Query: 195 AVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDME 254
           A   V G Q      N T  N+    + +  KH+A Y      G D    D   T+  M 
Sbjct: 179 AKAMVHGYQ----GNNLTAKNT----MLATVKHFALYGAAE-AGRDYNSVDMSRTK--MF 227

Query: 255 ETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQ 314
             +L P++  +  G A+S+M S+N V+GIP+  +  LL   +R +W   G++V+D  S+ 
Sbjct: 228 NQYLPPYKAGIDAG-AASIMTSFNDVDGIPASGNKWLLTDLLRKKWGFKGFVVSDYTSVN 286

Query: 315 VMVDNHKFLADSKEDAVAQTLKAGLDLD-CGQYYTNFTGNAVQQGKVKETDIDKSLKYLY 373
            M+ +   L D  +D  A +LKAGLD+D  G+ +      ++ +G+V E +I  + + + 
Sbjct: 287 EMIAHG--LGDL-QDVSALSLKAGLDMDMVGEGFLTTLKKSLDEGRVTEEEITNACRRIL 343

Query: 374 TVLMRLGFFDGSPQYVSLG--KQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVK 431
               +LG FD   +Y+     K+DI + ++  LA EAA+   VL KN  N LPL  +K  
Sbjct: 344 EAKYKLGLFDDPYKYIDAKRPKKDILTKKSKTLAREAAKRSFVLFKNHNNILPL--SKTA 401

Query: 432 TVAVVGPHANATVAMIGNYA--GIPCRYMSPIAGF---SGYANVTYKTGC---DDVACKS 483
            +A+VGP AN    M+G +A  G P   +  + GF   +  A +TY  G    DD     
Sbjct: 402 KIALVGPLANNKNNMLGTWAPTGDPQLSIPILNGFKNVASKAKITYAKGANITDDTELAK 461

Query: 484 NNSIFA----------------ASEAAKTADATIILAGLDLSVEAESLDREDLWLPGYQT 527
             ++F                 A + AKT+D  + + G    +  E+  R D+ +P  Q 
Sbjct: 462 KVNVFGTRVDIDKRSSEELLQEALDLAKTSDVVVAVVGEASEMSGEAASRTDISIPNSQK 521

Query: 528 QLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKF 587
           +LI ++ +  K PV+LV+MS   + I   E N  + +IL   +PG E G A+ADV+FG +
Sbjct: 522 RLIQELVKTGK-PVVLVLMSGRPLTIE-EEFNLPV-SILQVWHPGIEAGNAVADVIFGDY 578

Query: 588 NPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKF----------YNGPTLYPF 637
           NP G+L  TW     V  +P+       + + G P  +  F           N P L PF
Sbjct: 579 NPSGKLTATWPRN--VGQIPI----YHSIKNTGRPAPSPAFEKFKSNYLDVKNAP-LLPF 631

Query: 638 GYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEF 697
           GYGLSYT FKY+         +NL+K +  +  + T                        
Sbjct: 632 GYGLSYTSFKYS--------NINLSKKEIAQGEDVT-----------------------V 660

Query: 698 KVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSL 757
            V  +N G+ DG +VV +Y +         ++Q+ GF++VF++ G +K ++ V  A   L
Sbjct: 661 SVTVKNTGNFDGEEVVQLYLRDVVRSITPPMRQLKGFKKVFLKKGESKTVELVLTA-DDL 719

Query: 758 NIVDYAANTLLPAGEHTIFVG 778
              +   + +   G+  IFVG
Sbjct: 720 KFYNSTLDFVAEPGDFEIFVG 740


>gi|160882671|ref|ZP_02063674.1| hypothetical protein BACOVA_00625 [Bacteroides ovatus ATCC 8483]
 gi|423289150|ref|ZP_17268000.1| hypothetical protein HMPREF1069_03043 [Bacteroides ovatus CL02T12C04]
 gi|423298450|ref|ZP_17276507.1| hypothetical protein HMPREF1070_05172 [Bacteroides ovatus CL03T12C18]
 gi|156111986|gb|EDO13731.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
            ovatus ATCC 8483]
 gi|392662991|gb|EIY56545.1| hypothetical protein HMPREF1070_05172 [Bacteroides ovatus CL03T12C18]
 gi|392667846|gb|EIY61351.1| hypothetical protein HMPREF1069_03043 [Bacteroides ovatus CL02T12C04]
          Length = 1049

 Score =  265 bits (676), Expect = 1e-67,   Method: Compositional matrix adjust.
 Identities = 220/769 (28%), Positives = 362/769 (47%), Gaps = 108/769 (14%)

Query: 56   DSSLPYSIR----VKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVG- 110
            +S LP++      VKDL+SRMT++EK+ QL  +  G   L  P+ E+ S++L     VG 
Sbjct: 328  NSKLPHTPEADSFVKDLLSRMTVEEKIGQLSQYV-GRTLLTGPESEYLSDSLIARGLVGS 386

Query: 111  -------------------------PGTHFDDVIPG-ATSFPTVILTTASFNESLWKKIG 144
                                     P     DVI G  T FPT +  + S++ +  ++  
Sbjct: 387  VLNISGAKTLRDLQEKNMRHSRIKIPILFGMDVIHGYKTIFPTPLAESCSWDLAAIERAA 446

Query: 145  QAVSTEARAMYNLGRAGLTY-WSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQ 203
            +  + E+ A      AGL + ++P +++ARD RWGR+ E  GED ++    A   V G Q
Sbjct: 447  KIAAIESSA------AGLHWTFAPMVDIARDARWGRVVEGAGEDTYLGSEIAKARVNGFQ 500

Query: 204  DVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEM 263
                  N  + NS    V +C KH+ AY +       R +    ++E+ + +T+L PF+ 
Sbjct: 501  -----WNLWENNS----VLACAKHWVAYGLPQ---AGRDYAPVDMSERTLFDTYLPPFKA 548

Query: 264  CVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFL 323
            C+  G   + M ++N +NGIP+ A P LL   +RG+W+ +G++V+D ++++ +V   + +
Sbjct: 549  CIDAG-VLTFMSAFNDINGIPASAHPFLLKDLLRGQWNFNGFVVSDWEAVKQLV--AQGV 605

Query: 324  ADSKEDAVAQTLKAGLDLDCGQ-YYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFF 382
            A+  +DA      +G+D+D     Y  +    ++ GK+   D+D S+  +  +   LG F
Sbjct: 606  AEDDKDATRLAFNSGIDMDMTDGLYNKYMKELIEAGKISMEDVDNSVSRILHIKYALGLF 665

Query: 383  DGSPQYVS--LGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHA 440
                ++ +     Q I   E ++ A + A +  VLLKND +TLPL +  V+++AVVGP A
Sbjct: 666  VDPYKFCNEEYESQTIMKKEFLDAALDMAHKSAVLLKNDNHTLPL-AKNVRSIAVVGPLA 724

Query: 441  NATVAMIGNY-AGIPCRYMSPIAGFSGYAN--------VTYKTGCDDVACKSNNSIFAAS 491
            +    ++G++ A    R+++ +    G  N        V Y  GC D   +  +    A 
Sbjct: 725  DNQTELLGSWRARGEDRHVTTV--LQGIKNKIGGNKTKVGYARGC-DFDGEDKSGFKEAV 781

Query: 492  EAAKTADATIILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGV 551
            + A  +D  I + G    +  ES  R  L LPG Q +LI ++    K PV++V+M+   +
Sbjct: 782  KLASKSDMVIAVVGEKALMSGESRSRAQLDLPGVQEELIKELVATGK-PVVVVLMNGRPL 840

Query: 552  DIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGD-YVQMLPLTS 610
             I +   + N+ AIL   + G   G AIAD++FG +NP GRL I++   +  V +     
Sbjct: 841  SIEW--VDKNVSAILETWFLGTSAGTAIADILFGDYNPSGRLTISFPRVEGQVPIYYNYK 898

Query: 611  MPLRPVDSL-GYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRN 669
               RP D L     R     N P LYPFGYGLSYT F Y+    T+              
Sbjct: 899  KSGRPGDMLHSSTTRHIDVPNAP-LYPFGYGLSYTTFSYSAPQSTQK------------- 944

Query: 670  LNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIK 729
              YT   +                     V   N G  DG + V +Y           +K
Sbjct: 945  -EYTRQET-----------------ISVSVTVTNTGDRDGEETVQLYVNDKVASVVRPVK 986

Query: 730  QVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVG 778
            ++  F+++F++AG +K ++F  +   +L   D A N ++  GE  I  G
Sbjct: 987  ELKAFKKIFLKAGESKTVQFDISPL-ALGFYDAAMNYVVEPGEFEIMTG 1034


>gi|336415490|ref|ZP_08595829.1| hypothetical protein HMPREF1017_02937 [Bacteroides ovatus
           3_8_47FAA]
 gi|335940369|gb|EGN02236.1| hypothetical protein HMPREF1017_02937 [Bacteroides ovatus
           3_8_47FAA]
          Length = 863

 Score =  265 bits (676), Expect = 1e-67,   Method: Compositional matrix adjust.
 Identities = 164/458 (35%), Positives = 241/458 (52%), Gaps = 45/458 (9%)

Query: 48  QMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVS 107
           Q S + + D+ L    R  DL+ R+TL+EKV  + + +  +PRLG+  YEWW+EALHGV+
Sbjct: 22  QPSKYPYQDTKLTAEQRADDLLQRLTLEEKVALMQNNSPAIPRLGIKPYEWWNEALHGVA 81

Query: 108 NVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARA----MYNLGR---- 159
             G           AT FP  I   ASFN+ L  ++  AVS EARA        G+    
Sbjct: 82  RAGL----------ATVFPQAIGMAASFNDELLYEVFDAVSDEARAKNRQFNERGQYKRY 131

Query: 160 AGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPL 219
            GLT W+PN+N+ RDPRWGR  ET GEDP++ GR  +  VRGLQ  E  E          
Sbjct: 132 QGLTMWTPNVNIFRDPRWGRGQETYGEDPYLSGRMGMAAVRGLQGPEDAEYD-------- 183

Query: 220 KVSSCCKHYAAYDVDNWKGVDRYHFDAR-VTEQDMEETFLRPFEMCVKEGDASSVMCSYN 278
           K+ +C KH+A +    W   +R+ F+A  +  +D+ ET+L  F+  V++     VMC+YN
Sbjct: 184 KLHACAKHFAVHSGPEW---NRHSFNAENIAPRDLWETYLPAFKELVQKAGVKEVMCAYN 240

Query: 279 RVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSI-----QVMVDNHKFLADSKEDAVAQ 333
           R  G P C   +LL Q +R +W   G +V DC +I     +   + H   A +  DAV  
Sbjct: 241 RFEGDPCCGSNRLLTQILRNDWGFKGIVVTDCGAIGDFFQRKKHETHPDAAHASADAVLN 300

Query: 334 TLKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGK 393
               G DL+CG  + + T +AV++G + E  I+ S+K L      LG  + +  + ++  
Sbjct: 301 ----GTDLECGGNFKSIT-DAVKKGLISEEKINTSVKRLLKARFELGEMNPTHPWSNIPY 355

Query: 394 QDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGI 453
             I   ++ ELA + A E +VLL+N  N LPLN  +   VAV+GP+AN +V   GNY G 
Sbjct: 356 SVINCPKHKELALKMAHESLVLLQNKNNILPLN--RQMKVAVIGPNANDSVMQWGNYNGF 413

Query: 454 PCRYMSPIAGFSGY---ANVTYKTGCDDVACKSNNSIF 488
           P   ++ + G       A + Y+  C      + +S+F
Sbjct: 414 PSHTVTLLEGIRAKLPDAQIIYEPVCGYTNDTTLHSLF 451



 Score =  119 bits (297), Expect = 9e-24,   Method: Compositional matrix adjust.
 Identities = 94/296 (31%), Positives = 130/296 (43%), Gaps = 56/296 (18%)

Query: 495 KTADATIILAGLDLSVEAESL----------DREDLWLPGYQTQLINQVAEVAKGPVILV 544
           K AD  I   G+   +E ES+          DR ++ LP  Q +++   A + K     V
Sbjct: 598 KNADVVIFAGGISPLLEGESMRVSDPGFKGGDRTEIELPAIQREVL---ALLKKNGKKTV 654

Query: 545 IMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQ 604
            ++  G  +A      +  AIL A YPG+ GG A+ADV+FG +NP GRLPIT+Y    +Q
Sbjct: 655 FVNFSGSAMAIVPETQSCDAILQAWYPGQAGGTAVADVLFGNYNPAGRLPITFYKS--IQ 712

Query: 605 MLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKL 664
            LP         +     GRTY+F     LYPFGYGLSYT+F Y   +      +N +KL
Sbjct: 713 QLP-------DYEDYSMKGRTYRFMTETPLYPFGYGLSYTRFSYGKAT------LNQSKL 759

Query: 665 QHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIA 724
                   T                         +   NVG  DG +VV VY   P +  
Sbjct: 760 AKGEKAILT-------------------------IPVSNVGQRDGEEVVQVYICRPDDKG 794

Query: 725 ATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLP-AGEHTIFVGN 779
               K + GFQRV +  G+ + +        S    D A NT+ P +G + I  GN
Sbjct: 795 GPQ-KTLRGFQRVNIAKGKTQNVNIEL-PYDSFEWFDTATNTIRPLSGTYKILYGN 848


>gi|333377833|ref|ZP_08469566.1| hypothetical protein HMPREF9456_01161 [Dysgonomonas mossii DSM
           22836]
 gi|332883853|gb|EGK04133.1| hypothetical protein HMPREF9456_01161 [Dysgonomonas mossii DSM
           22836]
          Length = 780

 Score =  265 bits (676), Expect = 1e-67,   Method: Compositional matrix adjust.
 Identities = 214/720 (29%), Positives = 340/720 (47%), Gaps = 107/720 (14%)

Query: 90  RLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVST 149
           RLG+P +    E  HG   +G            T FPT I   A++N +L +++   +S 
Sbjct: 125 RLGIPIF-LAEECPHGHMAIG-----------TTVFPTAIGQAATWNPNLIQQMSAVISK 172

Query: 150 EARAMYNLGRAGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHE 209
           EAR+     +     + P +++AR+ RW R+ ET GEDP ++ +    +V G        
Sbjct: 173 EARS-----QGSHIGYGPVLDLAREARWSRVEETYGEDPVLISKMGEAFVTGF------- 220

Query: 210 NATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGD 269
            + DL S+P  + S  KH+ AY + +  G    + ++ V  +D++E +L PFE  VK G 
Sbjct: 221 GSGDL-SKPYSLISTLKHFVAYGIPD--GGHNGNSNS-VGMRDLKENYLPPFEKAVKAG- 275

Query: 270 ASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKED 329
           A SVM +YN V+GIP  ++  LL   +  +W   G+ V+D  SI+ +  +H ++  + ++
Sbjct: 276 ALSVMTAYNSVDGIPCTSNEYLLKDVLCKDWGFKGFTVSDLGSIEGLKGSH-YVVSTIQE 334

Query: 330 AVAQTLKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYV 389
           A   +L +GLD D G        +AV++G V ET ID ++  +  +   +G F+      
Sbjct: 335 AAILSLTSGLDCDLGGNAFFTLSDAVKKGMVGETQIDSAVYKILKLKFDMGLFENPYVDE 394

Query: 390 SLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGN 449
           +  +Q + + ENI LA + ARE IVLL+N  N LPLN +K+K +AV+GP+A+     +G+
Sbjct: 395 NNARQVVRTQENIVLARQVARESIVLLENKNNVLPLNKSKIKKIAVIGPNADNVYNQLGD 454

Query: 450 YAGIP-----CRYMSPIAGFSGYANVTYKTGCDDVACKSNNSIFAASEAAKTADATI--- 501
           Y            +  I      + + Y  GC  +    N  I  A +AA  +D  +   
Sbjct: 455 YTAPQDDSNVKTVLDGIRSKLKQSQIEYVKGCA-IRDTLNTDIDKAVQAALRSDVAVVVV 513

Query: 502 ------------ILAGLDLSVE--------AESLDREDLWLPGYQTQLINQVAEVAKGPV 541
                       I  G  ++ E         E  DR  L L G Q +L+  +    K PV
Sbjct: 514 GGSSARDFKTKYIETGAAVADEHSISDMESGEGFDRVSLDLMGKQLELLKAIKATGK-PV 572

Query: 542 ILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGD 601
           ++V +    +++ +A  N +  A+L A YPG+EGG AIADV+FG++NP GRLP++     
Sbjct: 573 VVVYIQGRPLNMNWASENAD--ALLSAWYPGQEGGNAIADVLFGEYNPAGRLPMSV--AK 628

Query: 602 YVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNL 661
            V  LP+      P  S  Y   T K      LY FGYGLS+T F+Y+ L   K+     
Sbjct: 629 SVGQLPVYYNHRNPA-SHDYVEMTSK-----PLYSFGYGLSFTSFEYSNLKINKS----- 677

Query: 662 NKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPA 721
                                         +   E  V+ +N G+ DG +VV +Y +   
Sbjct: 678 ------------------------------NSGVEVTVELRNSGNFDGDEVVQLYLRNNR 707

Query: 722 EIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLL-PAGEHTIFVGNG 780
                 I Q+  F+RV ++ G  K IK +       +I+D   N ++ P G+ T  VG+ 
Sbjct: 708 ASVVQPIMQLKAFERVNLKKGETKTIKLLLTK-DDFSIIDKKMNRVVEPNGDFTFMVGSA 766


>gi|299148437|ref|ZP_07041499.1| beta-glucosidase [Bacteroides sp. 3_1_23]
 gi|298513198|gb|EFI37085.1| beta-glucosidase [Bacteroides sp. 3_1_23]
          Length = 863

 Score =  264 bits (675), Expect = 1e-67,   Method: Compositional matrix adjust.
 Identities = 164/458 (35%), Positives = 241/458 (52%), Gaps = 45/458 (9%)

Query: 48  QMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVS 107
           Q S + + D+ L    R  DL+ R+TL+EKV  + + +  +PRLG+  YEWW+EALHGV+
Sbjct: 22  QPSKYPYQDTKLTAEQRADDLLQRLTLEEKVALMQNNSPAIPRLGIKPYEWWNEALHGVA 81

Query: 108 NVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARA----MYNLGR---- 159
             G           AT FP  I   ASFN+ L  ++  AVS EARA        G+    
Sbjct: 82  RAGL----------ATVFPQAIGMAASFNDELLYEVFDAVSDEARAKNRQFNERGQYKRY 131

Query: 160 AGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPL 219
            GLT W+PN+N+ RDPRWGR  ET GEDP++ GR  +  VRGLQ  E  E          
Sbjct: 132 QGLTMWTPNVNIFRDPRWGRGQETYGEDPYLSGRMGMAAVRGLQGPEDAEYD-------- 183

Query: 220 KVSSCCKHYAAYDVDNWKGVDRYHFDAR-VTEQDMEETFLRPFEMCVKEGDASSVMCSYN 278
           K+ +C KH+A +    W   +R+ F+A  +  +D+ ET+L  F+  V++     VMC+YN
Sbjct: 184 KLHACAKHFAVHSGPEW---NRHSFNAENIAPRDLWETYLPAFKELVQKAGVKEVMCAYN 240

Query: 279 RVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSI-----QVMVDNHKFLADSKEDAVAQ 333
           R  G P C   +LL Q +R +W   G +V DC +I     +   + H   A +  DAV  
Sbjct: 241 RFEGDPCCGSNRLLTQILRNDWGFKGIVVTDCGAIGDFFQRKKHETHPDAAHASADAVLN 300

Query: 334 TLKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGK 393
               G DL+CG  + + T +AV++G + E  I+ S+K L      LG  + +  + ++  
Sbjct: 301 ----GTDLECGGNFKSIT-DAVKKGLISEEKINTSVKRLLKARFELGEMNPTHPWSNIPY 355

Query: 394 QDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGI 453
             I   ++ ELA + A E +VLL+N  N LPLN  +   VAV+GP+AN +V   GNY G 
Sbjct: 356 SVINCPKHKELALKMAHESLVLLQNKNNILPLN--RQMKVAVIGPNANDSVMQWGNYNGF 413

Query: 454 PCRYMSPIAGFSGY---ANVTYKTGCDDVACKSNNSIF 488
           P   ++ + G       A + Y+  C      + +S+F
Sbjct: 414 PSHTVTLLEGIRAKLPDAQIIYEPVCGYTNDTTLHSLF 451



 Score =  118 bits (296), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 94/296 (31%), Positives = 130/296 (43%), Gaps = 56/296 (18%)

Query: 495 KTADATIILAGLDLSVEAESL----------DREDLWLPGYQTQLINQVAEVAKGPVILV 544
           K AD  I   G+   +E ES+          DR ++ LP  Q +++   A + K     V
Sbjct: 598 KNADVVIFAGGISPLLEGESMRVSDPGFKGGDRTEIELPAIQREVL---ALLKKNGKKTV 654

Query: 545 IMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQ 604
            ++  G  +A      +  AIL A YPG+ GG A+ADV+FG +NP GRLPIT+Y    +Q
Sbjct: 655 FVNFSGSAMAIVPETQSCDAILQAWYPGQAGGTAVADVLFGDYNPAGRLPITFYKS--IQ 712

Query: 605 MLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKL 664
            LP         +     GRTY+F     LYPFGYGLSYT+F Y   +      +N +KL
Sbjct: 713 QLP-------DYEDYSMKGRTYRFMTETPLYPFGYGLSYTRFSYGKAT------LNQSKL 759

Query: 665 QHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIA 724
                   T                         +   NVG  DG +VV VY   P +  
Sbjct: 760 AKGEKAILT-------------------------IPVSNVGQRDGEEVVQVYICRPDDKG 794

Query: 725 ATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLP-AGEHTIFVGN 779
               K + GFQRV +  G+ + +        S    D A NT+ P +G + I  GN
Sbjct: 795 GPQ-KTLRGFQRVNIAKGKTQNVNIEL-PYDSFEWFDTATNTIRPLSGTYKILYGN 848


>gi|282878201|ref|ZP_06286997.1| glycosyl hydrolase family 3 C-terminal domain protein [Prevotella
           buccalis ATCC 35310]
 gi|281299619|gb|EFA91992.1| glycosyl hydrolase family 3 C-terminal domain protein [Prevotella
           buccalis ATCC 35310]
          Length = 947

 Score =  264 bits (675), Expect = 1e-67,   Method: Compositional matrix adjust.
 Identities = 204/719 (28%), Positives = 333/719 (46%), Gaps = 102/719 (14%)

Query: 90  RLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVST 149
           RLG+P  ++ +E + GV +             AT+FPT +    ++N  L  ++G     
Sbjct: 159 RLGIP-VDFTNEGIRGVESFK-----------ATNFPTQLGLGTTWNRKLIHQVGYITGR 206

Query: 150 EARAMYNLGRAGLT-YWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGH 208
           EAR +      G T  ++P ++V RD RWGR  E  GE PF+V    +   RGLQ     
Sbjct: 207 EARLL------GYTNVYAPILDVGRDQRWGRYEEVYGESPFLVAELGIQMTRGLQT---- 256

Query: 209 ENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEG 268
                      +V+S  KH+AAY  +          D +++ ++++   L P+   V+E 
Sbjct: 257 ---------NYQVASTGKHFAAYSNNKGAREGMARVDPQMSPREVQNIHLYPWGRVVREA 307

Query: 269 DASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKE 328
                M SYN  +G+P       L + +R ++   GY+V+D D+++ +   H+  A+ KE
Sbjct: 308 GLLGAMSSYNDYDGVPIQGSFHWLTEVLRQQFGFKGYVVSDSDALEYLFSKHRTAANMKE 367

Query: 329 DAVAQTLKAGLDLDCG----QYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDG 384
            AV + + AGL++ C       +       V++G++    ID+ L+ +  V   +G FD 
Sbjct: 368 -AVYKAVMAGLNVRCTFRSPDSFVLPLRELVKEGRIPMKVIDERLRDILRVKFMVGIFDR 426

Query: 385 SPQY-VSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANAT 443
             Q  +    +++    + ++A +A+RE IVLLKN  NTLPLN A +K +AV GP+AN  
Sbjct: 427 PYQMNLQAADKEVDGKSHQQVALQASRESIVLLKNQNNTLPLNKASIKKIAVCGPNANDA 486

Query: 444 VAMIGNYAGIPCRYMSPIAGFSGYA----NVTYKTGCDDV--------------ACKSNN 485
              + +Y  +     +   G          VTY  GCD V                   N
Sbjct: 487 AYALTHYGPLAVEVTTVFEGIRNKVGSDVEVTYTKGCDLVDAHWPESELVDYPMTADEQN 546

Query: 486 SIFAASEAAKTADATIILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVI 545
            I  A E  + +D  +++ G +     E+  R  L LPG Q QL+  V    K  VILV+
Sbjct: 547 EIDKAVEQVRQSDVAVVVLGGNSRTCGENKSRSSLELPGRQLQLLKAVQATGK-TVILVL 605

Query: 546 MSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQM 605
           ++   + + +A+    + AI+ A YPG +GG A+ADV+FG +NPGG+L +T+     V  
Sbjct: 606 INGRPLSVNWADKF--VPAIVEAWYPGSQGGTAVADVLFGDYNPGGKLTVTF--PKTVGQ 661

Query: 606 LPLTSMPLRPV------DSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQV 659
           +P  + P +P       + LG  G   +  NG  LY FG+GLSYT FKY+ L  +     
Sbjct: 662 IPF-NFPSKPAALVDGGNKLGLHGNASR-ANG-ALYYFGHGLSYTTFKYSNLRLS----- 713

Query: 660 NLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKP 719
                   +N++ T D+    C                  D  N G   G +VV +Y + 
Sbjct: 714 -------AQNISPT-DSVVVSC------------------DITNTGQRAGDEVVQLYIQD 747

Query: 720 PAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVG 778
                 TY K + GF+RV ++ G  + + FV    + L +++     ++  G+  + +G
Sbjct: 748 VLSTVTTYEKNLRGFERVHLKPGETRTLSFVIKP-EHLQLINEQYQHVVEPGDFKVMMG 805


>gi|189464310|ref|ZP_03013095.1| hypothetical protein BACINT_00651 [Bacteroides intestinalis DSM
           17393]
 gi|189438100|gb|EDV07085.1| glycosyl hydrolase family 3 N-terminal domain protein [Bacteroides
           intestinalis DSM 17393]
          Length = 864

 Score =  264 bits (675), Expect = 1e-67,   Method: Compositional matrix adjust.
 Identities = 160/434 (36%), Positives = 246/434 (56%), Gaps = 41/434 (9%)

Query: 46  GLQMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHG 105
           G+  +  L+ D   P   R+ DL+SR+T++EK+  L   + G+PRL +P+Y   +EALHG
Sbjct: 20  GVAQAQELYKDEKAPMHERIMDLLSRLTVEEKISLLRATSPGIPRLDIPKYYHGNEALHG 79

Query: 106 VSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAG---- 161
           V  V PG          T FP  I   A++N  L  ++   +S EARA +N    G    
Sbjct: 80  V--VRPGRF--------TVFPQAIGLAATWNPELQLQVATVISDEARARWNELDQGREQK 129

Query: 162 ------LTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLN 215
                 LT+WSP +N+ARDPRWGR  ET GEDP++ G     +V+GLQ   G ++     
Sbjct: 130 SQFSDLLTFWSPTVNMARDPRWGRTPETYGEDPYLSGVMGTAFVKGLQ---GDDD----- 181

Query: 216 SRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMC 275
            R LK+ S  KH+AA + ++    +R+  + +++E+ + E +L  FE CVK+G ++S+M 
Sbjct: 182 -RYLKIVSTPKHFAANNEEH----NRFVCNPQISEKQLREYYLPAFEACVKDGKSASIMS 236

Query: 276 SYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTL 335
           +YN +N +P   +  LL + +R +W   GY+V+DC    ++V+ HK++  +KE A   ++
Sbjct: 237 AYNALNDVPCTLNAWLLTKVLREDWGFKGYVVSDCGGPSLLVNAHKYVK-TKEAAATLSI 295

Query: 336 KAGLDLDCG-QYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ--YVSLG 392
           KAGLDL+CG   +     +A +Q  V   DID +   +    M+LG FD   +  Y  + 
Sbjct: 296 KAGLDLECGDDVFDEPLLSAYRQYMVTNADIDSAAYRVLRARMQLGLFDSGEKNPYTKIS 355

Query: 393 KQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAG 452
              + S ++ E+A  AARE IVLLKN +  LPLN+ KVK++AVVG   NA     G+Y+G
Sbjct: 356 PAVVGSAKHQEVALNAARECIVLLKNQKKMLPLNAKKVKSIAVVG--INAGNCEFGDYSG 413

Query: 453 IPCRYMSPIAGFSG 466
            P   ++PI+   G
Sbjct: 414 SPV--IAPISVLQG 425



 Score =  144 bits (363), Expect = 2e-31,   Method: Compositional matrix adjust.
 Identities = 98/293 (33%), Positives = 146/293 (49%), Gaps = 54/293 (18%)

Query: 490 ASEAAKTADATIILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAG 549
           A +A +  +  + + G++ S+E E  DR D+ LP  Q + + ++ +V   P I+V++ AG
Sbjct: 595 AGKAVRECETVVAVLGINKSIEREGQDRYDIQLPADQMEFLQEIYKV--NPNIVVVLVAG 652

Query: 550 GVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLT 609
              +A    + ++ AI+ A YPGE GG+A+A+V+FG +NPGGRLP+T+Y         L 
Sbjct: 653 S-SLAVNWMDEHVPAIVNAWYPGESGGKAVAEVLFGDYNPGGRLPLTYYRS-------LD 704

Query: 610 SMPLRPVDSLGY-PGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCR 668
            +P  P D      GRTYK++ G  LYPFGYGLSYT FKY+       +QV         
Sbjct: 705 ELP--PFDDYDITKGRTYKYFKGDVLYPFGYGLSYTTFKYS------NLQV--------- 747

Query: 669 NLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQ--NVGSTDGSDVVIVYSKPPAEIAAT 726
                                  D   E  V FQ  N G   G +V  VY K P      
Sbjct: 748 ----------------------ADGEEEINVSFQLKNAGKYAGDEVAQVYVKLPERDEVM 785

Query: 727 YIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLL-PAGEHTIFVG 778
            +K++ GF+RV +++G NK++         L   D A    + P+G++TI VG
Sbjct: 786 PVKELKGFERVALKSGENKKMTLKLRK-DLLRYWDEAKGKFVYPSGDYTIMVG 837


>gi|393781366|ref|ZP_10369565.1| hypothetical protein HMPREF1071_00433 [Bacteroides salyersiae
           CL02T12C01]
 gi|392676859|gb|EIY70281.1| hypothetical protein HMPREF1071_00433 [Bacteroides salyersiae
           CL02T12C01]
          Length = 854

 Score =  264 bits (675), Expect = 1e-67,   Method: Compositional matrix adjust.
 Identities = 158/432 (36%), Positives = 243/432 (56%), Gaps = 41/432 (9%)

Query: 48  QMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVS 107
           Q    ++ D + P   R+ DL+S++T++EK+  L   + G+PRL + +Y   +EALHGV 
Sbjct: 22  QKGKDVYLDMNAPQHERILDLLSKLTIEEKISLLRATSPGIPRLQIDKYYHGNEALHGV- 80

Query: 108 NVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAG------ 161
            V PG          T FP  I   A +N  L  +I  A+S EARA +N    G      
Sbjct: 81  -VRPGNF--------TVFPQAIGLAAMWNPQLLNEISTAISDEARARWNELEQGKKQLGQ 131

Query: 162 ----LTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSR 217
               LT+WSP +N+ARDPRWGR  ET GEDPF+ G+  V++V+GLQ  +          R
Sbjct: 132 FSDLLTFWSPTVNMARDPRWGRTPETYGEDPFLSGKLGVSFVKGLQGDD---------PR 182

Query: 218 PLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSY 277
            LK+ S  KH+AA + ++    +R+  +  ++E+D+ E +L  FE C+ EG A+S+M +Y
Sbjct: 183 YLKIVSTPKHFAANNEEH----NRFECNPIISEKDLREYYLPAFEKCIIEGKAASIMTAY 238

Query: 278 NRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKA 337
           N +N +P   +  LL + +R +W   GY+V+DC +   +V +HK++  + E A   +++A
Sbjct: 239 NAINDVPCTLNNWLLKKVLRHDWGFDGYVVSDCGAPDFLVTHHKYVK-TLEAAATLSIQA 297

Query: 338 GLDLDCG-QYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGS--PQYVSLGKQ 394
           GLDL+CG   Y     NA +Q  V E +ID +  ++    MRLG FD      Y  +   
Sbjct: 298 GLDLECGDNVYMEPLLNAYKQYMVTEAEIDSAAYHILRARMRLGLFDDPNLNPYNKISPS 357

Query: 395 DICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIP 454
            +  +++ +LA EAAR+ IVLLKN++  LPL+  K+K++AVVG   NA     G+Y+G P
Sbjct: 358 VVGCEKHSQLALEAARQSIVLLKNEKKFLPLDLKKIKSIAVVG--INAGNCEFGDYSGTP 415

Query: 455 CRYMSPIAGFSG 466
                P++   G
Sbjct: 416 VN--QPVSILEG 425



 Score =  126 bits (316), Expect = 6e-26,   Method: Compositional matrix adjust.
 Identities = 91/293 (31%), Positives = 137/293 (46%), Gaps = 50/293 (17%)

Query: 489 AASEAAKTADATIILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSA 548
           AA +A +  D TI + G++ S+E E  DR  + LP  Q   I +  ++    V++++   
Sbjct: 594 AAGDAMRKCDLTIAVVGINKSIEREGQDRYSIELPKDQQIFIEEAYKINPNTVVVLV--- 650

Query: 549 GGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLP- 607
            G  +A    + +I AI+ A YPGE GG A+A+V+FG +NPGG+LP+T+Y    +  LP 
Sbjct: 651 AGSSLAINWMDEHIPAIVNAWYPGEAGGTAVAEVLFGDYNPGGKLPLTYYRS--LDELPA 708

Query: 608 LTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHC 667
                +R        GRTY+F+ G  LY FG+GLSYT F Y  L                
Sbjct: 709 FDDYDIR-------KGRTYQFFEGNPLYAFGHGLSYTTFSYKKL---------------- 745

Query: 668 RNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPA--EIAA 725
            N++ T DA K                F  K    N G  DG +V  +Y K      +  
Sbjct: 746 -NIDSTGDAVKVS--------------FALK----NTGKYDGDEVAQLYVKYQGNDSLVK 786

Query: 726 TYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVG 778
             +KQ+ GF+RV ++ G +KR+       +     +       PAG++   VG
Sbjct: 787 LPLKQLKGFERVHLKKGESKRVTLTVPKSELRFWDEEKGEFYTPAGDYLFMVG 839


>gi|423240769|ref|ZP_17221883.1| hypothetical protein HMPREF1065_02506 [Bacteroides dorei
           CL03T12C01]
 gi|392643731|gb|EIY37480.1| hypothetical protein HMPREF1065_02506 [Bacteroides dorei
           CL03T12C01]
          Length = 864

 Score =  264 bits (675), Expect = 1e-67,   Method: Compositional matrix adjust.
 Identities = 165/452 (36%), Positives = 234/452 (51%), Gaps = 40/452 (8%)

Query: 54  FCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGT 113
           + +S+L    R +DL+ ++TL+EKV  + D +  V RLG+  Y WW+EALHGV+  G   
Sbjct: 24  YKNSNLSPEERAEDLLQQLTLEEKVALMMDNSKPVERLGIKPYNWWNEALHGVARSGL-- 81

Query: 114 HFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRA--------GLTYW 165
                   AT FP  I   ASF       I  AVS EARA      A        GLT W
Sbjct: 82  --------ATVFPQPIGMAASFEPDAIHTIYTAVSDEARAKNTAYSAAGSYERYQGLTMW 133

Query: 166 SPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCC 225
           +P +N+ RDPRWGR  ET GEDP++     VN V+GLQ        TD N +  K+ +C 
Sbjct: 134 TPTVNIYRDPRWGRGIETYGEDPYLTSVMGVNVVKGLQ-------CTDANQKYDKIHACA 186

Query: 226 KHYAAYDVDNWKGVDRYHFDAR-VTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIP 284
           KH+A +    W   +R+ F+A  +  +D+ ET+L PFE  VKEG    VMC+YNR+ G P
Sbjct: 187 KHFAVHSGPEW---NRHEFNAENIKPRDLHETYLVPFEALVKEGKVKEVMCAYNRLEGDP 243

Query: 285 SCADPKLLNQTVRGEWDLHGYIVADCDSIQ--VMVDNHKFLADSKEDAVAQTLKAGLDLD 342
            C   +LL Q +R EW   G +++DC +I        HK   ++ E A A  + +G DL+
Sbjct: 244 CCGSDRLLMQILRQEWGYEGIVLSDCGAIDDFYREKGHKTHPNA-ESASAAAVLSGTDLE 302

Query: 343 CGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFD--GSPQYVSLGKQDICSDE 400
           CG  Y     +A ++G + E DID S+K L      LG  D     ++  +    +CS E
Sbjct: 303 CGSSYKALVESA-KKGLISEKDIDVSVKRLLKARFELGEMDDPSKVEWTKIPYSVVCSAE 361

Query: 401 NIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSP 460
           +  L+ + AR+ + LL N  N LPL     +T+AV+GP+AN +V   GNY G P   ++ 
Sbjct: 362 HDSLSLDIARKSMTLLLNKNNILPLKRGG-QTIAVMGPNANDSVMQWGNYNGTPKHTITL 420

Query: 461 IAGFSGYA----NVTYKTGCDDVACKSNNSIF 488
           + G          + Y+ GC  V      S+F
Sbjct: 421 LEGIRSAMGENDKLIYEQGCSWVERSLIRSVF 452



 Score =  126 bits (317), Expect = 4e-26,   Method: Compositional matrix adjust.
 Identities = 104/326 (31%), Positives = 144/326 (44%), Gaps = 62/326 (19%)

Query: 464 FSGYANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAESL-------- 515
           FSG A + +     D+  K   +I       K AD  I   G+  S+E E +        
Sbjct: 574 FSGDAQLNF-----DLGFKEEVNIKNTVAKVKDADIVIFAGGISPSLEGEEMGVNLPGFR 628

Query: 516 --DREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGE 573
             DR D+ LP  Q +LI  + +  K  VI V  S  G  IA        +AIL A YPG+
Sbjct: 629 KGDRTDIELPAVQRELIKALCDAGK-KVIFVNFS--GSPIAMEPETKYCQAILQAWYPGQ 685

Query: 574 EGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPT 633
            GG+A A+V+FG +NP GRLP+T+Y    +  LP         +     GRTY+++ G  
Sbjct: 686 SGGKAAAEVLFGDYNPAGRLPVTFYRN--IAQLP-------DFEDYNMTGRTYRYFKGDP 736

Query: 634 LYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDD 693
           L+PFGYGLSYT F Y+ +   +TI+V               + +K   P           
Sbjct: 737 LFPFGYGLSYTTFNYDNIKLDQTIKV--------------GETAKMVIP----------- 771

Query: 694 YFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNA 753
                    N G+ DG +VV VY K   E A    K +  F+RV + AG+   ++     
Sbjct: 772 -------VTNAGNRDGEEVVQVYLK-KQEDAEGPAKTLRAFKRVQIPAGKTVNVELELTP 823

Query: 754 CKSLNIVDYAANTLLP-AGEHTIFVG 778
            K L   D   NT+   AG   I VG
Sbjct: 824 -KQLEWWDAQTNTMRTIAGNFDIMVG 848


>gi|374310554|ref|YP_005056984.1| Beta-glucosidase [Granulicella mallensis MP5ACTX8]
 gi|358752564|gb|AEU35954.1| Beta-glucosidase [Granulicella mallensis MP5ACTX8]
          Length = 739

 Score =  264 bits (675), Expect = 1e-67,   Method: Compositional matrix adjust.
 Identities = 235/760 (30%), Positives = 353/760 (46%), Gaps = 119/760 (15%)

Query: 53  LFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPG 112
           ++ DS      RV  L++ MTLDEK+  L      VPRLG+       E LHG++  GPG
Sbjct: 44  VYLDSHADPESRVTALLAAMTLDEKIHALSTDP-SVPRLGVAGTNH-VEGLHGLALGGPG 101

Query: 113 THFD---------DVIPGATSFPTVILTTASFNESLWKKIGQAVSTEAR-AMYNLGRAGL 162
            H++         +VIP  T FP       +++ +L +K     + E R A     R GL
Sbjct: 102 -HWEGHSEGRTMLNVIP-TTQFPQSRGLGQTWDPALLQKAAAQEAYETRFAFGKYHRGGL 159

Query: 163 TYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVS 222
              +PN +++RDPRWGR  E+ GEDPF+VG  A  +  GLQ  + H   T         +
Sbjct: 160 VVRAPNADLSRDPRWGRGEESYGEDPFLVGTLATAFAHGLQGDDPHVWMT---------A 210

Query: 223 SCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNG 282
           S  KH+ A   ++ +     +FDAR+      E +  PF M ++EG A ++M SYN  N 
Sbjct: 211 SLLKHFLANSNEDGRDGSSSNFDARL----FHEYYAVPFRMAIEEGHADAMMTSYNAWNS 266

Query: 283 IPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLD-- 340
           +P  A+P ++   V  +W L G +  D  ++  MV  H   A   E A A  + AG++  
Sbjct: 267 VPMTANP-VVRDVVMAQWGLDGIVCTDAGALTNMVKQHHTYATMPE-AAAAAIHAGINQF 324

Query: 341 LDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFD---GSPQYVSLGKQDIC 397
           LD    Y     +A+QQ  + E DID++L+ +Y V++ LG  D    SP Y  +G  D  
Sbjct: 325 LDD---YQQPVRDALQQKLITEQDIDRNLRGVYRVMLHLGLLDPTANSP-YSHIGAFDQA 380

Query: 398 SDE--NIE----LAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYA 451
             +  N E    L   A  E IVLLKN    LPL++AK+K++AV+G   + TVA+   Y+
Sbjct: 381 QSDPWNTEAPRALVRRATDESIVLLKNTGGALPLDAAKLKSIAVIGQWGD-TVAL-DWYS 438

Query: 452 GIPCRYMSPIAGF---SGYANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDL 508
           G P   ++P+ G    +  A+V +  G D+ A  +          A  ++A I++ G   
Sbjct: 439 GTPLLSVTPVEGIRRRAAGASVVFNDGKDEAAAAA---------LAARSEAVIVIVGNHP 489

Query: 509 SVEA------------ESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFA 556
           + +A            E++DR+ L LP     L+  V  +A  P  +V++          
Sbjct: 490 TCDAGWNKCALPSEGKEAIDRKSLTLP--DESLVKAV--LAANPHAVVVLQT-SFPYTTN 544

Query: 557 ETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPV 616
            T  +  AIL   +  EE G A+ADV+FG +NP GRL  TW      Q+ P+    LR  
Sbjct: 545 WTQEHAPAILEITHNSEEQGTALADVLFGDYNPAGRLTQTW-PASLEQLPPMMDYDLR-- 601

Query: 617 DSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDA 676
                 GRTY +     LYPFG+GLSYT F Y+ L+ T+                     
Sbjct: 602 -----HGRTYLYAEKAPLYPFGFGLSYTSFAYSDLTVTQ--------------------- 635

Query: 677 SKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQR 736
              R   + V           +V   N GS  G +VV +Y+          I+++  F+R
Sbjct: 636 ---RGKSIAV-----------QVTVANTGSRAGDEVVQIYAAHQGSTVPRPIEELKAFRR 681

Query: 737 VFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIF 776
           V +RAG  + ++F      SL   D A +  +  G+   F
Sbjct: 682 VALRAGEKQVVRFEM-PVTSLAYWDEATHRFIVEGDRVEF 720


>gi|334365132|ref|ZP_08514098.1| glycosyl hydrolase family 3 N-terminal domain protein [Alistipes
           sp. HGB5]
 gi|313158675|gb|EFR58064.1| glycosyl hydrolase family 3 N-terminal domain protein [Alistipes
           sp. HGB5]
          Length = 771

 Score =  264 bits (675), Expect = 1e-67,   Method: Compositional matrix adjust.
 Identities = 208/718 (28%), Positives = 333/718 (46%), Gaps = 108/718 (15%)

Query: 90  RLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVST 149
           RLG+P +    EA HG   +G           AT+FPT     +++N  L +++G+ ++ 
Sbjct: 120 RLGIPLF-LAEEAPHGHMAIG-----------ATTFPTAPGQASTWNPELIERMGKVIAA 167

Query: 150 EARAMYNLGRAGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHE 209
           E R      + G   + P +++ RDPRW R  E+ GED ++  R    YVRG        
Sbjct: 168 EIRL-----QGGHICYGPVLDIVRDPRWSRTEESYGEDCYLTARIGEAYVRGT------- 215

Query: 210 NATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGD 269
            + DL S+     S  KH+ AY           +    + E+++ ET+L PFE  VK G 
Sbjct: 216 GSGDL-SQSRHALSTLKHFIAYGASEGGQNGGSNL---LGERELRETYLPPFEAAVKAG- 270

Query: 270 ASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKED 329
           A SVM +YN V+GIP  A+ ++L   +RGEW   G++V+D  SI+ + + H      +E 
Sbjct: 271 ARSVMTAYNSVDGIPCTANRRMLTDILRGEWGFDGFVVSDLLSIEGLHETHGVAGSVREA 330

Query: 330 AVAQTLKAGLDLDC-GQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQY 388
           AV Q L+AG+D D  G  + +    A + G V E +ID++++ +  +   +G F+ +P  
Sbjct: 331 AV-QALRAGVDADLKGGAFASLR-EAAEAGDVAEAEIDRAVERVLALKFEMGLFE-NPYI 387

Query: 389 VSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIG 448
                 ++    + ELA EAAR+ + LL+N   TLPL+  +++ VAV+GP+A+     +G
Sbjct: 388 DEAAAAEVGCAAHSELALEAARQSVTLLENRSGTLPLDPRRLRRVAVIGPNADNIYNQLG 447

Query: 449 NYAGIPCRYMSPIAGFS---GYANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAG 505
           +Y        +   G     G   V Y  GC  V     + I AA  AA+  DA +++ G
Sbjct: 448 DYTAQQTAANTVRDGLEKLLGRDRVVYSRGC-TVRGGDRSEIAAAVSAARGTDAAVVVIG 506

Query: 506 ----LDLSVE-------------------AESLDREDLWLPGYQTQLINQVAEVAKGPVI 542
                D   E                    E  DR  L L G Q +L+ ++      P+I
Sbjct: 507 GSSARDFDTEFLQTGAAKAAHDEVRDMECGEGFDRATLALLGEQEELLRRIKATGT-PLI 565

Query: 543 LVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDY 602
           +V ++   +D+  A    +  A+L A YPG  GG A+A+ + G+ NP GRLPIT    + 
Sbjct: 566 VVCIAGRPLDLRRASEQAD--ALLMAWYPGARGGDAVAETILGRNNPAGRLPITIPRAE- 622

Query: 603 VQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLN 662
              +P+     RP +        Y       LYPFGYGLSY+ F+Y  L   +  Q   N
Sbjct: 623 -GQIPVYYNKKRPANH------DYTDLTAAPLYPFGYGLSYSTFEYGSL---EARQSGDN 672

Query: 663 KLQ-HCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPA 721
            L+  CR  N               +D   D+  +  +          SD+V    +PP 
Sbjct: 673 VLEVSCRIRN--------------TSDREGDEVVQLYI----------SDMVASTVRPP- 707

Query: 722 EIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGN 779
                  +Q+ GF+R+ +  G  +++ F     ++L ++D     ++  G+  I VG+
Sbjct: 708 -------RQLGGFRRIRLAPGEQRQVSFTLGD-EALALIDPQGRRVVEKGDFVIAVGS 757


>gi|375309610|ref|ZP_09774891.1| glycoside hydrolase [Paenibacillus sp. Aloe-11]
 gi|375078919|gb|EHS57146.1| glycoside hydrolase [Paenibacillus sp. Aloe-11]
          Length = 769

 Score =  264 bits (674), Expect = 2e-67,   Method: Compositional matrix adjust.
 Identities = 227/817 (27%), Positives = 366/817 (44%), Gaps = 150/817 (18%)

Query: 49  MSSFLFCDSSLPYSIRVKDLVSRMTLDEK----VQQLG---------------DFAH--- 86
           M+  ++ D S P   RVK L+  MT++EK    VQ  G               DF     
Sbjct: 1   MTMLIYKDKSKPIEERVKHLIGLMTIEEKVGQLVQPFGWQVYEHTDGELSLHHDFKQQVQ 60

Query: 87  --GVPRL-GLPQYEWWS--------------EALHGVSN-------------VGPGTHFD 116
             GV  L G+ + + W+              EA++ +               +G      
Sbjct: 61  NGGVGSLYGVLRADPWTGVTLENGLSAKQGAEAVNLIQRYAVEHSRLGIPILIGEECSHG 120

Query: 117 DVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNINVARDPR 176
            +    T FP  +   +++N  L++ + +AV++E RA     + G   +SP ++V RDPR
Sbjct: 121 HMAIDGTVFPVPLSIGSTWNVDLYRDMCRAVASETRA-----QGGAVTYSPVLDVVRDPR 175

Query: 177 WGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAY-DVDN 235
           WGR  E  GEDP+++G +AV  V GLQ     E+    +S    V++  KH+A Y   + 
Sbjct: 176 WGRTEECFGEDPYLIGEFAVAAVEGLQG----ESLLSEHS----VAATLKHFAGYGSSEG 227

Query: 236 WKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQT 295
            +     H   R    ++ E  L PF+  V  G A S+M +YN ++G+P   + +LL+  
Sbjct: 228 GRNAGPVHMGWR----ELLEVDLYPFQKAVVAG-AQSIMPAYNEIDGVPCTVNAELLDDI 282

Query: 296 VRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLD-CGQYYTNFTGNA 354
           +R  W   G ++ DC +I+++V+ H  + ++  DA  Q ++AG+D++  G+ + +    A
Sbjct: 283 LRQSWGFDGLVITDCGAIEMLVNGHD-VTENGSDAAVQAIRAGIDMEMSGEMFGSHLVEA 341

Query: 355 VQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQDICSDENIELAAEAAREGIV 414
              GK++ + +D++ + + T+  RLG FD         +Q I   E+I LA + A EGIV
Sbjct: 342 AHAGKLETSVLDQAGRRVLTLKYRLGLFDNPYVNAERAEQVIGRAEHIRLARQLATEGIV 401

Query: 415 LLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIP--CRYMSPIAGFSGYA---- 468
           LLKN   TLPL     K +AV+GP+A+     +G+Y       R ++ + G         
Sbjct: 402 LLKNVNRTLPL-PKNSKRIAVIGPNADQVYNQLGDYTSPQPRSRVVTVLDGIRSKLSKHQ 460

Query: 469 -NVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAG-----------LDLSVEA---- 512
            +V Y  GC  +  +S      A   A  AD  +++ G           +DL   A    
Sbjct: 461 DDVLYTPGC-RIKGESREGFENALACAAEADTVVMVVGGSSARDFGEGTIDLKTGASKVA 519

Query: 513 ----------ESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNI 562
                     E +DR  L L G Q QL+ ++  + K    LV++   G  IA      + 
Sbjct: 520 DHDWNDMECGEGIDRMTLGLAGVQLQLMQEIYSLGKE---LVVVYMNGRPIAEPWVEEHA 576

Query: 563 KAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYP 622
            AI+ A YPG+EGG AIAD++FG  NP GRL ++     +V  LP+     R        
Sbjct: 577 HAIVEAWYPGQEGGHAIADILFGDVNPSGRLTLSIPK--HVGQLPVYYNGKRS------R 628

Query: 623 GRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCP 682
           G+ Y   +    YPFGYGLSYT F Y  L+ +                            
Sbjct: 629 GKRYLEDDAEPRYPFGYGLSYTTFSYERLTLS---------------------------- 660

Query: 683 GVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAG 742
               N +R D+     VD  N G  +G++VV +Y           ++++ GF +V ++ G
Sbjct: 661 ---TNSIRADESVTVTVDVTNTGEREGAEVVQLYISDTVSSVTRPVRELKGFCKVVLQPG 717

Query: 743 RNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGN 779
             + ++FV  + K L  +      ++ AG  +I VG 
Sbjct: 718 ETRTVEFVVGSDK-LQYIGRDLQPVVEAGRFSIQVGR 753


>gi|295132888|ref|YP_003583564.1| beta-glucosidase [Zunongwangia profunda SM-A87]
 gi|294980903|gb|ADF51368.1| beta-glucosidase [Zunongwangia profunda SM-A87]
          Length = 855

 Score =  264 bits (674), Expect = 2e-67,   Method: Compositional matrix adjust.
 Identities = 157/443 (35%), Positives = 243/443 (54%), Gaps = 40/443 (9%)

Query: 64  RVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGAT 123
           R +DLV+R+TL+EK   + D +  +PRLG+ ++ WWSEALHG +N       DDV    T
Sbjct: 24  RAEDLVNRLTLEEKASLMFDVSEAIPRLGIKKFNWWSEALHGFANN------DDV----T 73

Query: 124 SFPTVILTTASFNESLWKKIGQAVSTEARAMYNLG-RAG--------LTYWSPNINVARD 174
            FP  +   ASF++ L  ++  A S E RA Y+   R G        L+ W+PN+N+ RD
Sbjct: 74  VFPEPVGMAASFDDELVYQVFDATSDEVRAKYHEALRNGEENKRFLSLSVWTPNVNIFRD 133

Query: 175 PRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVD 234
           PRWGR  ET GEDP++  R  V  V+GLQ  E        +++  K+ +C KHYA +   
Sbjct: 134 PRWGRGQETYGEDPYLTSRMGVQVVKGLQGPE--------DAKYKKLLACAKHYAVHSGP 185

Query: 235 NWKGVDRYHFDAR-VTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLN 293
            W    R+  +   V+++D+ ET+L  F++ V++ +   VMC+Y R++  P C   +LL 
Sbjct: 186 EW---SRHELNLNNVSQRDLWETYLPAFKVLVQDANVRQVMCAYQRLDDEPCCGSDRLLQ 242

Query: 294 QTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFT-- 351
           Q +R +W     +V+DC +IQ    +H   +D+   A A+ + AG D++C     N+   
Sbjct: 243 QILREKWGFEHLVVSDCGAIQDFYTSHNVSSDAVH-AAAKAVLAGTDVECQWDKHNYKLL 301

Query: 352 GNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSP--QYVSLGKQDICSDENIELAAEAA 409
             AV++G VKE DID+S+K +      LG  D      Y  +    I ++E+ +LA + A
Sbjct: 302 PEAVEKGLVKEEDIDRSVKRVLIGRFELGEMDPDEIVPYAQIPASVINNEEHRQLALKMA 361

Query: 410 REGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFS---G 466
           RE + LL+N  N LPL+  + + +AV+GP+A+    + GNY G P R +S + G +   G
Sbjct: 362 RESMTLLQNKNNILPLSKGQDR-IAVIGPNADDEPMLWGNYNGTPVRTISILDGITSKIG 420

Query: 467 YANVTYKTGCDDVACKSNNSIFA 489
             ++ Y   CD V  K   S F+
Sbjct: 421 EKSIVYDKACDLVEDKVTQSYFS 443



 Score =  103 bits (258), Expect = 3e-19,   Method: Compositional matrix adjust.
 Identities = 80/296 (27%), Positives = 127/296 (42%), Gaps = 56/296 (18%)

Query: 495 KTADATIILAGLDLSVEAESL----------DREDLWLPGYQTQLINQVAEVAKGPVILV 544
           K  +  I + GL   +E E +          DR D+ LP  Q   +  + +  K    ++
Sbjct: 590 KGIETVIFVGGLSTKLEGEEMPVSYPGFKGGDRTDIALPSVQRNCLKTLKDAGKK---VI 646

Query: 545 IMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQ 604
            ++  G  I      T+  AIL A Y GE GG+A+ADV+FG +NP G+LP+T+Y  D  Q
Sbjct: 647 FVNNSGSAIGLVPETTSCDAILQAWYGGESGGQAVADVLFGDYNPSGKLPVTFYK-DTTQ 705

Query: 605 MLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKL 664
           +       +         GRTY+F     L+PFG+GLSYT FK          Q++ +++
Sbjct: 706 LPDFEDYSMN--------GRTYRFMKAEPLFPFGHGLSYTNFKIG------EAQLDKSEI 751

Query: 665 QHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIA 724
               ++N T                         +   N G T+G +++ VY      + 
Sbjct: 752 DTSSSVNIT-------------------------ISISNEGKTEGVEIIQVYVHKQG-LE 785

Query: 725 ATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTL-LPAGEHTIFVGN 779
              IK + GF+RV ++    K +        S    D  A ++ +  G + IF GN
Sbjct: 786 EGPIKTLKGFKRVNLKPNEMKNVTINL-PSNSFEFYDKKARSMKVMPGNYEIFYGN 840


>gi|390945417|ref|YP_006409177.1| beta-glucosidase-like glycosyl hydrolase [Alistipes finegoldii DSM
           17242]
 gi|390421986|gb|AFL76492.1| beta-glucosidase-like glycosyl hydrolase [Alistipes finegoldii DSM
           17242]
          Length = 771

 Score =  264 bits (674), Expect = 2e-67,   Method: Compositional matrix adjust.
 Identities = 208/718 (28%), Positives = 333/718 (46%), Gaps = 108/718 (15%)

Query: 90  RLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVST 149
           RLG+P +    EA HG   +G           AT+FPT     +++N  L +++G+ ++ 
Sbjct: 120 RLGIPLF-LAEEAPHGHMAIG-----------ATTFPTAPGQASTWNPELIERMGKVIAA 167

Query: 150 EARAMYNLGRAGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHE 209
           E R      + G   + P +++ RDPRW R  E+ GED ++  R    YVRG        
Sbjct: 168 EIRL-----QGGHICYGPVLDIVRDPRWSRTEESYGEDCYLTARIGEAYVRGT------- 215

Query: 210 NATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGD 269
            + DL S+     S  KH+ AY           +    + E+++ ET+L PFE  VK G 
Sbjct: 216 GSGDL-SQSRHALSTLKHFIAYGASEGGQNGGSNL---LGERELRETYLPPFEAAVKAG- 270

Query: 270 ASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKED 329
           A SVM +YN V+GIP  A+ ++L   +RGEW   G++V+D  SI+ + + H      +E 
Sbjct: 271 ARSVMTAYNSVDGIPCTANRRMLTDILRGEWGFDGFVVSDLLSIEGLHETHGVAGSVREA 330

Query: 330 AVAQTLKAGLDLDC-GQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQY 388
           AV Q L+AG+D D  G  + +    A + G V E +ID++++ +  +   +G F+ +P  
Sbjct: 331 AV-QALRAGVDADLKGGAFASLR-EAAEAGDVAEAEIDRAVERVLALKFEMGLFE-NPYI 387

Query: 389 VSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIG 448
                 ++    + ELA EAAR+ + LL+N   TLPL+  +++ VAV+GP+A+     +G
Sbjct: 388 DEAAAAEVGCAAHSELALEAARQSVTLLENRSGTLPLDPRRLRRVAVIGPNADNIYNQLG 447

Query: 449 NYAGIPCRYMSPIAGFS---GYANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAG 505
           +Y        +   G     G   V Y  GC  V     + I AA  AA+  DA +++ G
Sbjct: 448 DYTAQQTAANTVRDGLEKLLGRDRVVYSRGC-TVRGGDRSEIAAAVSAARGTDAAVVVIG 506

Query: 506 ----LDLSVE-------------------AESLDREDLWLPGYQTQLINQVAEVAKGPVI 542
                D   E                    E  DR  L L G Q +L+ ++      P+I
Sbjct: 507 GSSARDFDTEFLQTGAAKAAHDEVRDMECGEGFDRATLALLGEQEELLRRIKATGT-PLI 565

Query: 543 LVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDY 602
           +V ++   +D+  A    +  A+L A YPG  GG A+A+ + G  NP GRLPIT    + 
Sbjct: 566 VVCIAGRPLDLRRASEQAD--ALLMAWYPGARGGDAVAETILGHNNPAGRLPITIPRAE- 622

Query: 603 VQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLN 662
              +P+     RP +        Y       LYPFGYGLSY+ F+Y  L   +  Q   N
Sbjct: 623 -GQIPVYYNKKRPAN------HDYTDLTAAPLYPFGYGLSYSTFEYGSL---EARQSGDN 672

Query: 663 KLQ-HCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPA 721
            L+  CR  N               +D   D+  +  +          SD+V    +PP 
Sbjct: 673 VLEVSCRIRN--------------TSDREGDEVVQLYI----------SDMVASTVRPP- 707

Query: 722 EIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGN 779
                  +Q+ GF+R+ +  G  +++ F     ++L+++D     ++  G+  I VG+
Sbjct: 708 -------RQLGGFRRIRLAPGEQRQVSFTLGD-EALSLIDPQGRRVVEKGDFVIAVGS 757


>gi|373952814|ref|ZP_09612774.1| glycoside hydrolase family 3 domain protein [Mucilaginibacter
           paludis DSM 18603]
 gi|373889414|gb|EHQ25311.1| glycoside hydrolase family 3 domain protein [Mucilaginibacter
           paludis DSM 18603]
          Length = 862

 Score =  264 bits (674), Expect = 2e-67,   Method: Compositional matrix adjust.
 Identities = 157/438 (35%), Positives = 237/438 (54%), Gaps = 37/438 (8%)

Query: 54  FCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGT 113
           + + +L    R KDLV+R+TL EKV  + D +  VPRLG+ ++ WWSEALHG +N GP  
Sbjct: 24  YQNPALSSEARAKDLVTRLTLKEKVGLMKDVSEAVPRLGIKKFNWWSEALHGYANQGP-- 81

Query: 114 HFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRA--------GLTYW 165
                    T FP  +   ASF++     +  AVS EARA  N  R          L+ W
Sbjct: 82  --------VTVFPEPVGMAASFDDQKLFHVFDAVSDEARAKNNEYRKQVESQRFHDLSVW 133

Query: 166 SPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCC 225
           +PN+N+ RDPRWGR  ET GEDP++  R  V+ V+GLQ         D   R  K+ +C 
Sbjct: 134 TPNVNIFRDPRWGRGQETYGEDPYLTSRMGVSVVKGLQ------GPADAKYR--KLLACA 185

Query: 226 KHYAAYDVDNWKGVDRYHFDAR-VTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIP 284
           KHYA +    W    R+  +   VT +D+ ET+L  F+  V++ D   VMC+Y R++  P
Sbjct: 186 KHYAVHSGPEWS---RHEMNVTDVTPRDLWETYLPAFKSLVQDADVREVMCAYQRLDDEP 242

Query: 285 SCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCG 344
            C + +LL Q +R +W     +V+DC +I    ++H   +D+   A A+ + +G D++C 
Sbjct: 243 CCGNSRLLGQILREDWGFKYLVVSDCGAITDFYNSHHSSSDATH-ASAKAVLSGTDVECV 301

Query: 345 QYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSP--QYVSLGKQDICSDENI 402
            Y  +   +AV +G +KE DI+ S+  L T    LG  D      +  +    + S+++ 
Sbjct: 302 GYAFDKIPDAVYRGLIKEKDINTSVVRLMTQRFELGEMDKDELVPWTKIPLSVVNSEDHQ 361

Query: 403 ELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIA 462
           +LA + ARE + LL+N+ N LPL S  +  +AV+GP+AN +  + GNY G P R ++ + 
Sbjct: 362 KLALDMARETMTLLQNNNNILPL-SKSIGKLAVIGPNANDSQMLSGNYNGTPLRTINILE 420

Query: 463 GFS---GYANVTYKTGCD 477
           G     G  +V Y  GCD
Sbjct: 421 GIKTKLGADHVIYDAGCD 438



 Score =  112 bits (279), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 79/266 (29%), Positives = 117/266 (43%), Gaps = 54/266 (20%)

Query: 492 EAAKTADATIILAGLDLSVEAESL----------DREDLWLPGYQTQLINQVAEVAKGPV 541
           E  K AD  + + G+   +E E +          DR D+ LP  Q   I  + +  K   
Sbjct: 594 EKVKDADIVVFVGGISPKLEGEEMPVQLPGFKGGDRTDIELPAVQRNCIEALRKAGKK-- 651

Query: 542 ILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGD 601
            +V ++  G  IA      N  AIL A Y GE GG+A+ADV+FG +NP G LP+T+Y   
Sbjct: 652 -IVFVNCSGSAIAMVPETQNCDAILQAWYAGESGGQAVADVLFGDYNPSGHLPVTFYRN- 709

Query: 602 YVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNL 661
            VQ LP  S            GRTY++     L+PFG+GLSYT F       TK      
Sbjct: 710 -VQQLPDFS-------DYSMKGRTYRYLKSAPLFPFGFGLSYTTFNIGEAKLTK------ 755

Query: 662 NKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPA 721
                                    N++   +  + +V   N G TDG++++ VY +   
Sbjct: 756 -------------------------NNITKGEAIQLRVPVANAGKTDGTELLQVYIRKVD 790

Query: 722 EIAATYIKQVIGFQRVFVRAGRNKRI 747
           +      K + GF+R+ V AG+ + +
Sbjct: 791 DPDGAS-KTLRGFKRIPVSAGKTEMV 815


>gi|427387362|ref|ZP_18883418.1| hypothetical protein HMPREF9447_04451 [Bacteroides oleiciplenus YIT
           12058]
 gi|425725523|gb|EKU88394.1| hypothetical protein HMPREF9447_04451 [Bacteroides oleiciplenus YIT
           12058]
          Length = 865

 Score =  263 bits (673), Expect = 2e-67,   Method: Compositional matrix adjust.
 Identities = 167/467 (35%), Positives = 245/467 (52%), Gaps = 52/467 (11%)

Query: 51  SFLFCDSSL-------PY-------SIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQY 96
           SF FC  +L       PY       S R  DL+ RMTL+EK+ Q+ + +  + RLG+P Y
Sbjct: 8   SFCFCAVALVATAQNEPYKNPDLTPSERAWDLLKRMTLEEKISQMKNGSPAIERLGIPAY 67

Query: 97  EWWSEALHGVSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYN 156
            WW+EALHGV+  G           AT FP  I   A+F+     +    VS EARA Y+
Sbjct: 68  NWWNEALHGVARAGK----------ATVFPQAIGLAATFDNQAVHETFSIVSDEARAKYH 117

Query: 157 --------LGRAGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGH 208
                    G  GLT+W+PNIN+ RDPRWGR  ET GEDP++     +  V+GLQ     
Sbjct: 118 DFQRKGERDGYKGLTFWTPNINIYRDPRWGRGMETYGEDPYLTSLMGLAVVKGLQG---- 173

Query: 209 ENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDAR-VTEQDMEETFLRPFEMCVKE 267
               D   +  K  +C KHYA +    W   +R+ FDA+ ++++D+ ET+L  F+  V E
Sbjct: 174 ----DGTGKYDKTHACAKHYAVHSGPEW---NRHSFDAKNISQRDLWETYLPAFKTLVTE 226

Query: 268 GDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSI-QVMVDNHKFLADS 326
           G    VMC+YNR  G P C++ +LL + +R +W     +V+DC +I      NH     +
Sbjct: 227 GKVKEVMCAYNRYEGEPCCSNKQLLIRILREDWGYDDIVVSDCGAIGDFYYPNHHETHPT 286

Query: 327 KEDAVAQTLKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSP 386
              A A  + +G DL+CG  Y++    AV++G + E  I++S+  L     +LG FD + 
Sbjct: 287 AAAASADAVVSGTDLECGGSYSSLN-EAVRKGLISEDKINESVFRLLRARFQLGMFDDNT 345

Query: 387 --QYVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATV 444
              +  +    + S E++  A E AR+ +VLL N  N LPL S  V+ VAV+GP+AN +V
Sbjct: 346 LVSWSEIPYSVVESKEHVAKALEMARKSMVLLTNKNNILPL-SKSVRKVAVLGPNANDSV 404

Query: 445 AMIGNYAGIPCRYMSPIAGFSGY---ANVTYKTGCDDVACKSNNSIF 488
            +  NY G P + ++ + G         V Y+ GCD V  ++  S F
Sbjct: 405 MLWANYNGFPTKSVTILEGIRNKLPEGAVYYEKGCDFVNTQTVFSYF 451



 Score =  120 bits (302), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 82/282 (29%), Positives = 129/282 (45%), Gaps = 54/282 (19%)

Query: 478 DVACKSNNSIFAASEAAKTADATIILAGLDLSVEAESL----------DREDLWLPGYQT 527
           D+  K   +    ++ A  AD  I + GL  S+E E +          DR ++ LP  Q 
Sbjct: 582 DIGIKKEINYKEMADKAAEADVIIFVGGLSSSLEGEEMPVDLPGFRKGDRTNIDLPQVQE 641

Query: 528 QLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKF 587
           +++  + +  K PV+ V+ S  G  +A      N+ AI+ A YPG++GG A+ADV+FG +
Sbjct: 642 EMLKALKKTGK-PVVFVLCS--GSTLALPWEAENLDAIIEAWYPGQQGGTAVADVLFGDY 698

Query: 588 NPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFK 647
           NP GRLP+T+Y          +S  L   +      RTY+++ G  L+PFG+GLSYT F 
Sbjct: 699 NPAGRLPLTFY---------ASSSDLPDFEDYDMSNRTYRYFKGRPLFPFGHGLSYTTFD 749

Query: 648 YNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGST 707
           Y      K I                               LR  +     +  +N+G  
Sbjct: 750 YGKAKADKKI-------------------------------LRAGEGLTLTIPLKNIGKL 778

Query: 708 DGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKF 749
            G +VV VY + P +     IK +  F+R+ + AG+ + + F
Sbjct: 779 SGDEVVQVYLRNPGDKEGP-IKTLRAFRRISLEAGQAEDVLF 819


>gi|399025517|ref|ZP_10727513.1| beta-glucosidase-like glycosyl hydrolase [Chryseobacterium sp.
           CF314]
 gi|398077894|gb|EJL68841.1| beta-glucosidase-like glycosyl hydrolase [Chryseobacterium sp.
           CF314]
          Length = 875

 Score =  263 bits (673), Expect = 2e-67,   Method: Compositional matrix adjust.
 Identities = 160/459 (34%), Positives = 241/459 (52%), Gaps = 45/459 (9%)

Query: 48  QMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVS 107
           Q   + F + +LP   R+++L+  +T+DEK+  + D +  VPRL +P Y WW+EALHGV+
Sbjct: 19  QNYKYPFRNPNLPVEQRIENLLGLLTVDEKIGMMMDNSKAVPRLEIPAYGWWNEALHGVA 78

Query: 108 NVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYN--------LGR 159
             G           AT FP  I   A+++     K  + +S EARA YN         GR
Sbjct: 79  RAGT----------ATVFPQAIGMAAAWDVPEHLKTFEMISDEARAKYNKSFDEASKTGR 128

Query: 160 -AGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRP 218
             GLT+W+PNIN+ RDPRWGR  ET GEDP++     V  V+GLQ  +          + 
Sbjct: 129 YEGLTFWTPNINIFRDPRWGRGQETYGEDPYLTSVLGVAAVKGLQGND---------PKY 179

Query: 219 LKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYN 278
            K  +C KH+A +    W   +R+ ++A V+++D+ ET+L  F+  V EG+   VMC+YN
Sbjct: 180 FKTHACAKHFAVHSGPEW---NRHSYNAEVSKRDLYETYLPAFKSLVLEGNVREVMCAYN 236

Query: 279 RVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDN--HKFLADSKEDAVAQTLK 336
             +G P CA   LLN+ +RG+W   G +V+DC ++        H    D K  A A  LK
Sbjct: 237 AFDGQPCCASNTLLNEILRGKWKYDGMVVSDCWALADFYQEKYHGTHPDEKSTA-ADALK 295

Query: 337 AGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQD- 395
              DL+CG  Y N    ++  G + E DID S++ +      LG  D  P+   L  Q  
Sbjct: 296 HSTDLECGDTYNNLN-KSLAGGLITEKDIDISMRRILKGWFELGMLD--PKSSVLWNQIP 352

Query: 396 ---ICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAG 452
              + SDE+ + A + A++ IVL+KN+ N LP N   +K +AVVGP+A+  +  +GNY G
Sbjct: 353 YSVVDSDEHKKQALKMAQKSIVLMKNENNILPFNK-NIKKIAVVGPNADDEMMQLGNYNG 411

Query: 453 IPCRYMSPIAGFSGY---ANVTYKTGCDDVACKSNNSIF 488
            P   ++ + G         + Y+ G +     S  S++
Sbjct: 412 TPSSIVTILEGIKAKFPNTEIIYEKGSEVADPSSRASLY 450



 Score =  108 bits (270), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 81/302 (26%), Positives = 136/302 (45%), Gaps = 48/302 (15%)

Query: 489 AASEAAKTADATIILAGLDLSVEAESL----------DREDLWLPGYQTQLINQVAEVAK 538
           +  E  K AD  +   GL  S+E E +          D+  + LP  Q +L+ ++ +  K
Sbjct: 594 SVKEKVKDADVIVFAGGLSPSLEGEEMLVNAEGFKGGDKTSIELPKVQRELLAELRKTGK 653

Query: 539 GPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWY 598
            PV+ V+ +  G  +   +   N   +L A Y G+ GG A+ADV+ G +NP GRLP+T+Y
Sbjct: 654 -PVVFVLCT--GSSLGLEQDEKNYDVLLNAWYGGQSGGTAVADVLAGDYNPSGRLPVTFY 710

Query: 599 -NGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTI 657
            N + +      +   +  ++    GRTY++     LY FG+GLSY++F Y     +K  
Sbjct: 711 KNLEQLDNALSKTSKHQGFENYDMQGRTYRYMTENPLYAFGHGLSYSKFNYGNAKLSK-- 768

Query: 658 QVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYS 717
                                        N +  ++     V   N+   DG +VV VY 
Sbjct: 769 -----------------------------NSISPNEDIIITVPVTNISDRDGEEVVQVYV 799

Query: 718 KPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLL-PAGEHTIF 776
           K   ++ A  +K +  F+RV +R+   K I+   +  +S    D  A+ L+  +G++TI 
Sbjct: 800 KRNNDVLAP-VKTLRAFERVLIRSKETKNIQLTISK-ESFKFYDEKADDLISKSGDYTIL 857

Query: 777 VG 778
            G
Sbjct: 858 YG 859


>gi|322371968|ref|ZP_08046510.1| glycoside hydrolase family 3 domain protein [Haladaptatus
           paucihalophilus DX253]
 gi|320548390|gb|EFW90062.1| glycoside hydrolase family 3 domain protein [Haladaptatus
           paucihalophilus DX253]
          Length = 776

 Score =  263 bits (673), Expect = 2e-67,   Method: Compositional matrix adjust.
 Identities = 214/745 (28%), Positives = 338/745 (45%), Gaps = 126/745 (16%)

Query: 76  EKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGATSFPTVILTTASF 135
           ++  +L DF     RLG+P      E L G              P  T+FP ++   +++
Sbjct: 81  KRTNELQDFLGSETRLGIPAIPH-EECLSGYMG-----------PSGTTFPQMLGVASTW 128

Query: 136 NESLWKKIGQAVSTEARAMYNLGRAGLTY-WSPNINVARDPRWGRITETPGEDPFVVGRY 194
           +  L  +I   +  +  A+      G T+  SP +++ARD RWGR+ ET GEDP++V   
Sbjct: 129 SPDLVAEITDTIRGQLEAI------GTTHALSPVLDIARDLRWGRVEETFGEDPYLVAAM 182

Query: 195 AVNYVRGLQ-DVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDM 253
           A  YV GLQ D +G             +S+  KH+A +      G +R   +  V  +++
Sbjct: 183 ARGYVNGLQGDGDG-------------ISATLKHFAGHGAGE-GGKNRSSVN--VGRREL 226

Query: 254 EETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSI 313
            ET L PFE  +K  DA SVM +Y+ ++GIP  +D  LL   +RGEW   G +V+D  S+
Sbjct: 227 RETHLFPFEAVIKTADAESVMNAYHDIDGIPCASDGWLLTDVLRGEWGFDGTVVSDYYSV 286

Query: 314 QVMVDNHKFLADSKEDAVAQTLKAGLDL-----DCGQYYTNFTGNAVQQGKVKETDIDKS 368
           + +   H  +A SK+ A    ++AGLD+     DC   Y +   NAV+ G V E  ++ +
Sbjct: 287 EFLQSEHG-VAASKQAAGVMAVEAGLDVELPYTDC---YGDHLVNAVEDGDVAEATVNTA 342

Query: 369 LKYLYTVLMRLGFFDGSPQYVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSA 428
           ++ +       G  D     V        ++   +L   AARE + LLKN+ + LP +  
Sbjct: 343 VRRVLRAKAEKGLLDDPTVDVDAAAAPFNTENARDLTTRAARESMTLLKNEDDFLPFDGE 402

Query: 429 KVKTVAVVGPHANATVAMIGNYAGIPCRY---------MSPI---------AGFSGYANV 470
           +++TVAVVGP A+    ++G+YA  P  Y          +P+         AGF    +V
Sbjct: 403 ELETVAVVGPKADNAQELMGDYA-YPAHYPTEEVDLDATTPLDAIEARGEHAGF----DV 457

Query: 471 TYKTGCDDVACKSNN---------------SIFAASEAAKTADATIILAGL-DLSVEAES 514
            Y+ GC      + +               +   A  A   +D     A L  +    E 
Sbjct: 458 RYEQGCTTTGSSTEDFDSAAEAAEAADVAVTFVGARSAVDFSDIDEKQADLPSVPTSGEG 517

Query: 515 LDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEE 574
            D  DL LPG Q +L+ +V E    P+++V++S     + +        A+L+A  PGE 
Sbjct: 518 CDVVDLDLPGVQQELVERVHETGT-PLVVVVVSGKPHSVEW--IAEEAPALLYAWLPGER 574

Query: 575 GGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTL 634
           GG  IA+V+FG+ NPGGRLP++           +  +P+            + +     L
Sbjct: 575 GGEGIAEVLFGEHNPGGRLPVSIPRS-------VGQLPVYYNRKPNTANEEHVYTESTPL 627

Query: 635 YPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDY 694
           YPFG+GLSYT F+Y  LS                 L+  S A   R              
Sbjct: 628 YPFGHGLSYTDFEYGDLS-----------------LSTDSIAPSGRVSA----------- 659

Query: 695 FEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNAC 754
              +V   N G  DG +VV +Y+   +   A  +++++GF+R+F+ AG +KRI F  +A 
Sbjct: 660 ---EVTVSNTGDRDGHEVVQLYASAKSPSQARPVQELVGFERIFLAAGESKRIIFEIDAS 716

Query: 755 KSLNIVDYAANTLLPAGEHTIFVGN 779
           + L   D   N  +  G + + VG 
Sbjct: 717 Q-LAFHDRDMNLAVERGPYELRVGR 740


>gi|435848436|ref|YP_007310686.1| beta-glucosidase-like glycosyl hydrolase [Natronococcus occultus
           SP4]
 gi|433674704|gb|AGB38896.1| beta-glucosidase-like glycosyl hydrolase [Natronococcus occultus
           SP4]
          Length = 771

 Score =  263 bits (673), Expect = 2e-67,   Method: Compositional matrix adjust.
 Identities = 202/693 (29%), Positives = 327/693 (47%), Gaps = 102/693 (14%)

Query: 120 PGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTY-WSPNINVARDPRWG 178
           P AT+FP +I   ++++  L +++ + +  E  A+      G T+  SP ++VARD RWG
Sbjct: 113 PEATTFPQMIGMASTWDPELLEEVTETIRGELEAL------GTTHALSPVLDVARDLRWG 166

Query: 179 RITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKG 238
           R+ ET GEDP +V   A  YV GLQ           + R   VS+  KH+  +   +  G
Sbjct: 167 RVEETFGEDPLLVAAMACGYVSGLQG----------DGRADGVSATLKHFVGHGATDG-G 215

Query: 239 VDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRG 298
            +R   +  V  +++ E  L P+E  ++  DA SVM +Y+ ++GIP  +   LL   +RG
Sbjct: 216 KNRSSLN--VGPRELREVHLFPYEAAIRTADAESVMNAYHDIDGIPCASSEWLLTDLLRG 273

Query: 299 EWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDC--GQYYTNFTGNAVQ 356
           E+   G +V+D  S++ +V  H   A++K +A    L+AGLD++     YY      AV+
Sbjct: 274 EFGFDGTVVSDYYSVRHLVTEHG-TANTKPEAATAALEAGLDVELPYTDYYGEHLITAVE 332

Query: 357 QGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQDICSDENIELAAEAAREGIVLL 416
            G++ E  +D+S++ +     R G  D              +DE   L   AAR  + LL
Sbjct: 333 NGELSEKTLDESVRRVLREKARKGLLDDPSVDAEAAADAFRTDEAAALNRRAARRSMTLL 392

Query: 417 KNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRY---------MSPIAGFSGY 467
           KN+   LPL +    +VAV+GP A+A   ++G+YA     Y          +P+A     
Sbjct: 393 KNENELLPLTA---DSVAVIGPKADAKKELLGDYA-YAAHYPEEEYASDATTPLAALESR 448

Query: 468 --ANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVE-------------- 511
               V+Y+ GC  V+  S +    A++ A+ AD  +   G   +V+              
Sbjct: 449 DGLEVSYEQGC-TVSGPSTDGFEPAAQVAEDADVALAFVGARSAVDFSDGDASKEEKPSV 507

Query: 512 ---AESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWA 568
               E  D  DL LPG Q +LI+++ E    P+ +VI+S  G   +      ++ A+L+A
Sbjct: 508 PTSGEGCDVTDLGLPGVQEELIDRLQETGT-PLAVVIVS--GRPHSIERITADVPAVLYA 564

Query: 569 GYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLP--LTSMPLRPVDSLGYPGRTY 626
             PG+EGG AI DV+FG+ NP GRLP++         LP  +  +P+          ++Y
Sbjct: 565 WLPGDEGGSAIVDVLFGEHNPSGRLPVS---------LPKSVGQLPVYYNRKANTANKSY 615

Query: 627 KFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLV 686
            + +G  +YPFG+GLSYT+F+Y  LS ++     L  +                      
Sbjct: 616 VYTDGEPVYPFGHGLSYTEFEYGTLSLSEKRVSPLETVVAS------------------- 656

Query: 687 NDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKR 746
                       V   N G   G++VV +Y+       A  ++++IGF+RV + AG  KR
Sbjct: 657 ------------VPVTNEGDRSGAEVVQLYAHAANPSQARPVQELIGFERVPLEAGETKR 704

Query: 747 IKFVFNACKSLNIVDYAANTLLPAGEHTIFVGN 779
           + F  +  + L   D +    +  G + I VG 
Sbjct: 705 VSFELSPTQ-LAFHDESMTLTVEEGPYEIRVGR 736


>gi|254786805|ref|YP_003074234.1| glycoside hydrolase family 3 domain-containing protein
           [Teredinibacter turnerae T7901]
 gi|237686035|gb|ACR13299.1| glycoside hydrolase family 3 domain protein [Teredinibacter
           turnerae T7901]
          Length = 888

 Score =  263 bits (673), Expect = 2e-67,   Method: Compositional matrix adjust.
 Identities = 168/484 (34%), Positives = 249/484 (51%), Gaps = 50/484 (10%)

Query: 10  CFSLSIALLVFSTNAVDANGSSSPVFVCDPGRFSKLGLQMSSFLFCDSSLPYSIRVKDLV 69
              L++A L+F+  + D N    PV        S+         + D++L    RV DLV
Sbjct: 11  ILGLTLASLLFTGCSPDNNPVPKPV--------SERSTANEQPAYMDTTLDIDTRVDDLV 62

Query: 70  SRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGATSFPTVI 129
           SRM L EK+ Q+ + +  +  LG+ +Y+WW+EALHGV+  G           AT FP  I
Sbjct: 63  SRMDLAEKISQMYNESPAIEHLGIAEYDWWNEALHGVARAG----------KATVFPQAI 112

Query: 130 LTTASFNESLWKKIGQAVSTEARAMYN--------LGRAGLTYWSPNINVARDPRWGRIT 181
              A ++      I +AVS EARA ++            GLT+WSPNIN+ RDPRWGR  
Sbjct: 113 GMAAMWDRETMFDIAEAVSDEARAKHHYFVENGVHFRYTGLTFWSPNINIFRDPRWGRGQ 172

Query: 182 ETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDR 241
           ET GEDP++ G  A+ Y+ GLQ           N + LK ++  KH+A   V +     R
Sbjct: 173 ETYGEDPYLTGELALPYISGLQGE---------NPKYLKTAAMAKHFA---VHSGPEKSR 220

Query: 242 YHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWD 301
           +  +   + +D+ ET+L  FE  V EGD  SVMC+YNRVN  P+C +  LL +T+RG+W 
Sbjct: 221 HSDNYIASPKDLNETYLPAFEKAVVEGDVESVMCAYNRVNDEPACGNDMLLKETLRGKWG 280

Query: 302 LHGYIVADCDSI-QVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFTGN---AVQQ 357
             G++V+DC +I          +  +   A A  +++G DL+CG    +   N   A+Q+
Sbjct: 281 FKGHVVSDCGAIADFYAPEAHHVVMAPAAAAAWAVRSGTDLNCGTDRLSTFANLHFALQR 340

Query: 358 GKVKETDIDKSLKYLYTVLMRLGFFDGSPQ--YVSLGKQDICSDENIELAAEAAREGIVL 415
             + + +ID+S+K L     +LG FD   Q  Y  +    + S  ++ L  +AA +  VL
Sbjct: 341 EMITQDEIDQSVKRLMKTRFKLGMFDPDDQVPYSKIPMDVVGSQAHLALTQKAAEKSFVL 400

Query: 416 LKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFSGY---ANVTY 472
           LKN    LPL   K   VA++GP+A     ++GNY G P + ++P+ G   Y    NV Y
Sbjct: 401 LKN-SGILPLK--KSSKVAIIGPNATNPTVLVGNYFGDPIKPVTPLDGIQQYLGEENVFY 457

Query: 473 KTGC 476
             G 
Sbjct: 458 APGS 461



 Score =  108 bits (270), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 90/281 (32%), Positives = 128/281 (45%), Gaps = 48/281 (17%)

Query: 503 LAGLDLSVEAESLD---REDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETN 559
           L G ++SVE E  D   R D+ LP  Q +L+  + ++ K P++LV  S  G  IA    N
Sbjct: 634 LEGEEMSVEIEGFDHGDRTDIRLPEPQRKLLATLKKLNK-PIVLVNFS--GSAIALNWAN 690

Query: 560 TNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSL 619
            N+ AIL   YPGE  G A+A +++G+ +P GRLPIT+Y         L  +P       
Sbjct: 691 NNVDAILQGFYPGEATGTALARILWGEVSPSGRLPITFYRS-------LDDLP--GFKDY 741

Query: 620 GYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKT 679
               RTYK+Y G  LYPFGYGLSYTQF Y+ LS   T       +     L  T+  S  
Sbjct: 742 AMTNRTYKYYQGDVLYPFGYGLSYTQFAYSELSAPAT-------MASGEPLAITAQVS-- 792

Query: 680 RCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFV 739
                                  N G     +VV VY        +   +++  F+R+++
Sbjct: 793 -----------------------NSGKVASDEVVQVYVSMKVPGLSLPQRELKEFKRIYL 829

Query: 740 RAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGNG 780
             G ++ ++F   A K L+ VD         G  T+ VG G
Sbjct: 830 EPGASQTVEFSI-AGKDLSYVDDQGVRHPYHGPLTLSVGGG 869


>gi|313203744|ref|YP_004042401.1| glycoside hydrolase [Paludibacter propionicigenes WB4]
 gi|312443060|gb|ADQ79416.1| glycoside hydrolase family 3 domain protein [Paludibacter
           propionicigenes WB4]
          Length = 1286

 Score =  263 bits (673), Expect = 2e-67,   Method: Compositional matrix adjust.
 Identities = 153/425 (36%), Positives = 232/425 (54%), Gaps = 29/425 (6%)

Query: 53  LFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPG 112
           ++ ++S  +  R  DL+SR+TL+EK   LG+    +PRLG+     WSEALHG+     G
Sbjct: 32  IYLNTSYSFEERAADLISRLTLEEKESLLGNSMAAIPRLGIKSMNVWSEALHGILG---G 88

Query: 113 THFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNINVA 172
            +    I G TSFP  +   ++++ +L ++   A++ EARA+   G  GLTYWSP +   
Sbjct: 89  ANQSVGISGPTSFPNSVALGSAWDPALMQREAMAIADEARAINQTGTKGLTYWSPVVEPI 148

Query: 173 RDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYD 232
           RDPRWGR  E+ GEDPF+    A  +VRG+       + T L S P     C KHY A  
Sbjct: 149 RDPRWGRTGESYGEDPFLAAEIAGGFVRGMV----GNDPTYLKSVP-----CAKHYFA-- 197

Query: 233 VDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLL 292
             N    DR+   + +  +DM E +L P++  +++ +  S+M SYN VNG+P+ A    L
Sbjct: 198 --NNSEFDRHVSSSNMDSRDMREFYLAPYKKLIEQDNLPSIMSSYNAVNGVPTSASQLYL 255

Query: 293 NQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFTG 352
           +   R  + L GYI  DC +I+ +   H ++  + E+A A+ LKAG+D DCG  Y  +  
Sbjct: 256 DTIARRTYGLKGYITGDCAAIEDIYTGHYYV-KTAEEATAKGLKAGVDSDCGSIYQRYAI 314

Query: 353 NAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ--YVSLGKQDICSDENIELAAEAAR 410
            A+++G +   DID++L  ++ V MR G FD   +  Y       + S  N  LA E A 
Sbjct: 315 AALKKGLITMADIDRALLNIFIVRMRTGEFDPPAKVLYAQFQPNIVNSPANKALAKEIAT 374

Query: 411 EGIVLLKN------DQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCR--YMSPIA 462
           +  VLLKN      ++  LPLN A +K +A++GPHA+     +G Y+G P +   ++P A
Sbjct: 375 KTPVLLKNNISLKTNRKALPLNPADLKKIALIGPHADK--VELGPYSGRPAQENMITPFA 432

Query: 463 GFSGY 467
           G   Y
Sbjct: 433 GIKKY 437



 Score =  122 bits (306), Expect = 8e-25,   Method: Compositional matrix adjust.
 Identities = 89/265 (33%), Positives = 125/265 (47%), Gaps = 40/265 (15%)

Query: 505 GLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKA 564
           G D     E  DR  L LPG Q +LI  VA V     I+V+ + G V++   +   NI  
Sbjct: 619 GTDEKTATEEADRLTLLLPGNQVELIKAVAAVNPN-TIVVMQTLGCVEVEEFKNLQNIPG 677

Query: 565 ILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLP-LTSMPLRPVDSLGYPG 623
           I+W GY G+  G AIA V+FG+ NPGG+L  TWY    V+ LP +T   LR  +  G  G
Sbjct: 678 IIWVGYNGQAQGDAIASVLFGEVNPGGKLNGTWYKS--VKDLPEITDYTLRGGN--GKNG 733

Query: 624 RTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPG 683
           RT+ +++    Y FG+G+SYT F+Y+    +K                            
Sbjct: 734 RTFWYFDKDVSYEFGFGMSYTTFEYSNFRISK---------------------------- 765

Query: 684 VLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATY--IKQVIGFQRVFVRA 741
              N +   D     VD +N G  +G +V+ VY K P   A+    IK++ GF+RV + A
Sbjct: 766 ---NSIIPHDKITVSVDVKNTGKVEGDEVIQVYMKTPDSPASLQRPIKRLKGFKRVTLPA 822

Query: 742 GRNKRIKFVFNACKSLNIVDYAANT 766
           G+ K +    N C  L   D   NT
Sbjct: 823 GQTKTVNIDIN-CADLWFWDMDKNT 846


>gi|373951852|ref|ZP_09611812.1| glycoside hydrolase family 3 domain protein [Mucilaginibacter
           paludis DSM 18603]
 gi|373888452|gb|EHQ24349.1| glycoside hydrolase family 3 domain protein [Mucilaginibacter
           paludis DSM 18603]
          Length = 871

 Score =  263 bits (673), Expect = 2e-67,   Method: Compositional matrix adjust.
 Identities = 152/419 (36%), Positives = 225/419 (53%), Gaps = 34/419 (8%)

Query: 48  QMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVS 107
           Q S + + +  L ++ RV DLV RMTL+EKV Q+ + +  +PRL +P Y+WW+E LHGV+
Sbjct: 22  QTSDYPYQNYHLDFTTRVNDLVKRMTLEEKVSQMLNSSPAIPRLKIPAYDWWNEVLHGVA 81

Query: 108 NVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRA------- 160
                T F       T +P  I   A+F+     ++    + E RA++N           
Sbjct: 82  R----TPFK-----VTVYPQAIAMAATFDRQSLNQMADYAALEGRAVHNKALQMRKPGEK 132

Query: 161 --GLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRP 218
             GLTYW+PNIN+ RDPRWGR  ET GEDPF+ G     +V GLQ  +          + 
Sbjct: 133 YLGLTYWTPNINIFRDPRWGRGQETYGEDPFLTGAMGSAFVSGLQGND---------PKY 183

Query: 219 LKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYN 278
           LK ++C KHYA   V +     R+ F+A ++  D+ +T+L  F+  V +   + VMC+YN
Sbjct: 184 LKAAACAKHYA---VHSGPEPLRHVFNADISTYDLWDTYLPAFKKLVVDDKVAGVMCAYN 240

Query: 279 RVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAG 338
                P C    L+   +R +W   GY+ +DC  I     NHK  A + EDA    +  G
Sbjct: 241 AFKTQPCCGSDLLMVDILRNQWKFSGYVTSDCGGIDDFFKNHKTHA-TAEDASTDAVLHG 299

Query: 339 LDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSP--QYVSLGKQDI 396
            D++CG         AV++GK+ ET ID S+K L+ +  RLG FD S   +Y       +
Sbjct: 300 TDIECGTDAYKSLVAAVKEGKISETQIDISVKRLFMIRFRLGMFDPSDVVKYAQTPVSVL 359

Query: 397 CSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPC 455
            S E+   A + AR+ +VLLKN  +TLPL S  ++ + V+GP+A+  +A++GNY G P 
Sbjct: 360 ESPEHQAHALKMARQSVVLLKNANHTLPL-SKTIRKIVVLGPNADNPIAILGNYNGTPS 417



 Score =  120 bits (300), Expect = 4e-24,   Method: Compositional matrix adjust.
 Identities = 96/335 (28%), Positives = 142/335 (42%), Gaps = 70/335 (20%)

Query: 466 GYANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAESL---------- 515
           G AN+ +  G           + A  +    ADA + + G+   +E E +          
Sbjct: 577 GKANIRFSAGN-----YKKTDVAALVKRVADADAIVYVGGISPQLEGEEMQVNYPGFNGG 631

Query: 516 DREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEG 575
           DR  + LP  QT L+  +    K PV+ V+M+  G  +A      NI AI+ A Y G+  
Sbjct: 632 DRTSIQLPAAQTNLMKTLQATGK-PVVFVMMT--GSALATPWEAENIPAIVNAWYGGQAA 688

Query: 576 GRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLY 635
           G A+ADV+FG +NP GRLP+T+Y  D       T +P           RTY+++ G  LY
Sbjct: 689 GTAVADVLFGDYNPAGRLPVTFYKSD-------TDLP--DFTDYSMTNRTYRYFKGIPLY 739

Query: 636 PFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYF 695
            FGYGLSYTQFKY+ L    T+                                +     
Sbjct: 740 GFGYGLSYTQFKYDKLIVPATV--------------------------------KSGKAI 767

Query: 696 EFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFN--- 752
              V   N G   G +VV +Y K  ++     +K + GF RV+++AG  + + F+ +   
Sbjct: 768 HLSVTVTNSGQIAGDEVVQIYMKHHSQRIKVPLKALKGFARVYLKAGERRTLNFILSPDD 827

Query: 753 -ACKSLN--IVDYAANTLLPAG-----EHTIFVGN 779
            A  S N  +V       + AG     EH +  GN
Sbjct: 828 LAVTSSNGGLVPIKGKITISAGGSQPDEHNVTSGN 862


>gi|237718444|ref|ZP_04548925.1| glycoside hydrolase [Bacteroides sp. 2_2_4]
 gi|229452377|gb|EEO58168.1| glycoside hydrolase [Bacteroides sp. 2_2_4]
          Length = 746

 Score =  263 bits (672), Expect = 2e-67,   Method: Compositional matrix adjust.
 Identities = 220/777 (28%), Positives = 362/777 (46%), Gaps = 124/777 (15%)

Query: 56  DSSLPYSIR----VKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVG- 110
           +S LP++      VKDL+SRMT++EK+ QL  +  G   L  P+ E+ S++L     VG 
Sbjct: 25  NSKLPHTPEADSFVKDLLSRMTVEEKIGQLSQYV-GRTLLTGPESEYLSDSLIARGLVGS 83

Query: 111 -------------------------PGTHFDDVIPG-ATSFPTVILTTASFNESLWKKIG 144
                                    P     DVI G  T FPT +  + S++ +  ++  
Sbjct: 84  VLNISGAKTLRDLQEKNMRYSRIKIPILFGMDVIHGYKTIFPTPLAESCSWDLAAIERAA 143

Query: 145 QAVSTEARAMYNLGRAGLTY-WSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQ 203
           +  + E+ A      AGL + ++P +++ARD RWGR+ E  GED ++    A   V G Q
Sbjct: 144 KIAAIESSA------AGLHWTFAPMVDIARDARWGRVVEGAGEDTYLGSEIAKARVNGFQ 197

Query: 204 DVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEM 263
                 N  + NS    V +C KH+ AY +       R +    ++E+ + +T+L PF+ 
Sbjct: 198 -----WNLWENNS----VLACAKHWVAYGLPQ---AGRDYAPVDMSERTLFDTYLPPFKA 245

Query: 264 CVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFL 323
           C+  G   + M ++N +NGIP+ A P LL   +RG+W+ +G++V+D ++++ +V   + +
Sbjct: 246 CIDAG-VLTFMSAFNDINGIPASAHPFLLKDLLRGQWNFNGFVVSDWEAVKQLV--AQGV 302

Query: 324 ADSKEDAVAQTLKAGLDLDCGQ-YYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFF 382
           A+  +DA      +G+D+D     Y  +    ++ GK+   D+D S+  +  +   LG F
Sbjct: 303 AEDDKDATRLAFNSGIDMDMTDGLYNKYMKELIEAGKISMEDVDNSVSRILHIKYALGLF 362

Query: 383 DGSPQYVS--LGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHA 440
               ++ +     Q I   E ++ A + A +  VLLKND +TLPL +  V+++AVVGP A
Sbjct: 363 VDPYKFCNEEYESQTIMKKEFLDAALDMAHKSAVLLKNDNHTLPL-AKNVRSIAVVGPLA 421

Query: 441 NATVAMIGNY-AGIPCRYMSPIAGFSGYAN--------VTYKTGCDDVACKSNNSIFAAS 491
           +    ++G++ A    R+++ +    G  N        V Y  GC D   +  +    A 
Sbjct: 422 DNQTELLGSWRARGEDRHVTTV--LQGIKNKIGGNKTKVGYARGC-DFDGEDKSGFKEAV 478

Query: 492 EAAKTADATIILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGV 551
           + A  +D  I + G    +  ES  R  L LPG Q +LI ++    K PV++V+M+   +
Sbjct: 479 KLASKSDMVIAVVGEKALMSGESRSRAQLDLPGVQEELIKELVATGK-PVVVVLMNGRPL 537

Query: 552 DIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPIT----------WYNGD 601
            I +   + N+ AIL   + G   G AIAD++FG +NP GRL I+          +YN  
Sbjct: 538 SIEW--VDKNVSAILETWFLGTSAGTAIADILFGDYNPSGRLTISFPRVEGQVPVYYN-- 593

Query: 602 YVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNL 661
           Y +      MP           R     N P LYPFGYGLSYT F Y++   T+      
Sbjct: 594 YKKSGRPGDMPHSSTT------RHIDVPNAP-LYPFGYGLSYTTFSYSVPQSTQK----- 641

Query: 662 NKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPA 721
                     YT   +                     V   N G  DG + V +Y     
Sbjct: 642 ---------EYTRQET-----------------ISVSVTVTNTGDRDGEETVQLYVNDKV 675

Query: 722 EIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVG 778
                 +K++  F+++F++AG +K ++F  +   +L   D A N ++  GE  I  G
Sbjct: 676 ASVVRPVKELKAFKKIFLKAGESKTVQFDISPL-ALGFYDAAMNYVVEPGEFEIMTG 731


>gi|423223721|ref|ZP_17210190.1| hypothetical protein HMPREF1062_02376 [Bacteroides cellulosilyticus
           CL02T12C19]
 gi|392638096|gb|EIY31949.1| hypothetical protein HMPREF1062_02376 [Bacteroides cellulosilyticus
           CL02T12C19]
          Length = 954

 Score =  263 bits (672), Expect = 3e-67,   Method: Compositional matrix adjust.
 Identities = 226/760 (29%), Positives = 356/760 (46%), Gaps = 111/760 (14%)

Query: 48  QMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQL--GDFAHGVPRLGLPQYEWWSEALHG 105
           + +S  + D +LP   RV+ L+S MT ++K++ +  G    G+P L +P      EA+HG
Sbjct: 164 EKTSLRYMDPTLPVEERVESLLSVMTPEDKMELIREGWGIPGIPHLYVPPITK-VEAVHG 222

Query: 106 VSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYW 165
            S             GAT FP  +   A++N+ L + +  AV  E      L    +  W
Sbjct: 223 FSYGS----------GATIFPQALAMGATWNKKLTEDVAMAVGDE-----TLAAGTMQAW 267

Query: 166 SPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCC 225
           SP ++VA+D RWGR  ET GEDP +V +    +++G Q       +  L + P       
Sbjct: 268 SPVLDVAQDARWGRCEETFGEDPVLVSQIGGAWIKGYQ-------SKGLFTTP------- 313

Query: 226 KHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPS 285
           KH+  +      G D +  D  ++E++M E  L PF   ++  D  SVM +Y+   G+P 
Sbjct: 314 KHFGGHGAP-LGGRDSH--DIGLSEREMREVHLVPFRHVIRNYDCQSVMMAYSDYLGVPV 370

Query: 286 CADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQ 345
               +LL+  +R EW   G+IV+DC +I  +     + A  K +A  Q L AG+  +CG 
Sbjct: 371 AKSRELLHSILREEWGFDGFIVSDCGAIGNLTARKHYTAKDKIEAANQALAAGIATNCGD 430

Query: 346 YYTNF-TGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQDIC----SDE 400
            Y +     A + G++   ++D+  + +  ++ R   F+ +P    L    I     SD 
Sbjct: 431 TYNDKEVIQAAKDGRINMENLDEVCRTMLRMMFRNELFEKTPNK-PLDWNKIYPGWNSDS 489

Query: 401 NIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAG--IPCRYM 458
           + E+A +AARE IV+L+N  N LPL +  ++T+AVVGP A+      G+Y    +P +  
Sbjct: 490 HKEMARQAARESIVMLENKDNILPL-AKDMRTIAVVGPGADDLQP--GDYTPKLLPGQLK 546

Query: 459 SPIAGFS----GYANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEA-- 512
           S + G          V Y+ GC D    +   I  A +AA  +D  +++ G   + E+  
Sbjct: 547 SVLTGIKQAVGKQTKVVYEQGC-DFTSSNGTDIPKAVKAASQSDVVVLVLGDCSTSESTT 605

Query: 513 -------ESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAI 565
                  E+ D   L LPG Q +L+  V    K PVIL++ +  G     ++ +   KAI
Sbjct: 606 DVYKTSGENHDYATLILPGKQQELLEAVCATGK-PVILILQA--GRPYNLSKASELCKAI 662

Query: 566 LWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRT 625
           L    PG+EGG A ADV+FG +NP GRLP+T+    +V  LPL         +    GR 
Sbjct: 663 LVNWLPGQEGGPATADVLFGDYNPAGRLPMTFPR--HVGQLPLYY-------NFKTSGRR 713

Query: 626 YKFYNGP--TLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPG 683
           Y++ +     LY FGYGLSYT F+Y+ L           K+Q   N N    A+      
Sbjct: 714 YEYSDMEFYPLYYFGYGLSYTSFEYSGL-----------KIQEKDNGNVAIQAT------ 756

Query: 684 VLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGR 743
                             +NVG   G +VV +Y         T I ++  F RV ++   
Sbjct: 757 -----------------VKNVGQRAGDEVVQLYITDMYASVKTRITELKDFTRVHLQPDE 799

Query: 744 NKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGNGGVS 783
           +K + F     + L++++   + ++  GE  I V  GGVS
Sbjct: 800 SKIVSFELTPYE-LSLLNDRMDRVVEKGEFKILV--GGVS 836


>gi|86143269|ref|ZP_01061671.1| beta-glucosidase precursor [Leeuwenhoekiella blandensis MED217]
 gi|85830174|gb|EAQ48634.1| beta-glucosidase precursor [Leeuwenhoekiella blandensis MED217]
          Length = 873

 Score =  263 bits (672), Expect = 3e-67,   Method: Compositional matrix adjust.
 Identities = 160/424 (37%), Positives = 235/424 (55%), Gaps = 36/424 (8%)

Query: 51  SFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVG 110
            F F +  L    R+ DLVSRMTL+EK+ QL   A  + RL +P+Y WW+E+LHGV+  G
Sbjct: 23  QFPFQNEQLDLETRLNDLVSRMTLEEKISQLMSDAPAIERLNIPKYNWWNESLHGVARAG 82

Query: 111 PGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYN--LGR------AGL 162
                      AT FP  I   AS++  L +++  A+S EARA ++  L R       GL
Sbjct: 83  ----------YATVFPQSISIAASWDAQLVREVATAISDEARAKHHEYLRRDQHDIYQGL 132

Query: 163 TYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVS 222
           T WSPNIN+ RDPRWGR  ET GEDPF+ G     YV+GLQ           +   LKV 
Sbjct: 133 TMWSPNINIFRDPRWGRGHETYGEDPFLTGTLGAQYVKGLQGD---------DPEYLKVV 183

Query: 223 SCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNG 282
           +  KH+A   V +     R++FDA  +E+D+ ET+L  F M VK+    SVM +YNR  G
Sbjct: 184 ATAKHFA---VHSGPEESRHYFDANTSERDLWETYLPAFRMLVKDAQVQSVMTAYNRFRG 240

Query: 283 IPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLD 342
             + ++ KLL   +R +W   GY+V+DC +I  + ++HK +      A A  L+ G DL+
Sbjct: 241 EAASSN-KLLFDILRNKWGFDGYVVSDCGAINDIWEDHK-ITADAASASALALETGTDLN 298

Query: 343 CGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ--YVSLGKQDICSDE 400
           CG  Y +    A+  G + E  I+ +++ L+   ++LG FD      Y ++      +  
Sbjct: 299 CGATYKSLK-EAIANGLITEEKINIAIERLFRARLKLGMFDTEENLSYATIPFSVNTNAS 357

Query: 401 NIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSP 460
           +  LA +AA+E IVLLKN+ + LPL S  +K +AV+GP+A+   ++ GNY G P   ++ 
Sbjct: 358 HTALARKAAQESIVLLKNEAHMLPL-SKDLKQIAVIGPNAHNVQSLWGNYNGTPKNPVTV 416

Query: 461 IAGF 464
           + G 
Sbjct: 417 VQGI 420



 Score =  148 bits (373), Expect = 1e-32,   Method: Compositional matrix adjust.
 Identities = 101/313 (32%), Positives = 155/313 (49%), Gaps = 59/313 (18%)

Query: 480 ACKSNNSIFAASEAAKTADATIILAGLDLSVEAESL----------DREDLWLPGYQTQL 529
           +    N +  A   A+ +D TI++ GL+  +E E +          DR  L LP  Q +L
Sbjct: 582 STPEKNKLERAVNLAEDSDVTILVLGLNERLEGEEMRIDVEGFSKGDRTALDLPLEQREL 641

Query: 530 INQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNP 589
           +  +    K P++LV+++   + I +A+ +  + AIL AGYPG+EGG AIADV+FG +NP
Sbjct: 642 MRALVATGK-PIVLVLLNGSALAINYAQEH--VPAILSAGYPGQEGGNAIADVLFGDYNP 698

Query: 590 GGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYN 649
            GRLP+T+Y    V  LP         +     GRTY+++ G  LYPFGYGLSYTQF Y 
Sbjct: 699 AGRLPVTYYKS--VDDLP-------DFEDYSMKGRTYRYFEGEALYPFGYGLSYTQFSY- 748

Query: 650 LLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDG 709
                                    DA KT         L  D     +V   N G  DG
Sbjct: 749 -------------------------DAIKTS------GRLAADKVLNVQVTVTNSGDRDG 777

Query: 710 SDVVIVYSKPPAEIAATYIKQV--IGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTL 767
            +VV +Y K   E+A+T   QV  +GF+R+ ++ G  + ++F  +A +  ++++     +
Sbjct: 778 DEVVQLYLKD--EVASTTRPQVQLVGFKRIHLQKGETQTVEFRLDA-RQFSMINDQEQLV 834

Query: 768 LPAGEHTIFVGNG 780
           +  G  T++ G G
Sbjct: 835 VEPGWFTLYAGGG 847


>gi|395492941|ref|ZP_10424520.1| glycoside hydrolase family protein [Sphingomonas sp. PAMC 26617]
          Length = 865

 Score =  263 bits (672), Expect = 3e-67,   Method: Compositional matrix adjust.
 Identities = 168/443 (37%), Positives = 240/443 (54%), Gaps = 49/443 (11%)

Query: 53  LFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPG 112
           L+ D   P   RV DL+ RMTL+EK  Q+ + A  +PRLG+P Y++W+EALHGV+  G  
Sbjct: 13  LYFDPGQPIEARVDDLMRRMTLEEKAAQMQNVAPAIPRLGIPPYDYWNEALHGVARAGE- 71

Query: 113 THFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRA--------GLTY 164
                    AT FP  I   A+++  +    GQ V+TE RA YN  +A        GLT+
Sbjct: 72  ---------ATVFPQAIGMAATWDRDMMLAEGQTVATEGRAKYNQAQAQKNYDRYYGLTF 122

Query: 165 WSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSC 224
           WSPNIN+ RDPRWGR  ET GEDP++ G  AV +V G+Q        TD N   LK  + 
Sbjct: 123 WSPNINIFRDPRWGRGQETLGEDPYLTGTMAVPFVHGVQ-------GTDANY--LKAIAT 173

Query: 225 CKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIP 284
            KH+A   V +     R+ F+   + +D+ ET+L  F   + +G A S+MC+YN V+   
Sbjct: 174 PKHFA---VHSGPEQLRHQFNVDPSPRDLSETYLPAFRRAIVDGRAESLMCAYNAVDTKA 230

Query: 285 SCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCG 344
           +CA+  LL  T+RG W   G++ +DC +I  +   H     + E A A  +KAG D  C 
Sbjct: 231 ACANTMLLKDTLRGAWGFKGFVTSDCGAIDDITTGHHNSPTNPEGA-ALAVKAGTDTGC- 288

Query: 345 QYYTNFTGN------AVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ--YVSLGKQDI 396
               +F         AV+ G + E D+D +L+ L+T  M+LG FD + +  + ++   + 
Sbjct: 289 ----DFKDEMLDLPRAVKAGYLTEGDMDVALRRLFTARMKLGMFDPAARVPFSTISIAEN 344

Query: 397 CSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCR 456
            S  +  LA  AARE IVLLKND   LPL +A  + +AVVGP A + +A+ GNY G P  
Sbjct: 345 HSPAHRALALRAARESIVLLKND-GVLPL-AAGARRIAVVGPTAASLIALEGNYNGTPVG 402

Query: 457 YMSPIAGFS---GYANVTYKTGC 476
            + P+ G +   G   + Y  G 
Sbjct: 403 AVLPVDGMTAAFGADRIVYAQGS 425



 Score =  103 bits (257), Expect = 3e-19,   Method: Compositional matrix adjust.
 Identities = 85/286 (29%), Positives = 127/286 (44%), Gaps = 55/286 (19%)

Query: 505 GLDLSVEAESL----------DREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIA 554
           GL+  +E E +          DR  + LP  Q+QL++ +    K P+++V+ S  G  IA
Sbjct: 602 GLNAWLEGEEMPLQVPGFAGGDRTAIALPAAQSQLLDALFATGK-PLVIVLQS--GSAIA 658

Query: 555 FAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLR 614
                   +A+L A YPGE GG+AIA+V+ G  NP GRLP+T+Y       LP       
Sbjct: 659 LGAQEAKARAVLEAWYPGEAGGQAIAEVLSGTVNPSGRLPVTFYAS--TDQLPA------ 710

Query: 615 PVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTS 674
             D      RTY+++ G   YPFG+GLSYT+F Y+ L                      +
Sbjct: 711 -FDDYRMANRTYRYFAGRVEYPFGHGLSYTRFAYSALR--------------------PA 749

Query: 675 DASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGF 734
            +S     G  V+           V  +N G   G +V  +Y   P    A  I+ + G+
Sbjct: 750 TSSVAAGQGTSVS-----------VAVRNTGVLAGDEVAQLYLSVPGREGAP-IRSLKGY 797

Query: 735 QRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGNG 780
           QRV + AG  K + F     + L + + A    +    + I+VG G
Sbjct: 798 QRVHLAAGETKTLTFALEP-RDLALANAAGAMAVTKATYQIWVGGG 842


>gi|299146513|ref|ZP_07039581.1| beta-glucosidase [Bacteroides sp. 3_1_23]
 gi|298517004|gb|EFI40885.1| beta-glucosidase [Bacteroides sp. 3_1_23]
          Length = 736

 Score =  263 bits (671), Expect = 3e-67,   Method: Compositional matrix adjust.
 Identities = 214/726 (29%), Positives = 341/726 (46%), Gaps = 124/726 (17%)

Query: 90  RLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVST 149
           RLG+P +    EA HG   +G            T FPT I   A+++  L K++GQ ++ 
Sbjct: 83  RLGIPMF-LAEEAPHGHMAIG-----------TTVFPTGIGMAATWSPELVKEVGQVIAK 130

Query: 150 EARAMYNLGRAGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHE 209
           E R+     + G   + P +++ RDPRW R+ ET GEDP + G    + V GL       
Sbjct: 131 EIRS-----QGGHISYGPVLDLTRDPRWSRVEETFGEDPVLSGILGASMVDGL------- 178

Query: 210 NATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGD 269
              +L+ +   +++  KH+ AY V        Y   A V  +D+ + FL PF   +  G 
Sbjct: 179 GGGNLSQKYATIATL-KHFLAYAVPEGGQNGNY---ASVGIRDLHQNFLPPFRKAIDAG- 233

Query: 270 ASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKED 329
           A SVM SYN ++GIP  ++  LL + +R EW   G++V+D  SI+ + ++H F+A +KE+
Sbjct: 234 ALSVMTSYNSIDGIPCTSNHYLLTKLLRNEWKFRGFVVSDLYSIEGIHESH-FVAPTKEN 292

Query: 330 AVAQTLKAGLDLDCG-QYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQY 388
           A  Q++ AG+D+D G   YTN   +AVQ G++ +T ID ++  +  +   +G F+     
Sbjct: 293 AAIQSVMAGVDVDLGGDAYTNLC-HAVQSGQMDKTVIDTAVCRVLRMKFEMGLFEHPYVD 351

Query: 389 VSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIG 448
             +  + +   E+IELA + A+  I LLKN+ + LPL S  +  VAV+GP+A+    M+G
Sbjct: 352 PKIAAKTVRRKEHIELARKIAQSSITLLKNENSILPL-SKMINKVAVIGPNADNRYNMLG 410

Query: 449 NYA-------------GIPCRYMSPIAGFSGYANVTYKTGCDDVACKSNNSIFAASEAAK 495
           +Y              GI  + +SP       + V Y  GC  +   + N I  A EAA+
Sbjct: 411 DYTAPQEDSNVKTVLDGIITK-LSP-------SRVEYVRGCA-IRDTTVNEIEQAIEAAR 461

Query: 496 TAD----------------------ATIILAGLDLSVE-AESLDREDLWLPGYQTQLINQ 532
            ++                      A +   G    +E  E  DR  L L G Q +L+  
Sbjct: 462 RSEVVIVVVGGSSARDFKTSYKETGAAVAEEGSVSDMECGEGFDRASLSLLGRQQELLES 521

Query: 533 VAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGR 592
           + +  K P+I+V +    ++  +A    +  A+L A YPG+EGG AIADV+FG +NP GR
Sbjct: 522 LQKTGK-PLIVVYIEGRPLEKNWASEYAD--ALLTAYYPGQEGGNAIADVLFGDYNPSGR 578

Query: 593 LPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLS 652
           LPI+      V  +P+      P +        Y   +   LY FGYG+SYT F+Y+ L 
Sbjct: 579 LPISVPRS--VGQIPVYYNQKAPRN------HDYVEVSSSPLYSFGYGMSYTTFEYSDLQ 630

Query: 653 FTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDV 712
                                          V+    RC   FE     +N G  DG +V
Sbjct: 631 -------------------------------VVQKSARC---FEVSFKVKNTGKYDGEEV 656

Query: 713 VIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGE 772
             +Y +         +KQ+  F+R  ++ G  K++ FV    +   +V+Y    ++ +G 
Sbjct: 657 SQLYMRDEYASVVQPMKQLKHFERFHLKKGEEKKVTFVLTE-EDFFLVNYTLKKVVESGN 715

Query: 773 HTIFVG 778
             + +G
Sbjct: 716 FHLMIG 721


>gi|423289665|ref|ZP_17268515.1| hypothetical protein HMPREF1069_03558 [Bacteroides ovatus
           CL02T12C04]
 gi|423298158|ref|ZP_17276217.1| hypothetical protein HMPREF1070_04882 [Bacteroides ovatus
           CL03T12C18]
 gi|392663699|gb|EIY57246.1| hypothetical protein HMPREF1070_04882 [Bacteroides ovatus
           CL03T12C18]
 gi|392667376|gb|EIY60886.1| hypothetical protein HMPREF1069_03558 [Bacteroides ovatus
           CL02T12C04]
          Length = 955

 Score =  263 bits (671), Expect = 3e-67,   Method: Compositional matrix adjust.
 Identities = 223/748 (29%), Positives = 353/748 (47%), Gaps = 107/748 (14%)

Query: 54  FCDSSLPYSIRVKDLVSRMTLDEKVQQL--GDFAHGVPRLGLPQYEWWSEALHGVSNVGP 111
           + D+SLP   RV+ L++ MT ++K++ +  G    G+P L +P      EA+HG S    
Sbjct: 171 YMDASLPVEERVESLLAVMTPEDKMELIREGWGIPGIPHLYVPPITK-VEAVHGFSYGS- 228

Query: 112 GTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNINV 171
                    GAT FP  +   A++N  L +++   +  E   + N  +A    WSP ++V
Sbjct: 229 ---------GATIFPQALAMGATWNRKLTEEVAMVIGDET-VVANTKQA----WSPVLDV 274

Query: 172 ARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAY 231
           A+D RWGR  ET GEDP +V +    +++G Q            SR L  +   KH+  +
Sbjct: 275 AQDARWGRCEETFGEDPVLVSQIGGAWIKGYQ------------SRGLFTTP--KHFGGH 320

Query: 232 DVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKL 291
                 G D +  D  ++E++M E  L PF   V+  D  S+M +Y+   GIP     +L
Sbjct: 321 GAP-LGGRDSH--DIGLSEREMREVHLVPFRHVVRNYDCQSLMMAYSDYMGIPVAGSTEL 377

Query: 292 LNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNF- 350
           L Q +R EW  +G+IV+DC +I  +     + A  K +A  Q L AG+  +CG  Y +  
Sbjct: 378 LQQILRQEWGFNGFIVSDCGAIGNLTARKHYTAKDKIEAANQALAAGIATNCGDTYNDKE 437

Query: 351 TGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSP-QYVSLGK--QDICSDENIELAAE 407
              A + G++   ++D   + +   + R   F+ +P + +   K      SD + E+A +
Sbjct: 438 VIQAAKDGRINMVNLDNVCRTMLATMFRNELFEKNPCKPLDWNKIYPGWNSDRHREMARQ 497

Query: 408 AAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGI--PCRYMSPIAGFS 465
           AARE IV+L+N  N LPL S  +KT+AV+GP A+      G+Y     P +  S ++G  
Sbjct: 498 AARESIVMLENKDNLLPL-SKTLKTIAVLGPGADDLQP--GDYTPKLQPGQLKSVLSGIK 554

Query: 466 G----YANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEA--------- 512
                   V Y+ GCD     + N I  A +AA  +D  +++ G   + EA         
Sbjct: 555 AAVGKQTKVLYEQGCDFTTPDATN-IPKAVKAASQSDVVVMVLGDCSTSEATNNVRKTCG 613

Query: 513 ESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPG 572
           E+ D   L LPG Q +L+  V    K PV+L++ +    D+  A  +   KAIL    PG
Sbjct: 614 ENNDWATLILPGKQQELLEAVCATGK-PVVLILQAGRPYDLLKA--SEMCKAILVNWLPG 670

Query: 573 EEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGP 632
           +EGG A ADV+FG +NPGGRLP+T+    +V  LPL         +    GR Y++ +  
Sbjct: 671 QEGGPATADVLFGDYNPGGRLPMTFPR--HVGQLPLYY-------NFKTSGRRYEYVDME 721

Query: 633 --TLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLR 690
              LY FGYGLSYT F+Y+ L           K+Q   N N    A+             
Sbjct: 722 FYPLYRFGYGLSYTSFEYSDL-----------KIQEKSNGNVMVQAT------------- 757

Query: 691 CDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFV 750
                      +NVG   G +V  +Y         T + ++  F R+ ++ G +K + F 
Sbjct: 758 ----------VKNVGGCAGDEVAQLYITDMYASVKTRVMELKDFTRIHLQPGESKNVSFE 807

Query: 751 FNACKSLNIVDYAANTLLPAGEHTIFVG 778
                 +++++   + ++  GE  + VG
Sbjct: 808 LTPY-DISLLNDRMDRVVEKGEFKVMVG 834


>gi|260642727|ref|ZP_05417108.2| periplasmic beta-glucosidase [Bacteroides finegoldii DSM 17565]
 gi|260620819|gb|EEX43690.1| glycosyl hydrolase family 3 N-terminal domain protein [Bacteroides
           finegoldii DSM 17565]
          Length = 768

 Score =  263 bits (671), Expect = 4e-67,   Method: Compositional matrix adjust.
 Identities = 211/731 (28%), Positives = 345/731 (47%), Gaps = 104/731 (14%)

Query: 64  RVKDLVSRMTLDEKVQQLGDFAHGVP----------RLGLPQYEWWSEALHGVSNVG--- 110
           +V+ L+ +MTL+EK+ Q+   +   P           +G        E ++ +  +    
Sbjct: 53  KVEALLDKMTLEEKLGQMNQLSPWDPNELANKVRNGEIGSILNYMNPEEVNKIQKIAMEE 112

Query: 111 -----PGTHFDDVIPG-ATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTY 164
                P     DVI G  T FP  +   A+FN  + +   +  + EA A       G+ +
Sbjct: 113 SRLGIPLLVSRDVIHGYKTIFPIPLGQAATFNPQIVENGARVAAIEASA------DGIRW 166

Query: 165 -WSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSS 223
            ++P I+++RDPRWGRI E+ GEDP++     V  ++G Q          LNS P  +++
Sbjct: 167 TFAPMIDISRDPRWGRIAESCGEDPYLTSVMGVAMIKGFQ-------GDSLNS-PTSMAA 218

Query: 224 CCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGI 283
           C KH+ AY     +G   Y+    + E+ +   +L PF+  V  G  ++ M S+N  +G+
Sbjct: 219 CAKHFVAYGAS--EGGKDYN-STFIPERVLRNVYLPPFKAAVDAG-CATFMTSFNDNDGV 274

Query: 284 PSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLD- 342
           PS A+  +L   +R EW   G +V D  S   M+ NH F AD KE A  +++ AG+D+D 
Sbjct: 275 PSTANKFVLKDILRDEWKYDGMVVTDWASAAEMI-NHGFCADGKE-AAEKSVNAGVDMDM 332

Query: 343 CGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQDICSDENI 402
             + +      ++ + KV    ID +++ +  +  R+G F+    Y+   +    ++E++
Sbjct: 333 VSETFIKNLKQSLAENKVSIESIDDAVRNILRLKYRMGLFENP--YIVTPQNVKYAEEHL 390

Query: 403 ELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYA--GIPCRYMSP 460
           ++A EA  + ++LLKND  TLPL + K++TVAVVGP A+A    +G +   G      +P
Sbjct: 391 KIAKEAVEQSVILLKNDTQTLPLTN-KIRTVAVVGPMADAPYEQMGTWVFDGEKDHTQTP 449

Query: 461 IAG----FSGYANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAESLD 516
           +      +    NV ++        K+ N I  A  AA+ AD  +   G +  +  E+  
Sbjct: 450 LKAIREMYGDQVNVIFEPALGYSRDKNLNGIAKAVNAARHADVVLAFVGEEAILSGEAHS 509

Query: 517 REDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGG 576
             +L L G Q+QLI  ++   K P++ ++M+  G  +  A       A+L+A +PG  GG
Sbjct: 510 LANLNLQGAQSQLIQALSTTGK-PLVTIVMA--GRQLTIASEVEASDAVLYAFHPGTMGG 566

Query: 577 RAIADVVFGKFNPGGRLPITWYNGDYVQMLPL------TSMPLRPVDSL-----GYPGRT 625
            AIAD++FGK NP  + P+T+        +P+      T  P  P + L        G+T
Sbjct: 567 PAIADILFGKVNPSAKTPVTFPR--MTGQVPIYYAHNSTGRPANPKEMLIDEIPVEAGQT 624

Query: 626 ----YKFY---NGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASK 678
                 FY       LYPFGYGLSYT F+Y+                   NL  TSD   
Sbjct: 625 SVGCRSFYLDAGASPLYPFGYGLSYTTFEYS-------------------NLKLTSD--- 662

Query: 679 TRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVF 738
                     L  +      VD +N G  DG++VV +Y +         +K++  FQRV 
Sbjct: 663 ---------KLAINGEISVTVDLKNTGKYDGTEVVQLYIQDKVGSVTRPVKELKAFQRVE 713

Query: 739 VRAGRNKRIKF 749
           ++AG +K + F
Sbjct: 714 LKAGESKNVSF 724


>gi|150003731|ref|YP_001298475.1| glycoside hydrolase family protein [Bacteroides vulgatus ATCC 8482]
 gi|319640047|ref|ZP_07994774.1| glycoside hydrolase family 3 [Bacteroides sp. 3_1_40A]
 gi|345517061|ref|ZP_08796539.1| glycoside hydrolase family beta-glycosidase [Bacteroides sp.
           4_3_47FAA]
 gi|149932155|gb|ABR38853.1| glycoside hydrolase family 3, candidate beta-glycosidase
           [Bacteroides vulgatus ATCC 8482]
 gi|254833833|gb|EET14142.1| glycoside hydrolase family beta-glycosidase [Bacteroides sp.
           4_3_47FAA]
 gi|317388325|gb|EFV69177.1| glycoside hydrolase family 3 [Bacteroides sp. 3_1_40A]
          Length = 864

 Score =  263 bits (671), Expect = 4e-67,   Method: Compositional matrix adjust.
 Identities = 168/454 (37%), Positives = 233/454 (51%), Gaps = 42/454 (9%)

Query: 54  FCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGT 113
           + DSSL    R +DL+ ++TL+EKV  + D +  V RLG+  Y WW+EALHGV+  G   
Sbjct: 24  YKDSSLSPEERAEDLLQQLTLEEKVALMMDNSKPVERLGIKPYNWWNEALHGVARSGL-- 81

Query: 114 HFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRA--------GLTYW 165
                   AT FP  I   ASF       I  AVS EARA      A        GLT W
Sbjct: 82  --------ATVFPQPIGMAASFEPDAIHTIYTAVSDEARAKNAAYSAAGSYERYQGLTMW 133

Query: 166 SPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCC 225
           +P +N+ RDPRWGR  ET GEDP++     VN V+GLQ         D N +  K+ +C 
Sbjct: 134 TPTVNIYRDPRWGRGIETYGEDPYLTSVMGVNVVKGLQ-------CMDANQKYDKIHACA 186

Query: 226 KHYAAYDVDNWKGVDRYHFDAR-VTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIP 284
           KH+A +    W   +R+ F+A  +  +D+ ET+L PFE  VKE     VMC+YNR+ G P
Sbjct: 187 KHFAVHSGPEW---NRHEFNAENIKPRDLHETYLVPFEALVKEAKVKEVMCAYNRLEGDP 243

Query: 285 SCADPKLLNQTVRGEWDLHGYIVADCDSIQ--VMVDNHKFLADSKEDAVAQTLKAGLDLD 342
            C   +LL Q +R +W   G +++DC +I        HK   D+ E A A  + +G DL+
Sbjct: 244 CCGSDRLLMQILRQDWGYDGIVLSDCGAIDDFYREKGHKTHPDA-ESASAAAVLSGTDLE 302

Query: 343 CGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGK---QDICSD 399
           CG  Y     +A ++G + E DID S+K L      LG  D  P  V   K     +CS 
Sbjct: 303 CGSSYKALVESA-KKGLISEKDIDVSVKRLLKARFELGEMD-DPDKVEWTKIPYSVVCSA 360

Query: 400 ENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMS 459
           E+  L+ + AR+ + LL N  N LPL     +T+AV+GP+AN +V   GNY G P   ++
Sbjct: 361 EHDSLSLDIARKSMTLLLNKNNILPLKRGG-QTIAVMGPNANDSVMQWGNYNGTPKHTIT 419

Query: 460 PIAGFSGYA----NVTYKTGCDDVACKSNNSIFA 489
            + G          + Y+ GC  V      S+F+
Sbjct: 420 LLEGIRSAMGENDKLIYEQGCSWVERSLIRSVFS 453



 Score =  129 bits (324), Expect = 6e-27,   Method: Compositional matrix adjust.
 Identities = 104/326 (31%), Positives = 145/326 (44%), Gaps = 62/326 (19%)

Query: 464 FSGYANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAESL-------- 515
           FSG A + +     D+  K   +I       K AD  I   G+  S+E E +        
Sbjct: 574 FSGDAQLNF-----DLGFKEEVNIKNTVAKVKDADVVIFAGGISPSLEGEEMGVNLPGFR 628

Query: 516 --DREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGE 573
             DR D+ LP  Q +LI  + +  K  VI V  S  G  IA        +AIL A YPG+
Sbjct: 629 KGDRTDIELPAVQRELIKALCDAGK-KVIFVNFS--GSPIAMEPETKYCQAILQAWYPGQ 685

Query: 574 EGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPT 633
            GG+A A+V+FG +NP GRLP+T+Y         +T +P    +     GRTY+++ G  
Sbjct: 686 SGGKAAAEVLFGDYNPAGRLPVTFYRN-------ITQLP--DFEDYNMTGRTYRYFKGDP 736

Query: 634 LYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDD 693
           L+PFGYGLSYT F Y  +   +TI+V               + +K   P           
Sbjct: 737 LFPFGYGLSYTTFNYGNIKLEQTIKV--------------GETAKIIVP----------- 771

Query: 694 YFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNA 753
                    N G+ DG +VV VY K   E A   +K +  F+RV + AG+   ++     
Sbjct: 772 -------VTNTGNRDGEEVVQVYLK-KQEDAEGPVKTLRAFKRVQIPAGKTVNVELELTP 823

Query: 754 CKSLNIVDYAANTLLP-AGEHTIFVG 778
            K L   D   NT+   AG   I VG
Sbjct: 824 -KQLEWWDAQTNTMRTIAGNFDIMVG 848


>gi|294777452|ref|ZP_06742903.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
           vulgatus PC510]
 gi|294448520|gb|EFG17069.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
           vulgatus PC510]
          Length = 864

 Score =  263 bits (671), Expect = 4e-67,   Method: Compositional matrix adjust.
 Identities = 168/454 (37%), Positives = 233/454 (51%), Gaps = 42/454 (9%)

Query: 54  FCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGT 113
           + DSSL    R +DL+ ++TL+EKV  + D +  V RLG+  Y WW+EALHGV+  G   
Sbjct: 24  YKDSSLSPEERAEDLLQQLTLEEKVALMMDNSKPVERLGIKPYNWWNEALHGVARSGL-- 81

Query: 114 HFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRA--------GLTYW 165
                   AT FP  I   ASF       I  AVS EARA      A        GLT W
Sbjct: 82  --------ATVFPQPIGMAASFEPDAIHTIYTAVSDEARAKNAAYSAAGSYERYQGLTMW 133

Query: 166 SPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCC 225
           +P +N+ RDPRWGR  ET GEDP++     VN V+GLQ         D N +  K+ +C 
Sbjct: 134 TPTVNIYRDPRWGRGIETYGEDPYLTSVMGVNVVKGLQ-------CMDANQKYDKIHACA 186

Query: 226 KHYAAYDVDNWKGVDRYHFDAR-VTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIP 284
           KH+A +    W   +R+ F+A  +  +D+ ET+L PFE  VKE     VMC+YNR+ G P
Sbjct: 187 KHFAVHSGPEW---NRHEFNAENIKPRDLHETYLVPFEALVKEAKVKEVMCAYNRLEGDP 243

Query: 285 SCADPKLLNQTVRGEWDLHGYIVADCDSIQ--VMVDNHKFLADSKEDAVAQTLKAGLDLD 342
            C   +LL Q +R +W   G +++DC +I        HK   D+ E A A  + +G DL+
Sbjct: 244 CCGSDRLLMQILRQDWGYDGIVLSDCGAIDDFYREKGHKTHPDA-ESASAAAVLSGTDLE 302

Query: 343 CGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGK---QDICSD 399
           CG  Y     +A ++G + E DID S+K L      LG  D  P  V   K     +CS 
Sbjct: 303 CGSSYKALVESA-KKGLISEKDIDVSVKRLLKARFELGEMD-DPDKVEWTKIPYSVVCSA 360

Query: 400 ENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMS 459
           E+  L+ + AR+ + LL N  N LPL     +T+AV+GP+AN +V   GNY G P   ++
Sbjct: 361 EHDSLSLDIARKSMTLLLNKNNILPLKRGG-QTIAVMGPNANDSVMQWGNYNGTPKHTIT 419

Query: 460 PIAGFSGYA----NVTYKTGCDDVACKSNNSIFA 489
            + G          + Y+ GC  V      S+F+
Sbjct: 420 LLEGIRSAMGENDKLIYEQGCSWVERSLIRSVFS 453



 Score =  127 bits (320), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 103/326 (31%), Positives = 143/326 (43%), Gaps = 62/326 (19%)

Query: 464 FSGYANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAESL-------- 515
           FSG A + +     D+  K   +I       K AD  I   G+  S+E E +        
Sbjct: 574 FSGDAQLNF-----DLGFKEEVNIKNTVAKVKDADIVIFAGGISPSLEGEEMGVNLPGFR 628

Query: 516 --DREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGE 573
             DR D+ LP  Q +LI  + +  K  VI V  S  G  IA        +AIL A YPG+
Sbjct: 629 KGDRTDIELPAVQRELIKALCDAGK-KVIFVNFS--GSPIAMEPETKYCQAILQAWYPGQ 685

Query: 574 EGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPT 633
            GG+A A+V+FG +NP GRLP+T+Y           +  L   +     GRTY+++ G  
Sbjct: 686 SGGKAAAEVLFGDYNPAGRLPVTFYR---------NTAQLPDFEDYNMTGRTYRYFKGDP 736

Query: 634 LYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDD 693
           L+PFGYGLSYT F Y  +   +TI+V               + +K   P           
Sbjct: 737 LFPFGYGLSYTTFNYGNIKLEQTIKV--------------GETAKIIVP----------- 771

Query: 694 YFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNA 753
                    N G+ DG +VV VY K   E A   +K +  F+RV + AG+   ++     
Sbjct: 772 -------VTNTGNRDGEEVVQVYLK-KQEDAEGPVKTLRAFKRVQIPAGKTVNVELELTP 823

Query: 754 CKSLNIVDYAANTLLP-AGEHTIFVG 778
            K L   D   NT+   AG   I VG
Sbjct: 824 -KQLEWWDAQTNTMRTIAGNFDIMVG 848


>gi|293371439|ref|ZP_06617870.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
            ovatus SD CMC 3f]
 gi|292633636|gb|EFF52194.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
            ovatus SD CMC 3f]
          Length = 1049

 Score =  263 bits (671), Expect = 4e-67,   Method: Compositional matrix adjust.
 Identities = 219/769 (28%), Positives = 362/769 (47%), Gaps = 108/769 (14%)

Query: 56   DSSLPYSIR----VKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVG- 110
            +S LP++      VKDL+SRMT++EK+ QL  +  G   L  P+ E+ S++L     VG 
Sbjct: 328  NSKLPHTPEADSFVKDLLSRMTVEEKIGQLSQYV-GRTLLTGPESEYLSDSLIARGLVGS 386

Query: 111  -------------------------PGTHFDDVIPG-ATSFPTVILTTASFNESLWKKIG 144
                                     P     DVI G  T FPT +  + S++ +  ++  
Sbjct: 387  VLNISGAKTLRDLQEKNMRYSRIKIPILFGMDVIHGYKTIFPTPLAESCSWDLAAIERAA 446

Query: 145  QAVSTEARAMYNLGRAGLTY-WSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQ 203
            +  + E+ A      AGL + ++P +++ARD RWGR+ E  GED ++    A   V G Q
Sbjct: 447  KIAAIESSA------AGLHWTFAPMVDIARDARWGRVVEGAGEDTYLGSEIAKARVNGFQ 500

Query: 204  DVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEM 263
                  N  + NS    V +C KH+ AY +       R +    ++E+ + +T+L PF+ 
Sbjct: 501  -----WNLWENNS----VLACAKHWVAYGLPQ---AGRDYAPVDMSERTLFDTYLPPFKA 548

Query: 264  CVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFL 323
            C+  G   + M ++N +NGIP+ A P LL   +RG+W+ +G++V+D ++++ +V   + +
Sbjct: 549  CIDAG-VLTFMSAFNDINGIPASAHPFLLKDLLRGQWNFNGFVVSDWEAVKQLV--AQGV 605

Query: 324  ADSKEDAVAQTLKAGLDLDCGQ-YYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFF 382
            A+  +DA      +G+D+D     Y  +    ++ GK+   D+D S+  +  +   LG F
Sbjct: 606  AEDDKDATRLAFNSGIDMDMTDGLYNKYMKELIEAGKISMEDVDNSVSRILHIKYALGLF 665

Query: 383  DGSPQYVS--LGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHA 440
                ++ +     Q I   E ++ A + A +  VLLKND +TLPL +  V+++AVVGP A
Sbjct: 666  VDPYKFCNEEYESQTIMKKEFLDAALDMAHKSAVLLKNDNHTLPL-AKNVRSIAVVGPLA 724

Query: 441  NATVAMIGNY-AGIPCRYMSPIAGFSGYAN--------VTYKTGCDDVACKSNNSIFAAS 491
            +    ++G++ A    R+++ +    G  N        V Y  GC D   +  +    A 
Sbjct: 725  DNQTELLGSWRARGEDRHVTTV--LQGIKNKIGGNKTKVGYARGC-DFDGEDKSGFKEAV 781

Query: 492  EAAKTADATIILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGV 551
            + A  +D  I + G    +  ES  R  L LPG Q +LI ++    K PV++V+M+   +
Sbjct: 782  KLASKSDMVIAVVGEKALMSGESRSRAQLDLPGVQEELIKELVATGK-PVVVVLMNGRPL 840

Query: 552  DIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGD-YVQMLPLTS 610
             I +   + N+ AIL   + G   G AIAD++FG +NP GRL I++   +  V +     
Sbjct: 841  SIEW--VDKNVSAILETWFLGTSAGTAIADILFGDYNPSGRLTISFPRVEGQVPVYYNYK 898

Query: 611  MPLRPVD-SLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRN 669
               RP D       R     N P LYPFGYGLSYT F Y++   T+              
Sbjct: 899  KSGRPGDMPHSSTTRHIDVPNAP-LYPFGYGLSYTTFSYSVPQSTQK------------- 944

Query: 670  LNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIK 729
              YT   +                     V   N G  DG + V +Y           +K
Sbjct: 945  -EYTRQET-----------------ISVSVTVTNTGDRDGEETVQLYVNDKVASVVRPVK 986

Query: 730  QVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVG 778
            ++  F+++F++AG +K ++F  +   +L   D A N ++  GE  I  G
Sbjct: 987  ELKAFKKIFLKAGESKTVQFDISPL-ALGFYDAAMNYVVEPGEFEIMTG 1034


>gi|375143423|ref|YP_005005864.1| Beta-glucosidase [Niastella koreensis GR20-10]
 gi|361057469|gb|AEV96460.1| Beta-glucosidase [Niastella koreensis GR20-10]
          Length = 793

 Score =  263 bits (671), Expect = 4e-67,   Method: Compositional matrix adjust.
 Identities = 231/801 (28%), Positives = 350/801 (43%), Gaps = 139/801 (17%)

Query: 53  LFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFA--HGVPRLGLPQYEW----WS------ 100
           ++ D     + R  DL+S+MTLDEK  Q+      H V +  LP   W    W       
Sbjct: 42  IYEDPKQSVNARTADLLSKMTLDEKTCQMATLYGWHRVLKDSLPTDSWKNAIWKDGIANI 101

Query: 101 -EALHGVSNVGPGTHFDDV------------------------IPG-------------- 121
            E L+G +  G     D V                        IP               
Sbjct: 102 DEHLNGFAGWGKTAPIDLVKDMEKHVWAMNETQRFFIEQTRLGIPADFTNEGIRGVEAYE 161

Query: 122 ATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLT-YWSPNINVARDPRWGRI 180
           AT FPT +    ++N+ L  + G     EARA+      G T  ++P ++VARD RWGR+
Sbjct: 162 ATGFPTELNMGMTWNKELVHQEGIITGREARAL------GYTNVYAPIMDVARDQRWGRL 215

Query: 181 TETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVD 240
            E+ GEDP++V    +   +G+Q  +G            KV+S  KH+A Y  +      
Sbjct: 216 EESYGEDPYLVASMGIALAKGIQQ-DG------------KVASTAKHFAVYSANKGAREG 262

Query: 241 RYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEW 300
           +   D +V  +++E   L PF+  +KE     VM SYN  +GIP       L Q +R E 
Sbjct: 263 QARTDPQVAPREVENLLLYPFKKVIKEAGIMGVMSSYNDYDGIPVSGSNYWLIQRLRVEM 322

Query: 301 DLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDL----DCGQYYTNFTGNAVQ 356
              GY+V+D D+++ +   H   A+ KE AV Q   AG+++            +    V+
Sbjct: 323 GFTGYVVSDSDALEYLATKHHVAANLKE-AVFQAFMAGMNVRTTFKAPDSIIIYLRQLVK 381

Query: 357 QGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLG--KQDICSDENIELAAEAAREGIV 414
           +G++    I+  +  +  V  RLG FD  P   S    ++ + SD + ++A +A+RE +V
Sbjct: 382 EGRIPMDTINHRVADVLRVKFRLGLFD-HPYVESAAETRKVVNSDASQQIALQASRESVV 440

Query: 415 LLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFS---GYANVT 471
           LLKN+ N LPL  + +  +AVVGP+A        +Y  +    ++ + G     G   V 
Sbjct: 441 LLKNNNNILPLVKS-LDKIAVVGPNATDDDYAHTHYGPLGSPSVNVLQGIQAKLGAGKVL 499

Query: 472 YKTGCDDVACKSNNS--------------IFAASEAAKTADATIILAGLDLSVEAESLDR 517
           Y  G D V      S              + +A    K A   I++ G +     ES  R
Sbjct: 500 YAKGVDLVDKNWPESEILPEPMDAGEQAMLDSAVNITKQAQMAIVVLGGNTRTAGESKSR 559

Query: 518 EDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGR 577
            DL LPG+Q +L+  +    K PV++V++    + I +   +  I  I++AGYPG +GG 
Sbjct: 560 TDLDLPGHQLELVKAIKATGK-PVVVVLLGTQPMTINW--IDKYIDGIVYAGYPGVKGGI 616

Query: 578 AIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPF 637
           A+ADV+FG +NPGG+L +TW     V  +PL + P +P  +    G   K      LYPF
Sbjct: 617 AVADVLFGDYNPGGKLTLTWPKS--VGQIPL-NFPSKP-GAQSDEGEHAKIKG--LLYPF 670

Query: 638 GYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEF 697
           G+GLSYT F Y  L  +                       KT    V V           
Sbjct: 671 GFGLSYTSFGYTNLKIS---------------------TGKTAADPVAVT---------- 699

Query: 698 KVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSL 757
            VD  N G   G +VV  Y +       TY K + GF+RV ++AG  K I F     + L
Sbjct: 700 -VDVTNTGKLAGDEVVQCYIRDVLSSVTTYEKLLKGFERVHLQAGETKTISFTI-PREEL 757

Query: 758 NIVDYAANTLLPAGEHTIFVG 778
            + +     +L  GE ++ +G
Sbjct: 758 KLYNREMKFVLEPGEFSVMIG 778


>gi|323451833|gb|EGB07709.1| hypothetical protein AURANDRAFT_64764 [Aureococcus anophagefferens]
          Length = 819

 Score =  263 bits (671), Expect = 4e-67,   Method: Compositional matrix adjust.
 Identities = 234/740 (31%), Positives = 340/740 (45%), Gaps = 119/740 (16%)

Query: 54  FCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGT 113
           + D+SLP + R+  L   + L++ + QL + A  V  + LP Y W ++  HGV     GT
Sbjct: 71  YLDASLPEADRLAWLADNVPLEDMIGQLVNAAPAVDAVDLPAYNWLNDNEHGVK----GT 126

Query: 114 HFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYN-----LGRA-------- 160
                   AT +P      AS++  L  ++G A+  E+RA +N      G A        
Sbjct: 127 AH------ATVYPMGASLGASWSVDLAWRVGAAIGNESRATHNGLADKSGNACGSTSTGE 180

Query: 161 ------GLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDL 214
                 G+T ++PN+N+ RDPRWGR  E  GEDP +    AV  V GLQ      + +  
Sbjct: 181 VVANGCGITLYAPNVNLVRDPRWGRAEEVYGEDPHLTAELAVGMVTGLQG-NAEGSTSGP 239

Query: 215 NSRPLKVSSCCKHYAA----YDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDA 270
              PL   +CCKH+AA    Y  ++    DR   DA V+ +D+ ET+L   + CV    A
Sbjct: 240 GGGPLVTGACCKHFAAHFAVYQNEDLP-ADRMVLDANVSSRDLWETYLPVMKACVVRAKA 298

Query: 271 SSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDA 330
           +        VNG P+CA P+LLN  +R  W   G++V+D D+   +V  HK+++ + E+A
Sbjct: 299 T-------HVNGKPTCAHPELLNDVLRESWGFDGFVVSDYDAWSNLVTTHKYVS-TWEEA 350

Query: 331 VAQTLKAGLDLDC--GQYY-TNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ 387
            A  + AG+D +   G Y   +   +AV+ G V    + +S + L  V +RLG FD    
Sbjct: 351 AAAGINAGMDQEGGFGDYSPVDALPDAVRNGTVAAATVRRSFERLMRVRLRLGMFDPPAS 410

Query: 388 YVSLGKQDIC-----SDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANA 442
               G+   C     +   + LA EAAREGIVL KN    LPL  AK   +A+VGP  + 
Sbjct: 411 TAVYGEAYQCDYQCETAAKLALAREAAREGIVLFKNAGGALPL--AKGARIALVGPQVDD 468

Query: 443 TVAMIG--NYAGIPCRYMSPIA---GFSGYANVTYKTGCDDVACKSNNSIFAASEAAKTA 497
              ++G  NYA      ++P+    G    ANV+   GCD VAC +   +  A   A  A
Sbjct: 469 WRVLLGAVNYAFEDGPDVAPVTIQKGLEAVANVSVAAGCDSVACAALVDVDGAKRLAAAA 528

Query: 498 DATIILAG---------------LDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVI 542
           DAT+++ G                D   E+ES DR  + LPG Q  L+  +   +   V 
Sbjct: 529 DATVVVLGDSFGATDGWPLCRGTRDDGCESESHDRATIELPGEQVALVAALRAASSRLVC 588

Query: 543 LVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDY 602
           +++        A A+   +  A+L    PG+ GG A+ADV+FG ++P GR PIT Y    
Sbjct: 589 VLVHGGAVALGAAAD---DCDAVLDLWVPGQMGGAALADVLFGDYSPAGRSPITMYAA-- 643

Query: 603 VQMLPLTSMPLRPVDSLGYP---GRTYKFYNGPT-LYPFGYGLSYTQFKYNLLSFTKTIQ 658
              LP    P+   D        G TY++Y GP   Y FG GLSY  F Y   +   T  
Sbjct: 644 TSDLP----PMGVFDEYAGESSNGTTYRYYAGPAPTYAFGDGLSYASFSYAWAAAPPT-- 697

Query: 659 VNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSK 718
                         T DA                     +V   N GS    +VV VY++
Sbjct: 698 --------------TVDACGA---------------IRLRVAVTNTGSVASDEVVQVYAR 728

Query: 719 -PPAEIAATYIKQVIGFQRV 737
            P A + A  I+ ++ F RV
Sbjct: 729 VPDATVPAPAIR-LVAFDRV 747


>gi|317474379|ref|ZP_07933653.1| glycosyl hydrolase family 3 N terminal domain-containing protein
           [Bacteroides eggerthii 1_2_48FAA]
 gi|316909060|gb|EFV30740.1| glycosyl hydrolase family 3 N terminal domain-containing protein
           [Bacteroides eggerthii 1_2_48FAA]
          Length = 733

 Score =  263 bits (671), Expect = 4e-67,   Method: Compositional matrix adjust.
 Identities = 216/790 (27%), Positives = 367/790 (46%), Gaps = 122/790 (15%)

Query: 45  LGLQMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFA------------------- 85
           L +Q    ++ D+  P  IRVKDL+ RMTL EKV QL  +                    
Sbjct: 16  LSVQSQKPIYQDAGQPVEIRVKDLLKRMTLHEKVLQLNQYTFGENDNPNNIGKEVKNLPA 75

Query: 86  --------HGVPRL-GLPQYEWWSEALHGVSNVGPGTHFDDVIPG-ATSFPTVILTTASF 135
                   H  P+L    Q +   E+  G+    P     DVI G  T +P  +    SF
Sbjct: 76  EIGSLIYLHTDPKLRNQIQRKAMEESRLGI----PILFGFDVIHGLRTVYPISLAQACSF 131

Query: 136 NESLWKKIGQAVSTEARAMYNLGRAGLTY-WSPNINVARDPRWGRITETPGEDPFVVGRY 194
           N  L     +  + E+        +G+ + +SP I+VARDPRWGRI+E  GEDP+     
Sbjct: 132 NPDLVTLACRVAAKESVL------SGIDWTFSPMIDVARDPRWGRISECYGEDPY----- 180

Query: 195 AVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDME 254
            +N V G+  V+G++   +  S P  +++C KHY  Y V    G D  + D  ++ Q + 
Sbjct: 181 -LNTVFGIASVKGYQG--EKLSDPYSIAACLKHYVGYGVSE-GGRDYRYTD--ISPQALW 234

Query: 255 ETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQ 314
           ET+L P+E  VK G A+++M S+N ++GIP+ ++  +L + ++ +W   G++V+D ++I+
Sbjct: 235 ETYLPPYEAGVKAG-AATLMSSFNDISGIPATSNHYILTEILKNKWQHDGFVVSDWNAIE 293

Query: 315 VMVDNHKFLADSKEDAVAQTLKAGLDLDC-GQYYTNFTGNAVQQGKVKETDIDKSLKYLY 373
            ++  ++ +A  +++A  +   AG+++D     Y  +    V + K++ + ID ++  + 
Sbjct: 294 QLI--YQGVAKDRKEAAYKAFHAGVEMDMRDNVYCEYLEQLVAEKKIQVSQIDDAVARIL 351

Query: 374 TVLMRLGFFDGSPQYVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTV 433
            +  RLG FD       + ++     E+I LA   A E +VLLKN  N LP +S  +K V
Sbjct: 352 RLKFRLGLFDEPYAKELIEQERYLQQEDIALAGRLAEESMVLLKNANNLLPFSSM-IKKV 410

Query: 434 AVVGPHANATVAMIGNYA------GIPCRYMSPIAGFSGYANVTYKTGCDDVACKSNNSI 487
           AV+GP A  +V ++G +A       +   Y      F     + Y+ GC      S+ S 
Sbjct: 411 AVIGPIAKDSVNLLGAWAFKGKAEDVETIYEGMQKEFGDKVRLDYEQGC--ALDGSDESG 468

Query: 488 FAAS-EAAKTADATIILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIM 546
           F+A+ + A+ +D  ++  G       E+  R  + LP  Q +L+  + +  K P++LV+ 
Sbjct: 469 FSAALKTAEASDVVVLCLGESKQWSGENASRSTIALPDIQEKLLLHLKQANK-PIVLVLS 527

Query: 547 SAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQML 606
           S  G  +        ++AI+    PG  GG  +A ++ G+ NP G+L +T+         
Sbjct: 528 S--GRPLELIRLEPQVEAIIEMWQPGVAGGTPLAGILSGRVNPSGKLSVTF--------- 576

Query: 607 PLTS--MPL--------RPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKT 656
           PL++  +P+        RP D++G     Y+      LY FGYGLSYT F Y        
Sbjct: 577 PLSTGQIPVYYNMRQSARPFDAMG----DYQDIPTEPLYSFGYGLSYTTFVY-------- 624

Query: 657 IQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVY 716
                            SDA  +         +R D     +V   N G  +G + V+ Y
Sbjct: 625 -----------------SDAKLSSL------KIRKDQKITAEVTVTNAGKVEGKETVLWY 661

Query: 717 SKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIF 776
              P    +  +K++  F++  + AG ++  +F  +  + L+  D      L  GE  + 
Sbjct: 662 VSDPFCTISRPMKELKFFEKQSLNAGESRVFRFDIDPMRDLSYTDATGKRFLEPGEFIVS 721

Query: 777 VGNGGVSFPI 786
           VG   ++F +
Sbjct: 722 VGGRKLTFEV 731


>gi|365121873|ref|ZP_09338785.1| hypothetical protein HMPREF1033_02131 [Tannerella sp.
           6_1_58FAA_CT1]
 gi|363644185|gb|EHL83481.1| hypothetical protein HMPREF1033_02131 [Tannerella sp.
           6_1_58FAA_CT1]
          Length = 850

 Score =  263 bits (671), Expect = 4e-67,   Method: Compositional matrix adjust.
 Identities = 163/447 (36%), Positives = 245/447 (54%), Gaps = 47/447 (10%)

Query: 42  FSKLGLQMSSFLFC------DSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQ 95
           F  L L  S  LF       D   P   R+ DL+SR+T++EK+  L   + G+PRL + +
Sbjct: 9   FVVLALVFSGTLFAQKEVYKDMDAPQHERIMDLLSRLTIEEKISLLRATSPGIPRLEIEK 68

Query: 96  YEWWSEALHGVSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMY 155
           Y   +EALHG+  V PG          T FP  I   + +N     +I   +S EARA +
Sbjct: 69  YYHGNEALHGI--VRPGNF--------TVFPQAIGLASMWNPDFLYEISTVISDEARARW 118

Query: 156 NLGRAG----------LTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDV 205
           N    G          LT+WSP +N+ARDPRWGR  ET GEDPF+ G+  V +V+GLQ  
Sbjct: 119 NELNRGKDQKRLFSDLLTFWSPTVNMARDPRWGRTPETYGEDPFLSGKLGVAFVKGLQ-- 176

Query: 206 EGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCV 265
            G++       R LKV S  KH+AA + ++    +R+  + +++E+D+ E +L  FE C+
Sbjct: 177 -GND------PRYLKVVSTPKHFAANNEEH----NRFECNPQISERDLREYYLPAFERCI 225

Query: 266 KEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLAD 325
            +G A S+M +YN +N +P   +  LL + +R +W  +GY+V+DC +  ++V +HK++  
Sbjct: 226 IDGKAQSIMTAYNAINDVPCTLNTWLLKKVLRTDWGFNGYVVSDCGAPSLLVTHHKYVK- 284

Query: 326 SKEDAVAQTLKAGLDLDCG-QYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDG 384
           + E A    LKAGLDL+CG   Y     NA +Q  V E +ID +   +    M LG FD 
Sbjct: 285 TPEAAATLALKAGLDLECGDNVYIEPLMNAYKQYMVSEAEIDTAAYRILRARMMLGLFDD 344

Query: 385 SPQ--YVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANA 442
             +  Y +L    +  +++  +A EAAR+ +VLLKN+ N LP+N  K+K++AVVG   NA
Sbjct: 345 PAKNPYNALSPSIVGCEKHKNMALEAARQSLVLLKNENNFLPINPKKIKSIAVVG--INA 402

Query: 443 TVAMIGNYAGIPCRYMSPIAGFSGYAN 469
                G+Y+G P     P++   G  N
Sbjct: 403 GNCEFGDYSGKPVNV--PVSVLDGIRN 427



 Score =  120 bits (301), Expect = 3e-24,   Method: Compositional matrix adjust.
 Identities = 89/291 (30%), Positives = 131/291 (45%), Gaps = 50/291 (17%)

Query: 490 ASEAAKTADATIILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAG 549
           A +A +  D TI + G++ S+E E  DR+ + LP  Q   I +  ++   P + V++ AG
Sbjct: 594 AKKAIQECDMTIAVMGINKSIEREGRDRDHIELPKDQELFIEEAYKL--NPKMAVVLVAG 651

Query: 550 GVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLT 609
              +A    + ++ AIL A YPGE+GG A+A+ +FG +NP GRLP+T+Y         L 
Sbjct: 652 S-SLAVNWMDEHVPAILNAWYPGEQGGTAVAEALFGDYNPAGRLPLTYYRS-------LD 703

Query: 610 SMPLRPVDSLG-YPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCR 668
            +P  P D       RTY ++ G  LY FGYGLSYT+F Y                   R
Sbjct: 704 DLP--PFDDYAVQKNRTYMYFTGKPLYAFGYGLSYTKFDY-------------------R 742

Query: 669 NLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYI 728
            L+   DA   R                     +N G  +G +V  VY + P       I
Sbjct: 743 KLSVDQDAENVR----------------LSFTIKNSGKYNGDEVAQVYVQFPEIGVKVPI 786

Query: 729 KQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLL-PAGEHTIFVG 778
           KQ+ GF+RV +  G+   +       K L I +        P+G +   VG
Sbjct: 787 KQLKGFERVHIAKGKTLPVTITV-PKKELRIWNERKGEFFTPSGNYVFMVG 836


>gi|160887545|ref|ZP_02068548.1| hypothetical protein BACOVA_05565 [Bacteroides ovatus ATCC 8483]
 gi|156107956|gb|EDO09701.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
           ovatus ATCC 8483]
          Length = 736

 Score =  263 bits (671), Expect = 4e-67,   Method: Compositional matrix adjust.
 Identities = 218/741 (29%), Positives = 340/741 (45%), Gaps = 154/741 (20%)

Query: 90  RLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVST 149
           RLG+P +    EA HG   +G           AT FPT I   A+++  L K++GQ ++ 
Sbjct: 83  RLGIPMF-LAEEAPHGHMAIG-----------ATVFPTGIGMAATWSPELVKEVGQVIAK 130

Query: 150 EARAMYNLGRAGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHE 209
           E R+     + G   + P +++ RDPRW R+ ET GEDP + G    + V GL       
Sbjct: 131 EIRS-----QGGHISYGPVLDLTRDPRWSRVEETFGEDPVLSGILGASMVDGL------- 178

Query: 210 NATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGD 269
              +L+ +   +++  KH+ AY V        Y   A V  +D+ + FL PF   +  G 
Sbjct: 179 GGGNLSQKYATIATL-KHFLAYAVPEGGQNGNY---ASVGIRDLHQNFLPPFRKAIDAG- 233

Query: 270 ASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKED 329
           A SVM SYN ++G P  ++  LL Q +R EW   G++V+D  SI+ + ++H F+A +KE+
Sbjct: 234 ALSVMTSYNSIDGTPCTSNHYLLTQLLRNEWKFRGFVVSDLYSIEGIHESH-FVAPTKEN 292

Query: 330 AVAQTLKAGLDLDCG-QYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQY 388
           A  Q++ AG+D+D G   YTN   +AVQ G++ +T ID ++  +  +   +G F+     
Sbjct: 293 AAIQSVMAGVDVDLGGDAYTNLC-HAVQSGQMDKTVIDTAVCRVLRMKFEMGLFEHPYVD 351

Query: 389 VSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIG 448
             +  + +   E+IELA + A+  I LLKN+ + LPL S  +  VAV+GP+A+    M+G
Sbjct: 352 PKIAAKTVRRKEHIELARKIAQSSITLLKNENSILPL-SKTINKVAVIGPNADNRYNMLG 410

Query: 449 NYA-------------GIPCRYMSPIAGFSGYANVTYKTGC---DDVACKSNNSIFAASE 492
           +Y              GI  + +SP         V Y  GC   D    +   +I AA  
Sbjct: 411 DYTAPQEDSNVKTVLDGILTK-LSPF-------RVEYVRGCAIRDTTVNEIEQAIKAARR 462

Query: 493 AA------------------KTADATIILAGLDLSVE-AESLDREDLWLPGYQTQLINQV 533
           +                   K   A +   G    +E  E  DR  L L G Q +L+  +
Sbjct: 463 SEVVIVVVGGSSARDFKTSYKETGAAVAEEGSVSDMECGEGFDRASLSLLGRQQELLESL 522

Query: 534 AEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRL 593
            +  K P+I+V +    ++  +A    +  A+L A YPG+EGG AIADV+FG +NP GRL
Sbjct: 523 QKTGK-PLIVVYIEGRPLEKNWASEYAD--ALLTAYYPGQEGGNAIADVLFGDYNPSGRL 579

Query: 594 PIT----------WYNG------DYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPF 637
           PI+          +YN       DYV+M   +S P                     LY F
Sbjct: 580 PISVPRSVGQIPVYYNKKAPRNHDYVEM---SSFP---------------------LYSF 615

Query: 638 GYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEF 697
           GYG+SYT F+Y+ L                                V+    RC   FE 
Sbjct: 616 GYGMSYTTFEYSDLQ-------------------------------VVQKSARC---FEV 641

Query: 698 KVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSL 757
               +N G  DG +V  +Y +         +KQ+  F+R  ++ G  K++ FV    +  
Sbjct: 642 SFKVKNTGKYDGEEVSQLYMRDEYASVVQPMKQLKHFERFHLKKGEEKKVTFVLTE-EDF 700

Query: 758 NIVDYAANTLLPAGEHTIFVG 778
            +V+Y    ++ +G   + +G
Sbjct: 701 FLVNYTLKKVVESGNFHLMIG 721


>gi|373956830|ref|ZP_09616790.1| glycoside hydrolase family 3 domain protein [Mucilaginibacter
           paludis DSM 18603]
 gi|373893430|gb|EHQ29327.1| glycoside hydrolase family 3 domain protein [Mucilaginibacter
           paludis DSM 18603]
          Length = 823

 Score =  262 bits (670), Expect = 4e-67,   Method: Compositional matrix adjust.
 Identities = 226/822 (27%), Positives = 361/822 (43%), Gaps = 141/822 (17%)

Query: 35  FVCDPGRFSK----LGLQMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPR 90
           F  DP  + K    L       ++ DS+ P   R+ DL+ +MTL+EK  QL    +G  R
Sbjct: 50  FKADPPIYKKGWIDLNKNGKKDIYEDSTQPIEARLNDLIGQMTLEEKTCQLATL-YGYKR 108

Query: 91  L---GLPQYEWWSEALH-GVSNVG------------------------------------ 110
           +    +P  EW +E    G++N+                                     
Sbjct: 109 ILKDSVPTPEWKNEIWKDGIANIDEHLNGFITWGKTSDLPLVTDVKKHVWAMNQTQRFFI 168

Query: 111 -------PGTHFDDVIPG-----ATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLG 158
                  P    ++ I G     AT+FPT +    ++++ L  ++G     EARA   LG
Sbjct: 169 EQTRLGIPVDFTNEGIRGVEAYQATAFPTQLNMGMTWDKPLVNQMGNITGMEARA---LG 225

Query: 159 RAGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRP 218
              +  ++P ++VARD RWGR+ E  GEDP++V R  V   +G+Q               
Sbjct: 226 YTNV--YAPILDVARDQRWGRLEEVYGEDPYLVARLGVEMAKGMQQNN------------ 271

Query: 219 LKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYN 278
            ++++  KH+A Y  +          D +V  +++E   L PF+  +KE     VM SYN
Sbjct: 272 -QIAATAKHFAVYSANKGGREGLARTDPQVAPREVENILLYPFKKVIKEAGLMGVMSSYN 330

Query: 279 RVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAG 338
             +GIP       L Q +R E+   GY+V+D D+++ + + H   AD K DAV Q   AG
Sbjct: 331 DYDGIPISGSSYWLIQRLRQEFGFKGYVVSDSDALEYLYNKHHVAADLK-DAVYQAFMAG 389

Query: 339 LDLDCGQYYTN----FTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGS-PQYVSLGK 393
           +++       +    +    V++GK+    I+  ++ +  V  +LG FD    Q      
Sbjct: 390 MNVRTTFRTPDSIIIYARQLVKEGKLPIDTINSRVRDVLRVKFKLGLFDHPYVQDAEASA 449

Query: 394 QDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGI 453
           + +    N  +A +A++E IVLLKN    LPL  +K +T+AV+GP+A        +Y  +
Sbjct: 450 KLVNCAANQAVALQASKESIVLLKNKGAILPL--SKQQTLAVIGPNALNDDYAHTHYGPL 507

Query: 454 PCRYMSPIAGFS---GYANVTYKTGCD--------------DVACKSNNSIFAASEAAKT 496
             + ++ + G     G   V Y  GC+              D        I +A   A+ 
Sbjct: 508 ASKSINILEGIQAKVGAGKVLYALGCNLVDKHWPESEILPQDPDQAEQAKIDSAVTIARH 567

Query: 497 ADATIILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFA 556
           AD  +++ G +     E+  R  L LPGYQ +L+  V    K PV++V++ +  + I + 
Sbjct: 568 ADVAVVVLGGNTQTAGENKSRTSLDLPGYQLRLVKAVKATGK-PVVVVLIGSQPMTINW- 625

Query: 557 ETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPV 616
             + +I  I++AGYPG +GG A+ADV+FG +NPGG+L +T+     V  LP  + P +P 
Sbjct: 626 -IDQHIDGIIYAGYPGTQGGTAVADVLFGDYNPGGKLTLTFPKS--VGQLPF-NFPTKP- 680

Query: 617 DSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDA 676
           +S    G   K      LYPFG+GLSYT F Y+ L  +  IQ +          N T   
Sbjct: 681 NSETDEGELAKIKG--LLYPFGFGLSYTTFAYSDLKISPAIQSDQG--------NVTVSC 730

Query: 677 SKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQR 736
             T                       N G   G +VV +Y +       TY K + GF R
Sbjct: 731 KVT-----------------------NTGKVAGDEVVQLYLRDVLSTVTTYEKVLRGFDR 767

Query: 737 VFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVG 778
           + ++ G  K + F       L + +     ++  GE  + VG
Sbjct: 768 LSLKPGETKEVMFTI-VPDDLKLYNRQMKYVVEPGEFKVMVG 808


>gi|423215778|ref|ZP_17202304.1| hypothetical protein HMPREF1074_03836 [Bacteroides xylanisolvens
            CL03T12C04]
 gi|392691421|gb|EIY84666.1| hypothetical protein HMPREF1074_03836 [Bacteroides xylanisolvens
            CL03T12C04]
          Length = 1049

 Score =  262 bits (670), Expect = 4e-67,   Method: Compositional matrix adjust.
 Identities = 219/769 (28%), Positives = 362/769 (47%), Gaps = 108/769 (14%)

Query: 56   DSSLPYSIR----VKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVG- 110
            +S LP++      VKDL+SRMT++EK+ QL  +  G   L  P+ E+ S++L     VG 
Sbjct: 328  NSKLPHTPEADSFVKDLLSRMTVEEKIGQLSQYV-GRTLLTGPESEYLSDSLIARGLVGS 386

Query: 111  -------------------------PGTHFDDVIPG-ATSFPTVILTTASFNESLWKKIG 144
                                     P     DVI G  T FPT +  + S++ +  ++  
Sbjct: 387  VLNISGAKTLRDLQEKNMRHSRIKIPILFGMDVIHGYKTIFPTPLAESCSWDLAAIERAA 446

Query: 145  QAVSTEARAMYNLGRAGLTY-WSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQ 203
            +  + E+ A      AGL + ++P +++ARD RWGR+ E  GED ++    A   V G Q
Sbjct: 447  KIAAIESSA------AGLHWTFAPMVDIARDARWGRVVEGAGEDTYLGSEIAKARVNGFQ 500

Query: 204  DVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEM 263
                  N  + NS    V +C KH+ AY +       R +    ++E+ + +T+L PF+ 
Sbjct: 501  -----WNLWENNS----VLACAKHWVAYGLPQ---AGRDYAPVDMSERTLFDTYLPPFKA 548

Query: 264  CVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFL 323
            C+  G   + M ++N +NGIP+ A P LL   +RG+W+ +G++V+D ++++ +V   + +
Sbjct: 549  CIDAG-VLTFMSAFNDINGIPASAHPFLLKDLLRGQWNFNGFVVSDWEAVKQLV--AQGV 605

Query: 324  ADSKEDAVAQTLKAGLDLDCGQ-YYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFF 382
            A+  +DA      +G+D+D     Y  +    ++ GK+   D+D S+  +  +   LG F
Sbjct: 606  AEDDKDATRLAFNSGIDMDMTDGLYNKYMKELIEAGKISMEDVDNSVSRILHIKYALGLF 665

Query: 383  DGSPQYVS--LGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHA 440
                ++ +     Q I   E ++ A + A +  VLLKND +TLPL +  V+++AVVGP A
Sbjct: 666  VDPYKFCNEEYESQTIMKKEFLDAALDMAHKSAVLLKNDNHTLPL-AKNVRSIAVVGPLA 724

Query: 441  NATVAMIGNY-AGIPCRYMSPIAGFSGYAN--------VTYKTGCDDVACKSNNSIFAAS 491
            +    ++G++ A    R+++ +    G  N        V Y  GC D   +  +    A 
Sbjct: 725  DNQTELLGSWRARGEDRHVTTV--LQGIKNKIGGNKTKVGYARGC-DFDGEDKSGFKEAV 781

Query: 492  EAAKTADATIILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGV 551
            + A  +D  I + G    +  ES  R  L LPG Q +LI ++    K PV++V+M+   +
Sbjct: 782  KLASKSDMVIAVVGEKALMSGESRSRAQLDLPGVQEELIKELVATGK-PVVVVLMNGRPL 840

Query: 552  DIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGD-YVQMLPLTS 610
             I +   + N+ AIL   + G   G AIAD++FG +NP GRL I++   +  V +     
Sbjct: 841  SIEW--VDKNVSAILETWFLGTSAGTAIADILFGDYNPSGRLTISFPRVEGQVPVYYNYK 898

Query: 611  MPLRPVD-SLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRN 669
               RP D       R     N P LYPFGYGLSYT F Y++   T+              
Sbjct: 899  KSGRPGDMPHSSTTRHIDVPNAP-LYPFGYGLSYTTFSYSVPQSTQK------------- 944

Query: 670  LNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIK 729
              YT   +                     V   N G  DG + V +Y           +K
Sbjct: 945  -EYTRQET-----------------ISVSVTVTNTGDRDGEETVQLYVNDKVASVVRPVK 986

Query: 730  QVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVG 778
            ++  F+++F++AG +K ++F  +   +L   D A N ++  GE  I  G
Sbjct: 987  ELKAFKKIFLKAGESKTVQFDISPL-ALGFYDAAMNYVVEPGEFEIMTG 1034


>gi|336399403|ref|ZP_08580203.1| Beta-glucosidase [Prevotella multisaccharivorax DSM 17128]
 gi|336069139|gb|EGN57773.1| Beta-glucosidase [Prevotella multisaccharivorax DSM 17128]
          Length = 757

 Score =  262 bits (670), Expect = 4e-67,   Method: Compositional matrix adjust.
 Identities = 221/769 (28%), Positives = 348/769 (45%), Gaps = 130/769 (16%)

Query: 65  VKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALH------GVSNVG-------- 110
           V+DL+ +MTL EK+ QL  +  G    G PQ    S++L        + NVG        
Sbjct: 46  VRDLIKKMTLTEKIGQLSQYVGGSLLTG-PQSGALSDSLFVRGMVGSILNVGGVESLRKL 104

Query: 111 ------------PGTHFDDVIPG-ATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNL 157
                       P     DVI G  T FPT +  + S++      +G    T   A    
Sbjct: 105 QEKNMQSSRLKIPVLFAFDVIHGYKTIFPTPLAESCSWD------LGLMFETAKAAAIEA 158

Query: 158 GRAGLTY-WSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNS 216
             +G+ + ++P +++ARDPRWGRI E  GED ++  + A   VRG Q   G         
Sbjct: 159 SASGIHWTFAPMVDIARDPRWGRIVEGAGEDTYLACKIAETRVRGFQWNLG--------- 209

Query: 217 RPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCS 276
           +P  V +C KH+ AY      G D    D  ++   + E +L PF+ CV  G   + M +
Sbjct: 210 KPNSVYACAKHFVAYGAPQ-AGRDYAPVDLSLST--LAEVYLPPFKACVDAG-VHTFMSA 265

Query: 277 YNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLK 336
           +N +NG+P+  +  L+   +R +W  HG++V+D +++Q +  +   +A++  DA      
Sbjct: 266 FNSLNGVPATGNRWLMTDILRNQWKFHGFVVSDWNAVQELKAHG--VAETDTDAALMAFD 323

Query: 337 AGLDLDCGQ-YYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQ- 394
           AG+D+D     Y      AV +GK+    ID S++ +      LG FD   +++ + ++ 
Sbjct: 324 AGVDMDMTDGLYNRCLEKAVCEGKLDMQAIDTSVERILRAKYALGLFDDPYRFLDVKRER 383

Query: 395 -DICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYA-- 451
            +I S+   +LA +AA   +VLLKND  TLPL S   K +A++GP A+    ++G++   
Sbjct: 384 REIRSEAVTKLARKAAASSMVLLKNDHATLPL-SKHTKRIALIGPLADNRSEVMGSWKAR 442

Query: 452 -----------GIPCRYMSPIAGFSGYANVTYKTGCDDVACKSNNSIFAASEAAKTADAT 500
                      GI  +  S +A       VTY  GCD +   S     AA EAAK +D  
Sbjct: 443 GEESDVVTVLDGIKKKLGSDVA-------VTYVQGCDFLE-PSTREFPAAFEAAKQSDVV 494

Query: 501 IILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNT 560
           I + G    +  ES  R  L LPG Q  L++ + +  + P+++V+M+  G  +   + + 
Sbjct: 495 IAVVGEKALMSGESRSRAVLRLPGQQEALLDTLQKAGR-PLVVVLMN--GRPLCLQKVDR 551

Query: 561 NIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPL---RPVD 617
              A+L A +PG + G A+AD++FG   P  +L  ++         PLT   +       
Sbjct: 552 QADALLEAWFPGTQCGNAVADILFGDAVPSAKLTTSF---------PLTEGQIPNNYNYK 602

Query: 618 SLGYPG-----RTYKFYNGPT--LYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNL 670
             G PG      T +  + P   LYPFGYGLSYT F Y                      
Sbjct: 603 RSGRPGDMSHSSTVRHIDVPNRNLYPFGYGLSYTTFSYG--------------------- 641

Query: 671 NYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQ 730
                  + +CP         D   +  VD  N G  DG ++V +Y           +K+
Sbjct: 642 -------EMQCP----KQFNADGTLQVSVDVTNTGGYDGEEIVQLYVADKVASMVRPVKE 690

Query: 731 VIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGN 779
           + GFQ+VF+  G+ KRI F  NA + L   + +   ++  G   I VG 
Sbjct: 691 LKGFQKVFIPKGQTKRIDFTLNA-RDLGFWNNSMQYIVEPGTFEIMVGT 738


>gi|423346097|ref|ZP_17323785.1| hypothetical protein HMPREF1060_01457 [Parabacteroides merdae
           CL03T12C32]
 gi|409220895|gb|EKN13848.1| hypothetical protein HMPREF1060_01457 [Parabacteroides merdae
           CL03T12C32]
          Length = 955

 Score =  262 bits (670), Expect = 5e-67,   Method: Compositional matrix adjust.
 Identities = 221/810 (27%), Positives = 361/810 (44%), Gaps = 146/810 (18%)

Query: 53  LFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRL---GLPQYEW----WS----- 100
           ++ D ++P   RV+DL+S+M ++EK  Q+    +G  R+    LP  +W    W      
Sbjct: 60  VYEDPTVPIDARVEDLLSQMNVEEKTCQMVTL-YGYKRVLKDDLPTSDWKKQLWKDGIGA 118

Query: 101 --EALHGVSNVG----------------------------------PGTHFDDVIPG--- 121
             E L+G    G                                  P    ++ I G   
Sbjct: 119 IDEHLNGFQQWGLPPSDNPYVWPASRHAWALNEVQRFFIEETRLGIPTDFTNEGIRGVES 178

Query: 122 --ATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLT-YWSPNINVARDPRWG 178
             AT+FPT +    ++N +L  K+G     E R +      G T  ++P ++V RD RWG
Sbjct: 179 YIATNFPTQLGLGHTWNRNLVHKVGYITGREGRLL------GYTNVYAPILDVGRDQRWG 232

Query: 179 RITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKG 238
           R  E  GE P++V    +   +G+Q        TD      +V++  KHY AY  +    
Sbjct: 233 RYEEVYGESPYLVAELGIEMAKGMQ--------TDH-----QVAATSKHYIAYSNNKGGR 279

Query: 239 VDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRG 298
                 D +++ +++E   + P++  +KE     VM SYN  +G P  +    L   +RG
Sbjct: 280 EGMARVDPQMSPREVEMIHVYPWKRVIKEAGILGVMSSYNDYDGFPIQSSYYWLTTRLRG 339

Query: 299 EWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCG----QYYTNFTGNA 354
           E+   GY+V+D D+++ +   H   AD KE +V Q++ AGL++ C       Y       
Sbjct: 340 EFGFRGYVVSDSDAVEYLFSKHGTAADMKE-SVLQSVLAGLNIRCTFRSPDSYVLPLREL 398

Query: 355 VQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQD--ICSDENIELAAEAAREG 412
           + +G +  + ID  ++ +  V   +G FD  P  + L + D  +   EN ++A +A++E 
Sbjct: 399 IAEGAIPMSTIDDRVRDILRVKFLVGLFD-HPYQIDLKETDKEVNCAENQQVALQASKES 457

Query: 413 IVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFSGY----A 468
           +VLLKN    LPL+  K+  +AV GP+A+     + +Y  +     + + G         
Sbjct: 458 LVLLKNQDAVLPLDVNKISKIAVCGPNADEEAYALTHYGPLAVEVTTVLEGIRNKVKPGT 517

Query: 469 NVTYKTGCDDV--------------ACKSNNSIFAASEAAKTADATIILAGLDLSVEAES 514
           NV +  GCD V                +  + I  A E AK +D T+++ G       E+
Sbjct: 518 NVLFTKGCDLVDANWPESELIRYPLTAEEQSEIDKAVENAKKSDVTVVVLGGSDRTCGEN 577

Query: 515 LDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEE 574
             R  L LPG Q  L+  V    K PV+L++++   + I +A+    + AIL A YPG +
Sbjct: 578 KSRSSLDLPGRQLDLLQAVVATGK-PVVLILINGRPLSINWAD--KYVPAILEAWYPGSQ 634

Query: 575 GGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRP---VD---SLGYPGRTYKF 628
           GG AIAD +FG +NPGG+L +T+     V  +P  + P +P   VD   + G  G   + 
Sbjct: 635 GGTAIADALFGDYNPGGKLTVTF--PKTVGQIPF-NFPTKPNAQVDGGRNKGLDGNMSRV 691

Query: 629 YNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVND 688
            NGP LYPFGYGLSYT F+Y+ +S    I   +  +                        
Sbjct: 692 -NGP-LYPFGYGLSYTTFEYSDISIQPAIVTQVQPVT----------------------- 726

Query: 689 LRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIK 748
           +RC           N G   G +VV +Y +       TY K ++GF R+ +  G  K + 
Sbjct: 727 VRC--------KVTNTGKRAGDEVVQLYVRDILSSVTTYEKNLVGFDRIHLNPGETKELT 778

Query: 749 FVFNACKSLNIVDYAANTLLPAGEHTIFVG 778
           F     + L +++   + ++  G+  + VG
Sbjct: 779 FTIEP-RDLQLLNSDNHWVVEPGDFKVMVG 807


>gi|423313129|ref|ZP_17291065.1| hypothetical protein HMPREF1058_01677 [Bacteroides vulgatus
           CL09T03C04]
 gi|392686343|gb|EIY79649.1| hypothetical protein HMPREF1058_01677 [Bacteroides vulgatus
           CL09T03C04]
          Length = 864

 Score =  262 bits (670), Expect = 5e-67,   Method: Compositional matrix adjust.
 Identities = 168/454 (37%), Positives = 233/454 (51%), Gaps = 42/454 (9%)

Query: 54  FCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGT 113
           + DSSL    R +DL+ ++TL+EKV  + D +  V RLG+  Y WW+EALHGV+  G   
Sbjct: 24  YKDSSLSPEERAEDLLQQLTLEEKVALMMDNSKPVERLGIKPYNWWNEALHGVARSGL-- 81

Query: 114 HFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRA--------GLTYW 165
                   AT FP  I   ASF       I  AVS EARA      A        GLT W
Sbjct: 82  --------ATVFPQPIGMAASFEPDAIHTIYTAVSDEARAKNAAYSAAGSYERYQGLTMW 133

Query: 166 SPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCC 225
           +P +N+ RDPRWGR  ET GEDP++     VN V+GLQ         D N +  K+ +C 
Sbjct: 134 TPTVNIYRDPRWGRGIETYGEDPYLTSVMGVNVVKGLQ-------CMDANQKYDKIHACA 186

Query: 226 KHYAAYDVDNWKGVDRYHFDAR-VTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIP 284
           KH+A +    W   +R+ F+A  +  +D+ ET+L PFE  VKE     VMC+YNR+ G P
Sbjct: 187 KHFAVHSGPEW---NRHEFNAENIKPRDLHETYLVPFEALVKEAKVKEVMCAYNRLEGDP 243

Query: 285 SCADPKLLNQTVRGEWDLHGYIVADCDSIQ--VMVDNHKFLADSKEDAVAQTLKAGLDLD 342
            C   +LL Q +R +W   G +++DC +I        HK   D+ E A A  + +G DL+
Sbjct: 244 CCGSDRLLMQILRQDWGYDGIVLSDCGAIDDFYREKGHKTHPDA-ESASAAAVLSGTDLE 302

Query: 343 CGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGK---QDICSD 399
           CG  Y     +A ++G + E DID S+K L      LG  D  P  V   K     +CS 
Sbjct: 303 CGSSYKALVESA-KKGLISEKDIDVSVKRLLKARFELGEMD-DPDKVEWTKIPYSVVCSA 360

Query: 400 ENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMS 459
           E+  L+ + AR+ + LL N  N LPL     +T+AV+GP+AN +V   GNY G P   ++
Sbjct: 361 EHDSLSLDIARKSMTLLLNKNNILPLKRGG-QTIAVMGPNANDSVMQWGNYNGTPKHTIT 419

Query: 460 PIAGFSGYA----NVTYKTGCDDVACKSNNSIFA 489
            + G          + Y+ GC  V      S+F+
Sbjct: 420 LLEGIRSAMGENDKLIYEQGCSWVERSLIRSVFS 453



 Score =  130 bits (327), Expect = 3e-27,   Method: Compositional matrix adjust.
 Identities = 104/326 (31%), Positives = 146/326 (44%), Gaps = 62/326 (19%)

Query: 464 FSGYANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAESL-------- 515
           FSG A + +     D+  K   +I       K AD  I   G+  S+E E +        
Sbjct: 574 FSGDAQLNF-----DLGFKEEVNIKNTVAKVKDADVVIFAGGISPSLEGEEMGVNLPGFR 628

Query: 516 --DREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGE 573
             DR D+ LP  Q +LI  + +  K  VI V  S  G  IA        +AIL A YPG+
Sbjct: 629 KGDRTDIELPAVQRELIKALCDAGK-KVIFVNFS--GSPIAMEPETKYCQAILQAWYPGQ 685

Query: 574 EGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPT 633
            GG+A+A+V+FG +NP GRLP+T+Y         +T +P    +     GRTY+++ G  
Sbjct: 686 SGGKAVAEVLFGDYNPAGRLPVTFYRN-------ITQLP--NFEDYNMTGRTYRYFKGDP 736

Query: 634 LYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDD 693
           L+PFGYGLSYT F Y  +   +TI+V               + +K   P           
Sbjct: 737 LFPFGYGLSYTTFNYGNIKLEQTIKV--------------GETAKIIVP----------- 771

Query: 694 YFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNA 753
                    N G+ DG +VV VY K   E A   +K +  F+RV + AG+   ++     
Sbjct: 772 -------VTNTGNRDGEEVVQVYLK-KQEDAEGPVKTLRAFKRVQIPAGKTVNVELELTP 823

Query: 754 CKSLNIVDYAANTLLP-AGEHTIFVG 778
            K L   D   NT+   AG   I VG
Sbjct: 824 -KQLEWWDTQTNTMRTLAGNFDIMVG 848


>gi|409198288|ref|ZP_11226951.1| glycoside hydrolase 3 [Marinilabilia salmonicolor JCM 21150]
          Length = 747

 Score =  262 bits (669), Expect = 6e-67,   Method: Compositional matrix adjust.
 Identities = 225/766 (29%), Positives = 357/766 (46%), Gaps = 117/766 (15%)

Query: 64  RVKDLVSRMTLDEKVQQLGDFAHGVPR---------------LGLPQYEWWSE----ALH 104
           RV+ L+SRMTL+EK+ Q+       P                L + Q E  +E    AL 
Sbjct: 33  RVESLLSRMTLEEKIGQMNQLNGRNPDEKLMSRIRNGEVGSLLNIEQPELINEIQRIALE 92

Query: 105 GVSNVGPGTHFDDVIPG-ATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLT 163
                 P     DVI G  T FP  +   ASFN S+   +G      AR     G     
Sbjct: 93  ESRLGIPLLIARDVIHGYKTIFPIPLGQAASFNPSI---VGTGARVAAREATQDG----I 145

Query: 164 YWS--PNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKV 221
            W+  P ++++RDPRWGRI E+ GED ++  + +   +RG Q         DL + P  +
Sbjct: 146 RWTFAPMMDISRDPRWGRIAESFGEDTYLTTKLSSAMIRGFQ-------GNDLKN-PSSM 197

Query: 222 SSCCKHYAAYD-VDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRV 280
           ++C KH+  Y  V+  K  +  +   R     +   +L PF+  V+EG  +++M S+N  
Sbjct: 198 AACAKHFIGYGAVEGGKDYNSTYIPPR----QLRNVYLPPFKAAVEEG-VATIMTSFNSN 252

Query: 281 NGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLD 340
           +GIP   DP LL   +R EW   G +V+D  S++ M+  H F  + KE A+ + + AGLD
Sbjct: 253 DGIPPSGDPWLLTGILRDEWKFDGVVVSDWASVKEMI-AHGFAENGKEAAL-KAVNAGLD 310

Query: 341 LDCGQ--YYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQDIC- 397
           ++     Y+TN   + + +GKV E  ID +++ +  + +RLG FD    Y+S     +  
Sbjct: 311 MEMVSECYFTNIK-DLINEGKVSEKTIDDAVRNILRLKLRLGLFDNP--YISEEDPRVAY 367

Query: 398 SDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYA--GIPC 455
           S E+++ A  AA E +VLLKN+  TLP++S  VKT+ VVGP A+A    +G +   G   
Sbjct: 368 SKEHLDAAKMAAEESMVLLKNEDQTLPISSV-VKTICVVGPLADAPHDQMGTWVFDGEKE 426

Query: 456 RYMSPIAG----FSGYANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVE 511
           + ++P+      +    N+ Y+        K  +       AA+ +D  I   G +  + 
Sbjct: 427 KTITPLKALRQLYGDKVNIIYEPTLKYSRDKDRSKFSKTLAAARKSDVVIAFVGEESILS 486

Query: 512 AESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNI-KAILWAGY 570
            E+    DL L G Q +LI+ ++E A  P++ V+M+   + I    T   + K++++A +
Sbjct: 487 GEAHSLADLNLRGAQLELISALSE-AGTPLVTVVMAGRPLTIG---TEVELSKSVIYAWH 542

Query: 571 PGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPL------TSMPLRP----VDSLG 620
           PG  GG AIAD++FGK  P G+LP+T+     V  +P+      T  P R     +D + 
Sbjct: 543 PGTMGGPAIADILFGKTVPSGKLPVTFPK--MVGQIPVFYNHNSTGRPARGTEVLIDDIP 600

Query: 621 YPGRTYKFYNGP--------TLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNY 672
              R     N           L+ FGYGLSYT F+Y+ L                 NL+ 
Sbjct: 601 LEARQSSLGNTSYYLDAGFDPLFHFGYGLSYTSFEYSDL-----------------NLSN 643

Query: 673 TSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVI 732
           +S                  D     V   N G   G+++V +Y+   +      +K++ 
Sbjct: 644 SS--------------FHPSDTLRVSVQLSNTGDFQGTEIVQLYTADKSASVVRPVKELK 689

Query: 733 GFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVG 778
           GFQRV V+ G  K + F     +   +  +    ++ AGE +I VG
Sbjct: 690 GFQRVLVQPGETKDVVFHLPMSE---LSFWNDGDVVEAGEFSIMVG 732


>gi|261880245|ref|ZP_06006672.1| beta-glucosidase [Prevotella bergensis DSM 17361]
 gi|270333079|gb|EFA43865.1| beta-glucosidase [Prevotella bergensis DSM 17361]
          Length = 854

 Score =  262 bits (669), Expect = 6e-67,   Method: Compositional matrix adjust.
 Identities = 151/445 (33%), Positives = 246/445 (55%), Gaps = 37/445 (8%)

Query: 45  LGLQMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALH 104
           L ++   F + ++ L    R  DL SR+TL+EK + + + +  +PRLG+PQ+EWWSEALH
Sbjct: 16  LPMKAQQFPYQNTDLSPKERAADLCSRLTLEEKSKIMQNGSPAIPRLGIPQFEWWSEALH 75

Query: 105 GVSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGR----- 159
           G+   G           AT FP  +   +S++++L +K+  AVS E R      +     
Sbjct: 76  GIGRNG----------FATVFPITMGMASSWDDALLQKVFDAVSDEGRVKAQQAKRSGTI 125

Query: 160 ---AGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNS 216
               GL++W+PNIN+ RDPRWGR  ET GEDP++  R  +  VRGLQ           +S
Sbjct: 126 KRYQGLSFWTPNINIFRDPRWGRGQETYGEDPYLTSRMGLAVVRGLQGPS--------DS 177

Query: 217 RPLKVSSCCKHYAAYDVDNWKGVDRYHFDAR-VTEQDMEETFLRPFEMCVKEGDASSVMC 275
           +  K+ +C KH+A +    W   +R+ F+   + E+D+ ET+L  F+  V++GD + VMC
Sbjct: 178 KYRKLLACAKHFAVHSGPEW---NRHTFNVEDLPERDLWETYLPAFKALVQQGDVAEVMC 234

Query: 276 SYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSI-QVMVDNHKFLADSKEDAVAQT 334
           +Y R++G P C + + L   +R EW+  G +V+DC ++       H  ++     A A+ 
Sbjct: 235 AYQRIDGQPCCGNNRFLKSILRNEWNYQGMVVSDCWAVPDFWKKGHHEVSPDATHASAKA 294

Query: 335 LKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSP--QYVSLG 392
           + +G D++CG  Y+N    AV+ G +KE D+D S++ L      LG FD      +  + 
Sbjct: 295 VLSGTDVECGSDYSNLP-EAVRAGIIKEADVDVSVRRLLEARFALGDFDPDELVPWTKIS 353

Query: 393 KQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAG 452
           +  + S  + +LA + AR+ +VLL+N+ + LPL  +  K V VVG +A  +  M GNY+G
Sbjct: 354 ESVVASKAHKQLALDMARKSMVLLQNN-DILPLKRSGQKIV-VVGANAIDSTMMWGNYSG 411

Query: 453 IPCRYMSPIAGFSGYAN-VTYKTGC 476
            P + ++ + G    ++ VT+  GC
Sbjct: 412 YPTQTVTILQGLQTKSDQVTFIPGC 436



 Score =  120 bits (300), Expect = 3e-24,   Method: Compositional matrix adjust.
 Identities = 90/300 (30%), Positives = 140/300 (46%), Gaps = 68/300 (22%)

Query: 497 ADATIILAGLDLSVEAESL----------DREDLWLPGYQTQLINQVAEVAKGPVILVIM 546
           AD  I + G+   +E E +          DR  + LP  Q ++I  ++E  +    +V +
Sbjct: 599 ADVVIFVGGISPRLEGEEMEVSDPGFKGGDRTTIELPQAQREVIKALSEAGRR---IVFV 655

Query: 547 SAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQML 606
           +  G  IA    +  + AIL A YPGE+GG A+ADV+FG +NP G+LP+T+Y  D    L
Sbjct: 656 NCSGSAIALTPESQRVDAILQAWYPGEQGGTAVADVLFGDYNPSGKLPVTFYKND--AQL 713

Query: 607 PLTSMPLRPVDSLGY--PGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKL 664
           P         D L Y   GRTY+++    L+PFGYGLSYTQF              + + 
Sbjct: 714 P---------DFLDYRMAGRTYRYFKETPLFPFGYGLSYTQF-------------TIGQP 751

Query: 665 QHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIA 724
           ++  N                          + +V   N G  DG +VV VY +   + A
Sbjct: 752 RYINN--------------------------QVQVSVSNTGKRDGDEVVQVYIR-RTDDA 784

Query: 725 ATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTL-LPAGEHTIFVGNGGVS 783
           A  IK + GFQRV ++ G  K++       +S    D ++NT+ +  G + + VG+  ++
Sbjct: 785 AGPIKTLRGFQRVSLKVGETKQVSVSL-PRESFEWWDASSNTMRVIPGNYEVMVGSSSMA 843


>gi|293371041|ref|ZP_06617583.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
           ovatus SD CMC 3f]
 gi|292633971|gb|EFF52518.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
           ovatus SD CMC 3f]
          Length = 791

 Score =  262 bits (669), Expect = 6e-67,   Method: Compositional matrix adjust.
 Identities = 212/735 (28%), Positives = 333/735 (45%), Gaps = 142/735 (19%)

Query: 90  RLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVST 149
           RLG+P +    EA HG   +G            T FPT I   A+++  L K++GQ ++ 
Sbjct: 138 RLGIPMF-LAEEAPHGHMAIG-----------ITVFPTGIGMAATWSPELVKEVGQVIAK 185

Query: 150 EARAMYNLGRAGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHE 209
           E R+     + G   + P +++ RDPRW R+ ET GEDP + G      V GL  + G+ 
Sbjct: 186 EIRS-----QGGHISYGPVLDLTRDPRWSRVEETFGEDPVLSGTLGAAMVDGL--INGN- 237

Query: 210 NATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGD 269
                 SR     +  KH+ AY V       +    A V  +++ E FL PF+  +  G 
Sbjct: 238 -----ISRKNSTIATLKHFLAYAVPEG---GQNGNQALVGMRELHENFLPPFKKAIDAG- 288

Query: 270 ASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKED 329
           A SVM SYN ++GIP  A+  LLNQ +R EW   G++V+D  SI+ + ++H + A S ED
Sbjct: 289 ALSVMTSYNSIDGIPCTANSYLLNQLLRNEWKFRGFVVSDLYSIEGIYESH-YTASSIED 347

Query: 330 AVAQTLKAGLDLDC-GQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQY 388
           A  Q + AG+D+D  G+ YTN    AV++ ++ E  ID+ +  +  +   +G F+     
Sbjct: 348 AAIQAVSAGVDVDLGGEAYTNIY-RAVKEKRLSEAIIDEVVCRVLRLKFEMGLFENPYVD 406

Query: 389 VSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIG 448
             +  + + +  +I  A   A+  + LLKN  + LPL S  ++ VAV+GP+A+    M+G
Sbjct: 407 PQIAIERVRNANHIANARRMAQASVTLLKNRHDILPL-SKNIRKVAVIGPNADNCYNMLG 465

Query: 449 NYAGIPCR------YMSPIAGFSGYANVTYKTGCDDVACKSNNSIFAASEAAKTADATII 502
           +Y   P +       +  I      + V Y  GC  +   +NN I  A EAA  AD  I 
Sbjct: 466 DYTA-PQKDENIKTVLDGIISKLSLSRVEYVRGC-AIRDTTNNEIAKAVEAANRADVVIA 523

Query: 503 LAGLDLSVE-----------------------AESLDREDLWLPGYQTQLINQVAEVAKG 539
           + G   + +                        E  DR  L L G Q +L+  +    K 
Sbjct: 524 VVGGSSARDFKTTYKETGAAIADKSQISDMECGEGFDRATLSLLGKQLELLESLKSTRK- 582

Query: 540 PVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPIT--- 596
           P+I+V +    ++  +A  + +  A+L A YPG+EGG AIADV+FG +NP GRLP++   
Sbjct: 583 PLIVVYIEGRPLNKNWAAEHAD--ALLTAYYPGQEGGDAIADVLFGDYNPAGRLPVSVPR 640

Query: 597 -------WYNG------DYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSY 643
                  +YN       DYV+M                        +   LY FGYGLSY
Sbjct: 641 SEGQIPVYYNKKTPKCHDYVEM------------------------SASPLYSFGYGLSY 676

Query: 644 TQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQN 703
           + F+Y+ L  T+   +                                  +FE   D +N
Sbjct: 677 STFEYSNLKVTQQAPL----------------------------------HFEISFDVEN 702

Query: 704 VGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYA 763
            G  DG +V  +Y +         ++Q+  F+R F++ G  K I F     + L+I++  
Sbjct: 703 TGKYDGEEVAQLYIRDEYASVVRALRQLKHFKRFFLKQGEKKTIVFTL-VEEDLSIINQK 761

Query: 764 ANTLLPAGEHTIFVG 778
              ++  G   + +G
Sbjct: 762 MERIVEPGSFQLMIG 776


>gi|395802372|ref|ZP_10481625.1| glycoside hydrolase family 3 protein [Flavobacterium sp. F52]
 gi|395435613|gb|EJG01554.1| glycoside hydrolase family 3 protein [Flavobacterium sp. F52]
          Length = 745

 Score =  262 bits (669), Expect = 7e-67,   Method: Compositional matrix adjust.
 Identities = 234/790 (29%), Positives = 354/790 (44%), Gaps = 151/790 (19%)

Query: 48  QMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGD---FAH-GVPRLGLPQYEWWSEAL 103
           Q   ++  + S  +   +  L+S+MTL+EKV  L     FA+ GV RLG+P+ +     L
Sbjct: 33  QTEEYVGKEISTDHDAEIDKLISQMTLEEKVGMLHGNSMFANAGVKRLGIPELKMADGPL 92

Query: 104 HGV------SNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNL 157
            GV       N  P    +D    AT +P      A++N  +    G ++  E RA    
Sbjct: 93  -GVREEISRDNWAPAGWTNDF---ATYYPAGGALAATWNAEMAHTFGTSLGEELRA---- 144

Query: 158 GRAGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSR 217
            R      SP IN+ R P  GR  E   EDPF+  + AV  + GLQ+ +           
Sbjct: 145 -RDKDMLLSPAINMVRTPLGGRTYEYMSEDPFLNKKIAVPLIVGLQEKD----------- 192

Query: 218 PLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSY 277
              V +C KHYAA    N +  +R   D ++ E+ + E +L  FE  VKE  A S+M +Y
Sbjct: 193 ---VMACVKHYAA----NNQETNRDFVDVQIDERTLREIYLPAFEASVKEAKAYSIMGAY 245

Query: 278 NRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKA 337
           N+  G   C +  +LN+ +R EW   G +V+D  ++                + A++LK 
Sbjct: 246 NKFRGEYLCENDYMLNKILRDEWGFKGVVVSDWAAVH---------------STAKSLKN 290

Query: 338 GLDLDCGQ---YYTNFTGN----AVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVS 390
           GLD++ G    +   F  +    AV+ G+V E +ID  +K +  VL ++    G  +   
Sbjct: 291 GLDIEMGTPKPFNEFFLADKLIVAVKSGEVSEKEIDLHVKRILRVLFQVKAMGGGER--- 347

Query: 391 LGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNY 450
             K  I ++ + + A + A E IVLLKN+ N LPL    VK++AV+G +A    A+ G  
Sbjct: 348 -AKGSIATEAHYQDAYKIAAEAIVLLKNENNALPLQLDGVKSIAVIGNNATKKNALGGFG 406

Query: 451 AGIPC-RYMSPIAGFSGY----ANVTYKTGCDDVACKSNN-------------------- 485
           AG+   R ++P+ G          + Y  G  +   K N                     
Sbjct: 407 AGVKTKREVTPLEGLKNRLPSSVKINYAEGYLERYDKKNRGNLGNITANGPVTIDELDPA 466

Query: 486 SIFAASEAAKTADATIILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVI 545
            +  A +AAK +D  II AG +   E E+ DR DL LP  Q +LI +V  +A  P  +V+
Sbjct: 467 KVQEAVDAAKNSDVAIIFAGSNRDYETEASDRRDLHLPFGQEELIKKV--LAVNPKTIVV 524

Query: 546 MSAGG-VDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQ 604
           M AG   DI   E +    A++W+ + G EGG A+ADV+ GK NP G+LP       +  
Sbjct: 525 MIAGAPFDIN--EVSKKSSALVWSWFNGSEGGNALADVILGKVNPSGKLP-------WTM 575

Query: 605 MLPLTSMPLRPVDSLGYPG--------------RTYKFYNGPTLYPFGYGLSYTQFKYNL 650
            + L   P    +S  +PG              R +   N   LYPFGYGLSYT F    
Sbjct: 576 PIALKDSPAHATNS--FPGDKAVNYAEGLLIGYRWFDTKNVAPLYPFGYGLSYTSF---A 630

Query: 651 LSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGS 710
           L   KT     +K  + +N                       D  E  VD +N G  DG 
Sbjct: 631 LDNAKT-----DKTSYAQN-----------------------DVIEVTVDVKNTGKVDGK 662

Query: 711 DVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAAN--TLL 768
           +VV +Y+           +++ GF++  V+AG + ++       K L   D A+   T+ 
Sbjct: 663 EVVQLYTSKSDSKITRAAQELKGFKKAEVKAGSSTKVTIKV-PVKELAYYDVASKKWTVE 721

Query: 769 PAGEHTIFVG 778
           P G++TI +G
Sbjct: 722 P-GKYTIKLG 730


>gi|315500297|ref|YP_004089100.1| glycoside hydrolase family 3 domain protein [Asticcacaulis
           excentricus CB 48]
 gi|315418309|gb|ADU14949.1| glycoside hydrolase family 3 domain protein [Asticcacaulis
           excentricus CB 48]
          Length = 882

 Score =  261 bits (668), Expect = 7e-67,   Method: Compositional matrix adjust.
 Identities = 160/468 (34%), Positives = 241/468 (51%), Gaps = 45/468 (9%)

Query: 54  FCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGT 113
           + D+S P   R  DLVSRMTL+EK  QL + A  +PRL + +Y WW+E LHGV+  G   
Sbjct: 35  YQDASKPPEARAADLVSRMTLEEKTAQLINDAPAIPRLNVREYNWWNEGLHGVAAAG--- 91

Query: 114 HFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGR---------AGLTY 164
                   AT FP  +   A+++E L  ++ + +S E RA Y   R          GLT 
Sbjct: 92  -------YATVFPQAVGLAATWDEPLIHRVAETISVEFRAKYLKERHRFGGSDWFGGLTV 144

Query: 165 WSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPL--KVS 222
           WSPNIN+ RDPRWGR  ET GEDP++  R  V +VRGLQ              P+  +  
Sbjct: 145 WSPNINIFRDPRWGRGQETYGEDPYLTARMGVAFVRGLQ-----------GDDPVYYRTV 193

Query: 223 SCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNG 282
           +  KHYA   V +     R+  +   +  D+ +T+L  F   + EG A S+MC+YN +NG
Sbjct: 194 ATPKHYA---VHSGPEAGRHRDNVNPSPYDLADTYLPAFRATITEGQAGSIMCAYNAING 250

Query: 283 IPSCADPKLLNQTVRGEWDLHGYIVADCDSI-QVMVDNHKFLADSKEDAVAQTLKAGLDL 341
            P+CA+  LL + +R +W   GY+V+DCD++  +          + E+ V    + G DL
Sbjct: 251 QPACANEDLLVKYLRKDWGFKGYVVSDCDAVGDIYYKTSHAYRPTPEEGVTAAYQVGTDL 310

Query: 342 DCGQY-YTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ-YVSLGKQDICSD 399
            CG     +    AV+QG + E  +D +L  L+T   +LG FD   + +  +  +D  + 
Sbjct: 311 ICGNANEADHLTRAVRQGLLPEKTLDTALIRLFTARFKLGQFDPPAKVFPKITAEDYDTP 370

Query: 400 ENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMS 459
            N + + + A   +VLLKN+ N LPL   + + +AV+GP+A++  +++GNY G P   ++
Sbjct: 371 ANRDFSQKVAESAMVLLKNENNLLPLK-GEPRQIAVIGPNADSMDSLVGNYNGDPSHPVT 429

Query: 460 PIAGFSGY---ANVTYKTGC---DDVACKSNNSIFAASEAAKTADATI 501
            ++G       A VTY  G    D V     +S F   EA      T+
Sbjct: 430 VLSGIRARFPKATVTYAPGSGLIDPVMTAVPDSAFCRDEACTQTGVTV 477



 Score =  130 bits (326), Expect = 3e-27,   Method: Compositional matrix adjust.
 Identities = 90/308 (29%), Positives = 147/308 (47%), Gaps = 55/308 (17%)

Query: 483 SNNSIFAASEAAKTADATIILAGLDLSVEAESL----------DREDLWLPGYQTQLINQ 532
           S+    +A  AAK AD  + +AGL   VE E +          DR  L LP  Q +++ Q
Sbjct: 592 SDTGAQSAVAAAKEADLVVFVAGLSQRVEGEEMRVETEGFSGGDRTTLNLPPAQQKVLEQ 651

Query: 533 VAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGR 592
           V+   K PV+LV+++   + I +A+ N  + AI+ A YPG +GG A+A ++ G ++P GR
Sbjct: 652 VSAAGK-PVVLVLINGSALGINWADKN--VPAIIEAWYPGGQGGAAVARLIAGDYSPAGR 708

Query: 593 LPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLS 652
           LP+T+Y       LP         +     GRTY+++ G  LYPFGYGLS+T F+Y  L+
Sbjct: 709 LPVTFYRS--ADQLPA-------FNDYNMKGRTYRYFKGEALYPFGYGLSFTTFRYAPLT 759

Query: 653 FTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDV 712
            +                                  +  D       D  N GS D  +V
Sbjct: 760 LS-------------------------------ARQVAGDGQVSVSADVTNSGSRDSDEV 788

Query: 713 VIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGE 772
           V +Y   P +  A  I+ +  F+R+ ++AG  K ++F  +  ++L+ V+   +  +  G+
Sbjct: 789 VQLYVSYPGQKLAP-IRALARFERIHLKAGETKTVRFTLDP-QALSTVNADGSRSVKPGK 846

Query: 773 HTIFVGNG 780
             +++G G
Sbjct: 847 VELWLGGG 854


>gi|410096880|ref|ZP_11291865.1| hypothetical protein HMPREF1076_01043 [Parabacteroides goldsteinii
           CL02T12C30]
 gi|409225497|gb|EKN18416.1| hypothetical protein HMPREF1076_01043 [Parabacteroides goldsteinii
           CL02T12C30]
          Length = 799

 Score =  261 bits (668), Expect = 7e-67,   Method: Compositional matrix adjust.
 Identities = 213/730 (29%), Positives = 335/730 (45%), Gaps = 125/730 (17%)

Query: 90  RLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVST 149
           RLG+P  ++ +E +HG+++             AT  P  I   +++N  L  + G     
Sbjct: 139 RLGIP-VDFSNEGIHGLNHTK-----------ATPLPAPINIGSTWNRDLVHQAGDIAGK 186

Query: 150 EARAM-YNLGRAGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGH 208
           EA+A+ YN        ++P ++VARDPRWGR+ ET GEDP++VG   +  V+G+Q     
Sbjct: 187 EAKALGYN------NVYAPILDVARDPRWGRVLETYGEDPYLVGELGIQMVKGIQ----- 235

Query: 209 ENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEG 268
           +N          V+S  KH+A Y +           D  V  +++ E  L PF+  V++ 
Sbjct: 236 QNG---------VASTLKHFAVYSIPKGGRDAAVRTDPHVAPRELHEIHLYPFKRVVQKA 286

Query: 269 DASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKE 328
               VM SYN  +G+P  A    L Q +R E+   GYIV+D ++++ +   H  +ADS E
Sbjct: 287 HPKGVMSSYNDWDGVPVTASYYFLTQLLRQEYGFKGYIVSDSEAVEFVQTKH-HVADSYE 345

Query: 329 DAVAQTLKAGLDLDCGQYYTNFTG---------NAVQQGKVKETDIDKSLKYLYTVLMRL 379
           +AV Q ++AGL++      TNFT            V++GK+    +D+ +  +  V   L
Sbjct: 346 EAVRQVVEAGLNV-----RTNFTHPKDYILPVRKLVKEGKLSMKSVDRMVADVLRVKFEL 400

Query: 380 GFFDGSPQYVSLGK---QDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVV 436
           G FD SP YV   K   + + +D++ +   +  ++ +VLLKN+ N LPL+  + K V + 
Sbjct: 401 GLFD-SP-YVKDPKAADKIVGADKHRDFVLDMQKQSLVLLKNENNLLPLDKNQTKKVLIA 458

Query: 437 GPHANATVAMIGNYAGIPCRYMSPIAGFSGY----ANVTYKTGCD--------------D 478
           GP A  T  MI  Y       ++   G   Y      V Y  GC+               
Sbjct: 459 GPLAKETNYMISRYGPQGLDNITVYDGIKDYLGNQTEVVYAKGCEVKDANWPDSEIVPTP 518

Query: 479 VACKSNNSIFAASEAAKTADATIILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAK 538
           +  +    I  A+ AA   D  I + G D S   ES  R  L LPG Q QL+  +    K
Sbjct: 519 LTDEEKKGIAEAATAAADCDVIIAVLGEDESCTGESKSRTGLDLPGRQQQLLEALHATGK 578

Query: 539 GPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWY 598
            PV+LV+++   + I +A  + NI +IL A +PG+ GG AIA  +FG +NPGGRL +T+ 
Sbjct: 579 -PVVLVLINGQPLTINWA--DRNIPSILEAWFPGQLGGEAIAQTLFGDYNPGGRLSVTFP 635

Query: 599 NGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGP----------TLYPFGYGLSYTQFKY 648
               +  +   + P +P    G      +++ GP           LYPFGYGLSYT F Y
Sbjct: 636 RS--IGQIEF-NFPFKPGSQDG------QYFEGPNGSGRTRVNGALYPFGYGLSYTTFAY 686

Query: 649 NLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTD 708
           +                   NL+   +   ++ P  +  D+               G   
Sbjct: 687 S-------------------NLSVKQETPYSQSPVTVTVDVTN------------TGKRA 715

Query: 709 GSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLL 768
           G +VV +Y +        Y   + GF+R+ ++ G  K + FV    + L I+D      +
Sbjct: 716 GDEVVQLYIRDKVSSVIAYESVLRGFERISLQPGETKTVSFVL-LPEDLQILDRHMEWTV 774

Query: 769 PAGEHTIFVG 778
             GE  + +G
Sbjct: 775 EPGEFEVRIG 784


>gi|299149090|ref|ZP_07042152.1| periplasmic beta-glucosidase [Bacteroides sp. 3_1_23]
 gi|298513851|gb|EFI37738.1| periplasmic beta-glucosidase [Bacteroides sp. 3_1_23]
          Length = 1049

 Score =  261 bits (668), Expect = 8e-67,   Method: Compositional matrix adjust.
 Identities = 219/769 (28%), Positives = 361/769 (46%), Gaps = 108/769 (14%)

Query: 56   DSSLPYSIR----VKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVG- 110
            +S LP++      VKDL+SRMT++EK+ QL  +  G   L  P+ E+ S++L     VG 
Sbjct: 328  NSKLPHTPEADSFVKDLLSRMTVEEKIGQLSQYV-GRTLLTGPESEYLSDSLIARGLVGS 386

Query: 111  -------------------------PGTHFDDVIPG-ATSFPTVILTTASFNESLWKKIG 144
                                     P     DVI G  T FPT +  + S++ +  ++  
Sbjct: 387  VLNISGAKTLRDLQEKNMRHSRIKIPILFGMDVIHGYKTIFPTPLAESCSWDLAAIERAA 446

Query: 145  QAVSTEARAMYNLGRAGLTY-WSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQ 203
            +  + E+ A      AGL + ++P +++ARD RWGR+ E  GED ++    A   V G Q
Sbjct: 447  KIAAIESSA------AGLHWTFAPMVDIARDARWGRVVEGAGEDTYLGSEIAKARVNGFQ 500

Query: 204  DVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEM 263
                  N  + NS    V +C KH+ AY +       R +    ++E+ + +T+L PF+ 
Sbjct: 501  -----WNLWENNS----VLACAKHWVAYGLPQ---AGRDYAPVDMSERTLFDTYLPPFKA 548

Query: 264  CVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFL 323
            C+  G   + M ++N +NGIP+ A P LL   +RG+W+ +G++V+D ++++ +V   + +
Sbjct: 549  CIDAG-VLTFMSAFNDINGIPASAHPFLLKDLLRGQWNFNGFVVSDWEAVKQLV--AQGV 605

Query: 324  ADSKEDAVAQTLKAGLDLDCGQ-YYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFF 382
            A+  +DA      +G+D+D     Y  +    ++ GK+   D+D S+  +  +   LG F
Sbjct: 606  AEDDKDATRLAFNSGIDMDMTDGLYNKYMKELIEAGKISMEDVDNSVSRILHIKYALGLF 665

Query: 383  DGSPQYVS--LGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHA 440
                ++ +     Q I   E ++ A + A +  VLLKND +TLPL +  V+++AVVGP A
Sbjct: 666  VDPYKFCNEEYESQTIMKKEFLDAALDMAHKSAVLLKNDNHTLPL-AKNVRSIAVVGPLA 724

Query: 441  NATVAMIGNY-AGIPCRYMSPIAGFSGYAN--------VTYKTGCDDVACKSNNSIFAAS 491
            +    ++G++ A    R+++ +    G  N        V Y  GC D   +  +    A 
Sbjct: 725  DNQTELLGSWRARGEDRHVTTV--LQGIKNKIGGNKTKVGYARGC-DFDGEDKSGFKEAV 781

Query: 492  EAAKTADATIILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGV 551
            + A  +D  I + G    +  ES  R  L LPG Q +LI ++    K PV++V+M+   +
Sbjct: 782  KLASKSDMVIAVVGEKALMSGESRSRAQLDLPGVQEELIKELVATGK-PVVVVLMNGRPL 840

Query: 552  DIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGD-YVQMLPLTS 610
             I +   + N+ AIL   + G   G AIAD++FG +NP GRL I++   +  V +     
Sbjct: 841  SIEW--VDKNVSAILETWFLGTSAGTAIADILFGDYNPSGRLTISFPRVEGQVPVYYNYK 898

Query: 611  MPLRPVD-SLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRN 669
               RP D       R     N P LYPFGYGLSYT F Y+    T+              
Sbjct: 899  KSGRPGDMPHSSTTRHIDVPNAP-LYPFGYGLSYTTFSYSAPQSTQK------------- 944

Query: 670  LNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIK 729
              YT   +                     V   N G  DG + V +Y           +K
Sbjct: 945  -EYTRQET-----------------ISVSVTVTNTGDRDGEETVQLYVNDKVASVVRPVK 986

Query: 730  QVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVG 778
            ++  F+++F++AG +K ++F  +   +L   D A N ++  GE  I  G
Sbjct: 987  ELKAFKKIFLKAGESKTVQFDISPL-ALGFYDAAMNYVVEPGEFEIMTG 1034


>gi|374312362|ref|YP_005058792.1| Beta-glucosidase [Granulicella mallensis MP5ACTX8]
 gi|358754372|gb|AEU37762.1| Beta-glucosidase [Granulicella mallensis MP5ACTX8]
          Length = 874

 Score =  261 bits (668), Expect = 8e-67,   Method: Compositional matrix adjust.
 Identities = 160/425 (37%), Positives = 228/425 (53%), Gaps = 42/425 (9%)

Query: 64  RVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGAT 123
           R+ +L+++MT+ E++ QL D A  + RLGLP Y WW+E LHG++  G           AT
Sbjct: 38  RIDELIAKMTVSERIAQLQDRAPAIERLGLPSYNWWNEGLHGLARDG----------YAT 87

Query: 124 SFPTVILTTASFNESLWKKIGQAVSTEARAMY------NLGR-AGLTYWSPNINVARDPR 176
            FP  I   A+++  L  ++G  VSTEARA +      N  R  GLT WSPNIN+ RDPR
Sbjct: 88  VFPQAIGLAATWDAPLLHEVGDVVSTEARAKFYSHGGENTPRFGGLTVWSPNINIFRDPR 147

Query: 177 WGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRP--LKVSSCCKHYAAYDVD 234
           WGR  ET GEDPF+       +V G+Q            + P  LK  +  KH+AA+   
Sbjct: 148 WGRGQETYGEDPFLTATLGTQFVEGVQ-----------GNDPFYLKADATPKHFAAHSGP 196

Query: 235 NWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQ 294
             +G D   F+A V+  D+ +T+L  F        A+++MCSYN ++G PSCA    L  
Sbjct: 197 E-EGRDS--FNAVVSPHDLADTYLPAFHALTTNAHAAALMCSYNEIDGTPSCASGNNLQD 253

Query: 295 TVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFTGNA 354
            VR  W   GY+V+DCD++  +   H F  D+   A A  L AG+DLDCG  Y   +  +
Sbjct: 254 LVRERWGFKGYVVSDCDAVGNIAGYHHFATDNAHGA-ADALNAGVDLDCGNTYAALS-KS 311

Query: 355 VQQGKVKETDIDKSLKYLYTVLMRLGFFDG---SPQYVSLGKQDICSDENIELAAEAARE 411
           + Q    E  ++++L  L    +RLG  D    SP Y  +G +++ S  +  LA  AA E
Sbjct: 312 LDQNLTTEAKLNQALHRLLLARVRLGMLDPLSCSP-YRDIGAEELDSPAHHTLALRAAEE 370

Query: 412 GIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGF-SGYANV 470
            IVLLKND   LPL  A  + V+V+GP A+    +  NY G     ++P+ GF S + +V
Sbjct: 371 SIVLLKND-GVLPLQ-ASTQKVSVIGPTADMVKVLEANYHGTALHPITPLDGFRSRFHDV 428

Query: 471 TYKTG 475
           +Y  G
Sbjct: 429 SYAQG 433



 Score =  125 bits (314), Expect = 9e-26,   Method: Compositional matrix adjust.
 Identities = 91/299 (30%), Positives = 140/299 (46%), Gaps = 55/299 (18%)

Query: 490 ASEAAKTADATIILAGLDLSVEAESL----------DREDLWLPGYQTQLINQVAEVAKG 539
           A + A  +D  +   GL   +E E+L          DR  L LP  Q  L++++ ++ K 
Sbjct: 598 AVQTAAKSDVIVAFVGLSPDLEGEALQLRLKGFNGGDRTSLDLPEAQRTLLSRLTQLHK- 656

Query: 540 PVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYN 599
           PVI+V+ S  GV  A      +   +L A YPGE GG A+A ++ G  NP GRLP+T+Y 
Sbjct: 657 PVIIVLTSGSGV--ALGPEAKDAAGVLEAWYPGEAGGEALAGILAGNVNPSGRLPVTFYR 714

Query: 600 GDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQV 659
                   +  +P     S+ +  RTY++++GP L+PFGYGLSY+ F+Y           
Sbjct: 715 S-------VDDLPAFTDYSMAH--RTYRYFDGPVLFPFGYGLSYSHFQYG---------- 755

Query: 660 NLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKP 719
                     L  ++   KT  P V +            V   N    +G++V  +Y +P
Sbjct: 756 ---------QLRLSTHMLKTSEPLVAM------------VTVHNESQREGTEVAELYLQP 794

Query: 720 PAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVG 778
           P    A  +  + G QRV +R G  + + F   A   L+ VD +    + AGE+ +FVG
Sbjct: 795 PQASGAPRLT-LQGVQRVALRPGETRELTFKL-APGQLSTVDTSGARTVRAGEYKLFVG 851


>gi|316980598|dbj|BAJ51947.1| putative beta-D-xylosidase [Glycyrrhiza uralensis]
          Length = 285

 Score =  261 bits (668), Expect = 9e-67,   Method: Compositional matrix adjust.
 Identities = 128/277 (46%), Positives = 184/277 (66%), Gaps = 10/277 (3%)

Query: 505 GLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKA 564
           GLD S+EAE  DR  L LPG+Q +L+++VA VA+GPVILV+MS G +D++FA+ +  I A
Sbjct: 2   GLDQSIEAEFRDRVGLLLPGHQQELVSRVARVARGPVILVLMSGGPIDVSFAKNDPKISA 61

Query: 565 ILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGR 624
           ILW GYPG+ GG AIADV+FG  NPGGRLP+TWY  +Y+  +P+T+M +RP  + GYPGR
Sbjct: 62  ILWVGYPGQAGGTAIADVIFGTTNPGGRLPMTWYPQNYLAKVPMTNMDMRPNPATGYPGR 121

Query: 625 TYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGV 684
           TY+FY GP ++PFG+GLSYT+F ++L    K + V    LQ   N   ++  +      V
Sbjct: 122 TYRFYKGPVVFPFGHGLSYTRFTHSLAIAPKQVSVPFATLQAFTNSTVSTSKA------V 175

Query: 685 LVNDLRCDDY-FEFKVDFQNVGSTDGSDVVIVYSK-PPAEIAATYIKQVIGFQRVFVRAG 742
            V+   CD     F VD +N GS DG++ ++V+SK PP + +AT  KQ++ F + +V AG
Sbjct: 176 RVSHANCDAMEVGFHVDVKNEGSMDGTNTLLVFSKPPPGKWSAT--KQLVSFHKTYVPAG 233

Query: 743 RNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGN 779
             +R+K   + CK L++VD      +P GEH + +G+
Sbjct: 234 SKQRVKVGVHVCKHLSVVDEFGIRRIPMGEHELQIGD 270


>gi|313205375|ref|YP_004044032.1| glycoside hydrolase [Paludibacter propionicigenes WB4]
 gi|312444691|gb|ADQ81047.1| glycoside hydrolase family 3 domain protein [Paludibacter
           propionicigenes WB4]
          Length = 858

 Score =  261 bits (668), Expect = 9e-67,   Method: Compositional matrix adjust.
 Identities = 176/466 (37%), Positives = 242/466 (51%), Gaps = 56/466 (12%)

Query: 47  LQMSSFLFCDS----SLPY-------SIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQ 95
           L   +FLF  S     LPY        +R  DL++R+TL EK   + + +  +PRLG+  
Sbjct: 6   LTFIAFLFTVSLVAQQLPYQNPKLSAEVRATDLLARLTLAEKAALMQNNSPAIPRLGIKA 65

Query: 96  YEWWSEALHGVSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMY 155
           YEWW+EALHGV   G           AT FP  I   ASFN  L      AVS EARA  
Sbjct: 66  YEWWNEALHGVGRSGV----------ATVFPQAIGMAASFNNGLLFDAFTAVSDEARAKS 115

Query: 156 N-------LGR-AGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEG 207
           N       L R  GLTYW+PN+N+ RDPRWGR  ET GEDP++     V  V+GLQ  + 
Sbjct: 116 NKFSEQGGLKRYQGLTYWTPNVNIFRDPRWGRGQETYGEDPYLTSLMGVAVVKGLQGPD- 174

Query: 208 HENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDAR-VTEQDMEETFLRPFEMCVK 266
                  N+   K+ +C KH+A +    W   +R+ F+A  +  +D+ ET+L  F+  V+
Sbjct: 175 -------NAEYDKLHACAKHFAVHSGPEW---NRHSFNAENINPRDLWETYLPAFKALVQ 224

Query: 267 EGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMV--DNHKFLA 324
           + D   VMC+YNR    P C   +LL Q +R +W   G +V+DC +I      + H    
Sbjct: 225 KADVKEVMCAYNRFEDEPCCGSNRLLTQILRNDWKFDGLVVSDCWAISDFYKPNAHATQP 284

Query: 325 DSKEDAVAQTLKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDG 384
           D+   A A  +  G DL+CG  + N    AV+ G ++E  ID SLK L      LG  + 
Sbjct: 285 DATH-AAANAVLNGTDLECGSDFRNLP-EAVKAGLIEEKRIDVSLKRLLKARFELGEMN- 341

Query: 385 SPQYVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATV 444
           S Q   +    + S+++  LA   A E IVLL+N+ N LPL S K+K +AV+GP+AN +V
Sbjct: 342 SDQVWPISYSVVNSEKHQNLALRMAEESIVLLQNNNNILPL-SKKLK-IAVMGPNANDSV 399

Query: 445 AMIGNYAGIPCRYMSPIAG----FSGYANVTYKTGCD---DVACKS 483
              GNY G P   ++ +      F G A + Y+ GCD   DVA  S
Sbjct: 400 MQWGNYNGFPAHTVTLLEAMRKSFPG-AQLIYEPGCDRTMDVAVSS 444



 Score =  100 bits (250), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 78/301 (25%), Positives = 133/301 (44%), Gaps = 56/301 (18%)

Query: 489 AASEAAKTADATIILAGLDLSVEAESL----------DREDLWLPGYQTQLINQVAEVAK 538
           A+    K AD  +   G+  S+E E +          DR D+ LP  Q +L+  + +  K
Sbjct: 587 ASIAKVKDADVVVFAGGIAPSLEGEEMRVTVPGFKGGDRTDIELPAIQRRLLQALKDAGK 646

Query: 539 GPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWY 598
               +V ++  G  +       + +AIL A YPG+ GG A+A+V+ G +NP GRLP+T+Y
Sbjct: 647 K---VVFVNFSGSAMGLVPETQSCEAILQAWYPGQAGGTAVANVLLGNYNPSGRLPVTFY 703

Query: 599 NGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQ 658
               V  LP         +     GRTY++     L+ FGYGLSYT+F   +L   K   
Sbjct: 704 KN--VAQLP-------DFEDYSMKGRTYRYMTEKPLFSFGYGLSYTKF---VLGTAK--- 748

Query: 659 VNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSK 718
             LNK                       + ++ ++  +  V   N G   G++V+ VY +
Sbjct: 749 --LNK-----------------------SSIKANETLKITVPVTNAGKVAGTEVLQVYVR 783

Query: 719 PPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLL-PAGEHTIFV 777
              ++     K + GF++V +  G+  +I     +  +    D+    ++   GE+ ++ 
Sbjct: 784 KVKDVDGP-AKTLRGFKKVNIEPGKTSQISIDLTSS-AFEFYDWTQRKMMVTPGEYEVYY 841

Query: 778 G 778
           G
Sbjct: 842 G 842


>gi|313204103|ref|YP_004042760.1| glycoside hydrolase [Paludibacter propionicigenes WB4]
 gi|312443419|gb|ADQ79775.1| glycoside hydrolase family 3 domain protein [Paludibacter
           propionicigenes WB4]
          Length = 1278

 Score =  261 bits (668), Expect = 9e-67,   Method: Compositional matrix adjust.
 Identities = 157/427 (36%), Positives = 240/427 (56%), Gaps = 35/427 (8%)

Query: 53  LFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPG 112
           ++ +++  +  R  DLVSRMTL+EK  QLG+    +PRLG+ +Y+ W EALHGV  VG  
Sbjct: 38  IYLNTAYSFKERAADLVSRMTLEEKQSQLGNTMPPIPRLGVNKYDVWGEALHGV--VGRN 95

Query: 113 THFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNINVA 172
            +   +   ATSFP  +   ++++ +L K+    V+ EAR   +     LTYWSP I  A
Sbjct: 96  NNSGMI---ATSFPNSVAVGSTWDPALIKRETSVVADEARGFNHDLIFTLTYWSPVIEPA 152

Query: 173 RDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYD 232
           RDPRWGR  ET GEDPF+V +    +V+GL      ++ T L + P     C KHY A  
Sbjct: 153 RDPRWGRTAETFGEDPFLVSQIGSGFVQGLM----GDDPTYLKTVP-----CGKHYFA-- 201

Query: 233 VDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLL 292
             N    +R++  A + ++DM E +L P+   +++    S+M +Y+ VNG+P  A   L+
Sbjct: 202 --NNSEFNRHNGSANMDDRDMREFYLTPYRTLIQKDKLPSIMTAYSAVNGVPMSASKFLV 259

Query: 293 NQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFTG 352
           +   +  + L GY+  DCD++  +V++H++ A SK +A A  LK G+D DCG  Y     
Sbjct: 260 DTIAKRTYGLDGYVTGDCDAVADVVNSHRY-AKSKAEAAAMGLKTGVDSDCGGIYQTSAL 318

Query: 353 NAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ----YVSLGKQDICSDENIELAAEA 408
            A++QG + E D+DK+L  +YT+ MRLG FD  PQ    Y  +    I    + +LA E 
Sbjct: 319 EALKQGLISEADMDKALVNIYTIRMRLGEFD--PQNIVPYAGIKPSIINDPSHNDLALEI 376

Query: 409 AREGIVLLKND------QNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGI--PCRYMSP 460
           A +  VLLKN+      +  LPLN+  +K +AV+GP A+     +G+Y+G   P   ++P
Sbjct: 377 ATKSPVLLKNNLVGKSGKKALPLNAGTIKKIAVLGPQADK--VELGDYSGEADPKYKITP 434

Query: 461 IAGFSGY 467
           + G   Y
Sbjct: 435 LEGIKNY 441



 Score =  132 bits (332), Expect = 8e-28,   Method: Compositional matrix adjust.
 Identities = 87/259 (33%), Positives = 126/259 (48%), Gaps = 39/259 (15%)

Query: 492 EAAKTADATIILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGV 551
           + A +AD  ++  G D +   E  DR  + LPG Q +LI  +A V     I+VI   G V
Sbjct: 610 DMAASADVAVVFVGTDQTTGREESDRFAITLPGNQNELIKSIAAVNPN-TIVVIQGMGMV 668

Query: 552 DIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLP-LTS 610
           ++   + N N+  I++ GY G+  G A+A V+FG  NPGG+  +TWY    +  LP LT 
Sbjct: 669 EVEQFKNNPNVAGIIFTGYNGQAQGTAMAKVLFGDVNPGGKTSLTWYKS--INDLPALTD 726

Query: 611 MPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNL 670
             LR     G  GRTY ++N    Y FGYGLSYT F Y+  + +KT              
Sbjct: 727 YTLR--GGAGKNGRTYMYFNKDVSYEFGYGLSYTTFAYSNFNISKT-------------- 770

Query: 671 NYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATY--I 728
                             +  +D     VD +N G+ DG +VV +Y K P   A+    I
Sbjct: 771 -----------------SITPNDKVTVTVDVKNTGTVDGDEVVQIYVKTPDSPASLERPI 813

Query: 729 KQVIGFQRVFVRAGRNKRI 747
           K++ GF+RV + AG+ K +
Sbjct: 814 KRLKGFKRVAIPAGQTKTV 832


>gi|293370402|ref|ZP_06616956.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
           ovatus SD CMC 3f]
 gi|292634550|gb|EFF53085.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
           ovatus SD CMC 3f]
          Length = 863

 Score =  261 bits (667), Expect = 1e-66,   Method: Compositional matrix adjust.
 Identities = 162/458 (35%), Positives = 242/458 (52%), Gaps = 45/458 (9%)

Query: 48  QMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVS 107
           Q S + + D+ L    R  DL+ R+TL+EKV  + + +  +PRLG+  YEWW+EALHGV+
Sbjct: 22  QPSKYPYQDTKLTVEQRADDLLQRLTLEEKVALMQNNSPAIPRLGIKPYEWWNEALHGVA 81

Query: 108 NVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARA---MYNLG-----R 159
             G           AT FP  I   ASFN+ L  ++  AVS EARA    +N        
Sbjct: 82  RAGL----------ATVFPQAIGMAASFNDELLYEVFDAVSDEARAKNRQFNEKGQYKRY 131

Query: 160 AGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPL 219
            GLT W+PN+N+ RDPRWGR  ET GEDP++ GR  +  VRGLQ  E  E          
Sbjct: 132 QGLTMWTPNVNIFRDPRWGRGQETYGEDPYLSGRMGMAAVRGLQGPEDAEYD-------- 183

Query: 220 KVSSCCKHYAAYDVDNWKGVDRYHFDAR-VTEQDMEETFLRPFEMCVKEGDASSVMCSYN 278
           K+ +C KH+A +    W   +R+ F+A  +  +D+ ET+L  F+  V++     VMC+YN
Sbjct: 184 KLHACAKHFAVHSGPEW---NRHSFNAENIAPRDLWETYLPAFKELVQKAGVKEVMCAYN 240

Query: 279 RVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSI-----QVMVDNHKFLADSKEDAVAQ 333
           R  G P C   +LL Q +R +W   G +V DC +I     +   + H   A +  DAV  
Sbjct: 241 RFEGDPCCGSNRLLTQILRNDWGFKGIVVTDCGAIGDFFQRKKHETHPDAAHASADAVL- 299

Query: 334 TLKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGK 393
              +G DL+CG  + + T +AV++  + E  I+ S+K +      LG  + +  + ++  
Sbjct: 300 ---SGTDLECGGNFKSIT-DAVKKDLISEEKINTSVKRVLKARFELGEMNSTHPWSNIPF 355

Query: 394 QDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGI 453
             I   ++ ELA + A E +VLL+N+ N LPLN  +   VAV+GP+AN +V   GNY G 
Sbjct: 356 SVIDCPKHKELALKMAHESLVLLQNNNNILPLN--RQMKVAVIGPNANDSVMQWGNYNGF 413

Query: 454 PCRYMSPIAGFSGY---ANVTYKTGCDDVACKSNNSIF 488
           P   ++ + G       A + Y+  C      + +S+F
Sbjct: 414 PSHTVTLLEGIRAKLPDAQIIYEPVCGYTNDTTLHSLF 451



 Score =  119 bits (297), Expect = 8e-24,   Method: Compositional matrix adjust.
 Identities = 94/296 (31%), Positives = 131/296 (44%), Gaps = 56/296 (18%)

Query: 495 KTADATIILAGLDLSVEAESL----------DREDLWLPGYQTQLINQVAEVAKGPVILV 544
           ++AD  I   G+   +E ES+          DR ++ LP  Q +++   A + K     V
Sbjct: 598 QSADVVIFAGGISPLLEGESMRVSDPGFKGGDRTEIELPAIQREVL---ALLKKNGKKTV 654

Query: 545 IMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQ 604
            ++  G  +A      N  AIL A YPG+ GG A+ADV+FG +NP GRLPIT+Y    +Q
Sbjct: 655 FVNFSGSAMAIVPETQNCDAILQAWYPGQAGGTAVADVLFGDYNPAGRLPITFYKS--MQ 712

Query: 605 MLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKL 664
            LP         +     GRTY+F     LYPFGYGLSYT+F Y   +      +N +KL
Sbjct: 713 QLP-------DYEDYSMKGRTYRFMTETPLYPFGYGLSYTRFSYGKAT------LNQSKL 759

Query: 665 QHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIA 724
                   T                         +   NVG  DG +VV VY   P +  
Sbjct: 760 TKGEKAILT-------------------------IPVSNVGQRDGEEVVQVYICRPDDKE 794

Query: 725 ATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPA-GEHTIFVGN 779
               K + GFQRV +  G+ + ++       S    D A NT+ P  G + I  GN
Sbjct: 795 GPQ-KTLRGFQRVSIAKGKTQNVQIEL-PYDSFEWFDAATNTIRPLNGTYKILYGN 848


>gi|423287910|ref|ZP_17266761.1| hypothetical protein HMPREF1069_01804 [Bacteroides ovatus
           CL02T12C04]
 gi|392671925|gb|EIY65396.1| hypothetical protein HMPREF1069_01804 [Bacteroides ovatus
           CL02T12C04]
          Length = 782

 Score =  261 bits (667), Expect = 1e-66,   Method: Compositional matrix adjust.
 Identities = 218/741 (29%), Positives = 340/741 (45%), Gaps = 154/741 (20%)

Query: 90  RLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVST 149
           RLG+P +    EA HG   +G           AT FPT I   A+++  L K++GQ ++ 
Sbjct: 129 RLGIPMF-LAEEAPHGHMAIG-----------ATVFPTGIGMAATWSLELVKEVGQVIAK 176

Query: 150 EARAMYNLGRAGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHE 209
           E R+     + G   + P +++ RDPRW R+ ET GEDP + G    + V GL       
Sbjct: 177 EIRS-----QGGHISYGPVLDLTRDPRWSRVEETFGEDPVLSGILGASMVDGL------- 224

Query: 210 NATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGD 269
              +L+ +   +++  KH+ AY V        Y   A V  +D+ + FL PF   +  G 
Sbjct: 225 GGGNLSQKYATIATL-KHFLAYAVPEGGQNGNY---ASVGIRDLHQNFLPPFRKAIDSG- 279

Query: 270 ASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKED 329
           A SVM SYN ++GIP  ++  LL Q +R EW   G++V+D  SI+ + ++H F+A +KE+
Sbjct: 280 ALSVMTSYNSIDGIPCTSNHYLLTQLLRNEWKFCGFVVSDLYSIEGIHESH-FVALTKEN 338

Query: 330 AVAQTLKAGLDLDCG-QYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQY 388
           A  Q++ AG+D+D G   YTN   +AVQ G++ +  ID ++  +  +   +G F+     
Sbjct: 339 AAIQSVTAGVDVDLGGDAYTNLC-HAVQSGQMDKAVIDTAVCRVLRMKFEMGLFEHPYVD 397

Query: 389 VSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIG 448
             +  + +   E+IELA + A+  I LLKN+ + LPL S  +  VAV+GP+A+    M+G
Sbjct: 398 PKIAAKTVRRKEHIELARKIAQSSITLLKNENSILPL-SKTINKVAVIGPNADNRYNMLG 456

Query: 449 NYA-------------GIPCRYMSPIAGFSGYANVTYKTGC---DDVACKSNNSIFAASE 492
           +Y              GI  + +SP         V Y  GC   D    +   +I AA  
Sbjct: 457 DYTAPQEDSNVKTVLDGILTK-LSPF-------RVEYVRGCAIRDTTVNEIEQAIKAARR 508

Query: 493 AA------------------KTADATIILAGLDLSVE-AESLDREDLWLPGYQTQLINQV 533
           +                   K   A +   G    +E  E  DR  L L G Q +L+  +
Sbjct: 509 SEVVIVVVGGSSARDFKTSYKETGAAVAEEGSVSDMECGEGFDRASLSLLGRQQELLESL 568

Query: 534 AEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRL 593
            +  K P+I+V +    ++  +A    +  A+L A YPG+EGG AIADV+FG +NP GRL
Sbjct: 569 QKTGK-PLIVVYIEGRPLEKNWASEYAD--ALLTAYYPGQEGGNAIADVLFGDYNPSGRL 625

Query: 594 PIT----------WYNG------DYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPF 637
           PI+          +YN       DYV+M   +S P                     LY F
Sbjct: 626 PISVPRSVGQIPVYYNKKAPRNHDYVEM---SSFP---------------------LYSF 661

Query: 638 GYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEF 697
           GYG+SYT F+Y+ L                                V+    RC   FE 
Sbjct: 662 GYGMSYTTFEYSDLQ-------------------------------VVQKSARC---FEV 687

Query: 698 KVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSL 757
               +N G  DG +V  +Y +         +KQ+  F+R  ++ G  K++ FV    +  
Sbjct: 688 SFKVKNTGKYDGEEVSQLYMRDEYASVVQPMKQLKHFERFHLKKGEEKKVTFVLTE-EDF 746

Query: 758 NIVDYAANTLLPAGEHTIFVG 778
            +V+Y    ++ +G   + +G
Sbjct: 747 FLVNYTLKKVVESGNFHLMIG 767


>gi|393782348|ref|ZP_10370533.1| hypothetical protein HMPREF1071_01401 [Bacteroides salyersiae
           CL02T12C01]
 gi|392673619|gb|EIY67078.1| hypothetical protein HMPREF1071_01401 [Bacteroides salyersiae
           CL02T12C01]
          Length = 852

 Score =  261 bits (666), Expect = 1e-66,   Method: Compositional matrix adjust.
 Identities = 163/415 (39%), Positives = 235/415 (56%), Gaps = 39/415 (9%)

Query: 53  LFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPG 112
           LF D + P   R+ DL+SR+T++EK+  L + A  +PRL + +Y   +EALHG+  V PG
Sbjct: 29  LFRDMNAPQHERLLDLLSRLTIEEKISLLVNDAREIPRLNIDKYYHGNEALHGI--VRPG 86

Query: 113 THFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMY---NLGR---AG----L 162
                     T FP  I   A++N  L  ++  A+S EAR  +   + G+   AG    L
Sbjct: 87  EF--------TVFPQAIGLAATWNPGLIFEVSSAISDEARGRWKELDYGKKQIAGASDLL 138

Query: 163 TYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVS 222
           T+WSP +N+ARDPRWGR  ET GEDPF+ G     +V+GLQ           + R LK  
Sbjct: 139 TFWSPTVNMARDPRWGRTPETYGEDPFLTGVIGCEFVKGLQGD---------HPRYLKTV 189

Query: 223 SCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNG 282
           S  KH+AA + ++    +R   +AR++E+D+ E +L  FE C+ +  A S+M +YN VNG
Sbjct: 190 STPKHFAANNEEH----NRSSCNARMSERDLREFYLPSFERCIVDAKAQSIMMAYNAVNG 245

Query: 283 IPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLD 342
           +P   +  L+   +RG+W  +GYIV+DC + + MV  HK++ D  + A    +KAGLDL+
Sbjct: 246 VPCTVNTYLIKNVLRGDWGFNGYIVSDCSAPEWMVTKHKYVRDL-DAAATLAIKAGLDLE 304

Query: 343 CG-QYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ--YVSLGKQDICSD 399
           CG + YT     A  +  V + DID +   +    M LG FD   Q  Y  +    I   
Sbjct: 305 CGDRVYTAPLLKAYNESMVSKADIDSAAYRVLRGRMLLGLFDDPSQNPYNQIEPSVIGCK 364

Query: 400 ENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIP 454
           ++ ELA E AR+ +VLLKN +N LPLN  KVK++AVVG   NA     G+Y+GIP
Sbjct: 365 KHQELALETARQSMVLLKNQKNFLPLNLKKVKSIAVVG--INAGHCEFGDYSGIP 417



 Score =  136 bits (342), Expect = 5e-29,   Method: Compositional matrix adjust.
 Identities = 95/291 (32%), Positives = 141/291 (48%), Gaps = 49/291 (16%)

Query: 490 ASEAAKTADATIILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAG 549
           A +AAK  D T+ + G++ S+E E  DR  L LP  Q + I ++ +V    V++++    
Sbjct: 597 AGKAAKECDVTVAVLGINKSIEREGQDRYSLELPTDQQEFIRELYKVNPNTVVVLV---A 653

Query: 550 GVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLT 609
           G  +A    + N+ AIL A YPGE+GG AIA+V+FG +NPGGRLP+T+YN        L 
Sbjct: 654 GSSLAINWIDENVPAILNAWYPGEQGGTAIAEVLFGDYNPGGRLPLTYYNS-------LD 706

Query: 610 SMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRN 669
            +P    D+     RTY+++ G  LY FGYGLSYT+F Y              K ++   
Sbjct: 707 ELP--SFDNYSVQNRTYQYFKGKPLYEFGYGLSYTKFNY--------------KKKNVSI 750

Query: 670 LNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIK 729
            N T D +                   FKV   N G  DG +V  VY + P       +K
Sbjct: 751 ANDTIDIT-------------------FKV--SNAGKYDGDEVAQVYVQYPETGTYMPLK 789

Query: 730 QVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLL-PAGEHTIFVGN 779
           Q+ GF RV ++ G++  +       K L   D      + P G++   +G+
Sbjct: 790 QLRGFSRVHIKKGKSADVTISVPK-KELRYWDEKTRQFVTPEGKYVFLIGS 839


>gi|383302737|gb|AFH08276.1| hypothetical protein [uncultured bacterium]
          Length = 768

 Score =  260 bits (665), Expect = 2e-66,   Method: Compositional matrix adjust.
 Identities = 211/710 (29%), Positives = 342/710 (48%), Gaps = 123/710 (17%)

Query: 101 EALHGVSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRA 160
           +A+HG +N           P  T +PT I   +SF+  +  KI +  + E RAM NL   
Sbjct: 134 DAIHGNANA----------PDNTVYPTNIGLASSFDPEMAYKIARQTAAEMRAM-NL--- 179

Query: 161 GLTYWS--PNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRP 218
              +W+  PN++V RDPRWGR+ ET GEDP+++       V G + V+G++   D    P
Sbjct: 180 ---HWTFNPNVDVVRDPRWGRVGETFGEDPYLIS------VLGAESVKGYQGTLDT---P 227

Query: 219 LKVSSCCKHY--AAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCS 276
             V +C KH+    +  +   G         V+E+ + E  L PFE  V+ G A S+M S
Sbjct: 228 NDVLACIKHFVGGGFPANGTNGSP-----TDVSERTLREVLLPPFEAGVEAG-AGSLMTS 281

Query: 277 YNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLK 336
           +N VNGIP+ ++  L+   +RGEW   G++V+D   I+ + D H+  A++ ++A  Q++ 
Sbjct: 282 HNEVNGIPAHSNEWLMRDVLRGEWGFKGFVVSDWMDIEHIYDLHR-TAENLKEAFYQSIM 340

Query: 337 AGLDLDC-GQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQD 395
           AG+D+   G Y+       V++G++ E+ ID+S++ +  V  RLG F+      +   + 
Sbjct: 341 AGMDMHMHGIYWNELVCELVREGRIPESRIDESVRRILDVKFRLGIFENPYADEARTMEV 400

Query: 396 ICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGI-- 453
             S  +   A EAAR  IVLLKND   LPL+++K K V V G +A+    ++G+++    
Sbjct: 401 RLSPGHRATALEAARNSIVLLKND-GVLPLDASKYKRVMVTGINADDE-NILGDWSASQR 458

Query: 454 PCRYMSPIAGFSGYANVTYKTGCD---DVACKSNNSIFAASEAAKTADATIILAG----- 505
           P    + + G    A  T+    D   +    S   +  A+E A+ AD  I++AG     
Sbjct: 459 PENVTTILEGLREVAPDTHFEFVDQGWNPQTMSPAQVEKAAEHARHADLNIVVAGEYMMR 518

Query: 506 --LDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIK 563
               L    E  DR D+ L G Q +LI +VA   K P IL++++   + + +A    N+ 
Sbjct: 519 HRWALRTGGEDTDRSDIDLVGLQNELIEKVAASGK-PTILILVNGRQLGVEWAA--ENLP 575

Query: 564 AILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPG 623
           AI+ A  PG  GG+A+A++++G  NP  +LP+T         +P      R V      G
Sbjct: 576 AIVEAWEPGMYGGQAVAEILYGTVNPSAKLPVT---------IP------RSV------G 614

Query: 624 RTYKFYN-GPTLY--------------PFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCR 668
           +   +YN  P+LY              PFG+GLSYT ++Y+                   
Sbjct: 615 QIQMYYNHKPSLYFHPYAAGKSSSPLWPFGFGLSYTTYEYS------------------- 655

Query: 669 NLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYI 728
           +L  +SD            ++  D   +  V  +N GS DG +++ +Y +         +
Sbjct: 656 DLRLSSD------------EIAADGTLDVTVRVKNTGSRDGVEIIQLYIRDLYSSVTRPV 703

Query: 729 KQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVG 778
           K++  F RV ++AG  K I F     K L  +D     ++  GE  + VG
Sbjct: 704 KELKDFGRVALKAGETKDITFTITPDK-LQFLDKDLRPVVEPGEFVVMVG 752


>gi|410613210|ref|ZP_11324278.1| beta-glucosidase [Glaciecola psychrophila 170]
 gi|410167352|dbj|GAC38167.1| beta-glucosidase [Glaciecola psychrophila 170]
          Length = 743

 Score =  260 bits (665), Expect = 2e-66,   Method: Compositional matrix adjust.
 Identities = 206/727 (28%), Positives = 344/727 (47%), Gaps = 104/727 (14%)

Query: 59  LPYSIR---VKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTHF 115
           L YSIR   V  +++ + L   V +L   A    RLG+P              +G     
Sbjct: 53  LAYSIRQGRVGSILNEVRL-HTVNELQRLAVEESRLGIPLL------------IG----- 94

Query: 116 DDVIPG-ATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTY-WSPNINVAR 173
            DVI G  T FP  +   AS+     K+     + EA ++      G+ + ++P I+++R
Sbjct: 95  RDVIHGFNTIFPIPLGQAASWCVETVKQCAHISALEAASV------GVNWTFAPMIDISR 148

Query: 174 DPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDV 233
           DPRWGRI E+ GEDP++     V  ++G Q  E H+N +        +++C KH+A Y  
Sbjct: 149 DPRWGRIAESLGEDPYLCSVLGVAMLQGFQGDELHKNGS--------IAACAKHFAGYGA 200

Query: 234 DNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLN 293
                  R +    + E ++   +L PF+     G A+  M +++ +NG+P+  +  L+ 
Sbjct: 201 GE---SGRDYSTTNIPENELRNVYLPPFKAAADAGVAT-FMAAFSDLNGVPASGNKWLMT 256

Query: 294 QTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLD-CGQYYTNFTG 352
             +R EWD  G++V+D +S+ + +  H F  D+K DA  +   AG+D++     Y     
Sbjct: 257 DILREEWDYKGFVVSDWESV-IQLTTHGFSKDNK-DAAYEAANAGIDMEMVSSAYFEHLP 314

Query: 353 NAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQDICSDENIELAAEAAREG 412
           + V +G++    I+ ++K +  +  +LG FD      SL  + + S +N++ A +AA + 
Sbjct: 315 DLVAEGRIDMRQINNAVKKILHLKWQLGLFDSPYTDASLLPKPLNS-QNLQAAKDAAIKS 373

Query: 413 IVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYA--GIPCRYMSPIAG----FSG 466
            VLLKND+N LPL++  + +VA++GP A+     +G +   G P    + +       SG
Sbjct: 374 CVLLKNDKNILPLSAGSLHSVAIIGPLADDPYEQLGTWIFDGDPQHSQTCLTAITQELSG 433

Query: 467 YANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAESLDREDLWLPGYQ 526
            AN+ +                 A ++A TAD  I++ G +  +  E+  R ++ LPG Q
Sbjct: 434 KANIHHVKAMQTSRSHDQADFKQAVKSASTADVAILILGEESILSGEAHCRAEIDLPGCQ 493

Query: 527 TQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGK 586
            QLIN +AE    P++LVIM+  G  +        + A+L+A +PG  GG AIAD++FGK
Sbjct: 494 EQLINAIAETGT-PIVLVIMA--GRPLTIETVLPKVDAVLFAWHPGTMGGPAIADLLFGK 550

Query: 587 FNPGGRLPITWYNGD------YVQ-----------MLPLTSMPLR-PVDSLGYPGRTYKF 628
             P G+LP+T+          Y Q            + + ++P+  P  SLG        
Sbjct: 551 ACPSGKLPVTFPRKVGQVPIYYAQKHSGKPATEQAFIHMDNIPVHSPQTSLGMAATHLDT 610

Query: 629 YNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVND 688
           +  P L+PFG+GLSYTQF Y  L           +L H                      
Sbjct: 611 HFSP-LFPFGFGLSYTQFSYQNL-----------ELSH--------------------KT 638

Query: 689 LRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIK 748
           L+  +    +V   NVG TDG ++  +Y +         +K++  F+RV + AG+N+ + 
Sbjct: 639 LKLGETLVVRVLLTNVGDTDGEEIAQLYIRDLVGSVTRPVKELKDFKRVKLTAGKNEWVT 698

Query: 749 FVFNACK 755
           F  +  K
Sbjct: 699 FELSTDK 705


>gi|354582345|ref|ZP_09001247.1| glycoside hydrolase family 3 domain protein [Paenibacillus lactis
           154]
 gi|353199744|gb|EHB65206.1| glycoside hydrolase family 3 domain protein [Paenibacillus lactis
           154]
          Length = 765

 Score =  260 bits (665), Expect = 2e-66,   Method: Compositional matrix adjust.
 Identities = 199/708 (28%), Positives = 330/708 (46%), Gaps = 109/708 (15%)

Query: 76  EKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGATSFPTVILTTASF 135
           E V ++  +A    RLG+P      E  HG   +G           AT FP  +   +++
Sbjct: 89  EAVNEIQRYAVEHSRLGIPIL-IGEECSHGHMAIG-----------ATVFPVPLSLGSTW 136

Query: 136 NESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYA 195
           N  L++++ +AV+ E R+     + G   +SP ++V RDPRWGR  E  GEDP+++G +A
Sbjct: 137 NTELYREMCRAVALETRS-----QGGAVTYSPVLDVVRDPRWGRTEECFGEDPYLIGEFA 191

Query: 196 VNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAY-DVDNWKGVDRYHFDARVTEQDME 254
              V GLQ       A+        V++  KH+  Y   +  +     H   R    ++ 
Sbjct: 192 AASVEGLQGESLDGEAS--------VAATLKHFVGYGSSEGGRNAGPVHMGTR----ELM 239

Query: 255 ETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQ 314
           E  + PF+  V+ G A+S+M +YN ++G+P   + +LL+  +R EW   G ++ DC +I 
Sbjct: 240 EVDMYPFKKAVEAG-AASIMPAYNEIDGVPCTVNEELLDGVLRKEWGFDGMVITDCGAIN 298

Query: 315 VMVDNHKFLADSKEDAVAQTLKAGLDLD-CGQYYTNFTGNAVQQGKVKETDIDKSLKYLY 373
           ++   H    D   DA    + AG+D++  G+ +  +   AVQ+ ++  + +D++++ + 
Sbjct: 299 MLAAGHDTAEDGM-DAAVSAISAGIDMEMSGEMFGMYLERAVQEKRLDVSVLDEAVRRVL 357

Query: 374 TVLMRLGFFDGSPQYVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTV 433
           T+  +LG F+      +  +Q I    + E+A + A EGIVLLKN+ +TLPL S +   +
Sbjct: 358 TLKFKLGLFENPYADPARAEQVIGCSRHREMARQLAAEGIVLLKNEGSTLPL-SKEDGVI 416

Query: 434 AVVGPHANATVAMIGNYAG--IPCRYMSPIAGFSG-----YANVTYKTGCDDVACKSNNS 486
           AV+GP+A+     +G+Y     P R ++ + G           V Y  GC  +   S   
Sbjct: 417 AVIGPNADQGYNQLGDYTSPQPPSRVVTVLEGIRAKLGGDKGRVLYAPGC-RINGDSREG 475

Query: 487 IFAASEAAKTADATIILAG-----------LDLSVEA--------------ESLDREDLW 521
              A   A  AD  +++ G           +DL   A              E +DR  L 
Sbjct: 476 FELALSCAGQADTVVLVLGGSSARDFGEGTIDLRTGASKVTGNDWSDMDCGEGIDRMTLQ 535

Query: 522 LPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIAD 581
           L G Q +L  ++ ++ K    LV++   G  IA    + +  AIL A YPG+EGG A+AD
Sbjct: 536 LSGVQLELAREIHKLGK---RLVVVYINGRPIAEPWIDRHADAILEAWYPGQEGGHAVAD 592

Query: 582 VVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGL 641
           ++FG  NP G+L I+     +V  LP+     R        G+ Y   +    YPFGYGL
Sbjct: 593 ILFGDVNPSGKLTISIPK--HVGQLPVYYNGKRS------RGKRYLEEDSQPQYPFGYGL 644

Query: 642 SYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDF 701
           SYT+F+Y+ L  T                                  +R  +     V+ 
Sbjct: 645 SYTEFRYSDLQVTP-------------------------------QTIRTGETAVVTVNV 673

Query: 702 QNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKF 749
           +N GS  G++VV +Y    A       K++ GF+++++  G  +RI+F
Sbjct: 674 ENSGSVAGAEVVQLYINDAASRFTRPAKELKGFRKIYLEPGEKQRIEF 721


>gi|295135338|ref|YP_003586014.1| glycoside hydrolase [Zunongwangia profunda SM-A87]
 gi|294983353|gb|ADF53818.1| glycoside hydrolase family protein [Zunongwangia profunda SM-A87]
          Length = 764

 Score =  260 bits (665), Expect = 2e-66,   Method: Compositional matrix adjust.
 Identities = 211/734 (28%), Positives = 336/734 (45%), Gaps = 107/734 (14%)

Query: 76  EKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGATSFPTVILTTASF 135
           EK++   D+A    R+G+P     S+ +HG                 T+FP  + T AS+
Sbjct: 90  EKIRVAQDYAVNDTRMGIPLL-IGSDVIHGYK---------------TTFPIPLGTAASW 133

Query: 136 NESLWKKIGQAVSTEARAMYNLGRAGLTY-WSPNINVARDPRWGRITETPGEDPFVVGRY 194
           +  + KK  +  + EA A       G+ + +SP +++ARDPRWGRI E  GEDP++  + 
Sbjct: 134 DMEMIKKTAEIAAQEATA------DGINWNFSPMVDIARDPRWGRIAEGAGEDPYLGSQI 187

Query: 195 AVNYVRGLQ-DVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDM 253
           A   V G Q D    EN          + +  KH+A Y         R +    ++   M
Sbjct: 188 AKAMVEGYQGDDLAKENT---------MIATVKHFALYGASE---AGRDYNTTDMSRVKM 235

Query: 254 EETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSI 313
              +L P++  +  G A SVM S+N V+G+P+  +  LL   +R  W   G++ +D  S+
Sbjct: 236 FNEYLPPYKAAIDAG-AESVMSSFNDVDGVPATGNKWLLTDLLRDRWGFEGFVTSDYTSL 294

Query: 314 QVMVDNHKFLADSKEDAVAQTLKAGLDLD-CGQYYTNFTGNAVQQGKVKETDIDKSLKYL 372
             M+ +   + D +    A  LKAGLD+D  G+ Y      ++ +GKV E +I  + + +
Sbjct: 295 NEMIAHG--MGDLQA-VSALALKAGLDMDMVGEGYLKTLKKSLDEGKVTEAEITTAARRI 351

Query: 373 YTVLMRLGFFDGSPQYV--SLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKV 430
                +LG FD   +Y+  S  ++DI S+EN   + + A    VLLK D    PL   K 
Sbjct: 352 LEAKYKLGLFDDPYKYLDESRPEKDILSEENRTFSRKVAAHSFVLLKKDAGVFPLK--KN 409

Query: 431 KTVAVVGPHANATVAMIGNYA--GIPCRYMSPIAGFSGY---ANVTYKTGC---DDVACK 482
             +A++GP AN    M+G +A  G P   +  + G       A VTY  G    DD    
Sbjct: 410 AKIALIGPLANNKNNMLGTWAPTGNPQLSVPVLQGVKNVAPKAKVTYAQGANITDDAQLA 469

Query: 483 SNNSIFA----------------ASEAAKTADATIILAGLDLSVEAESLDREDLWLPGYQ 526
            N ++F                 A + AK +D  + + G    +  E+  R +L +P  Q
Sbjct: 470 ENINVFGPRAEISETSPEKMLEEALKVAKKSDVIVAVVGEATEMSGEAASRTNLLIPESQ 529

Query: 527 TQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGK 586
            +LI ++A+  K P+ LV+MS   ++I+  E+  NI  IL   +PG E G AIADV+FG 
Sbjct: 530 KKLIRELAKTGK-PMALVLMSGRPLNIS-EESEMNID-ILQVWHPGVEAGNAIADVIFGD 586

Query: 587 FNPGGRLPITW-YNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPT--LYPFGYGLSY 643
           +NP G++  +W  N   V +        RP +  G+     +F + P   LYPFGYGLSY
Sbjct: 587 YNPSGKITASWPRNVGQVPVYYAMKRTGRPGEVEGFQKFKSEFLDTPNSPLYPFGYGLSY 646

Query: 644 TQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQN 703
           T+F+Y                         SD   +       ++L+ D          N
Sbjct: 647 TEFEY-------------------------SDVKAS------ADELKMDGTLTLSAIITN 675

Query: 704 VGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYA 763
            G  DG +VV +Y           +KQ+IGF+++ ++ G +K + F  +A + L   + +
Sbjct: 676 TGDYDGEEVVQLYIHDKVRSITPPMKQLIGFEKIMLKKGESKTVTFEISA-EDLKFYNSS 734

Query: 764 ANTLLPAGEHTIFV 777
              +   GE   F+
Sbjct: 735 LEYVAEPGEFEFFI 748


>gi|423342899|ref|ZP_17320613.1| hypothetical protein HMPREF1077_02043 [Parabacteroides johnsonii
           CL02T12C29]
 gi|409217154|gb|EKN10133.1| hypothetical protein HMPREF1077_02043 [Parabacteroides johnsonii
           CL02T12C29]
          Length = 955

 Score =  260 bits (665), Expect = 2e-66,   Method: Compositional matrix adjust.
 Identities = 222/810 (27%), Positives = 358/810 (44%), Gaps = 146/810 (18%)

Query: 53  LFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRL---GLPQYEW----WS----- 100
           ++ D + P   RV+DL+S+M ++EK  Q+    +G  R+    LP  +W    W      
Sbjct: 60  VYEDPTAPIDARVEDLLSQMNVEEKTCQMVTL-YGYKRVLKDDLPTPDWKNQLWKDGMGA 118

Query: 101 --EALHGVSNVG----------------------------------PGTHFDDVIPG--- 121
             E L+G    G                                  P    ++ I G   
Sbjct: 119 IDEHLNGFQQWGLPPSDNPYVWPASRHAWALNEVQRFFIEETRLGIPTDFTNEGIRGVES 178

Query: 122 --ATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLT-YWSPNINVARDPRWG 178
             AT+FPT +    ++N  L  K+G     E R +      G T  ++P ++V RD RWG
Sbjct: 179 YIATNFPTQLGLGHTWNRDLVHKVGYITGREGRLL------GYTNVYAPILDVGRDQRWG 232

Query: 179 RITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKG 238
           R  E  GE P++V    V   +G+Q        TD      +V++  KHY AY  +    
Sbjct: 233 RYEEVYGESPYLVAELGVEMAKGMQ--------TDY-----QVAATSKHYIAYSNNKGGR 279

Query: 239 VDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRG 298
                 D +++ +++E   + P++  +KE     VM SYN  +G P  +    L   +RG
Sbjct: 280 EGMARVDPQMSPREVEMLHVYPWKRVIKEAGILGVMSSYNDYDGFPIQSSYYWLTTRLRG 339

Query: 299 EWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCG----QYYTNFTGNA 354
           E+   GY+V+D D+++ +   H   AD KE +V Q++ AGL++ C       Y       
Sbjct: 340 EFGFRGYVVSDSDAVEYLFSKHGTAADMKE-SVLQSVLAGLNIRCTFRSPDSYVLPLREL 398

Query: 355 VQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQD--ICSDENIELAAEAAREG 412
           + +G +  + ID  ++ +  V   +G FD  P  + L + D  + S EN ++A +A++E 
Sbjct: 399 IAEGALPMSTIDDRVRDILRVKFLVGLFD-QPYQIDLKQADKEVNSAENQQVALQASKES 457

Query: 413 IVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFSGY----A 468
           +VLLKN    LPL+  K+  +AV GP+A+     + +Y  +     + + G         
Sbjct: 458 LVLLKNQDAVLPLDVNKISKIAVCGPNADEEAYALTHYGPLAVEVTTVLEGIQNKVKPGT 517

Query: 469 NVTYKTGCDDV--------------ACKSNNSIFAASEAAKTADATIILAGLDLSVEAES 514
            V +  GCD V                +  + I  A E AK +D  +++ G       E+
Sbjct: 518 EVLFTKGCDLVDANWPESELIRYPLTSEEQSEIDKAVENAKKSDVAVVVLGGSNRTCGEN 577

Query: 515 LDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEE 574
             R  L LPG Q  L+  V    K PV+LV+++   + I +A+    + AIL A YPG +
Sbjct: 578 KSRSSLELPGRQLDLLQAVVATGK-PVVLVLINGRPISINWAD--KYVPAILEAWYPGSQ 634

Query: 575 GGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRP---VD---SLGYPGRTYKF 628
           GG AIAD +FG +NPGG+L +T+     V  +P  + P +P   VD   + G  G   + 
Sbjct: 635 GGTAIADALFGDYNPGGKLTVTF--PKTVGQIPF-NFPTKPNAQVDGGRNKGLDGNMSRV 691

Query: 629 YNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVND 688
            NGP LYPFGYGLSYT F+Y+ +S    I   +  +                        
Sbjct: 692 -NGP-LYPFGYGLSYTTFEYSDISIQPAIVTQVQPVT----------------------- 726

Query: 689 LRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIK 748
           +RC           N G   G +VV +Y +       TY K ++GF R+ +  G  K + 
Sbjct: 727 VRC--------KVTNTGKRAGDEVVQLYVRDILSSVTTYEKNLVGFDRIHLNPGETKELT 778

Query: 749 FVFNACKSLNIVDYAANTLLPAGEHTIFVG 778
           F     + L +++   + ++  G+  + VG
Sbjct: 779 FTIEP-RDLQLLNSDNHWVVEPGDFKVMVG 807


>gi|218258058|ref|ZP_03474485.1| hypothetical protein PRABACTJOHN_00138 [Parabacteroides johnsonii
           DSM 18315]
 gi|218225777|gb|EEC98427.1| hypothetical protein PRABACTJOHN_00138 [Parabacteroides johnsonii
           DSM 18315]
          Length = 955

 Score =  260 bits (664), Expect = 2e-66,   Method: Compositional matrix adjust.
 Identities = 222/810 (27%), Positives = 358/810 (44%), Gaps = 146/810 (18%)

Query: 53  LFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRL---GLPQYEW----WS----- 100
           ++ D + P   RV+DL+S+M ++EK  Q+    +G  R+    LP  +W    W      
Sbjct: 60  VYEDPTAPIDARVEDLLSQMNVEEKTCQMVTL-YGYKRVLKDDLPTPDWKNQLWKDGMGA 118

Query: 101 --EALHGVSNVG----------------------------------PGTHFDDVIPG--- 121
             E L+G    G                                  P    ++ I G   
Sbjct: 119 IDEHLNGFQQWGLPPSDNPYVWPASRHAWALNEVQRFFIEETRLGIPTDFTNEGIRGVES 178

Query: 122 --ATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLT-YWSPNINVARDPRWG 178
             AT+FPT +    ++N  L  K+G     E R +      G T  ++P ++V RD RWG
Sbjct: 179 YIATNFPTQLGLGHTWNRDLVHKVGYITGREGRLL------GYTNVYAPILDVGRDQRWG 232

Query: 179 RITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKG 238
           R  E  GE P++V    V   +G+Q        TD      +V++  KHY AY  +    
Sbjct: 233 RYEEVYGESPYLVAELGVEMAKGMQ--------TDY-----QVAATSKHYIAYSNNKGGR 279

Query: 239 VDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRG 298
                 D +++ +++E   + P++  +KE     VM SYN  +G P  +    L   +RG
Sbjct: 280 EGMARVDPQMSPREVEMLHVYPWKRVIKEAGILGVMSSYNDYDGFPIQSSYYWLTTRLRG 339

Query: 299 EWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCG----QYYTNFTGNA 354
           E+   GY+V+D D+++ +   H   AD KE +V Q++ AGL++ C       Y       
Sbjct: 340 EFGFRGYVVSDSDAVEYLFSKHGTAADMKE-SVLQSVLAGLNIRCTFRSPDSYVLPLREL 398

Query: 355 VQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQD--ICSDENIELAAEAAREG 412
           + +G +  + ID  ++ +  V   +G FD  P  + L + D  + S EN ++A +A++E 
Sbjct: 399 IAEGALPMSTIDDRVRDILRVKFLVGLFD-QPYQIDLKQADKEVNSAENQQVALQASKES 457

Query: 413 IVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFSGY----A 468
           +VLLKN    LPL+  K+  +AV GP+A+     + +Y  +     + + G         
Sbjct: 458 LVLLKNQDAVLPLDVNKISKIAVCGPNADEEAYALTHYGPLAVEVTTVLEGIQNKVKPGT 517

Query: 469 NVTYKTGCDDV--------------ACKSNNSIFAASEAAKTADATIILAGLDLSVEAES 514
            V +  GCD V                +  + I  A E AK +D  +++ G       E+
Sbjct: 518 EVLFTKGCDLVDANWPESELIRYPLTSEEQSEINKAVENAKKSDVAVVVLGGSNRTCGEN 577

Query: 515 LDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEE 574
             R  L LPG Q  L+  V    K PV+LV+++   + I +A+    + AIL A YPG +
Sbjct: 578 KSRSSLELPGRQLDLLQAVVATGK-PVVLVLINGRPISINWAD--KYVPAILEAWYPGSQ 634

Query: 575 GGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRP---VD---SLGYPGRTYKF 628
           GG AIAD +FG +NPGG+L +T+     V  +P  + P +P   VD   + G  G   + 
Sbjct: 635 GGTAIADALFGDYNPGGKLTVTF--PKTVGQIPF-NFPTKPNAQVDGGRNKGLDGNMSRV 691

Query: 629 YNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVND 688
            NGP LYPFGYGLSYT F+Y+ +S    I   +  +                        
Sbjct: 692 -NGP-LYPFGYGLSYTTFEYSDISIQPAIVTQVQPVT----------------------- 726

Query: 689 LRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIK 748
           +RC           N G   G +VV +Y +       TY K ++GF R+ +  G  K + 
Sbjct: 727 VRC--------KVTNTGKRAGDEVVQLYVRDILSSVTTYEKNLVGFDRIHLNPGETKELT 778

Query: 749 FVFNACKSLNIVDYAANTLLPAGEHTIFVG 778
           F     + L +++   + ++  G+  + VG
Sbjct: 779 FTIEP-RDLQLLNSDNHWVVEPGDFKVMVG 807


>gi|254295141|ref|YP_003061164.1| glycoside hydrolase [Hirschia baltica ATCC 49814]
 gi|254043672|gb|ACT60467.1| glycoside hydrolase family 3 domain protein [Hirschia baltica ATCC
           49814]
          Length = 897

 Score =  260 bits (664), Expect = 2e-66,   Method: Compositional matrix adjust.
 Identities = 181/531 (34%), Positives = 264/531 (49%), Gaps = 72/531 (13%)

Query: 1   MAKVVSSLLCFSLSIALLVFSTNAVDANGSSSPVFVCDPGRFSKLGLQMSSFLFCDSSLP 60
           M  V S LL    +IA L F+T    +   +      +  + S+       F F D SL 
Sbjct: 1   MKSVKSILLG---TIASLAFATACSSSQTDTETAQTTEEAKSSE-------FRFMDPSLS 50

Query: 61  YSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTHFDDVIP 120
              R  DLVS MTL+EK  Q+ D A  +PRLGL +Y WW+EALHGV+  G          
Sbjct: 51  PKERALDLVSHMTLEEKAAQMYDKAAAIPRLGLHEYNWWNEALHGVARAG---------- 100

Query: 121 GATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNL--------GRAGLTYWSPNINVA 172
            AT FP  I   A+++E L  ++   +S E RA ++            GLT+WSPNIN+ 
Sbjct: 101 HATVFPQAIGMAATWDEDLMLEVANVISDEGRAKHHFYANEDVYAMYGGLTFWSPNINIF 160

Query: 173 RDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYD 232
           RDPRWGR  ET GEDP++ GR AVN++ GLQ   G ++      +  K  +  KHYA   
Sbjct: 161 RDPRWGRGQETYGEDPYLTGRMAVNFINGLQ---GDDD------KYFKSVATVKHYA--- 208

Query: 233 VDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLL 292
           V +     R+  +   T+ D+ ET+L  F+    E + +SVMC+YN V G P+C   +L+
Sbjct: 209 VHSGPEPSRHRDNYIATDADLYETYLPAFKTAFDETEVASVMCAYNAVWGDPACGSERLM 268

Query: 293 NQTVRGEWDLHGYIVADCDSI-QVMVDNHKFL-----------ADSKEDAVAQTLKAGLD 340
              +R E    GY+V+DC +I     D  K              D++  A A ++  G D
Sbjct: 269 KDLLREELGFDGYVVSDCGAIGDFYYDEEKKAEGTAPYAAHDHVDTRAQAAALSVNMGTD 328

Query: 341 LDCGQYYTNFTG---NAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYV---SLGKQ 394
           L+CG    N       AV++G + E  ID+S+  LY+ L +LG +D  P  V   ++   
Sbjct: 329 LNCGDGEGNKMDALPQAVKEGLITEETIDQSVVRLYSALFKLGMYD-DPSLVPWSNISID 387

Query: 395 DICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIP 454
            + S  ++E + EAAR  +VLLKND   LPL       VAV+GP+A+    ++ NY G P
Sbjct: 388 TVASPSHLEKSEEAARASLVLLKND-GILPLKPD--TKVAVIGPNADNWWTLVANYYGQP 444

Query: 455 CRYMSPIAGFS---GYANVTYKTGC-------DDVACKSNNSIFAASEAAK 495
              ++ + G     G  NV+Y  G         +     +N++F  +EA +
Sbjct: 445 TAPVTALKGIKAKIGAENVSYSVGSTIAGDIYSNYKAVPSNTLFHKNEAGE 495



 Score =  109 bits (273), Expect = 5e-21,   Method: Compositional matrix adjust.
 Identities = 69/198 (34%), Positives = 99/198 (50%), Gaps = 22/198 (11%)

Query: 501 IILAGLDLSVEAESL----------DREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGG 550
           +   G+D ++E E +          DR  + LP  Q +L+ ++    K PV+LV  S  G
Sbjct: 634 LFFGGIDANLEGEEMGVELDGFLGGDRTHINLPAPQEKLLKELHATGK-PVVLVNFS--G 690

Query: 551 VDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTS 610
             +A    + N+ AI+ A YPGE+ G AIAD+++G+F+P GRLP+T+Y         L  
Sbjct: 691 SAMALNWEDENLPAIVQAFYPGEKSGTAIADLLWGEFSPSGRLPVTFYKS-------LEG 743

Query: 611 MPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNL 670
           MP    D      RTYK+Y G  LYPFG+GLSYT F+Y+ L        N N     +  
Sbjct: 744 MPA--FDDYSMENRTYKYYEGEQLYPFGHGLSYTSFEYSDLKLETAYAANENLQVSVKVT 801

Query: 671 NYTSDASKTRCPGVLVND 688
           N    AS+      +  D
Sbjct: 802 NSGDKASREIVQAYVTRD 819


>gi|305663349|ref|YP_003859637.1| glycoside hydrolase family protein [Ignisphaera aggregans DSM
           17230]
 gi|304377918|gb|ADM27757.1| glycoside hydrolase family 3 domain protein [Ignisphaera aggregans
           DSM 17230]
          Length = 757

 Score =  260 bits (664), Expect = 3e-66,   Method: Compositional matrix adjust.
 Identities = 218/790 (27%), Positives = 352/790 (44%), Gaps = 149/790 (18%)

Query: 64  RVKDLVSRMTLDEKVQQL----------------------------------GDFAHGVP 89
           RV++L+ RM+++EK+ QL                                  G  A   P
Sbjct: 6   RVRELIGRMSIEEKIAQLISIPLESVLDGKKFSVEKAREVLKYGVGEILRIGGSSARLSP 65

Query: 90  RLGLPQYEWWSEALHGVSNVG-PGTHFDDVI-----PGATSFPTVILTTASFNESLWKKI 143
           R  +  Y      L   + +G P    ++ I     P AT FP  +   ++++  L  ++
Sbjct: 66  REAVEIYNAIQRFLTRETRLGIPAIVHEESIAGLLAPTATVFPIPLALASTWDPDLVYRV 125

Query: 144 GQAVSTEARAMYNLGRAGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQ 203
             A+  +  A+           +P +++ R+PRWGR  ET GED ++     + YV+G+Q
Sbjct: 126 AVAIRRQIMAI-----GSRHTLAPVLDLCREPRWGRCEETYGEDSYLAASMGIAYVKGIQ 180

Query: 204 DVEGHENATDLNSRPLKVSSCCKHYAAYDV-DNWKGVDRYHFDARVTEQDMEETFLRPFE 262
                    D+      V +  KH+  + V +  + +   H   R    ++ E ++ PFE
Sbjct: 181 -------GDDIR---YGVIATGKHFVGHGVPEGGRNIASIHVGLR----ELLEIYMYPFE 226

Query: 263 MCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKF 322
             VKE +  S+M +Y+ ++ +P  A+  LL   +RG W   G  V+D + ++ +   H+ 
Sbjct: 227 ATVKEANLLSIMPAYHDIDNVPCHANKWLLTDILRGSWGFKGIAVSDYEGVKQLHTIHRV 286

Query: 323 LADSKEDAVAQTLKAGLDLD--CGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLG 380
             D  E AV + +KAG+D++   G+ +      AV++G + E  I+++++ +  +   LG
Sbjct: 287 ARDCMEAAV-KAIKAGVDIEYPSGECFKQLV-EAVRKGLIDEDTINRAVERVLKLKFMLG 344

Query: 381 FFDGSPQYVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHA 440
            F+      +     + ++ + ELA E AR+ IVLLKND   LPL    +KT+AV+GP+A
Sbjct: 345 LFENPFIDETKVPTTLDNEADRELAREVARKAIVLLKND-GILPLKR-DIKTIAVIGPNA 402

Query: 441 NATVAMIGNYAGIPCRYMSPIAGFSGY------------------------ANVTYKTGC 476
           N   AM+G+Y      Y + I  F G                           V Y  GC
Sbjct: 403 NDPWAMLGDY-----HYDAHIGSFDGTYGKISPSVRIVTVLEAIKSRVSPSTEVLYAKGC 457

Query: 477 DDVACKSNNSIFAASEAAKTADATIILAG-------LDLSVEAESLDREDLWLPGYQTQL 529
           D +     +    A E AK AD  I + G       L +    E +DR  L LPG Q +L
Sbjct: 458 DTIG-DDRSGFGEAIEIAKRADIIIAVMGDRSGLFNLKMFTSGEGVDRASLKLPGVQEEL 516

Query: 530 INQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNP 589
           + ++A + K P+ILV+++  G  +A +     + AI+ A  PGEEGG AIAD++FG ++P
Sbjct: 517 LKELASLGK-PIILVLIN--GRPLALSSILPYVNAIVEAWRPGEEGGNAIADILFGDYSP 573

Query: 590 GGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPG--RTYKFYNGPTLYPFGYGLSYTQFK 647
           GGRLP++         LP     L P+     P   R Y  Y    L+PFGYGLSYTQF 
Sbjct: 574 GGRLPVS---------LPYDVGQL-PIYYSRKPNCFRDYVEYPAKPLFPFGYGLSYTQFA 623

Query: 648 YNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGST 707
           Y                      N   ++++ R P         D      VD +NVGS 
Sbjct: 624 YE---------------------NLVVESTEVRDP---------DTVIRVSVDVKNVGSM 653

Query: 708 DGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTL 767
            G +VV +Y           + ++ GF+R+ +  G  K + F     + L   D   N +
Sbjct: 654 AGDEVVQLYISRDYASVTRPVAELKGFKRITLEPGEKKTVVFEI-PLELLAYYDMDMNYV 712

Query: 768 LPAGEHTIFV 777
           +  GE+T  +
Sbjct: 713 VEPGEYTFMI 722


>gi|383119099|ref|ZP_09939838.1| hypothetical protein BSHG_1822 [Bacteroides sp. 3_2_5]
 gi|251946311|gb|EES86688.1| hypothetical protein BSHG_1822 [Bacteroides sp. 3_2_5]
          Length = 859

 Score =  259 bits (662), Expect = 4e-66,   Method: Compositional matrix adjust.
 Identities = 216/768 (28%), Positives = 337/768 (43%), Gaps = 144/768 (18%)

Query: 50  SSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGD-------------------------- 83
           ++F + ++SLP  +RV+DL+SRMTL+EK+ Q+                            
Sbjct: 22  TNFKYKNASLPVEVRVQDLLSRMTLEEKIAQMRHIHAYSIMENGKLNEEKLEKMIGGQNY 81

Query: 84  -FAHGV---------------------PRLGLPQYEWWSEALHGVSNVGPGTHFDDVIPG 121
            F  G+                     PRLG+P +   +E+LHG            V  G
Sbjct: 82  GFIEGITLPGKECLTLMNEVQKYMREKPRLGIPVFTL-TESLHG-----------SVHDG 129

Query: 122 ATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTY-WSPNINVARDPRWGRI 180
           +T FP  I   ++FN  L  ++  A++ E      L   G+T   +P I+V RD RWGR+
Sbjct: 130 STIFPQAIALGSTFNPILAYEMTSAIAKE------LSAQGITQSLTPVIDVCRDLRWGRV 183

Query: 181 TETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVD 240
            E  GEDPF+V R  V+ VRG  D +              VS   KH+ A+      G++
Sbjct: 184 EECFGEDPFLVSRMGVSQVRGYLDNQ--------------VSPMIKHFGAHGTPQ-GGLN 228

Query: 241 RYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEW 300
                    ++++   +L+ FE  VKE    +VM SYN  N  P+ +   L+ + +R  W
Sbjct: 229 LASVSC--GQRELLSIYLKTFETVVKEAKPWAVMSSYNSWNNEPNSSSHYLMTELLRDRW 286

Query: 301 DLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFTGNAVQQGKV 360
           D  GY+ +D  +I ++   HK   +S E A+ Q L AGLD +            V+ G +
Sbjct: 287 DFQGYVYSDWGAIGMLNYFHKTAQNSAEAAI-QALTAGLDAEASDNSYAELQQLVENGML 345

Query: 361 KETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQDICSDENIELAAEAAREGIVLLKNDQ 420
               ID+++  + T    +G F+          + + +  ++ LA + A E IVLL+N+ 
Sbjct: 346 DVKYIDQAVARILTAKFNMGLFEYPLPMEKNYDKVVHAPAHVSLARKIAEESIVLLQNEN 405

Query: 421 NTLPLNSAKVKTVAVVGPHANATVAMIGNYA-------GIPCRYMSPIAGFSGYANVTYK 473
           N LPL   K+K++AV+GP  NA     G+Y        G+     +     S    + Y 
Sbjct: 406 NILPLQMNKLKSIAVIGP--NADQVQFGDYTWSRDNKDGVTL-LEALKERVSNQLTLNYA 462

Query: 474 TGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEA---------ESLDREDLWLPG 524
            GC D+     +    A + AK +D  I++ G   +  A         E  D  DL L G
Sbjct: 463 KGC-DLVTDDCSGFKEAVDVAKKSDVCIVVVGSASASLARDYSNATCGEGFDLSDLTLTG 521

Query: 525 YQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVF 584
            Q  L+  +    K PVI+V++S  G   A +    NI  I+   YPGE+GG A+AD++ 
Sbjct: 522 VQEDLVEAIHATGK-PVIVVLLS--GKPFAMSWIKENIPGIVVQWYPGEQGGLALADMLL 578

Query: 585 GKFNPGGRLPITWYNGD-----YVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGY 639
           GK NP G+L  ++         Y   LP      R   S   PG+ Y F +   L+ FG+
Sbjct: 579 GKVNPSGKLNYSFPQSVGHLPCYYNYLPTDKGFYRSPGSKNKPGKDYVFSSPKALWAFGH 638

Query: 640 GLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKV 699
           GLSYT F+Y  LS T + +                             D  C+D  E  +
Sbjct: 639 GLSYTDFEY--LSATTSKE-----------------------------DYACEDVIEVTI 667

Query: 700 DFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRI 747
             +N G  DG +V  VY +         ++++ GF++V ++ G  K++
Sbjct: 668 AIRNTGDYDGLEVPQVYVRDMVSSVVMPVQELKGFEKVLIKKGETKQV 715


>gi|423722678|ref|ZP_17696831.1| hypothetical protein HMPREF1078_00891 [Parabacteroides merdae
           CL09T00C40]
 gi|409241951|gb|EKN34716.1| hypothetical protein HMPREF1078_00891 [Parabacteroides merdae
           CL09T00C40]
          Length = 955

 Score =  259 bits (662), Expect = 4e-66,   Method: Compositional matrix adjust.
 Identities = 222/809 (27%), Positives = 360/809 (44%), Gaps = 144/809 (17%)

Query: 53  LFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRL---GLPQYEW----WS----- 100
           ++ D ++P   RV+DL+S+M ++EK  Q+    +G  R+    LP  +W    W      
Sbjct: 60  VYEDPTVPIDARVEDLLSQMNVEEKTCQMVTL-YGYKRVLKDDLPTSDWKKQLWKDGIGA 118

Query: 101 --EALHGVSNVG----------------------------------PGTHFDDVIPG--- 121
             E L+G    G                                  P    ++ I G   
Sbjct: 119 IDEHLNGFQQWGLPPSDNPYVWPASRHAWALNEVQRFFIEETRLGIPTDFTNEGIRGVES 178

Query: 122 --ATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNINVARDPRWGR 179
             AT+FPT +    ++N +L  K+G     E R    LG   +  ++P ++V RD RWGR
Sbjct: 179 YIATNFPTQLGLGHTWNRNLVHKVGYITGREGRL---LGYTNV--YAPILDVGRDQRWGR 233

Query: 180 ITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGV 239
             E  GE P++V    V   +G+Q        TD      +V++  KHY AY  +     
Sbjct: 234 YEEVYGESPYLVAELGVEMAKGMQ--------TDY-----QVAATSKHYIAYSNNKGGRE 280

Query: 240 DRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGE 299
                D +++ +++E   + P++  +KE     VM SYN  +G P  +    L   +RGE
Sbjct: 281 GMARVDPQMSPREVEMIHVYPWKRVIKEAGILGVMSSYNDYDGFPIQSSYYWLTTRLRGE 340

Query: 300 WDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCG----QYYTNFTGNAV 355
           +   GY+V+D D+++ +   H   AD KE +V Q++ AGL++ C       Y       +
Sbjct: 341 FGFRGYVVSDSDAVEYLFSKHGTAADMKE-SVLQSVLAGLNIRCTFRSPDSYVLPLRELI 399

Query: 356 QQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQD--ICSDENIELAAEAAREGI 413
            +G +  + ID  ++ +  V   +G FD  P  + L + D  +   EN  +A +A++E +
Sbjct: 400 AEGAIPMSTIDDRVRDILRVKFLVGLFD-HPYQIDLKETDKEVNCAENQLVALQASKESL 458

Query: 414 VLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFSGY----AN 469
           VLLKN    LPL+  K+  +AV GP+A+     + +Y  +     + + G         +
Sbjct: 459 VLLKNQDAVLPLDVNKISKIAVCGPNADEEAYALTHYGPLAVEVTTVLEGIRNKVKPGTD 518

Query: 470 VTYKTGCDDV--------------ACKSNNSIFAASEAAKTADATIILAGLDLSVEAESL 515
           V +  GCD V                +  + I  A E AK +D T+++ G       E+ 
Sbjct: 519 VLFTKGCDLVDANWPESELIRYPLTAEEQSEIDKAVENAKKSDVTVVVLGGSNRTCGENK 578

Query: 516 DREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEG 575
            R  L LPG Q  L+  V    K PV+LV+++   + I +A+    + AIL A YPG +G
Sbjct: 579 SRSSLDLPGRQLDLLQAVVATGK-PVVLVLINGRPLSINWAD--KYVPAILEAWYPGSQG 635

Query: 576 GRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRP---VD---SLGYPGRTYKFY 629
           G AIAD +FG +NPGG+L +T+     V  +P  + P +P   VD   + G  G   +  
Sbjct: 636 GTAIADALFGDYNPGGKLTVTF--PKTVGQIPF-NFPTKPNAQVDGGRNKGLDGNMSRV- 691

Query: 630 NGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDL 689
           NGP LYPFGYGLSYT F+Y+ +S    I   +  +                        +
Sbjct: 692 NGP-LYPFGYGLSYTTFEYSDISIQPAIVTQVQPVT-----------------------V 727

Query: 690 RCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKF 749
           RC           N G   G +VV +Y +       TY K ++GF R+ +  G  K + F
Sbjct: 728 RC--------KVTNTGKRAGDEVVQLYVRDILSSVTTYEKNLVGFDRIHLNPGETKELTF 779

Query: 750 VFNACKSLNIVDYAANTLLPAGEHTIFVG 778
                + L +++   + ++  G+  + VG
Sbjct: 780 TIEP-RDLQLLNSDNHWVVEPGDFKVMVG 807


>gi|160882475|ref|ZP_02063478.1| hypothetical protein BACOVA_00426 [Bacteroides ovatus ATCC 8483]
 gi|156112056|gb|EDO13801.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
           ovatus ATCC 8483]
          Length = 859

 Score =  259 bits (662), Expect = 4e-66,   Method: Compositional matrix adjust.
 Identities = 220/798 (27%), Positives = 345/798 (43%), Gaps = 142/798 (17%)

Query: 51  SFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGD--------------------------- 83
           SF + +  LP  +RV DL+ RMTL+EK+ Q+                             
Sbjct: 24  SFSYKNPLLPTELRVNDLLGRMTLEEKIAQIRHLHSWDVFDGQILNQEKLDKMCGGIGYG 83

Query: 84  FAHGVP---------------------RLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGA 122
           F  G P                     RLG+P +   +E+LHGV           V  G 
Sbjct: 84  FFEGFPLTAASCRKTFREIQTYMVEKTRLGIPGFPV-AESLHGV-----------VHEGT 131

Query: 123 TSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNINVARDPRWGRITE 182
           T +P  I   ++FN  L  +  + ++ E   M           +P I+V RD RWGR+ E
Sbjct: 132 TIYPQNIAMGSTFNPELAYEKTKHIAGELNTM-----GVKQVLAPCIDVVRDLRWGRVEE 186

Query: 183 TPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRY 242
           + GEDPF+  + AV  V+G  +   H            +S   KHY  +  +   G++  
Sbjct: 187 SFGEDPFLCSKMAVAEVKGYME---H-----------GISPMLKHYGPHG-NPLGGLNLA 231

Query: 243 HFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDL 302
             +  V  +D+ + +L+PFE  + E +  +VM SYN  N IP+ A   +L   +R  +  
Sbjct: 232 SVECGV--RDLFDIYLKPFEAVLAETEIMAVMSSYNSWNRIPNSASRFMLTDILRNRFGF 289

Query: 303 HGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFTGNAVQQGKVKE 362
            GY+ +D   + ++   HK  AD  E A  Q L AG+D++          + ++ G+   
Sbjct: 290 RGYVYSDWGVVSMLKTFHKTAADDFE-AARQVLTAGMDVEASSSCYAVLADKIRNGEFDI 348

Query: 363 TDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQDICSDENIELAAEAAREGIVLLKNDQNT 422
           + ID++++ +      LG F+   Q  ++ +  + S E+++L+   A E  VLLKND   
Sbjct: 349 SYIDQAVRRVLRAKFELGLFEDPYQEQAVYRLPLRSKESVKLSRRIADESTVLLKNDGQL 408

Query: 423 LPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRY--MSPIAGFSGY----ANVTYKTGC 476
           LPLN   +K+VAV+GP  NA     G+Y     +   ++P+ G          + Y  GC
Sbjct: 409 LPLNVRNLKSVAVIGP--NADNVQFGDYTWSKKKEDGVTPLQGIKNLLGDRVKINYAKGC 466

Query: 477 DDVACKSNNSIFAASEAAKTADATIILAG----------LDLSVEAESLDREDLWLPGYQ 526
             +A    + I  A +AA+ +D  +I  G           + S   E +D  D+ L G Q
Sbjct: 467 -SLASLDTSGIAEAVDAARHSDVALIFVGSSSTAFVRHTQEPSTSGEGIDLSDISLTGAQ 525

Query: 527 TQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGK 586
            QLI +V  V K PV++++++  G   A      NI AIL   Y GE+ G +IAD++FG 
Sbjct: 526 EQLIREVFAVGK-PVVVILVA--GKPFAIPWVKENIPAILAQWYAGEQEGNSIADILFGN 582

Query: 587 FNPGGRLPITWYNGD-----YVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGL 641
            NP G+L  ++         Y   LP      +   +   PGR Y F N   L+ FGYGL
Sbjct: 583 VNPSGKLTFSFPQSTGHLPVYYNYLPTDKGYYKEPGTYEKPGRDYVFSNSSPLWAFGYGL 642

Query: 642 SYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDF 701
           SYTQF+Y                     L   +D    +      ND  C       V  
Sbjct: 643 SYTQFEY---------------------LKAVTDKELYQA-----NDTVC-----VTVQL 671

Query: 702 QNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVD 761
           +N G   G +V+ VY +       T +KQ+ GF++V +  G+ +    +        + D
Sbjct: 672 KNTGKRTGKEVIQVYMRDVVSSVMTQVKQLKGFRKVDLLPGQTRETTIMI-PVHEFYLTD 730

Query: 762 YAANTLLPAGEHTIFVGN 779
              N  L +G+  + VG 
Sbjct: 731 DLGNRYLESGKFELQVGT 748


>gi|86142030|ref|ZP_01060554.1| putative beta-glucosidase [Leeuwenhoekiella blandensis MED217]
 gi|85831593|gb|EAQ50049.1| putative beta-glucosidase [Leeuwenhoekiella blandensis MED217]
          Length = 803

 Score =  259 bits (662), Expect = 4e-66,   Method: Compositional matrix adjust.
 Identities = 216/724 (29%), Positives = 330/724 (45%), Gaps = 113/724 (15%)

Query: 90  RLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVST 149
           RLG+P +    EA+HG   +G            T FP+ I   ++FN  L KK+G AV+ 
Sbjct: 135 RLGIPLF-LAEEAMHGHMAIG-----------TTEFPSAIGQASTFNPQLNKKMGAAVAK 182

Query: 150 EARAMYNLGRAGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHE 209
           E RA     +     + P +++AR+PRW R+ ET GEDP+++    +  + G Q  EG E
Sbjct: 183 ELRA-----QGAHIGYGPILDLAREPRWSRVEETFGEDPYLISEMGLGVIEGFQG-EGIE 236

Query: 210 NATDLNSRPLKVSSCCKHYAAYDV-DNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEG 268
           N       P  V S  KH+AAY V +        H   R   QD    ++ PF+  +  G
Sbjct: 237 N-------PESVISTLKHFAAYGVSEGGHNGGAVHIGQRELMQD----YMYPFKKAIDAG 285

Query: 269 DASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQ-VMVDNHKFLADSK 327
              SVM +Y+ V+GIPS ++  LL   +R +W   G++V+D  SI+ +  D+H       
Sbjct: 286 -VLSVMTAYSSVDGIPSTSNKALLTGLLREQWGFEGFVVSDLASIEGIKGDHHAAATFED 344

Query: 328 EDAVAQTLKAGLDLDCG-QYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSP 386
             A+A  + AG+D D G   + +   NA + GKV E  +D+++KY+  +  ++G F+   
Sbjct: 345 AAALA--MNAGVDADLGGNGFDDELLNAFKNGKVSEARLDEAVKYVLRLKFKMGLFENPY 402

Query: 387 QYVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAM 446
                 K+ + S  +I +A E A EG+ LLKN+   LPL S ++K +AV+GP+A+     
Sbjct: 403 VEEKAPKKVVRSAAHIAIAKEMALEGVTLLKNENGLLPL-SKELKKIAVIGPNADMMYNQ 461

Query: 447 IGNYAG--IPCRYMSPIAGFSG---YANVTYKTGCDDVACKSNNSIFAASEAAKTADATI 501
           +G+Y     P   ++P+ G       A +TY  G         +   A + A     A +
Sbjct: 462 LGDYTAPQEPEFIVTPLEGIRAKMPKAEITYVKGTAIRDTTQTDIPAAVAAAKSAEVAIV 521

Query: 502 ILAG---LDLSVE----------------------AESLDREDLWLPGYQTQLINQVAEV 536
           +L G    D   E                       E  DR  L L G Q +L+ Q  E 
Sbjct: 522 VLGGSSARDFKTEYLETGAATVSSKEDQVLSDMESGEGYDRSTLDLMGKQLELL-QAVEA 580

Query: 537 AKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPIT 596
              P ILV+++  G  +       +I AI+   YPG +GG A+ADV+FG +NP GRLP++
Sbjct: 581 TGTPTILVLIT--GRPLLINWPAKHIPAIIDTWYPGSQGGHALADVLFGDYNPAGRLPVS 638

Query: 597 WYNGDYVQMLPLTSMPLRPV--DSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFT 654
                    +P  S+   PV  +      R Y       LY FG+GLSYT F Y+ L   
Sbjct: 639 ---------IP-KSVGQSPVYYNHWWPKRRDYVEETSAPLYAFGHGLSYTTFDYSDL--- 685

Query: 655 KTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVI 714
                   K+    N   T+                     E  V+  N G  DG +VV 
Sbjct: 686 --------KISQSGNATNTT--------------------IEVSVEVTNTGDRDGDEVVQ 717

Query: 715 VYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHT 774
           +Y         T +KQ+ GF+R+ +  G +K + F+    + L + D   N +  AGE  
Sbjct: 718 LYLSDVVSSVVTPVKQLRGFERIHLDKGESKTVTFILTPAE-LALFDAEMNHVAEAGEFE 776

Query: 775 IFVG 778
           + +G
Sbjct: 777 VQLG 780


>gi|325104789|ref|YP_004274443.1| glycoside hydrolase family protein [Pedobacter saltans DSM 12145]
 gi|324973637|gb|ADY52621.1| glycoside hydrolase family 3 domain protein [Pedobacter saltans DSM
           12145]
          Length = 802

 Score =  259 bits (662), Expect = 4e-66,   Method: Compositional matrix adjust.
 Identities = 227/829 (27%), Positives = 358/829 (43%), Gaps = 147/829 (17%)

Query: 42  FSKLGLQMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGD-FAHG-VPRLGLPQYEW- 98
           F+K G++    +F D S P   RV+DL+S+MT+ EK  Q    + +G V +  +P  EW 
Sbjct: 39  FNKNGIKD---VFEDQSQPIEKRVEDLLSQMTVAEKTNQTATLYGYGRVLKDEMPTSEWK 95

Query: 99  ---WS-------EALHGVSN-------------------------------VGPGTHF-D 116
              W        EAL+ + N                               +G    F +
Sbjct: 96  KSIWKDGIANMDEALNSLPNNKKAQTEYSFPYSKHATAINTLQKWFIEETRLGIPVDFTN 155

Query: 117 DVIPG-----ATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLT-YWSPNIN 170
           + I G     AT F   I   +S+N++L +K G+    E +A+      G T  ++P ++
Sbjct: 156 EGIHGLCHDRATPFCAPIGIGSSWNKNLVRKAGEIAGREGKAL------GYTNVYAPILD 209

Query: 171 VARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAA 230
           +ARDPRWGR+ E  GEDPF+VG    N V GLQ                 +++  KHYA 
Sbjct: 210 LARDPRWGRVVECYGEDPFLVGELGKNMVSGLQSN--------------GIAATLKHYAV 255

Query: 231 YDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPK 290
           Y V           D  VT +++ +  L PF+  V+E     VM SYN  +GIP      
Sbjct: 256 YSVPKGGRDGHARTDPHVTPRELHQIHLYPFKKVVQEAKPLGVMSSYNDWDGIPVTGSYY 315

Query: 291 LLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCG----QY 346
            L + +R ++  +GY+V+D ++++ +   H+   D KE +V   LKAGL++         
Sbjct: 316 FLTELLRKQYGFNGYVVSDSEAVEFIASKHRVAKDFKEASVI-ALKAGLNVWTNFRQPDN 374

Query: 347 YTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGS-PQYVSLGKQDICSDENIELA 405
           Y N    +V  G +    +++ ++ + +V  RLG FD    +  +   + + + E+ + A
Sbjct: 375 YINNLRASVADGSLDMETLNQRVREVLSVKFRLGLFDRPFTENPAASDKKVQTPEDKKFA 434

Query: 406 AEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFS 465
            +  +E IVLLKN  + LPL+  K + + V GP A      I  Y        S + G  
Sbjct: 435 EQMNKESIVLLKNGNDFLPLDKNKNQKILVTGPLAAEVGYTISRYGPSNNPSTSILDGLK 494

Query: 466 GY----ANVTYKTGC--------------DDVACKSNNSIFAASEAAKTADATIILAGLD 507
            Y     N+ Y  GC              + V  K    I  A   AK  D  I + G +
Sbjct: 495 QYNNGKLNIDYAKGCEIVNEGWPGTEIIDEPVTEKEKAMIADAVAKAKNVDVIIAVVGEN 554

Query: 508 LSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILW 567
             +  ESL R  L LPG Q +L+  +    K PV++V+++   + I +   N  + AIL 
Sbjct: 555 EKIVGESLSRTSLNLPGRQLELLKALHATGK-PVVMVLVNGRPLTINWE--NHYLTAILE 611

Query: 568 AGYPGEEGGRAIADVVFGKFNPGGRLPITWYNG-DYVQMLPLTSMPLRPVDSLGYP---- 622
             + G   G+ +A+ +FG +NPGG+L +T+      ++M    + P +P      P    
Sbjct: 612 TWFLGPSAGKVVAETLFGDYNPGGKLSVTFPKSIGQIEM----NFPFKPGSHANQPSSGD 667

Query: 623 -GRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRC 681
            G      NG  LYPFGYGLSYT+F Y+ L                              
Sbjct: 668 NGFGKSRVNG-VLYPFGYGLSYTKFSYSDLKL---------------------------- 698

Query: 682 PGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRA 741
                 D    D        +N+G  DG +VV +Y +       TY  Q+  F+R+ ++A
Sbjct: 699 ------DFSKPDSISASFVLKNIGKRDGDEVVQLYFRDLISSVITYDTQLRAFERIHLKA 752

Query: 742 GRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGNGGVSFPIHLNF 790
           G  K++   F A K L I+D   N  +  G+  + +G+      +   F
Sbjct: 753 GETKQLNLKF-ARKDLAILDKDMNWAVEPGDFEVLIGSSSEDIRLKEKF 800


>gi|409197254|ref|ZP_11225917.1| glycoside hydrolase 3 [Marinilabilia salmonicolor JCM 21150]
          Length = 734

 Score =  259 bits (662), Expect = 4e-66,   Method: Compositional matrix adjust.
 Identities = 210/761 (27%), Positives = 352/761 (46%), Gaps = 109/761 (14%)

Query: 64  RVKDLVSRMTLDEKVQQLGDFAHGVPR--------LGLPQYEWWSEALHGVSNVG----- 110
           RV+ L+  MTLDEK+ Q+   + G           +G    E   E ++ +  +      
Sbjct: 23  RVEQLLGEMTLDEKIGQMCQVSGGQGNEESIRQGMIGSILNEVDPENINRLQKIAVEESR 82

Query: 111 ---PGTHFDDVIPG-ATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTY-W 165
              P     DVI G  T FP  +   A++N  L +K  +  ++EA +       G+ + +
Sbjct: 83  LGIPIIVARDVIHGFKTVFPIPLGQAATWNPELVQKGSRIAASEAAS------TGVRWTF 136

Query: 166 SPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCC 225
           +P I+++RD RWGRI E+ GEDP++        V G Q          LN     +++C 
Sbjct: 137 APMIDISRDARWGRIAESLGEDPYLTSVLGAAMVTGFQ-------GDSLNGE-TSIAACA 188

Query: 226 KHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPS 285
           KH+A Y         R +    +  +++ + +L PF+  V  G   + M  +N V+G+P+
Sbjct: 189 KHFAGYGAAEG---GRDYNTTSIPPRELRDIYLPPFKAAVDAG-VRTFMSGFNEVDGVPA 244

Query: 286 CADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQ 345
            A+  LL   +R EW   G++V+D  S   M+ NH F AD KE A  + +K G+D++   
Sbjct: 245 TANKYLLTDVLRNEWQFDGFVVSDWASTWEMI-NHGFAADEKE-AAHRAIKVGVDMEMAT 302

Query: 346 Y-YTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQD-ICSDENIE 403
             Y +     +++G +   DI+++++ +  V   LG FD    Y++  KQ+     E +E
Sbjct: 303 TTYRDNIAALLKEGALNIEDINQAVRNILRVKFELGLFDNP--YIAEEKQNQFARPEYLE 360

Query: 404 LAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYA--GIPCRYMSPI 461
            A  AA + +VLLKN+Q TLP+NS+    +A++GP A+     +G +   G     ++P+
Sbjct: 361 AANLAATQSMVLLKNEQKTLPINSSS--KIALIGPMADQPYEQLGTWIFDGDTTLTVTPL 418

Query: 462 AGFS---GYANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAESLDRE 518
             F+   G  NV +  G      +       A E AK +D  +   G +  +  E+  R 
Sbjct: 419 QAFNKTFGQENVLFAEGMPISRTRHQKGFRKAIEQAKNSDVIVFCGGEESILSGEAHSRA 478

Query: 519 DLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRA 578
           ++ LPG Q +LI ++ +  K P++LV+M+  G  +   E + +  A+++A +PG  GG A
Sbjct: 479 NIDLPGVQNELIKELKKTGK-PLVLVVMA--GRPLTIGEISEHADAVVYAWHPGTMGGAA 535

Query: 579 IADVVFGKFNPGGRLPIT----------WYN----------GDYVQMLPLTSMPLR-PVD 617
           +AD+V GK NP G+LP+T          +YN            + QM     +P++ P  
Sbjct: 536 LADIVSGKANPSGKLPVTFPKVVGQIPIYYNHKNTGRPANPDSWTQMY---DIPVKAPQT 592

Query: 618 SLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDAS 677
           SLG           P LYPFGYGLSYT F+Y+ LS  K +        + R         
Sbjct: 593 SLGNESHYIDAGFIP-LYPFGYGLSYTSFEYSDLSLDKEV--------YAR--------- 634

Query: 678 KTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRV 737
                         D+  E +    N G   G +V  VY +         +K++  F+R+
Sbjct: 635 --------------DETIEVRFTLSNTGEFAGEEVAQVYVRDLVGNVTRPVKELKAFERI 680

Query: 738 FVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVG 778
            ++ G +K +       + L   +     ++  GE  ++VG
Sbjct: 681 DLQKGESKTVTLTI-PVQELAFTNIDMKQVVEPGEFQLWVG 720


>gi|409730324|ref|ZP_11271901.1| beta-glucosidase [Halococcus hamelinensis 100A6]
 gi|448724096|ref|ZP_21706609.1| beta-glucosidase [Halococcus hamelinensis 100A6]
 gi|445786548|gb|EMA37314.1| beta-glucosidase [Halococcus hamelinensis 100A6]
          Length = 747

 Score =  259 bits (662), Expect = 5e-66,   Method: Compositional matrix adjust.
 Identities = 207/695 (29%), Positives = 338/695 (48%), Gaps = 105/695 (15%)

Query: 120 PGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTY-WSPNINVARDPRWG 178
           P  T+FP  I   +S++  L +++ +   +E  A+      G T+  SP ++VARD RWG
Sbjct: 88  PEGTTFPQSIGMASSWDPDLMRQVMERTRSEMAAI------GTTHALSPVLDVARDLRWG 141

Query: 179 RITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKG 238
           R+ ET GEDP++V   A  YV GLQ     +           +S+  KH+AA+   +  G
Sbjct: 142 RVEETFGEDPYLVAAMASAYVAGLQGPSIEDG----------ISATLKHFAAHSA-SEGG 190

Query: 239 VDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRG 298
            +R   +  V  +++ ET L P+E  +    A SVM +Y+ ++GIPS ++  LL   +RG
Sbjct: 191 KNRASVN--VGPRELRETHLFPYEAAITTAGAESVMNAYHDIDGIPSASNEWLLTDLLRG 248

Query: 299 EWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDL-----DCGQYYTNFTGN 353
           E    G +V+D  S+  + + H  ++DS  ++    L+AG+D+     DC ++       
Sbjct: 249 ELGFDGTVVSDYYSVDFLREEHG-VSDSDRESAVMALEAGIDVELPATDCYEHLP----E 303

Query: 354 AVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQDICSDENIELAAEAAREGI 413
           A++ G++ E  +D++++ +  +  R G  D S    S+      ++   EL   AARE I
Sbjct: 304 AIENGELSEATLDEAVRRVLRMKFRKGLVDDSTVDASVAADAFNTEAATELTERAARESI 363

Query: 414 VLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRY---------MSPIAGF 464
           VLLKN+   LPL+     ++AVVGP A+    M+G+YA  P  Y          +P+   
Sbjct: 364 VLLKNENELLPLD--DTDSLAVVGPKADDGQEMMGDYA-YPAHYPEAEVSLDATTPLDAI 420

Query: 465 SGYAN---VTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVE---------- 511
             +A+   + Y+ GC   +  S +   AA EAA  AD T+   G   +V+          
Sbjct: 421 RVHADGTEIAYEEGC-TTSGPSTDGFDAAVEAAAGADVTLAFVGARSAVDFSDPDAEDVT 479

Query: 512 -------AESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKA 564
                   E  D  DL LPG QT+L+ +V E    P+++V++S     I +      + A
Sbjct: 480 NPALPTSGEGSDVTDLGLPGVQTELLERVHETGT-PLVVVVVSGKPHSIEW--VAEEVPA 536

Query: 565 ILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGR 624
           ++ A  PGEEGG  IADV+FG +NPGG LP++      V  LP+     RP  +     +
Sbjct: 537 VVQAWLPGEEGGTGIADVLFGDYNPGGHLPVSLARS--VGQLPV-HYDRRPNSA----NK 589

Query: 625 TYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGV 684
            + +     LY FG+GLSYT+F+Y+        +V+ + L    ++  +  A+       
Sbjct: 590 DHVYTESEPLYSFGHGLSYTEFEYD------DFEVSTDTLGASGSVTASVTAT------- 636

Query: 685 LVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRN 744
                             NVG   GSDVV +Y+   +   A  +++++GF+RV + AG +
Sbjct: 637 ------------------NVGGRGGSDVVQLYAHAESPDQARPVQELVGFERVSLDAGES 678

Query: 745 KRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGN 779
            RI F  +A + L   D   N  +  G + + VG+
Sbjct: 679 TRISFEIDATQ-LAYHDRDMNLRVHDGSYELRVGH 712


>gi|261405721|ref|YP_003241962.1| glycoside hydrolase family protein [Paenibacillus sp. Y412MC10]
 gi|261282184|gb|ACX64155.1| glycoside hydrolase family 3 domain protein [Paenibacillus sp.
           Y412MC10]
          Length = 765

 Score =  259 bits (661), Expect = 5e-66,   Method: Compositional matrix adjust.
 Identities = 196/711 (27%), Positives = 328/711 (46%), Gaps = 109/711 (15%)

Query: 76  EKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGATSFPTVILTTASF 135
           E V  +  +A    RLG+P      E  HG   +G            T FP  +   +++
Sbjct: 89  EAVNHIQRYAVEQSRLGIPIL-IGEECSHGHMAIG-----------GTVFPVPLSIGSTW 136

Query: 136 NESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYA 195
           N  L++ + +AV+ E R+     + G   +SP ++V RDPRWGR  E  GEDP+++  YA
Sbjct: 137 NVDLYRDMCRAVALETRS-----QGGAVTYSPVLDVVRDPRWGRTEECFGEDPYLISEYA 191

Query: 196 VNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAY-DVDNWKGVDRYHFDARVTEQDME 254
           V  V GLQ          L+S P  V++  KH+  Y   +  +     H   R    ++ 
Sbjct: 192 VASVEGLQ-------GESLDS-PSSVAATLKHFVGYGSSEGGRNAGPVHMGTR----ELM 239

Query: 255 ETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQ 314
           E  + PF+  V+ G A+S+M +YN ++G+P   + +LL+  +R EW   G ++ DC +I 
Sbjct: 240 EVDMLPFKKAVEAG-AASIMPAYNEIDGVPCTVNTELLDGILRKEWGFDGMVITDCGAID 298

Query: 315 VMVDNHKFLADSKEDAVAQTLKAGLDLD-CGQYYTNFTGNAVQQGKVKETDIDKSLKYLY 373
           ++   H    D   DA  Q ++AG+D++  G+ +      AV+  K++ + +D++++ + 
Sbjct: 299 MLASGHDTAEDGM-DAAVQAIRAGIDMEMSGEMFGKHLQKAVESNKLEVSVLDEAVRRVL 357

Query: 374 TVLMRLGFFDGSPQYVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTV 433
           T+  +LG F+         +  I S++++ LA + A EGIVLLKN+   LPL S +   +
Sbjct: 358 TLKFKLGLFENPYVDPQTAENVIGSEQHVGLARQLAAEGIVLLKNEAKALPL-SKEGGVI 416

Query: 434 AVVGPHANATVAMIGNYAG--IPCRYMSPIAGFSGY-----ANVTYKTGCDDVACKSNNS 486
           AV+GP+A+     +G+Y     P    + + G           V Y  GC  +   S   
Sbjct: 417 AVIGPNADQGYNQLGDYTSPQPPAAVTTVLGGIRAKLGEEAQRVLYAPGC-RIKDDSREG 475

Query: 487 IFAASEAAKTADATIILAG-----------LDLSVEA--------------ESLDREDLW 521
              A   A+ AD  +++ G           +DL   A              E +DR  L 
Sbjct: 476 FEFALTCAEQADTVVMVLGGSSARDFGEGTIDLRTGASKVTDDALSDMDCGEGIDRMTLQ 535

Query: 522 LPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIAD 581
           L G Q +L+ ++ ++ K  +++ I    G  IA    + +  AIL A YPG+EGG A+AD
Sbjct: 536 LSGVQLELVQEIHKLGKRMIVVYI---NGRPIAEPWIDEHADAILEAWYPGQEGGHAVAD 592

Query: 582 VVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGL 641
           ++FG  NP G+L ++     +V  LP+     R        G+ Y   +    YPFGYGL
Sbjct: 593 ILFGDVNPSGKLTMSIPK--HVGQLPVYYNGKRS------RGKRYLEEDSQPRYPFGYGL 644

Query: 642 SYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDF 701
           SYT+F Y+ +  T  +                               +  D      V+ 
Sbjct: 645 SYTEFSYSDIQMTPEV-------------------------------IGTDGTAVVSVNV 673

Query: 702 QNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFN 752
            N G  +GS+VV +Y    A       +++ GFQ++F++ G  ++++F   
Sbjct: 674 TNSGDCEGSEVVQLYVSDAASKYTRPARELKGFQKIFLQPGERRKVEFTIG 724


>gi|393786524|ref|ZP_10374660.1| hypothetical protein HMPREF1068_00940 [Bacteroides nordii
           CL02T12C05]
 gi|392660153|gb|EIY53770.1| hypothetical protein HMPREF1068_00940 [Bacteroides nordii
           CL02T12C05]
          Length = 841

 Score =  259 bits (661), Expect = 6e-66,   Method: Compositional matrix adjust.
 Identities = 226/821 (27%), Positives = 352/821 (42%), Gaps = 171/821 (20%)

Query: 53  LFCDSSLPYSIRVKDLVSRMTL-------------------------------------- 74
           +F D S P   RVKDL+S+MT+                                      
Sbjct: 81  IFEDPSQPVEKRVKDLLSQMTIEEKSCQLATLYGFGRVLKDSLPTPAWKEAIWKDGIANI 140

Query: 75  DEKVQQLGDFAHGVP------------------------RLGLPQYEWWSEALHGVSNVG 110
           DE++  +G  A  VP                        RLG+P  ++ +E +HG+++  
Sbjct: 141 DEQLNGVGRGAKRVPHLIVPFSNHVKAINETQRWFIEETRLGIP-VDFSNEGIHGLNHTK 199

Query: 111 PGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLT-YWSPNI 169
                      AT  P  I   +++N  L ++ G+ V  EAR +      G T  ++P +
Sbjct: 200 -----------ATPLPAPIAIGSTWNTELVREAGEIVGKEARVL------GYTNVYAPIL 242

Query: 170 NVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYA 229
           +V RDPRWGR  E  GEDP+++G   V  V G+Q  +G             V++  KH+A
Sbjct: 243 DVVRDPRWGRTLECYGEDPYLIGELGVQMVDGIQS-QG-------------VAATLKHFA 288

Query: 230 AYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADP 289
            Y             D  VT +++ E +L PF+  +++     VM SYN  NG P  +  
Sbjct: 289 VYSSPKGGRDGNCRTDPHVTPRELHEIYLYPFKHVIQQSHPMGVMSSYNDWNGEPVTSSY 348

Query: 290 KLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTN 349
             L + +R E+   GY+V+D  +++ +   H+ +A+  ++AV Q L+AGL++      T+
Sbjct: 349 YFLTKLLREEYGFDGYVVSDSQAVEFVHTKHQ-VAEDYDEAVRQVLEAGLNVR-----TH 402

Query: 350 FTGNA---------VQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQDIC-SD 399
           FT  A         + + K+    IDK +  +  V  RLG FD   +       ++  +D
Sbjct: 403 FTPPADFILPIRRLLAENKISMATIDKRVSEVLAVKFRLGLFDAPYRDNPKEADEVAGAD 462

Query: 400 ENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYA--GIPCRY 457
           ++ E   E  R+ +VLLKND   LPLN  ++K V V GP A+    MI  Y   G+P   
Sbjct: 463 KHSEFVKEMQRQSLVLLKNDGQLLPLNKKEIKKVLVTGPLADEDNFMISRYGPNGLPT-- 520

Query: 458 MSPIAGFSGY----ANVTYKTGCDDV-----ACKSNNSIFAASEAA---------KTADA 499
           ++ + G   Y      V Y  GC+ +     A +   ++  A E A         ++AD 
Sbjct: 521 ITVLQGIKDYLKGDVEVVYSKGCNIIDKEWPASEVLPAVLTAEEVADMDKAVSEAQSADV 580

Query: 500 TIILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETN 559
            I + G D     ES  R  L LPG Q +L+  +    K PV+LV+++   + I +   +
Sbjct: 581 IIAVMGEDEYRVGESRSRTSLELPGRQRELLQALHATGK-PVVLVLINGQPLTINWE--D 637

Query: 560 TNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYN--GDYVQMLPLTSMPLRPVD 617
            N+ AIL A +P  +GG+ IA+ +FG +NPGG+L +T+    G      P          
Sbjct: 638 QNLPAILEAWFPSFQGGKIIAETLFGDYNPGGKLTVTFPKSVGQIELNFPFKKGSHGTQP 697

Query: 618 SLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDAS 677
           S G  G       G  LYPFGYGLSYT F Y+                   NL  T+ A 
Sbjct: 698 SSGPNGSGSTRVLG-ALYPFGYGLSYTTFAYS-------------------NLEVTAPAK 737

Query: 678 KTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRV 737
            T+               +   D  N G   G +V  +Y +       TY  ++ GFQRV
Sbjct: 738 GTQGE------------VQISFDITNTGKYAGEEVAQLYVRDLVSSVVTYDSRLRGFQRV 785

Query: 738 FVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVG 778
            ++    KR+ F       L ++D      + +G   + VG
Sbjct: 786 LLQPNETKRMHFTLKPA-DLELLDRNMEWTVESGTFEVRVG 825


>gi|346226406|ref|ZP_08847548.1| beta-glucosidase [Anaerophaga thermohalophila DSM 12881]
          Length = 775

 Score =  259 bits (661), Expect = 6e-66,   Method: Compositional matrix adjust.
 Identities = 199/708 (28%), Positives = 329/708 (46%), Gaps = 90/708 (12%)

Query: 117 DVIPG-ATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTY-WSPNINVARD 174
           DVI G  T+FP  +    S++  L ++  +  + EA A      +G+ + ++P I++ARD
Sbjct: 122 DVIHGLETTFPIPLAEACSWDLELMEQSARIAAEEATA------SGIAWNFAPMIDIARD 175

Query: 175 PRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVD 234
           PRWGR+ E  GEDP++    A   VRG Q +E +++ + +N+    + +  KH+  Y   
Sbjct: 176 PRWGRVMEGAGEDPYLGSLVARARVRGFQGIETYKDFSKINT----MMATSKHFVGYGAV 231

Query: 235 NWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQ 294
              G D +  D  V  + + ET+L PF+  V EG  ++ M ++N +NG+P   +  L  +
Sbjct: 232 Q-AGRDYHSVDMSV--RTLHETYLPPFKAAVDEG-VTAFMTAFNDLNGVPCTGNKYLFKE 287

Query: 295 TVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLD-CGQYYTNFTGN 353
            +R  W   G +V D  +IQ MV  H F  D K  A    + AG+D+D   + +  +   
Sbjct: 288 ILRDRWGFGGMVVTDYTAIQEMV-AHGFARDLKH-ATELAIDAGIDMDMISEGFVTYLKE 345

Query: 354 AVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQD--ICSDENIELAAEAARE 411
            V++GKV E  ID ++  +  +   LG FD   +Y +  +Q   + + E+++ A E A+ 
Sbjct: 346 LVEEGKVSEKQIDVAVSRILEMKFLLGLFDDPFKYCNAERQKEVVMNPEHLKAAREVAQR 405

Query: 412 GIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYA-------------GIPCRYM 458
            IVLL+N  N LPL   + K VA++GP      ++ G +A             G+  +Y 
Sbjct: 406 SIVLLENKNNVLPLKKNEPKRVALIGPFVKERESLTGEWAIKGDPDKSVTLMEGLEEKYK 465

Query: 459 SPIAGFSGYAN---------VTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLS 509
                FS YA           T K     V  +S  S   A   A+T+D  ++  G    
Sbjct: 466 DSQVKFS-YAKGTSLPVIDRTTQKVSTTRVPDRSGFS--EAINLARTSDVILVAMGEKFH 522

Query: 510 VEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAG 569
              E+  R D+ LPG Q +L+ ++ +  K P+ILV+ +   +D+++     N+ AI+ A 
Sbjct: 523 WSGEAASRTDITLPGNQRELLKELKKTGK-PIILVLFNGRPLDLSWEA--ENVDAIVEAW 579

Query: 570 YPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPL------TSMPLRPVDSLGYPG 623
           YPG   G A+ADV+ G +NP  +L +T+     V  +P+      T  P    +   Y  
Sbjct: 580 YPGIMAGHAVADVLSGDYNPSAKLVMTFPRN--VGQIPIFYNVKNTGRPFDEDNPADYRS 637

Query: 624 RTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPG 683
                 N P LYPFGYGLSYT F+Y+                     N    + K    G
Sbjct: 638 SYIDCPNSP-LYPFGYGLSYTSFEYD---------------------NAKISSKKLERGG 675

Query: 684 VLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGR 743
           +L             VD  N G+ DG +VV +Y           +K++ GF+++ ++ G 
Sbjct: 676 ILT----------VSVDVTNTGTMDGEEVVQLYIHDKVGSVVRPVKELKGFKKIHLKKGE 725

Query: 744 NKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGNGGVSFPIHLNFN 791
            K ++F  +  + L + +     +   GE   ++ +       HL F+
Sbjct: 726 TKTVEFTIDE-ERLKMYNLDMEWVAEPGEFEAWIASSSADESNHLEFS 772


>gi|116621797|ref|YP_823953.1| glycoside hydrolase family protein [Candidatus Solibacter usitatus
           Ellin6076]
 gi|116224959|gb|ABJ83668.1| glycoside hydrolase, family 3 domain protein [Candidatus Solibacter
           usitatus Ellin6076]
          Length = 765

 Score =  258 bits (660), Expect = 6e-66,   Method: Compositional matrix adjust.
 Identities = 210/698 (30%), Positives = 333/698 (47%), Gaps = 118/698 (16%)

Query: 90  RLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVST 149
           RLG+P   +  E LHG + +G            TSFP  I   A+F+  L + +    + 
Sbjct: 104 RLGIPVI-FHEECLHGHAAIG-----------GTSFPQPIGLGATFDPELVESLFAMTAA 151

Query: 150 EARAMYNLGRAGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHE 209
           EARA     R      +P ++VAR+PRWGR+ ET GEDPF+V R  +  VRG Q      
Sbjct: 152 EARA-----RGTHQALTPVVDVAREPRWGRVEETYGEDPFLVSRMGIAAVRGFQGDATFR 206

Query: 210 NATDLNSRPLKVSSCCKHYAAY-DVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEG 268
           + T       +V +  KH+AA+   ++       +   RV    + ETFL PF+  + +G
Sbjct: 207 DKT-------RVIATLKHFAAHGQPESGTNCAPVNVSMRV----LRETFLFPFKEALDKG 255

Query: 269 DASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMV---DNH-KFLA 324
            A SVM SYN ++G+PS A   LL   +R EW   G++V+D  +I  +    ++H  F+A
Sbjct: 256 CAISVMASYNEIDGVPSHASRWLLRDVLRKEWGFKGFVVSDYYAIYELSYRPESHGHFVA 315

Query: 325 DSKEDAVAQTLKAGLDL-----DCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRL 379
             K +A A  ++AG+++     DC  +  +     V +G ++E+ +D+ ++ +     ++
Sbjct: 316 KDKREACALAVQAGVNIELPEPDCYLHLVDL----VHKGVLQESQLDELVEPMLRWKFQM 371

Query: 380 GFFDGSPQYVSLGKQDICS--DENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVG 437
           G FD    YV   + +  +  D + ELA +AARE I LLKND   +PL+ + +KT+AV+G
Sbjct: 372 GLFDDP--YVDPAEAERIAGCDAHRELAMQAARETITLLKNDGPVVPLDLSAIKTIAVIG 429

Query: 438 PHANATVAMIGNYAGIPCRYMSPIAGFS----GYANVTYKTGC----------DDVA--- 480
           P+AN +  ++G Y+G+P   ++ + G        A V Y  GC          D+V    
Sbjct: 430 PNANRS--LLGGYSGVPKHDVTVLDGIRERVGSRAKVVYAEGCKITIGGSWVQDEVTPSD 487

Query: 481 -CKSNNSIFAASEAAKTADATIILAGLDLSVEAESL------DREDLWLPGYQTQLINQV 533
             +    I  A + AK AD  ++  G +     E+       DR  L L G Q +L+  +
Sbjct: 488 PAEDRRQIAEAVKVAKRADVIVLAIGGNEQTSREAWSPKHLGDRPSLDLVGRQEELVRAM 547

Query: 534 AEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRL 593
               K PVI  + +   + I +     ++ AI    Y G+E GRA+A+V+FG  NPGG+L
Sbjct: 548 VATGK-PVIAFLFNGRPISINY--LAQSVPAIFECWYLGQETGRAVAEVLFGDTNPGGKL 604

Query: 594 PITWYNGDYVQMLPLTSMPLRPVDSLGYPG--RTYKFYNGPTLYPFGYGLSYTQFKYNLL 651
           PIT         +P ++  L P      P   R Y F     LY FGYGLSYT F +  L
Sbjct: 605 PIT---------IPRSAGHL-PAFYNHKPSARRGYLFDEVGPLYAFGYGLSYTTFAFQNL 654

Query: 652 SFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSD 711
              K     +++    R L                            VD  N G+ +G +
Sbjct: 655 RLAKK---KMHRESTARVL----------------------------VDVTNTGAREGRE 683

Query: 712 VVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKF 749
           VV +Y +         IK++ GF+++ ++ G+ + ++F
Sbjct: 684 VVQLYIRDLVSSVTRPIKELKGFRKITLQPGQTQTVEF 721


>gi|334144838|ref|YP_004538047.1| beta-glucosidase [Novosphingobium sp. PP1Y]
 gi|333936721|emb|CCA90080.1| beta-glucosidase [Novosphingobium sp. PP1Y]
          Length = 889

 Score =  258 bits (660), Expect = 6e-66,   Method: Compositional matrix adjust.
 Identities = 153/403 (37%), Positives = 227/403 (56%), Gaps = 43/403 (10%)

Query: 67  DLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGATSFP 126
           DLV++MTLDEK+ QL + A  +PRL +P Y WW+E+LHG     P           T+FP
Sbjct: 36  DLVAKMTLDEKLGQLLNTAPAIPRLDIPAYNWWTESLHGALGSLP----------TTNFP 85

Query: 127 TVILTTASFNESLWKKIGQAVSTEARAMYNLGR---------AGLTYWSPNINVARDPRW 177
             I   A+F+ SL K +  A+STE R ++ L R          GL  WSPNIN+ RDPRW
Sbjct: 86  EPIGLAATFDASLVKDVAGAISTEVRGLHALARKTGRMGRIGTGLDTWSPNINIFRDPRW 145

Query: 178 GRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWK 237
           GR  ET GEDP++  R  V++V G+Q  +      DL      V +  KH+A   V N  
Sbjct: 146 GRGQETYGEDPYLTARMGVSFVEGMQGPD-----PDLPD----VIATPKHFA---VHNGP 193

Query: 238 GVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVR 297
              R+H +  V+  D+E+T+L  F   + EG A SVMC+YNRV+G P+CA  +LL + + 
Sbjct: 194 ESTRHHANVFVSRHDLEDTYLPAFRAAIVEGRAGSVMCAYNRVDGQPACASQELLQEHLV 253

Query: 298 GEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQY-------YTNF 350
             W   GY+V+DCD+++ + DNHK+  D    AVA  ++ G+D +C  +        T+ 
Sbjct: 254 DAWGFQGYVVSDCDAVKDISDNHKYAPDGAA-AVAAAMRMGVDSECHTWTLSDTDGLTDR 312

Query: 351 TGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSL--GKQDICSDENIELAAEA 408
              A+++G +  +D+D++L  L++  +R G   G  +  +      D+ +  +  LA +A
Sbjct: 313 YREALERGLITVSDVDRTLIRLFSARLRNGDLPGVRKLSTFTSSAADVGTPAHGALALKA 372

Query: 409 AREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYA 451
           A E +VLLKND   LP  +A +K VAV+GP  +AT  + GNY+
Sbjct: 373 AEESLVLLKND-GILPFQTAGMK-VAVIGPFGDATRVLRGNYS 413



 Score =  113 bits (282), Expect = 4e-22,   Method: Compositional matrix adjust.
 Identities = 90/303 (29%), Positives = 136/303 (44%), Gaps = 55/303 (18%)

Query: 492 EAAKTADATIILAGLDLSVEAESL----------DREDLWLPGYQTQLINQVAEVAKGPV 541
            AA+ AD  + + GL   +EAE            D+  L +P  Q +L+ Q     K P+
Sbjct: 613 RAAQAADVLVAVVGLTSDLEAEESPIEIPGFKGGDKTTLDIPADQQELLEQAKATGK-PL 671

Query: 542 ILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGD 601
           I+V M+   +++ +A+ N +  AIL A YPG+ GG AIA+V+ GK NP G+LP+T+Y   
Sbjct: 672 IVVAMNGSPINLHWAKENAD--AILEAWYPGQSGGLAIANVLTGKANPTGKLPLTFYRS- 728

Query: 602 YVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNL 661
            V+ LP       P D     GRTY+++ G  +YPFGYGLSYT F Y  ++         
Sbjct: 729 -VEDLP-------PFDDYDMKGRTYRYFTGKAVYPFGYGLSYTTFGYGPVA--------- 771

Query: 662 NKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPA 721
                         AS     G+ V                N G   G D V +Y   P 
Sbjct: 772 -----------VEPASGGAQDGIRVT-----------TQVSNTGQRAGGDAVQLYLDFPD 809

Query: 722 EIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGNGG 781
                 I  + GFQ+V ++ G  +++ F  +     ++       +L  G + + VG+G 
Sbjct: 810 APGTPNIA-LRGFQKVSLQPGETRQVTFTLSPRDLSSVTPDGVRKVL-KGHYRVTVGSGQ 867

Query: 782 VSF 784
             F
Sbjct: 868 PGF 870


>gi|154493932|ref|ZP_02033252.1| hypothetical protein PARMER_03276 [Parabacteroides merdae ATCC
           43184]
 gi|154086192|gb|EDN85237.1| glycosyl hydrolase family 3 C-terminal domain protein
           [Parabacteroides merdae ATCC 43184]
          Length = 955

 Score =  258 bits (660), Expect = 7e-66,   Method: Compositional matrix adjust.
 Identities = 221/809 (27%), Positives = 360/809 (44%), Gaps = 144/809 (17%)

Query: 53  LFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRL---GLPQYEW----WS----- 100
           ++ D ++P   RV+DL+S+M ++EK  Q+    +G  R+    LP  +W    W      
Sbjct: 60  VYEDPTVPIDARVEDLLSQMNVEEKTCQMVTL-YGYKRVLKDDLPTSDWKKQLWKDGIGA 118

Query: 101 --EALHGVSNVG----------------------------------PGTHFDDVIPG--- 121
             E L+G    G                                  P    ++ I G   
Sbjct: 119 IDEHLNGFQQWGLPPSDNPYVWPASRHAWALNEVQRFFIEETRLGIPTDFTNEGIRGVES 178

Query: 122 --ATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNINVARDPRWGR 179
             AT+FPT +    ++N +L  K+G     E R    LG   +  ++P ++V RD RWGR
Sbjct: 179 YIATNFPTQLGLGHTWNRNLVHKVGYITGREGRL---LGYTNV--YAPILDVGRDQRWGR 233

Query: 180 ITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGV 239
             E  GE P++V    +   +G+Q        TD      +V++  KHY AY  +     
Sbjct: 234 YEEVYGESPYLVAELGIEMAKGMQ--------TDH-----QVAATSKHYIAYSNNKGGRE 280

Query: 240 DRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGE 299
                D +++ +++E   + P++  +KE     VM SYN  +G P  +    L   +RGE
Sbjct: 281 GMARVDPQMSPREVEMIHVYPWKRVIKEAGILGVMSSYNDYDGFPIQSSYYWLTTRLRGE 340

Query: 300 WDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCG----QYYTNFTGNAV 355
           +   GY+V+D D+++ +   H   AD KE +V Q++ AGL++ C       Y       +
Sbjct: 341 FGFRGYVVSDSDAVEYLFSKHGTAADMKE-SVLQSVLAGLNIRCTFRSPDSYVLPLRELI 399

Query: 356 QQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQD--ICSDENIELAAEAAREGI 413
            +G +  + ID  ++ +  V   +G FD  P  + L + D  +   EN  +A +A++E +
Sbjct: 400 AEGAIPMSTIDDRVRDILRVKFLVGLFD-HPYQIDLKETDKEVNCAENQLVALQASKESL 458

Query: 414 VLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFSGY----AN 469
           VLLKN    LPL+  K+  +AV GP+A+     + +Y  +     + + G         +
Sbjct: 459 VLLKNQDAVLPLDVNKISKIAVCGPNADEEAYALTHYGPLAVEVTTVLEGIRNKVKPGTD 518

Query: 470 VTYKTGCDDV--------------ACKSNNSIFAASEAAKTADATIILAGLDLSVEAESL 515
           V +  GCD V                +  + I  A E AK +D T+++ G       E+ 
Sbjct: 519 VLFTKGCDLVDANWPESELIRYPLTAEEQSEIDKAVENAKKSDVTVVVLGGSNRTCGENK 578

Query: 516 DREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEG 575
            R  L LPG Q  L+  V    K PV+LV+++   + I +A+    + AIL A YPG +G
Sbjct: 579 SRSSLDLPGRQLDLLQAVVATGK-PVVLVLINGRPLSINWAD--KYVPAILEAWYPGSQG 635

Query: 576 GRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRP---VD---SLGYPGRTYKFY 629
           G AIAD +FG +NPGG+L +T+     V  +P  + P +P   VD   + G  G   +  
Sbjct: 636 GTAIADALFGDYNPGGKLTVTF--PKTVGQIPF-NFPTKPNAQVDGGRNKGLDGNMSRV- 691

Query: 630 NGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDL 689
           NGP LYPFGYGLSYT F+Y+ +S    I   +  +                        +
Sbjct: 692 NGP-LYPFGYGLSYTTFEYSDISIQPAIVTQVQPVT-----------------------V 727

Query: 690 RCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKF 749
           RC           N G   G +VV +Y +       TY K ++GF R+ +  G  K + F
Sbjct: 728 RC--------KVTNTGKRAGDEVVQLYVRDILSSVTTYEKNLVGFDRIHLNPGETKELTF 779

Query: 750 VFNACKSLNIVDYAANTLLPAGEHTIFVG 778
                + L +++   + ++  G+  + VG
Sbjct: 780 TIEP-RDLQLLNSDNHWVVEPGDFKVMVG 807


>gi|330996729|ref|ZP_08320604.1| glycosyl hydrolase family 3 protein [Paraprevotella xylaniphila YIT
           11841]
 gi|329572574|gb|EGG54217.1| glycosyl hydrolase family 3 protein [Paraprevotella xylaniphila YIT
           11841]
          Length = 852

 Score =  258 bits (660), Expect = 7e-66,   Method: Compositional matrix adjust.
 Identities = 160/415 (38%), Positives = 227/415 (54%), Gaps = 39/415 (9%)

Query: 53  LFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPG 112
           LF D   P   R+ DL+SR+T++EK+  L + A  + RLG+ +Y   +EALHGV  V PG
Sbjct: 28  LFRDMKAPQHERIMDLLSRLTVEEKISLLVNDAPAIGRLGIDKYNHGNEALHGV--VRPG 85

Query: 113 THFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAG----------L 162
                     T FP  I   A +N  L  +I  A+S EAR  +     G          L
Sbjct: 86  DF--------TVFPQAIGMAAMWNPELLYRISSAISDEARGRWKELEYGKKQIAGASDLL 137

Query: 163 TYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVS 222
           T+WSP +N+ARDPRWGR  ET GEDP++ G   V +V+GLQ           + R LK  
Sbjct: 138 TFWSPTVNMARDPRWGRTPETYGEDPYLSGVLGVAFVKGLQGN---------HPRYLKTV 188

Query: 223 SCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNG 282
           S  KH+A  + ++    +R   +A+V+E+D+ E +L  FE C+ EG A S+M +YN VN 
Sbjct: 189 STPKHFAVNNEEH----NRSSCNAKVSERDLREYYLPSFERCITEGKAQSIMMAYNAVND 244

Query: 283 IPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLD 342
           +P   +  L+   +RG+W  +GYIV+DC + + M+  H ++  ++E A    +KAGLDL+
Sbjct: 245 VPCTVNTYLIKNVLRGDWGFNGYIVSDCSAPEWMITKHHYVK-TREAAATLAVKAGLDLE 303

Query: 343 CG-QYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ--YVSLGKQDICSD 399
           CG Q Y      A +Q  V E DID +   +    M LG FD   Q  Y  +    +   
Sbjct: 304 CGNQVYGEGLLKAYRQYMVSEADIDSAAYRILRGRMMLGLFDDPSQNPYNQIEPSVVGCK 363

Query: 400 ENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIP 454
            + +LA EAAR+ +VLLKN  N LPLN  KVK++AVVG   +A     G+Y+G P
Sbjct: 364 AHQDLALEAARQSMVLLKNKDNFLPLNPQKVKSIAVVG--ISAGHCEFGDYSGTP 416



 Score =  140 bits (352), Expect = 4e-30,   Method: Compositional matrix adjust.
 Identities = 95/290 (32%), Positives = 140/290 (48%), Gaps = 49/290 (16%)

Query: 490 ASEAAKTADATIILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAG 549
           A + A   D T+ + G++ S+E E  DR  L LP  Q + I ++ +V    V++++    
Sbjct: 597 AGKVAAECDVTVAVLGINKSIEREGQDRFTLELPIDQQEFIKELYKVNPNTVVVLV---A 653

Query: 550 GVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLT 609
           G  +A    + N+ AIL A YPGE+GG A+A+V+FG +NPGGRLP+T+YN        L 
Sbjct: 654 GSSLAVNWMDENVPAILNAWYPGEQGGNAVAEVLFGDYNPGGRLPLTYYNS-------LD 706

Query: 610 SMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRN 669
            +P    D+    GRTY+++ G  LY FGYGLSYT+F+Y                     
Sbjct: 707 EIP--AFDNYSVKGRTYQYFEGQPLYEFGYGLSYTKFRY--------------------- 743

Query: 670 LNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIK 729
                     +  GV V      D  +   +  N G  DG +V  VY K P       +K
Sbjct: 744 ----------KSKGVSV----ARDTVKVSFEVSNTGKYDGDEVAQVYVKYPETGTYMPLK 789

Query: 730 QVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLL-PAGEHTIFVG 778
           Q+ GF+RV ++ G+  ++  V    K L   D      + P GE+T  VG
Sbjct: 790 QLHGFKRVHIKKGKTSKVT-VGVPKKDLRYWDEQERKFVTPKGEYTFMVG 838


>gi|441500080|ref|ZP_20982250.1| Beta-glucosidase [Fulvivirga imtechensis AK7]
 gi|441436171|gb|ELR69545.1| Beta-glucosidase [Fulvivirga imtechensis AK7]
          Length = 704

 Score =  258 bits (660), Expect = 8e-66,   Method: Compositional matrix adjust.
 Identities = 203/656 (30%), Positives = 335/656 (51%), Gaps = 79/656 (12%)

Query: 117 DVIPG-ATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTY-WSPNINVARD 174
           DVI G  T FP  +   +S+N  L K+  +  + EA A      +GL + ++P +++ARD
Sbjct: 71  DVIHGHRTIFPLPLAEASSWNLDLIKETARLSAKEAAA------SGLNWTFNPMVDIARD 124

Query: 175 PRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVD 234
           PRWGRI E  GED ++    A   V G Q         DL S P  V +C KH+AAY   
Sbjct: 125 PRWGRIAEGSGEDTYLGSLIAKAKVEGYQ-------GDDL-SDPFTVLACVKHFAAYGAS 176

Query: 235 NWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQ 294
              G D +  D  ++++ + ET+L P++  +  G A++VM S+N ++G+P+     L+ +
Sbjct: 177 Q-AGRDYHTVD--MSDRVLRETYLPPYKAAIDAG-AATVMTSFNELHGVPASGSRYLMTE 232

Query: 295 TVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDC-GQYYTNFTGN 353
            +R EW   G++V D  SI  MV  H  +A+ KE A    L AG+D+D  G  Y +    
Sbjct: 233 ILREEWRFKGFVVTDYTSINEMVP-HGVVANEKE-AADLALNAGVDMDMQGGVYNDHLAT 290

Query: 354 AVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGK--QDICSDENIELAAEAARE 411
            V +GKV E  +D++++ +  +  RLG F    +Y+   +  Q + S E ++ A  +ARE
Sbjct: 291 LVNEGKVSEKQVDEAVRRILEMKWRLGLFKDPYRYLDEKRELQVLFSKELMDHALVSARE 350

Query: 412 GIVLLKND----QNTLPLNSAKVKTVAVVGPHANATVAMIGNY--AGIPCRYMSPIAGFS 465
            IVLLKN+    +  LP+ +  VK++A++GP  +  + M+G +  +G   + ++ + G  
Sbjct: 351 SIVLLKNEPYNNKKLLPI-ANDVKSIALIGPLGDNQIDMLGTWHASGDANKVVTVLQGLK 409

Query: 466 G---YANVTYKTGCDDVACKSNNSIF-AASEAAKTADATIILAGLDLSVEAESLDREDLW 521
                A +TY  G D +   S+ S F  A++ A+ AD  I+  G +     E+  R  L 
Sbjct: 410 EAFPKAKITYTKGADFMG--SDKSGFEEATKNARAADLVIMAVGENHQQSGEAASRSGLD 467

Query: 522 LPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIAD 581
           LPG Q +L+  + +  K P++ ++M+   + I +   + NI AI+   + G   G+AIA+
Sbjct: 468 LPGVQQELVEAIYQTGK-PIVALVMAGRPLTIGW--MDENIPAIVNTWHLGTMAGKAIAE 524

Query: 582 VVFGKFNPGGRLPITWYNGDYVQMLPL-TSMPL--RPVDS-LGYPGRTYKFYNGPTLYPF 637
           V+ GK+NP G+L IT+     V  +P+  SM    RP D+   Y  +     N P LYPF
Sbjct: 525 VLAGKYNPSGKLTITFPRN--VGQIPIYYSMKNTGRPFDADSKYTSKYLDVSNEP-LYPF 581

Query: 638 GYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEF 697
           GYGLSYT F+Y         +  L+K++   + N T                        
Sbjct: 582 GYGLSYTTFEYG--------EPKLSKIEIKEHENLT-----------------------I 610

Query: 698 KVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNA 753
           +V  +N G  +G +VV +Y +         +K++ GF+++ ++ G +K + F  N+
Sbjct: 611 EVMVKNTGEYEGQEVVQLYVRDLVGSVTRPVKELKGFEKISLKPGESKVVTFTINS 666


>gi|319901343|ref|YP_004161071.1| glycoside hydrolase 3 [Bacteroides helcogenes P 36-108]
 gi|319416374|gb|ADV43485.1| glycoside hydrolase family 3 domain protein [Bacteroides helcogenes
           P 36-108]
          Length = 781

 Score =  258 bits (659), Expect = 8e-66,   Method: Compositional matrix adjust.
 Identities = 212/744 (28%), Positives = 340/744 (45%), Gaps = 140/744 (18%)

Query: 81  LGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGATSFPTVILTTASFNESLW 140
           L  +A    RLG+P   +  E  HG   +G           AT FPT +   ++++ESL 
Sbjct: 118 LQKYAVEETRLGIPVL-FAEECPHGHMAIG-----------ATVFPTALSAASTWDESLM 165

Query: 141 KKIGQAVSTEARAM-YNLGRAGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYV 199
           +++G+A++ EAR    N+G      + P ++VAR+PRW R+ ET GEDP +     V  +
Sbjct: 166 QQMGEAIALEARLQGANIG------YGPVLDVAREPRWSRMEETFGEDPVLTSVMGVALM 219

Query: 200 RGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETF-- 257
           +G+Q         D+ +    + S  KH+AAY      GV     +       M + F  
Sbjct: 220 KGMQG--------DVQNDGKHLYSTLKHFAAY------GVPESGHNGSRANSGMRQLFSE 265

Query: 258 -LRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVM 316
            L PF+  V+ G A ++M SYN ++G+P  ++  LL + +R +W   G++ +D  SI+ +
Sbjct: 266 YLPPFKKAVEAG-AGTIMTSYNSIDGVPCTSNKFLLTEVLRNQWGFKGFVYSDLISIEGI 324

Query: 317 VDNHKFLADSKEDAVAQTLKAGLDLDC-GQYYTNFTGNAVQQGKVKETDIDKSLKYLYTV 375
           V   +   D+KE A A+ L+AGLD+D  G  +      A ++G +   D+D+++  +  +
Sbjct: 325 V-GMRAAKDNKE-AAAKALRAGLDMDLGGDAFGRNLKQAYEEGLITMDDLDRAVSNVLRL 382

Query: 376 LMRLGFFDGSPQYVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAV 435
             ++G F+          + I S E+ ELA   AREG+VLLKND   LPL+   +K +AV
Sbjct: 383 KFQMGLFENPYVSPEQAGKHIRSREHKELARRVAREGVVLLKND-GVLPLDK-HLKRIAV 440

Query: 436 VGPHANATVAMIGNYAGIPCRYM------SPIAGFSGYANVTYKTGC-------DDV--- 479
           +GP+A+     +G+Y     R           A  S    V Y  GC        D+   
Sbjct: 441 IGPNADMMYNQLGDYTAPQDRKEIVTVLDGVRAAVSKTTQVVYVKGCAVRDTTESDIPAA 500

Query: 480 -----------------ACKSNNSIFAASEAAKTADATIILAGLDLSVEAESLDREDLWL 522
                            + +   + + ++ AA  ++   +L  +D     E  DR  L L
Sbjct: 501 VAAAQRADAVILVVGGSSARDFKTKYISTGAATVSEDIKVLPDMDC---GEGFDRSSLRL 557

Query: 523 PGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADV 582
            G Q +LIN VA   K P++++ ++   +++  A      +A+L A YPGE+GG  IAD+
Sbjct: 558 LGDQEKLINAVAATGK-PLVVIYIAGRAMNMNLAADKA--RALLAAWYPGEQGGAGIADI 614

Query: 583 VFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLS 642
           +FG +NP GRLP++         +P +   L    S G   R Y    G  LY FGYGLS
Sbjct: 615 LFGDYNPAGRLPVS---------IPRSEGQLPVFYSQGTQ-RDYVEEKGTPLYAFGYGLS 664

Query: 643 YTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQ 702
           YT+F Y+ L   K   V   +   C                                   
Sbjct: 665 YTKFVYSALEMRKGTDVETLQTVSC--------------------------------TVT 692

Query: 703 NVGSTDGSDVVIVY--------SKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNAC 754
           N G  DG +VV +Y        S+PP  + A        F+R+F++ G ++++ F+    
Sbjct: 693 NTGDRDGEEVVQLYICDEVASVSQPPILLKA--------FRRIFLKKGESRKVTFLLKK- 743

Query: 755 KSLNIVDYAANTLLPAGEHTIFVG 778
             L I D   N ++  G+  + VG
Sbjct: 744 DDLAIYDDEMNYVVEPGDFKVMVG 767


>gi|329851587|ref|ZP_08266344.1| beta-xylosidase B [Asticcacaulis biprosthecum C19]
 gi|328840433|gb|EGF90005.1| beta-xylosidase B [Asticcacaulis biprosthecum C19]
          Length = 883

 Score =  258 bits (659), Expect = 8e-66,   Method: Compositional matrix adjust.
 Identities = 169/522 (32%), Positives = 263/522 (50%), Gaps = 62/522 (11%)

Query: 28  NGSSSPVFVCDPGRFSKLGLQMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHG 87
           N S   + VC     ++    + S  + D++     R  DLVSRM+L+EK  QL + A  
Sbjct: 11  NASVLALLVCLSAPTAQAQNPLESPAYQDTTKTAEQRAADLVSRMSLEEKAAQLINDAPA 70

Query: 88  VPRLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAV 147
           +PRLG+ +Y WW+E LHGV+  G           AT FP  +   A+F+E L  ++   +
Sbjct: 71  IPRLGVREYNWWNEGLHGVAAHG----------YATVFPQAVGMAATFDEPLIHRVADTI 120

Query: 148 STEARAMYNLGR---------AGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNY 198
           S E RA Y   R          GLT WSPNIN+ RDPRWGR  ET GEDP++  R  V +
Sbjct: 121 SVEFRAKYVASRHRFGGSDWFRGLTVWSPNINIFRDPRWGRGQETYGEDPYLTARIGVAF 180

Query: 199 VRGLQDVEGHENATDLNSRPL--KVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEET 256
           V+GLQ  +           P+  +  +  KHYA   V +     R+  +   +  D+E+T
Sbjct: 181 VKGLQGED-----------PVYYRTIATPKHYA---VHSGPEASRHRDNINPSRYDLEDT 226

Query: 257 FLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSI-QV 315
           +L  F   + EG A S+MC+YN ++G P+CA+  LL + +R +W   G++V+DCD++  +
Sbjct: 227 YLPAFRATIVEGKAVSIMCAYNAIDGQPACANDDLLVKHLRQDWGFKGFVVSDCDAVGDI 286

Query: 316 MVDNHKFLADSKEDAVAQTLKAGLDLDCGQY-YTNFTGNAVQQGKVKETDIDKSLKYLYT 374
                     + E+ V    +AG DL CG     +   +AV++G + E+ +D +L  L++
Sbjct: 287 YYKTSHHYRPTPEEGVTVAYQAGTDLICGNANEADHVASAVRKGILPESLVDTALVRLFS 346

Query: 375 VLMRLGFFDGSPQ-YVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTV 433
              +LG FD   Q + ++   D  +  N + +   A   +VLLKND   LPL S + +T+
Sbjct: 347 ARFKLGQFDPPAQVFPAITADDYDTQANRDFSQHVAESAMVLLKND-GLLPLKS-EPRTI 404

Query: 434 AVVGPHANATVAMIGNYAGIPCRYMSPIAGFSGY---ANVTYKTGC-----------DDV 479
           AV+GP+A+   +++GNY G P   ++ +AG       A V Y  G            DD 
Sbjct: 405 AVIGPNADTMDSLVGNYNGDPSHPVTVLAGIKARFPNATVRYAQGSGLIDPVMTAVPDDS 464

Query: 480 ACKSNN--------SIFAASEAAKTADATIILAGLDLSVEAE 513
            C+  +        S FA+ E + TA  +   AG+  + + E
Sbjct: 465 FCRDKDCAAKGVTASHFASPEMSGTAQKSAAEAGIHQAWKGE 506



 Score =  130 bits (327), Expect = 3e-27,   Method: Compositional matrix adjust.
 Identities = 93/308 (30%), Positives = 150/308 (48%), Gaps = 55/308 (17%)

Query: 483 SNNSIFAASEAAKTADATIILAGLDLSVEAESL----------DREDLWLPGYQTQLINQ 532
           S+     A  AAK +D  I +AGL   VE E +          DR  L LP  Q +++ Q
Sbjct: 593 SDTGAQEAVAAAKESDLVIFVAGLSQRVEGEEMRVETPGFSGGDRTSLDLPPVQQKVLEQ 652

Query: 533 VAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGR 592
           V+   K PV+LV+++   + + +A+ N  + AI+ A YPG +GG A+A ++ G F+P GR
Sbjct: 653 VSATGK-PVVLVLINGSALSVNWADKN--VPAIVEAWYPGGQGGAAVARLIAGDFSPAGR 709

Query: 593 LPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLS 652
           LP+T+Y     Q+   T   ++        GRTY+++ G  LYPFGYGLSYT+F Y    
Sbjct: 710 LPVTFYR-SADQIPAFTDYTMK--------GRTYRYFKGEALYPFGYGLSYTKFSYAPAK 760

Query: 653 FTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDV 712
            +                     A+K    G +             VD  N G+ DG +V
Sbjct: 761 LS---------------------AAKVAGNGEVT----------VSVDVTNSGARDGDEV 789

Query: 713 VIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGE 772
           V +Y   P +   T I+ +  F R+ ++AG  K + F  ++ ++L+ V+   +  +  G+
Sbjct: 790 VQLYLSHPGQ-KDTPIRALARFDRIHLKAGETKTVTFTLDS-RALSTVNADGSRSVKPGK 847

Query: 773 HTIFVGNG 780
             +++G G
Sbjct: 848 VNLWLGGG 855


>gi|336411808|ref|ZP_08592268.1| hypothetical protein HMPREF1018_04286 [Bacteroides sp. 2_1_56FAA]
 gi|335940152|gb|EGN02020.1| hypothetical protein HMPREF1018_04286 [Bacteroides sp. 2_1_56FAA]
          Length = 859

 Score =  258 bits (659), Expect = 9e-66,   Method: Compositional matrix adjust.
 Identities = 216/769 (28%), Positives = 338/769 (43%), Gaps = 146/769 (18%)

Query: 50  SSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGD-------------------------- 83
           ++F + ++SLP  +RV+DL+SRMTL+EK+ Q+                            
Sbjct: 22  TNFKYKNASLPVEVRVQDLLSRMTLEEKIAQMRHIHAYSIMENGKLNEEKLEKMIGGQNY 81

Query: 84  -FAHGVP---------------------RLGLPQYEWWSEALHGVSNVGPGTHFDDVIPG 121
            F  G+                      RLG+P +   +E+LHG            V  G
Sbjct: 82  GFIEGITLPGKECLTLMNEVQKYMREKTRLGIPVFTL-TESLHG-----------SVHDG 129

Query: 122 ATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTY-WSPNINVARDPRWGRI 180
           +T FP  I   ++FN  L  ++  A++ E      L   G+T   +P I+V RD RWGR+
Sbjct: 130 STIFPQAIALGSTFNPILAYEMTSAIAKE------LSAQGITQSLTPVIDVCRDLRWGRV 183

Query: 181 TETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVD 240
            E  GEDPF+V R  V+ VRG  D +              VS   KH+ A+      G++
Sbjct: 184 EECFGEDPFLVSRMGVSQVRGYLDNQ--------------VSPMIKHFGAHGAPQ-GGLN 228

Query: 241 RYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEW 300
                    ++++   +L+ FE  VKE    +VM SYN  N  P+ +   L+ + +R  W
Sbjct: 229 LASVSC--GQRELLSIYLKTFETVVKEAKPWAVMSSYNSWNNEPNSSSHYLMTELLRDRW 286

Query: 301 DLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFTGNAVQQGKV 360
           D  GY+ +D  +I ++   HK   +S E A+ Q L AGLD +            V+ G +
Sbjct: 287 DFQGYVYSDWGAIGMLNYFHKTAQNSAEAAI-QALTAGLDAEASDNSYAELQQLVENGML 345

Query: 361 KETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQDICSDENIELAAEAAREGIVLLKNDQ 420
               ID+++  + T    +G F+          + + +  ++ LA + A E IVLL+N+ 
Sbjct: 346 DVKYIDQAVARILTAKFNMGLFEYPLPMEKNYDKVVHAPAHVSLARKIAEESIVLLQNEN 405

Query: 421 NTLPLNSAKVKTVAVVGPHANATVAMIGNYA-------GIP-CRYMSPIAGFSGYANVTY 472
           N LPL   K+K++AV+GP  NA     G+Y        G+     +   AG      + Y
Sbjct: 406 NILPLQMNKLKSIAVIGP--NADQVQFGDYTWSRDNKDGVTLLEALKERAG--NQLTLNY 461

Query: 473 KTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEA---------ESLDREDLWLP 523
             GC D+     +    A + AK +D  I++ G   +  A         E  D  DL L 
Sbjct: 462 AKGC-DLVTDDRSGFKEAVDVAKKSDVCIVVVGSASASLARDYSNATCGEGFDLSDLTLT 520

Query: 524 GYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVV 583
           G Q  L+  +    K PVI+V++S  G  +A +    NI  I+   YPGE+GG A+AD++
Sbjct: 521 GVQEDLVEAIHATGK-PVIVVLLS--GKPLAMSWIKENIPGIVVQWYPGEQGGLALADML 577

Query: 584 FGKFNPGGRLPITWYNGD-----YVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFG 638
            GK NP G+L  ++         Y   LP      R   S   PG+ Y F +   L+ FG
Sbjct: 578 LGKVNPSGKLNYSFPQSVGHLPCYYNYLPTDKGFYRSPGSKNKPGKDYVFSSPKALWAFG 637

Query: 639 YGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFK 698
           +GLSYT F+Y  LS T + +                             D  C+D  E  
Sbjct: 638 HGLSYTDFEY--LSATTSKE-----------------------------DYACEDVIEVT 666

Query: 699 VDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRI 747
           +  +N G  DG +V  VY +         ++++ GF++V ++ G  K++
Sbjct: 667 IAIRNTGDYDGLEVPQVYVRDMVSSVVMPVQELKGFEKVLIKKGETKQV 715


>gi|198277570|ref|ZP_03210101.1| hypothetical protein BACPLE_03792 [Bacteroides plebeius DSM 17135]
 gi|198270068|gb|EDY94338.1| glycosyl hydrolase family 3 N-terminal domain protein [Bacteroides
           plebeius DSM 17135]
          Length = 753

 Score =  258 bits (659), Expect = 1e-65,   Method: Compositional matrix adjust.
 Identities = 220/764 (28%), Positives = 362/764 (47%), Gaps = 111/764 (14%)

Query: 64  RVKDLVSRMTLDEKVQQLG-----DFAHGVPRLGLPQYEWWS-------EALHGVSNVG- 110
           +V  L+S+MTL+EK+ Q+      DF     R+   + E  S       E ++ +  +  
Sbjct: 38  KVDSLLSQMTLEEKLGQMNQLSPWDFEELAARIR--KGEVGSILNVVNPEEINKIQKIAV 95

Query: 111 -------PGTHFDDVIPG-ATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGL 162
                  P     DVI G  T FP  +   A+FN  + ++  +  + EA A       G+
Sbjct: 96  EESRLGIPILVARDVIHGYKTIFPIPLGQAATFNPEIAEQGARVAAIEASA------DGI 149

Query: 163 TY-WSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKV 221
            + ++P I+V+RDPRWGRI E+ GEDP++      N V G   ++G++   D  + P  +
Sbjct: 150 RWTFAPMIDVSRDPRWGRIAESCGEDPYL------NAVIGTAMIKGYQG--DSLNDPTAI 201

Query: 222 SSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVN 281
           ++C KH+ AY         R +    + E+ +   +L PF+     G A+  M S+N  +
Sbjct: 202 AACAKHFVAYGAAEG---GRDYNSTFIPERVLRNVYLPPFKAAANAGCAT-FMTSFNDND 257

Query: 282 GIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDL 341
           G+PS A+  +L   +R EW   G +V D  S   MV NH F  D K DA  +++ AG+D+
Sbjct: 258 GVPSTANSFVLKDVLRKEWKYDGMVVTDWASALEMV-NHGFCTDGK-DAAEKSVNAGVDM 315

Query: 342 D-CGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQDICSDE 400
           +   + +      ++ + KV    ID +++ +  +  RLG FD    Y+   +    +++
Sbjct: 316 EMVSETFIQNLKQSISENKVSMETIDNAVRNILRLKFRLGLFDNP--YIVTPQSVKYAEK 373

Query: 401 NIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYA--GIPCRYM 458
           +++ A  AA + ++LLKN+  +LPL   KVKT+A++GP A+A    +G +   G      
Sbjct: 374 HLQAAKTAAEQSVILLKNENQSLPLTD-KVKTLAIIGPMADAPYEQMGTWVFDGEKEHTQ 432

Query: 459 SPIAG----FSGYANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAES 514
           +P+      +     V ++ G      KS   I  A   A+ +DA ++  G +  +  E+
Sbjct: 433 TPLTAIKKMYGDKVKVLFEKGLAYSRDKSTAGIARAISVARQSDAVVVFVGEESILSGEA 492

Query: 515 LDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEE 574
               +L L G Q+QLI ++A   K PV+ V+M+  G  +A A+      A+L++ +PG  
Sbjct: 493 HSLVNLNLQGAQSQLIKELAATGK-PVVTVVMA--GRQLAIADEVKVSDAVLYSFHPGTM 549

Query: 575 GGRAIADVVFGKFNPGGRLPITW--YNGD----YVQMLPLTSMPLRPVDSL-----GYPG 623
           GG AIAD++FGK NP G+ P+T+   +G     Y Q    T  P  P + L        G
Sbjct: 550 GGPAIADILFGKVNPSGKTPVTFPRMSGQVPIYYAQH--KTGRPANPTEMLIDEIPVEAG 607

Query: 624 RT----YKFY----NGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSD 675
           +T      FY    N P L+PFGYGLSYT F+Y+                   NL+ TSD
Sbjct: 608 QTSVGCRSFYLDAGNSP-LFPFGYGLSYTTFEYS-------------------NLSLTSD 647

Query: 676 ASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQ 735
                        L   D        +N G+ DG++VV +Y +         +K++  FQ
Sbjct: 648 ------------KLTAQDTLSISFTLKNTGNYDGTEVVQLYIQDKVGSVTRPVKELKRFQ 695

Query: 736 RVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGN 779
           RV ++AG + ++         L    Y  N  +  G+  ++VG+
Sbjct: 696 RVTLKAGESTQVSLNL-PVSELAFWGYDMNYTVEPGDFRLWVGS 738


>gi|189460899|ref|ZP_03009684.1| hypothetical protein BACCOP_01546 [Bacteroides coprocola DSM 17136]
 gi|189432473|gb|EDV01458.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
           coprocola DSM 17136]
          Length = 718

 Score =  258 bits (658), Expect = 1e-65,   Method: Compositional matrix adjust.
 Identities = 215/770 (27%), Positives = 349/770 (45%), Gaps = 92/770 (11%)

Query: 43  SKLGLQMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEA 102
           S  G+  +  +F D ++    R+ DL++RMTLDEKV  LG+    VPRLG+ Q     E 
Sbjct: 15  STTGIIHAQNVFNDPAINEEQRLDDLIARMTLDEKVDALGNNTQ-VPRLGI-QASGSVEG 72

Query: 103 LHGVSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYN---LGR 159
           LHG+   GP T+ D      T FP       +++  L  ++   +STE R ++      +
Sbjct: 73  LHGIVLGGP-TYGDRANTPTTGFPQAYGLGETWDTDLLHRVATYISTENRYLFQNAKYRK 131

Query: 160 AGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPL 219
           +GL  W+PN+++ RDPRWGR  E  GED F+  R AV +++G+Q           + +  
Sbjct: 132 SGLIMWTPNVDLGRDPRWGRTEECYGEDAFLTSRLAVAFIKGIQGD---------HPKYW 182

Query: 220 KVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNR 279
           + +S  KH+    + N     R    +  +++   E +  PF   V EG + ++M +YN 
Sbjct: 183 RNASLMKHF----LSNSNEYGRTFSSSNYSDKLFREYYAYPFYKGVTEGGSQALMTAYNA 238

Query: 280 VNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGL 339
            NG P    P L N  V  EW L+G ++ D  + ++++ +HK   + +  A A  +KAG+
Sbjct: 239 YNGTPCIMHPVLRN-IVMKEWGLNGTLLTDGGAFRLLLSDHKRFDNDRAAAAAACIKAGI 297

Query: 340 DLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ--YVSLGKQDIC 397
                +Y  +    A+ +  +   DI+K+++    + ++LG  D +    Y ++G  D  
Sbjct: 298 TKFLDEY-KDAVYEALHRKLISVEDIEKAIRGNLRISLKLGLLDHAEDNPYAAIGVTDTI 356

Query: 398 SD----ENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGI 453
           +     E   L  EA  + IVLLKN  + LPL+  K+K +AV+G    AT  +   YAG 
Sbjct: 357 APWSKPETKALVREATLKSIVLLKNQDHLLPLDRHKIKKIAVIG--QRATEVLQDWYAGK 414

Query: 454 PCRYMSPIAGFSGYANVTYKTGCD-DVACKSNNSIFAASEAAKTADATIILAGLDLSVEA 512
           P   ++ +        +  + G D +V     N + +A   A  AD  I+  G   +  A
Sbjct: 415 PFYTVNVLDA------IREEAGNDIEVRYVKTNRMDSARTVAAWADVAIVCVGNHPTCNA 468

Query: 513 ------------ESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNT 560
                       E++DR+ L     Q  L+ QVA+     + ++I S      A    N 
Sbjct: 469 GWEQAPVISEGKEAVDRQSL--QLDQEDLLLQVAQTNPNTIGVLISS---FPYAINRANQ 523

Query: 561 NIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLG 620
            + A+L      +E G A++DV+FG +NP GRL  TW          +T +P   +D   
Sbjct: 524 TVPALLHLTQCSQELGHAVSDVIFGHYNPAGRLTQTWVKN-------ITDLP-HMMDYDI 575

Query: 621 YPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTR 680
             GRTY ++    LYPFGYGLSYT+F Y+  +    +    + L+ C NL          
Sbjct: 576 THGRTYMYFKEKPLYPFGYGLSYTRFNYSGTTLNDRVIERGDTLRVCFNL---------- 625

Query: 681 CPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVR 740
                                +N G  DG +VV +Y           IKQ+  FQR+ +R
Sbjct: 626 ---------------------KNSGDMDGDEVVQLYVSARKHTDKDPIKQLKAFQRISLR 664

Query: 741 AGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGNGGVSFPIHLNF 790
            G  K+++      +     +  +  +LP  E T+ +G       +   F
Sbjct: 665 KGETKKVELTVPYTELQVWDEKQSRFILPDKEMTLEIGASSSDIRLRTTF 714


>gi|427387354|ref|ZP_18883410.1| hypothetical protein HMPREF9447_04443 [Bacteroides oleiciplenus YIT
           12058]
 gi|425725515|gb|EKU88386.1| hypothetical protein HMPREF9447_04443 [Bacteroides oleiciplenus YIT
           12058]
          Length = 786

 Score =  258 bits (658), Expect = 1e-65,   Method: Compositional matrix adjust.
 Identities = 232/816 (28%), Positives = 349/816 (42%), Gaps = 149/816 (18%)

Query: 53  LFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRL---GLPQYEWWSEALH-GVSN 108
           ++ D S P   RVKDL+S+MT++EK  Q+    +G  R+    LP  +W +E    G++N
Sbjct: 41  VYEDPSAPLEARVKDLLSQMTMEEKTCQMATL-YGSGRVLKDSLPTEQWKNEIWKDGIAN 99

Query: 109 VG---------------------------------------PGTHFDDVIPG-----ATS 124
           +                                        P    ++ I G     AT 
Sbjct: 100 IDEQANGLGKFGSSLSYPYVNSVENRQAIQRWFVEQTRLGIPVDFTNEGIRGLCHDRATM 159

Query: 125 FPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNINVARDPRWGRITETP 184
           FP      A++N+ L  +I +  + EA+A   LG   +  +SP +++A+DPRWGR+ E  
Sbjct: 160 FPAQCGQGATWNKELISEIAKVTAEEAKA---LGYTNI--YSPILDIAQDPRWGRVVECY 214

Query: 185 GEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHF 244
           GEDPF+VG      ++GLQ       A  L S P       KH+A Y +           
Sbjct: 215 GEDPFLVGELGKRMIKGLQ-------AEGLVSTP-------KHFAVYSIPVGGRDAGTRT 260

Query: 245 DARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHG 304
           D  V  ++M   ++ PF     E  A  VM SYN  +G P       L + +R EW   G
Sbjct: 261 DPHVAPREMRTLYIEPFRKAFCEAGALGVMSSYNDYDGEPITGSYHFLTEILRHEWGFKG 320

Query: 305 YIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFT---------GNAV 355
           Y+V+D ++++ +   H   A++  D  AQ + AGL++      TNFT           A+
Sbjct: 321 YVVSDSEAVEFLYSKHNVAANAV-DGAAQVINAGLNVR-----TNFTLPENFIRPLRQAI 374

Query: 356 QQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQD--ICSDENIELAAEAAREGI 413
            +GKV E  ID  +  +  V   +G FD +P      K +  + S E+  ++  AA E I
Sbjct: 375 SEGKVSEQTIDSRVADVLRVKFMMGLFD-NPYKGDAKKPEKVVHSKEHQAVSMRAALESI 433

Query: 414 VLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFSGY---ANV 470
           VLLKN+ N LPL S   K VAV+GP+A     +I  Y        +   G   Y   A+V
Sbjct: 434 VLLKNENNILPL-SKSTKKVAVIGPNAAEVDNLICRYGPANAPIKTVYQGIKDYLPDADV 492

Query: 471 TYKTGCD------------DVACKSNNS--IFAASEAAKTADATIILAGLDLSVEAESLD 516
            Y  G D            DV    +    I  A   AK +D  I++ G +     E   
Sbjct: 493 RYAKGADIIDKYFPESELYDVPLDKDEQAMIDEAVALAKESDVAIMVLGGNEKTVREEYS 552

Query: 517 REDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGG 576
           R +L L G Q +L+  V    K PV+L+++      I +AE    I  I+ A +PGE  G
Sbjct: 553 RTNLDLCGRQEKLLQAVYATGK-PVVLLLVDGRAATINWAE--HYIPGIVHAWFPGEFMG 609

Query: 577 RAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRP-VDSLGYPGRTYKFYNGPTLY 635
            A+A V+FG +NPGG+L +T+     V  +P  + P +P  DS G+   T       TLY
Sbjct: 610 DAVAKVLFGDYNPGGKLAVTFPRS--VGQIPF-AFPFKPGSDSKGFVRVT------GTLY 660

Query: 636 PFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYF 695
           PFGYGLSYT F Y+ L     +                               +      
Sbjct: 661 PFGYGLSYTTFAYSDLKIENPV-------------------------------IGVQGSV 689

Query: 696 EFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACK 755
           +     +N G   G +VV +Y         TY+K + GF+RV +  G  K + FV    +
Sbjct: 690 KLSCKVKNTGKVAGDEVVQLYLHDEMSSVTTYVKVLRGFERVHLEPGEEKTVNFVLTP-Q 748

Query: 756 SLNIVDYAANTLLPAGEHTIFVGNGGVSFPIHLNFN 791
            L + +   + ++  G   + VG+      +   F 
Sbjct: 749 ELGLWNKDNHFVVEPGTFAVMVGSSSQDIRLQDKFE 784


>gi|257051950|ref|YP_003129783.1| glycoside hydrolase family 3 domain protein [Halorhabdus utahensis
           DSM 12940]
 gi|256690713|gb|ACV11050.1| glycoside hydrolase family 3 domain protein [Halorhabdus utahensis
           DSM 12940]
          Length = 783

 Score =  258 bits (658), Expect = 1e-65,   Method: Compositional matrix adjust.
 Identities = 208/726 (28%), Positives = 323/726 (44%), Gaps = 117/726 (16%)

Query: 90  RLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVST 149
           RLG+P  E   E L G              PG T FP  I   ++++ +L + I  ++  
Sbjct: 103 RLGIPALEH-EECLTGYRG-----------PGGTIFPQSIGLASTWSPALVESITDSIRK 150

Query: 150 EARAMYNLGRAGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQ-DVEGH 208
              A+       +   SP ++V+RD RWGR+ ET GEDP +VG     YV GLQ D +G 
Sbjct: 151 RLAAV-----GAVQALSPVLDVSRDMRWGRVEETYGEDPQLVGALGAAYVSGLQNDGDG- 204

Query: 209 ENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEG 268
                       + +  KH+AA+      G +R     ++ E+++ E  L PFE+ ++E 
Sbjct: 205 ------------IDATLKHFAAHG-SGEGGKNRSSV--QIGERELREVHLYPFEVAIREA 249

Query: 269 DASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKE 328
           DA +VM +Y+ ++G+P  +   LL   +RGEW   G++VAD  S+ ++   H  +AD++ 
Sbjct: 250 DARAVMNAYHDIDGVPCASSEWLLTDVLRGEWGFDGHVVADYFSVDLLKTEHG-IADTQR 308

Query: 329 DAVAQTLKAGLDL-----DCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFD 383
           +A    L+AGLD+     DC  Y  N    AV+ G++ E  +D +++ +    +  G FD
Sbjct: 309 EAGVAALEAGLDIELPATDC--YGENLL-KAVEDGELSEATVDTAVRRVLRAKIESGVFD 365

Query: 384 GSPQYVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANAT 443
                     +   +DE  ELAA AARE + LL+ND + LPL    + +VA+VGP A+  
Sbjct: 366 DPYVDPEAASEPFDTDEQTELAARAARESMTLLEND-DLLPLAGEDLDSVALVGPQADDG 424

Query: 444 VAMIGNYAG--------------IPCRYMSPIAGFSGYANVTYKTGCDDVACKSNNSIFA 489
            A +G+Y                +  R      G +   +V Y  G   +   S     A
Sbjct: 425 RAQVGDYTHAARFDTEEDGDFECVTPRDALEAKGETAGFDVEYVEGA-TMTGPSTEEFDA 483

Query: 490 ASEAAKTADATIILAGL----------------DLSVEAESLDREDLWLPGYQTQLINQV 533
           A E    AD  +   G                 D+    E+ D  DL LPG Q +LI+++
Sbjct: 484 AEETVADADVAVACVGARSDIDFADRENPSELPDVPTSGENCDVTDLELPGVQAELIDRL 543

Query: 534 AEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRL 593
           AE       LV++   G   A  E    + A+L A  PG+ GG AIADV+FG++NP G L
Sbjct: 544 AETD---TPLVVVQVSGKPHAIPEIAETVPALLHAWLPGQAGGTAIADVLFGEYNPSGHL 600

Query: 594 PITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSF 653
           P++       Q +  +  P             + + +G  LY FG+GLSYT F+Y  L  
Sbjct: 601 PVSIPKSVGQQPVYYSRKP-------NSANEEHVYMDGEPLYSFGHGLSYTDFEYGELEL 653

Query: 654 TKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVV 713
            +     +  L                                  V   N G   G DVV
Sbjct: 654 EEGTVEPMGSLSAS-------------------------------VTVTNAGERAGDDVV 682

Query: 714 IVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEH 773
            +Y        A  +++++GF+RV +  G +KR+ F F+A + L   D   +  +  G +
Sbjct: 683 QLYQHAENPSQARPVQELLGFERVHLEPGESKRVTFTFDATQ-LAYYDLNMHLAVEEGPY 741

Query: 774 TIFVGN 779
            + VG 
Sbjct: 742 ELRVGE 747


>gi|299140913|ref|ZP_07034051.1| periplasmic beta-glucosidase [Prevotella oris C735]
 gi|298577879|gb|EFI49747.1| periplasmic beta-glucosidase [Prevotella oris C735]
          Length = 767

 Score =  258 bits (658), Expect = 1e-65,   Method: Compositional matrix adjust.
 Identities = 204/696 (29%), Positives = 320/696 (45%), Gaps = 102/696 (14%)

Query: 122 ATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLT-YWSPNINVARDPRWGRI 180
           ATSFP       +++ +L ++I    + EA A+      G T  ++P ++V+RDPRWGR+
Sbjct: 119 ATSFPAQCGQGVTWDRALIRQIANVTAQEASAL------GYTNVYAPILDVSRDPRWGRV 172

Query: 181 TETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVD 240
            E   E P++ G      V GLQ     EN         ++ S  KH+A Y +      +
Sbjct: 173 VECYSESPYLAGELGKQMVLGLQ-----EN---------RIVSTPKHFAVYSLPVGGRDE 218

Query: 241 RYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEW 300
               D  V  ++M+   L PF   ++EG A  VM SYN  +G P    P  L + +R +W
Sbjct: 219 GTRTDPHVAPKEMKTLLLEPFRKAIQEGGALGVMSSYNDYDGEPITGSPYFLTELLRHQW 278

Query: 301 DLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFT--------- 351
             HGY+V+D ++++ +   H  +A ++E+  A  + AGLD+      TNF+         
Sbjct: 279 GFHGYVVSDSEAVEFLSSKHH-VAANREEGAAMAINAGLDVR-----TNFSMPETFILPL 332

Query: 352 GNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQD--ICSDENIELAAEAA 409
             A+  G V    +D  +K +  V   LG FD +P   ++ + D  + S  + +L+  AA
Sbjct: 333 RQALTDGLVSMQILDARVKDVLYVKFWLGLFD-NPYRGNVNEVDQVVHSKAHQQLSLRAA 391

Query: 410 REGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFSGY-- 467
            E IVLLKN+ N LPL S  +K +AV+GP+A+AT A +  Y        S ++G      
Sbjct: 392 LESIVLLKNENNLLPL-SKSLKRIAVIGPNADATTAHVCRYGPANAPIKSVLSGIRESMP 450

Query: 468 -ANVTYKTGCD------------DVACKSNNS--IFAASEAAKTADATIILAGLDLSVEA 512
            A V Y  GC             +VA  +     I  A   A+ +D  +++ G       
Sbjct: 451 GAEVRYAKGCSIVDKHFPESELYEVALDTTEQRMIDEAVGVARQSDVAVVVLGGSEETVR 510

Query: 513 ESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPG 572
           E   R DL L G Q QL+  V    K PV+LV++      I +A  N  + AI+   +PG
Sbjct: 511 EEYSRTDLNLMGRQEQLLRAVYATGK-PVVLVLLDGRAATINWA--NQYVPAIVHGWFPG 567

Query: 573 EEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRP-VDSLGYPGRTYKFYNG 631
           E  G A+A V+FG +NPGG+L +T+     V  +P  + P +P  DS G P R     +G
Sbjct: 568 EFTGTAVAKVLFGDYNPGGKLAVTFPKS--VGQIPY-AFPFKPGADSKG-PVRV----DG 619

Query: 632 PTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRC 691
             LYPFGYGLSYT F Y+    +K +                               +  
Sbjct: 620 -ALYPFGYGLSYTTFAYSDFHISKPV-------------------------------IGI 647

Query: 692 DDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVF 751
               E     +N G  +G ++V +Y +       TY K + GF+R+ ++AG    ++F+ 
Sbjct: 648 QGETEVSCKVRNTGQREGDEIVQLYIRDDISSVTTYQKSLRGFERIHLKAGEETTVRFML 707

Query: 752 NACKSLNIVDYAANTLLPAGEHTIFVGNGGVSFPIH 787
              + L++ +     ++  G  TI +G       +H
Sbjct: 708 TP-RDLSLWNKHEEFVVEPGTFTIMIGRSSEDICLH 742


>gi|325299987|ref|YP_004259904.1| Beta-glucosidase [Bacteroides salanitronis DSM 18170]
 gi|324319540|gb|ADY37431.1| Beta-glucosidase [Bacteroides salanitronis DSM 18170]
          Length = 864

 Score =  258 bits (658), Expect = 1e-65,   Method: Compositional matrix adjust.
 Identities = 158/441 (35%), Positives = 233/441 (52%), Gaps = 41/441 (9%)

Query: 64  RVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGAT 123
           R  DLV R+TL+EK   + + +  +PRLG+  Y+WW+EALHGV   G           AT
Sbjct: 37  RANDLVGRLTLEEKASLMQNTSPAIPRLGIKAYDWWNEALHGVGRAGI----------AT 86

Query: 124 SFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRA--------GLTYWSPNINVARDP 175
            FP  I   ASF++ L  ++  AVS EARA Y   R         GLT+W+PN+N+ RDP
Sbjct: 87  VFPQTIGMAASFDDELLYQVFTAVSDEARAKYTQFRKEGDLKRYQGLTFWTPNVNIFRDP 146

Query: 176 RWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPL-KVSSCCKHYAAYDVD 234
           RWGR  ET GEDP++  +  +  VRGLQ   G E+A      P  K+ +C KH+A +   
Sbjct: 147 RWGRGQETYGEDPYLTSQMGMAVVRGLQ---GPEDA------PYDKLHACAKHFAVHSGP 197

Query: 235 NWKGVDRYHFDAR-VTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLN 293
            W   +R+ F+A  +  +D+ ET++  F+  V++     VMC+YNR+ G P C + +LL 
Sbjct: 198 EW---NRHEFNAENIAPRDLWETYMPAFKDLVQKAHVKEVMCAYNRLEGEPCCGNNRLLT 254

Query: 294 QTVRGEWDLHGYIVADCDSIQVM--VDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFT 351
             +R EW   G +V+DC +I       +H+   D K  A A  + +G DL+CG  Y +  
Sbjct: 255 HILRDEWGYQGIVVSDCGAISDFWRKGDHETHPD-KAHASAGAVLSGTDLECGSNYKSLP 313

Query: 352 GNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQDICSDENIELAAEAARE 411
             AV+ G + E+ +D S+K L      LG  D    + ++    +    + +LA   ARE
Sbjct: 314 -EAVKAGLIAESQLDISVKRLLKARFELGEMDKDVCWDTIPYSVVDCQAHKDLALRMARE 372

Query: 412 GIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFSGY---A 468
            IVLL+N  N LPL   K   +A+VGP+AN ++   GNY G P    +           +
Sbjct: 373 SIVLLQNRNNILPLR--KDMKIALVGPNANDSIMHWGNYNGFPSHTETLYEALKKRLPAS 430

Query: 469 NVTYKTGCDDVACKSNNSIFA 489
            + Y+ GCD  +  +  S+FA
Sbjct: 431 QLIYEFGCDRTSPVALESVFA 451



 Score =  108 bits (271), Expect = 9e-21,   Method: Compositional matrix adjust.
 Identities = 82/300 (27%), Positives = 129/300 (43%), Gaps = 54/300 (18%)

Query: 489 AASEAAKTADATIILAGLDLSVEAESL----------DREDLWLPGYQTQLINQVAEVAK 538
           A ++  K AD  +   G+  ++E E +          DR  + LP  Q QL+ ++ ++ K
Sbjct: 593 ATADKVKDADVILFAGGISPTLEGEEMPVDAEGFRGGDRTSIELPAIQRQLVGELKKLGK 652

Query: 539 GPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWY 598
            P++ +  S  G  +  A  +     ++ A YPG+ GG AIADV+FG +NP G+LP+T+Y
Sbjct: 653 -PIVFINYS--GSAMGLAPESEICDGMIQAWYPGQAGGTAIADVLFGDYNPAGKLPVTFY 709

Query: 599 NGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQ 658
                + LP         +     GRTY++     L+ FG+GLSYT F Y         +
Sbjct: 710 RN--TEQLP-------DFEDYAMKGRTYRYMTETPLFRFGHGLSYTTFDYG------KAR 754

Query: 659 VNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSK 718
           ++ N       L  T                         +   N G+ DG + V VY +
Sbjct: 755 LSQNTFSKGETLTLT-------------------------IPVSNTGTRDGEETVQVYLR 789

Query: 719 PPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVG 778
            P +  A     +  F+RV+V  G  K IKF  +    L       N  L +GE+ +  G
Sbjct: 790 RPGDADAPS-HTLRAFKRVYVPKGGTKEIKFTLSDDNFLWFDTSTNNMNLISGEYELLYG 848


>gi|332881172|ref|ZP_08448831.1| glycosyl hydrolase family 3 protein [Capnocytophaga sp. oral taxon
           329 str. F0087]
 gi|357047867|ref|ZP_09109460.1| glycosyl hydrolase family 3 protein [Paraprevotella clara YIT
           11840]
 gi|332680886|gb|EGJ53824.1| glycosyl hydrolase family 3 protein [Capnocytophaga sp. oral taxon
           329 str. F0087]
 gi|355529206|gb|EHG98645.1| glycosyl hydrolase family 3 protein [Paraprevotella clara YIT
           11840]
          Length = 851

 Score =  257 bits (657), Expect = 1e-65,   Method: Compositional matrix adjust.
 Identities = 159/415 (38%), Positives = 226/415 (54%), Gaps = 39/415 (9%)

Query: 53  LFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPG 112
           LF D   P   R+ DL+SR+T++EK+  L + A  + RLG+ +Y   +EALHGV  V PG
Sbjct: 28  LFRDMKAPQHERIMDLLSRLTVEEKISLLVNDAPAIGRLGIDKYNHGNEALHGV--VRPG 85

Query: 113 THFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAG----------L 162
                     T FP  I   A +N  L  +I  A+S EAR  +     G          L
Sbjct: 86  DF--------TVFPQAIGMAAMWNPELLYRISSAISDEARGRWKELEYGKKQIAGASDLL 137

Query: 163 TYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVS 222
           T+WSP +N+ARDPRWGR  ET GEDP++ G   V +V+GLQ           + R LK  
Sbjct: 138 TFWSPTVNMARDPRWGRTPETYGEDPYLSGVLGVAFVKGLQGD---------HPRYLKTV 188

Query: 223 SCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNG 282
           S  KH+A  + ++    +R   +A+V+E+D+ E +L  FE C+ EG A S+M +YN VN 
Sbjct: 189 STPKHFAVNNEEH----NRSSCNAKVSERDLREYYLPSFERCITEGKAQSIMMAYNAVND 244

Query: 283 IPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLD 342
           +P   +  L+   +RG+W  +GYIV+DC + + M+  H ++  ++E A    +K GLDL+
Sbjct: 245 VPCTVNTYLIKNVLRGDWGFNGYIVSDCSAPEWMITKHHYV-KTREAAATLAVKVGLDLE 303

Query: 343 CG-QYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQ--YVSLGKQDICSD 399
           CG Q Y      A +Q  V E DID +   +    M LG FD   Q  Y  +    +   
Sbjct: 304 CGNQVYGEGLLKAYRQYMVSEADIDSAAYRILRGRMMLGLFDAPSQNPYNQIEPSVVGCK 363

Query: 400 ENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIP 454
            + +LA EAAR+ +VLLKN  N LPLN  KVK++AVVG   +A     G+Y+G P
Sbjct: 364 AHQDLALEAARQSMVLLKNKDNFLPLNPKKVKSIAVVG--ISAGHCEFGDYSGTP 416



 Score =  140 bits (353), Expect = 2e-30,   Method: Compositional matrix adjust.
 Identities = 91/289 (31%), Positives = 139/289 (48%), Gaps = 47/289 (16%)

Query: 490 ASEAAKTADATIILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAG 549
           A + A   D T+ + G++ S+E E  DR  L LP  Q + I ++ +V    V++++    
Sbjct: 596 AGKVAAECDVTVAVLGINKSIEREGQDRFSLELPVDQQEFIKELYKVNPNTVVVLV---A 652

Query: 550 GVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLT 609
           G  +A    + N+ AIL A YPGE+GG A+A+V+FG +NPGGRLP+T+YN        L 
Sbjct: 653 GSSMAVNWMDENVPAILNAWYPGEQGGNAVAEVLFGDYNPGGRLPLTYYNS-------LD 705

Query: 610 SMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRN 669
            +P    D+    GRTY+++ G  LY FGYGLSYT+F+Y      K+  VN+ +      
Sbjct: 706 EIP--AFDNYSVKGRTYQYFEGQPLYEFGYGLSYTKFRY------KSKGVNVEQ------ 751

Query: 670 LNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIK 729
                                  D  +   +  N G  DG +V  VY K P       +K
Sbjct: 752 -----------------------DTVKVSFEVSNTGKYDGDEVAQVYVKYPETGTYMPLK 788

Query: 730 QVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVG 778
           Q+ GF+RV ++ G+  ++             +     + P GE+T  VG
Sbjct: 789 QLHGFKRVHIKKGKTSKVTIGVPRKDLRYWYEQERKFITPKGEYTFMVG 837


>gi|448410571|ref|ZP_21575276.1| beta-glucosidase [Halosimplex carlsbadense 2-9-1]
 gi|445671607|gb|ELZ24194.1| beta-glucosidase [Halosimplex carlsbadense 2-9-1]
          Length = 760

 Score =  257 bits (657), Expect = 2e-65,   Method: Compositional matrix adjust.
 Identities = 206/694 (29%), Positives = 316/694 (45%), Gaps = 104/694 (14%)

Query: 120 PGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNINVARDPRWGR 179
           P  T+FP  I   ++++  L   +   +  +  A   +G A     SP ++VARD RWGR
Sbjct: 102 PEGTTFPQGIGMASTWDPDLMAAVTDTIGDQLEA---IGTA--HALSPVLDVARDLRWGR 156

Query: 180 ITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGV 239
           + ET GEDP++V   A  YV GLQ           +S    +S+  KH+  + V    G 
Sbjct: 157 VEETYGEDPYLVAEMATAYVDGLQG----------DSPADGISATLKHFVGHAV-GAGGK 205

Query: 240 DRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGE 299
           +R   D  V+ + + E  + PFE  ++EG+A SVM +Y+ ++G+P   D  LL   +RGE
Sbjct: 206 NRSSVD--VSRRTLREVHMFPFEAAIQEGNAESVMNAYHDIDGVPCAKDEWLLTDVLRGE 263

Query: 300 WDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDL-----DCGQYYTNFTGNA 354
           W   G +V+D  S+  + + H   A  +E AV+  ++AG+D+     DC +Y       A
Sbjct: 264 WGFDGTVVSDYFSVDFLKEEHGVAATQQEAAVS-AVEAGVDVELPNTDCYEYLA----EA 318

Query: 355 VQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQDICSDENIELAAEAAREGIV 414
           V+ G + E  +D+S++ +       G F+     V         +  + LA EAAR+ +V
Sbjct: 319 VRDGDLAEESLDESVRRVLRAKFEKGLFEEYTVDVDAATDPYEDEAAVGLAREAARDSLV 378

Query: 415 LLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRY------------MSPIA 462
           +LKN+ + LPL+ A   +VAVVGP A+    M+G+YA     Y            +S I 
Sbjct: 379 VLKNESDLLPLDDA--DSVAVVGPKADDKKGMLGDYA-YAAHYPEEEYEFEADTPLSAIE 435

Query: 463 GFSGYANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVE----------- 511
              G A+V Y  GC      S + I  A EAA+ AD  +   G   +V+           
Sbjct: 436 NRVG-ADVNYAQGC-TATGNSTDKIGRAVEAAENADVALAFVGARSAVDFSDADGVKAEQ 493

Query: 512 ------AESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAI 565
                  E  D  DL LPG Q +L+ QV E    PV++V++S  G   A  E +    A+
Sbjct: 494 PMVPTSGEGCDVTDLGLPGVQNELVAQVEET-DTPVVIVLVS--GKPHAIPEIDAGADAV 550

Query: 566 LWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRT 625
           + A  PGEE G AI DVVF   + GG LP++           +  +P+            
Sbjct: 551 VQAWLPGEEAGNAIVDVVFEGHDSGGHLPVSMPKS-------VGQLPVHYSRKPNTYSED 603

Query: 626 YKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVL 685
           Y + +   +YPFG+GLSY +F+Y+ L  +                               
Sbjct: 604 YVYDDAQPVYPFGHGLSYAEFEYSDLDLSDVDVDPSGT---------------------- 641

Query: 686 VNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNK 745
                    F   V  +N    DGSDVV +Y        A  +++++GF+RV + AG + 
Sbjct: 642 ---------FSASVTVENTAERDGSDVVQLYVSAENPDLARPVQELVGFRRVELDAGEST 692

Query: 746 RIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGN 779
            I F   A   L   D  AN  + AG++ + VG+
Sbjct: 693 EITFDL-AASQLAYHDRNANLAVEAGDYELRVGH 725


>gi|423223593|ref|ZP_17210062.1| hypothetical protein HMPREF1062_02248 [Bacteroides cellulosilyticus
           CL02T12C19]
 gi|392638218|gb|EIY32065.1| hypothetical protein HMPREF1062_02248 [Bacteroides cellulosilyticus
           CL02T12C19]
          Length = 863

 Score =  257 bits (657), Expect = 2e-65,   Method: Compositional matrix adjust.
 Identities = 167/460 (36%), Positives = 235/460 (51%), Gaps = 43/460 (9%)

Query: 54  FCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGT 113
           + D+SL    R + LV  +TL+EK   + D +  V RLG+  Y WW+EALHGV+  G   
Sbjct: 23  YKDASLSPERRAELLVKELTLEEKAHLMMDGSRSVERLGIKPYNWWNEALHGVARAGL-- 80

Query: 114 HFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRA--------GLTYW 165
                   AT FP  I   ASFN  +  ++  AVS EARA      +        GLT W
Sbjct: 81  --------ATVFPQPIGMAASFNPEMVYEVFNAVSDEARAKNTYYASQDSRERYQGLTMW 132

Query: 166 SPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCC 225
           +P +N+ RDPRWGR  ET GEDP++  R  V  V+GLQ           + +  K+ +C 
Sbjct: 133 TPTVNIYRDPRWGRGIETYGEDPYLTSRMGVMVVKGLQG--------PADGKYDKLHACA 184

Query: 226 KHYAAYDVDNWKGVDRYHFDAR-VTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIP 284
           KH+A +    W   +R+ F+A  +  +D+ ET+L PFE  VKEG    VMC+YNR  G P
Sbjct: 185 KHFAVHSGPEW---NRHSFNAENIKPRDLYETYLPPFEALVKEGKVEEVMCAYNRFEGDP 241

Query: 285 SCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDN--HKFLADSKEDAVAQTLKAGLDLD 342
            C   +LL Q +RGEW   G +V+DC +I    ++  H    D+ E A A  + +G DL+
Sbjct: 242 CCGSDRLLMQILRGEWGFDGIVVSDCGAIADFYNDRGHHTHPDA-ESASAAAVISGTDLE 300

Query: 343 CGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGK---QDICSD 399
           CG  Y      +V++G + E  +D S+K L      LG  D  P+ VS  K     + S 
Sbjct: 301 CGSSYKALI-ESVKKGLISEETVDTSVKRLMKARFALGEMD-EPEKVSWTKIPFSVVASA 358

Query: 400 ENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMS 459
            +  LA   ARE + LL N  N LPL    + TVAV+GP+AN +V   GNY G+P   ++
Sbjct: 359 AHDSLALNMARESMTLLMNKDNFLPLKRGGL-TVAVMGPNANDSVMQWGNYNGMPAHTVT 417

Query: 460 PIAGFSGYA----NVTYKTGCDDVACKSNNSIFAASEAAK 495
            + G          + Y+ GC  V      S F+  ++ K
Sbjct: 418 ILDGVRNLLGTDDKLIYEQGCPWVERTLIQSAFSQCKSDK 457



 Score =  124 bits (312), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 95/312 (30%), Positives = 144/312 (46%), Gaps = 56/312 (17%)

Query: 478 DVACKSNNSIFAASEAAKTADATIILAGLDLSVEAESL----------DREDLWLPGYQT 527
           D+  K +  I  + E  K AD  I  +G+  S+E E +          DR D+ LP  Q 
Sbjct: 581 DLGFKKDVDIRKSVERVKDADIVIFASGISPSLEGEEMGVNLPGFKKGDRTDIELPAVQR 640

Query: 528 QLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKF 587
           +LI+ +    K    +++++  G  I         +AIL A YPG++GG+A+A+V+FG +
Sbjct: 641 ELIDALHRAGKK---IILVNCSGSPIGLEPETQKCEAILQAWYPGQQGGKAVAEVLFGDY 697

Query: 588 NPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFK 647
           NP G+LP+T+Y    V  LP         +     GRTY++     L+PFGYGLSYT F 
Sbjct: 698 NPAGKLPVTFYRN--VSQLP-------DFEDYNMTGRTYRYMQDVPLFPFGYGLSYTTFG 748

Query: 648 YNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGST 707
           Y      KT+   L+K                       N+L      +  V   N G  
Sbjct: 749 YG-----KTV---LDK-----------------------NELTAGQSLKLTVPVTNTGKR 777

Query: 708 DGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTL 767
           +G +VV VY +   + A   IK +  F+RV + AG+   ++F     K L   D  +NT+
Sbjct: 778 NGEEVVQVYLRKQGD-AEGPIKTLRAFKRVSIPAGKTVNVEFDLKD-KELEWWDDQSNTV 835

Query: 768 -LPAGEHTIFVG 778
            +  G + I VG
Sbjct: 836 RVCPGNYDIMVG 847


>gi|150003144|ref|YP_001297888.1| glycoside hydrolase family protein [Bacteroides vulgatus ATCC 8482]
 gi|149931568|gb|ABR38266.1| glycoside hydrolase family 3, candidate beta-glycosidase
           [Bacteroides vulgatus ATCC 8482]
          Length = 785

 Score =  257 bits (657), Expect = 2e-65,   Method: Compositional matrix adjust.
 Identities = 206/703 (29%), Positives = 326/703 (46%), Gaps = 126/703 (17%)

Query: 121 GATSFPTVILTTASFNESLWKKIGQAVSTEARAM-YNLGRAGLTYWSPNINVARDPRWGR 179
           G T FPT +   +++NE L  K+G+A++ EAR    N+G      + P ++VAR+PRW R
Sbjct: 150 GTTVFPTALSAASTWNEGLMLKMGEAIALEARLQGANIG------YGPVLDVAREPRWSR 203

Query: 180 ITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGV 239
           + ET GEDP +     V  ++G+Q          + +    + +  KH+AAY V      
Sbjct: 204 MEETFGEDPVLTTIMGVAMMKGMQG--------KVQNDGKHLYATLKHFAAYGVP----- 250

Query: 240 DRYHFDARVT--EQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVR 297
           +  H  +R     + +   +L PF   VKEG A ++M SYN ++G+P  A+ +LL   +R
Sbjct: 251 ESGHNGSRANCGMRQLLSEYLPPFRKAVKEG-AGTLMTSYNAIDGVPCTANKELLTDVLR 309

Query: 298 GEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCG-QYYTNFTGNAVQ 356
            +W   G++ +D  SI+ +V   +   D+KE AV + LKAGLD+D G   +      A +
Sbjct: 310 NQWGFKGFVYSDLISIEGIV-GMRAAKDNKEAAV-KALKAGLDMDLGGNAFGKNLKKAYE 367

Query: 357 QGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQDICSDENIELAAEAAREGIVLL 416
           +G +   D+D+++  +  +  ++G F+       L K+ + S E+ ELA + AREG+VLL
Sbjct: 368 EGLITMADLDRAVGNVLRLKFQMGLFENPYVSPELAKKLVHSKEHKELARQVAREGVVLL 427

Query: 417 KNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPI------AGFSGYANV 470
           KN+   LPL S  +  +AV+GP+A+     +G+Y     R           A  S    V
Sbjct: 428 KNE-GVLPL-SKHIGHLAVIGPNADEMYNQLGDYTAPQVREEVATVLDGIRAAVSESTRV 485

Query: 471 TYKTGC---DDVA------------------------CKSNNSIFAASEAAKTADATIIL 503
           TY  GC   D  A                         +   + + ++ AA  ++    L
Sbjct: 486 TYVKGCAVRDTTATDIPAAVAAAQKADAVVLVVGGSSARDFKTKYISTGAATVSEDAKTL 545

Query: 504 AGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIK 563
             +D     E  DR  L L G Q +LI+ VA   K P+++V +    +++  A      +
Sbjct: 546 PDMDC---GEGFDRSSLRLLGDQEKLISAVASTGK-PLVVVYIQGRTMNMNLAAEKA--Q 599

Query: 564 AILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPG 623
           A+L A YPGE+GG  IAD++FG ++P GRLP++         +P +   L    S G   
Sbjct: 600 ALLTAWYPGEQGGMGIADILFGDYSPAGRLPVS---------VPRSEGQLPVFYSQGTQ- 649

Query: 624 RTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPG 683
           R Y    G  LY FGYGLSYT+F Y+ L   K  ++   +   C                
Sbjct: 650 RDYVESKGTPLYAFGYGLSYTRFTYSGLELQKGTEMETLQTVAC---------------- 693

Query: 684 VLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVY--------SKPPAEIAATYIKQVIGFQ 735
                              N G+ DG +VV +Y        S+PP  + A        FQ
Sbjct: 694 ----------------TVTNTGNRDGEEVVQLYIGDKVASVSQPPLLLKA--------FQ 729

Query: 736 RVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVG 778
           R+F++ G ++++ F       L I D   N ++  GE  + VG
Sbjct: 730 RIFLKKGESRQVIFHLKK-DDLGIYDSEMNYVVEPGEFKVMVG 771


>gi|332665860|ref|YP_004448648.1| beta-glucosidase [Haliscomenobacter hydrossis DSM 1100]
 gi|332334674|gb|AEE51775.1| Beta-glucosidase [Haliscomenobacter hydrossis DSM 1100]
          Length = 887

 Score =  257 bits (657), Expect = 2e-65,   Method: Compositional matrix adjust.
 Identities = 151/424 (35%), Positives = 229/424 (54%), Gaps = 34/424 (8%)

Query: 52  FLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGP 111
           F   D++L + +RVKDLVSR+TL+EKV Q+ + A  +PRLG+P Y+WW+E LHGV+    
Sbjct: 40  FPMWDTNLSFEVRVKDLVSRLTLEEKVGQMLNAAPAIPRLGIPAYDWWNEVLHGVAR--- 96

Query: 112 GTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYN----LGR-----AGL 162
            T F       T +P  I   A ++ +    +    + E RA++N    LGR      GL
Sbjct: 97  -TPFH-----VTVYPQAIGMAAGWDSTSLAMMAHYSALEGRAVFNKATALGRNNERYLGL 150

Query: 163 TYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVS 222
           TYW+PNIN+ RDPRWGR  ET GEDPF+       +VRGLQ  +          + LK +
Sbjct: 151 TYWTPNINIFRDPRWGRGQETYGEDPFLTSMLGRAFVRGLQGDD---------PKYLKAA 201

Query: 223 SCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNG 282
           +C KH+A   V +     R+  +   +  D+ +T+L  F+  V +     VMC+YN  +G
Sbjct: 202 ACAKHFA---VHSGPEPSRHSDNFSPSNYDLWDTYLPAFKELVTKAKVEGVMCAYNAFHG 258

Query: 283 IPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLD 342
            P C    L+N  +R +W   GY+ +DC +I      HK   D+   +V   L  G D++
Sbjct: 259 QPCCGSDVLMNDILRKQWQFKGYVTSDCWAIDDFFKFHKTHPDATSASVDAVLH-GTDVE 317

Query: 343 CGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFD--GSPQYVSLGKQDICSDE 400
           CG        + V++G + E  +D SL  L+T   RLG FD     +Y    +  + + E
Sbjct: 318 CGTDVYKSLLDGVKKGMIAEAQLDISLIRLFTTRYRLGMFDPVSMVKYAQTPESILETAE 377

Query: 401 NIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSP 460
           +   + + A++ IVLLKN+ NTLPL S  +K +AV+GP+A+  + ++GNY G P   ++ 
Sbjct: 378 HKAHSLKMAQQSIVLLKNEGNTLPL-SKNIKKIAVLGPNADNRIVVLGNYNGQPSEIITA 436

Query: 461 IAGF 464
           + G 
Sbjct: 437 LQGI 440



 Score =  133 bits (335), Expect = 3e-28,   Method: Compositional matrix adjust.
 Identities = 99/324 (30%), Positives = 147/324 (45%), Gaps = 60/324 (18%)

Query: 465 SGYANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAESL--------- 515
            G ANV  + G  +   KSN S  A     K ADA + + G+   +E E +         
Sbjct: 592 EGKANVHLRAGLLE---KSNLS--AIVNRVKDADAIVYVGGISPQLEGEEMRVDFPGFNG 646

Query: 516 -DREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEE 574
            DR  + LP  QT+L+  +    K P++ V+M+  G  IA    + NI AI+ A Y G+ 
Sbjct: 647 GDRTSILLPAVQTELLKMLKGTGK-PLVFVVMT--GSAIALPYEDQNIPAIVNAWYGGQS 703

Query: 575 GGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTL 634
            G AIADV+FG +NP GRLP+T+Y  D       + +P     S     RTY+++ G  L
Sbjct: 704 AGTAIADVLFGDYNPAGRLPVTFYKAD-------SDLP--DFKSYDMNNRTYRYFKGDAL 754

Query: 635 YPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDY 694
           YPFG+GLSYT F+Y                            SK + PG     ++    
Sbjct: 755 YPFGHGLSYTSFQY----------------------------SKLKTPG----KIKSGAS 782

Query: 695 FEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNAC 754
           F+      N G  DG +VV +Y   P       I+ + GF R+ ++AG +K + F  +  
Sbjct: 783 FKVSATLTNTGKKDGDEVVQLYLAYPEVAGKAPIRALKGFNRIRLKAGESKTVSFTLSP- 841

Query: 755 KSLNIVDYAANTLLPAGEHTIFVG 778
           +   +V+       P G+  I +G
Sbjct: 842 EQCQLVNEEGALYQPKGKMEISLG 865


>gi|393782958|ref|ZP_10371138.1| hypothetical protein HMPREF1071_02006 [Bacteroides salyersiae
           CL02T12C01]
 gi|392671316|gb|EIY64790.1| hypothetical protein HMPREF1071_02006 [Bacteroides salyersiae
           CL02T12C01]
          Length = 759

 Score =  257 bits (656), Expect = 2e-65,   Method: Compositional matrix adjust.
 Identities = 225/811 (27%), Positives = 356/811 (43%), Gaps = 160/811 (19%)

Query: 53  LFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHG------------------------- 87
           L+ DS  P   RV+DL+ RMTL EK  QL +   G                         
Sbjct: 29  LYKDSLAPIESRVEDLLRRMTLHEKTLQLQNKPVGRIDEIESIFQGQSYGCTHEMGKTAE 88

Query: 88  ---------------VPRLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGATSFPTVILTT 132
                            RLG+P       A+ G+  +        +   +T FP  I   
Sbjct: 89  ECAGIYNELQKYMLTKTRLGIPILT----AVEGIQGI--------LQNNSTLFPHSIAQG 136

Query: 133 ASFNESLWKKIGQAVSTEARAMYNLGRAGL-TYWSPNINVARDPRWGRITETPGEDPFVV 191
           ++FN  L +++  A   EA AM      G+    SP  ++AR+ RWGR+ ET GEDPF++
Sbjct: 137 STFNPELIERMTDAAGKEAAAM------GIHQVLSPVFDIARELRWGRVEETYGEDPFLI 190

Query: 192 GRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQ 251
               + +V+G Q    H+           ++   KH+ A+      G++         E+
Sbjct: 191 SEMGIGFVKGYQK---HQ-----------ITCTPKHFVAHGTPA-GGLNCAFVSG--GER 233

Query: 252 DMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCD 311
           +    +L PF   +KE +   +M  Y+  +GIP  A P  +   +R E    GY+ +D  
Sbjct: 234 EFRSIYLYPFARVIKETNPLCIMSCYSAYDGIPVSASPYYMTDVLRDELGFKGYVYSDWG 293

Query: 312 SIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKY 371
           S+  ++  H +   ++E+A   +L AG+DLD    Y       V++GK+ E  IDK+++ 
Sbjct: 294 SVDRVMTFH-YAVPTREEAAKVSLIAGVDLDVDSDYETLE-QQVKEGKIDEAYIDKAVRR 351

Query: 372 LYTVLMRLGFFD----GSPQYVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNS 427
           +  V   LG FD    G P+ V   K+ + SD++I LA E A E  +LL+N  N LPL+ 
Sbjct: 352 VLYVKFALGLFDRPYYGDPKLV---KKVVRSDKHIALAKEVADESTILLENKNNILPLDL 408

Query: 428 AKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFSGYANVTYKT-------GCDDVA 480
           +K K++AVVGP++N TV   G+Y+         +  + G   V  K        GC+   
Sbjct: 409 SKYKSIAVVGPNSNQTV--FGDYSWTTPDTKEGVTLYQGLQQVLGKKKTILQADGCNWWN 466

Query: 481 CKSNNSIFAASEAAKTADATIILAGLD---------LSVEAESLDREDLWLPGYQTQLIN 531
              +  I  A +A + +D  I+  G            S   E  D   L LPG Q++L+ 
Sbjct: 467 RADSKDIEQAVKAVEQSDLAIVAVGTRSTFLGRGPRYSTAGEGFDLSSLELPGNQSELLK 526

Query: 532 QVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGG 591
            V    K P+I+V++S   + +++A+ N +   + W  Y GE+ GR++AD++ G  NP G
Sbjct: 527 AVKATGK-PMIVVLISGKPLVMSWAKENADAVLVQW--YAGEQQGRSLADILVGNVNPSG 583

Query: 592 RLPIT-------------WYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFG 638
           R+ ++             +Y  D VQ         RP  +   P   Y F +   L+ FG
Sbjct: 584 RVNVSFPRSTGNTPCFYNYYPTDRVQRFD------RP-GTYEEPAGHYIFEHPYALWEFG 636

Query: 639 YGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFK 698
           YGLSYT F Y+  +   +I                SD       G +V            
Sbjct: 637 YGLSYTNFNYSGCTLNDSIY---------------SDQ------GTIVA----------T 665

Query: 699 VDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLN 758
           V+ +N G  DG +VV +Y +      +T IKQ+  F++VF++AG  K++         L 
Sbjct: 666 VEVENTGKRDGKEVVQLYVRDKISSVSTPIKQLKAFKKVFIKAGEKKKVTLEV-PMSELA 724

Query: 759 IVDYAANTLLPAGEHTIFVGNGGVSFPIHLN 789
           + D     ++  GE  I +G+   S  IH N
Sbjct: 725 LYDVRMKPVVEPGEFEIQIGSS--SDRIHFN 753


>gi|260593561|ref|ZP_05859019.1| xylosidase/arabinosidase [Prevotella veroralis F0319]
 gi|260534549|gb|EEX17166.1| xylosidase/arabinosidase [Prevotella veroralis F0319]
          Length = 771

 Score =  257 bits (656), Expect = 2e-65,   Method: Compositional matrix adjust.
 Identities = 206/682 (30%), Positives = 324/682 (47%), Gaps = 94/682 (13%)

Query: 121 GATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWS--PNINVARDPRWG 178
           G T +PT I   +SF+  +  KI +  + E RAM         +W+  PN+ VARD RWG
Sbjct: 144 GNTVYPTNIGLASSFDVDMAYKIARQTAEEMRAMN-------MHWNFNPNVEVARDARWG 196

Query: 179 RITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHY--AAYDVDNW 236
           R  ET GEDP++V    V   +G Q     +N  D       V  C KH+   +Y ++  
Sbjct: 197 RCGETFGEDPYLVTLMGVATNKGYQ--RNLDNVQD-------VLGCVKHFVGGSYSINGT 247

Query: 237 KGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTV 296
            G         V+E+ + E F  PF+  +++G   +VM S+N +NG+P   +  L+   +
Sbjct: 248 NGAP-----CEVSERTLREVFFPPFKAAIQQGGDWNVMMSHNDLNGVPCHTNSWLMTDVL 302

Query: 297 RGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDC-GQYYTNFTGNAV 355
           R EW   G+IV+D   I+  VD H+  A++KE A  Q++ AG+D+   G  +       V
Sbjct: 303 RKEWGFRGFIVSDWMDIEHCVDQHRTAANNKE-AFYQSIMAGMDMHMHGPEWQTAVVELV 361

Query: 356 QQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQDICSDENIELAAEAAREGIVL 415
           ++G++ E+ ID+S++ + TV  RLG F+          + I   E+   A EA+R  IVL
Sbjct: 362 KEGRIPESRIDESVRRILTVKFRLGLFEHPYSDAKTRDRVITDPEHKRTALEASRNSIVL 421

Query: 416 LKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPC--RYMSPIAGFSGYANVTYK 473
           LKN+ + LPL++ K K V V G +AN    M G+++ +    +  + + G    +  T  
Sbjct: 422 LKNENDLLPLDAQKYKKVLVTGINANDQNIM-GDWSELQPEDQVWTVLRGLKSVSPTTDF 480

Query: 474 TGCD---DVACKSNNSIFAASEAAKTADATIILAG-------LDLSVEAESLDREDLWLP 523
              D   D    S   + AA  AAK  D  I+  G        +     E  DR++L L 
Sbjct: 481 KFVDQGWDPRNMSQAQVNAAVAAAKDCDLNIVCCGEYMMRFRWNERTSGEDTDRDNLDLV 540

Query: 524 GYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVV 583
           G Q QLI ++ E  K P I+VI+S   + + +A    ++ AI+ A  PG+ GG+AIA+++
Sbjct: 541 GLQNQLIQRLNETGK-PTIVVIISGRPLSLRYAA--EHVPAIINAWEPGQFGGQAIAEII 597

Query: 584 FGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFY-------NGPTLYP 636
           +GK NP  +L +T         +P ++  +    S  Y  +   F+       N P LYP
Sbjct: 598 YGKVNPSAKLAMT---------IPRSAGQI----STWYNHKRSAFFHPAVCTDNKP-LYP 643

Query: 637 FGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFE 696
           FGYGLSYT F+Y+ L  +K I  N  K Q   +                           
Sbjct: 644 FGYGLSYTSFRYSNLKLSKQIIPNDGKTQIIAS--------------------------- 676

Query: 697 FKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKS 756
             V  +N G  DG ++  +Y        +  +K++  F RV ++AG  + ++F     K 
Sbjct: 677 --VTIENTGQRDGVEICQLYINDLVSSVSRPVKELKDFLRVELKAGEKRTVEFTITPDK- 733

Query: 757 LNIVDYAANTLLPAGEHTIFVG 778
           L   D   N ++ AGE  + +G
Sbjct: 734 LAFYDLNMNPIVEAGEFEVMIG 755


>gi|255532174|ref|YP_003092546.1| glycoside hydrolase family protein [Pedobacter heparinus DSM 2366]
 gi|255345158|gb|ACU04484.1| glycoside hydrolase family 3 domain protein [Pedobacter heparinus
           DSM 2366]
          Length = 799

 Score =  257 bits (656), Expect = 2e-65,   Method: Compositional matrix adjust.
 Identities = 224/797 (28%), Positives = 354/797 (44%), Gaps = 143/797 (17%)

Query: 35  FVCDPGRFSKLGLQMSSF----LFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPR 90
           F  DP  + K  + ++      ++ D   P + R+ +L+S+MTL+EK  Q+    +G  R
Sbjct: 25  FKADPPIYRKGWIDLNKNGKKDIYEDPLQPLNARIDNLLSQMTLEEKTCQMATL-YGWKR 83

Query: 91  L---GLPQYEW----WS-------EALHGVSNVG-------------------------- 110
           +    LP  EW    W        E L+G    G                          
Sbjct: 84  VLKDSLPTKEWKTAIWKDGIANIDEHLNGFLTWGVTSTSELVTDIKKHVWAMNETQRFFI 143

Query: 111 -------PGTHFDDVIPG-----ATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLG 158
                  P    ++ I G     AT FPT +    ++N +L +K+G+    EARA   LG
Sbjct: 144 EQTRLGIPVDFTNEGIRGVEAYEATGFPTQLNMGMTWNRNLIRKMGRITGQEARA---LG 200

Query: 159 RAGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRP 218
              +  ++P ++VARD RWGR+ E  GEDP++V R  V    G+Q     EN        
Sbjct: 201 YTNV--YAPILDVARDQRWGRLEEVYGEDPYLVARLGVEMTLGMQ-----ENN------- 246

Query: 219 LKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYN 278
            +++S  KH+A Y  +          D +V+ +++E+  L PF+  ++E     VM SYN
Sbjct: 247 -QIASTAKHFAVYSANKGAREGLARTDPQVSPREVEDIMLYPFKKVIQEAGIMGVMSSYN 305

Query: 279 RVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAG 338
             NGIP       L Q +R ++   GY+V+D D+++ + + H   A+ KE AV Q   AG
Sbjct: 306 DYNGIPITGSEYWLTQRLRKDFGFGGYVVSDSDALEYLYNKHHVAANLKE-AVFQAFMAG 364

Query: 339 LDLDCG----QYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYV---SL 391
           L++            +    V +G++    I+  +K +  V  +LG FD    YV   + 
Sbjct: 365 LNVRTTFRPPDSIIIYARQLVNEGRIPIETINSRVKDVLRVKFKLGLFDQP--YVKDAAA 422

Query: 392 GKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYA 451
            ++ + S  +  +A +A++E IVLLKN+   LPL S  +K +AV+GP+A        +Y 
Sbjct: 423 SEKLVNSIAHQAVALQASKESIVLLKNNNQILPL-SRSLKKIAVIGPNAADNDYAHTHYG 481

Query: 452 GIPCRYMSPIAGFS---GYANVTYKTGCDDVACK-SNNSIFA-------------ASEAA 494
            +  +  + + G     G   V Y  GC+ V      + IF              A   A
Sbjct: 482 PLQSKSTNILEGIRNKIGADKVWYAKGCELVDKNWPESEIFPEDPDATAIALIEDAVNTA 541

Query: 495 KTADATIILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIA 554
             AD  I++ G +     E+  R  L LPG+Q  LI  + +  K PV+ V++    + I 
Sbjct: 542 MKADVAIVVLGGNTKTAGENKSRTTLELPGFQLNLIKAIQKTGK-PVVAVMIGTQPMGIN 600

Query: 555 FAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLR 614
           +   +  I  I++AGYPG +GG A+ADV+FG +NPGG+L +T+     V  LPL + P +
Sbjct: 601 W--IDKYIDGIVYAGYPGVKGGIAVADVLFGDYNPGGKLTLTFPKS--VGQLPL-NFPSK 655

Query: 615 PVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTS 674
           P ++    G   K      LYPFG+GLSYT F Y+ L  +   Q                
Sbjct: 656 P-NAQTDEGELAKIKG--LLYPFGFGLSYTTFAYSNLKISPIEQ---------------- 696

Query: 675 DASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGF 734
                            D      VD  N    +G ++V +Y +       TY K + GF
Sbjct: 697 ---------------EKDGNISISVDITNTAKLEGDEIVQLYIRDVLSTVTTYEKILRGF 741

Query: 735 QRVFVRAGRNKRIKFVF 751
           +R+ ++    K +KF  
Sbjct: 742 ERISLKPNETKTLKFTL 758


>gi|393779898|ref|ZP_10368130.1| glycosyl hydrolase family 3, N-terminal domain protein
           [Capnocytophaga sp. oral taxon 412 str. F0487]
 gi|392609318|gb|EIW92128.1| glycosyl hydrolase family 3, N-terminal domain protein
           [Capnocytophaga sp. oral taxon 412 str. F0487]
          Length = 770

 Score =  257 bits (656), Expect = 2e-65,   Method: Compositional matrix adjust.
 Identities = 209/733 (28%), Positives = 354/733 (48%), Gaps = 99/733 (13%)

Query: 64  RVKDLVSRMTLDEKVQQLGDFAHGVPRLG---LPQYEWWSE-----------ALHGVSNV 109
           RV  ++  MTL+EK+ Q+  F+      G     +Y+ + E           ++ G+ N+
Sbjct: 46  RVDSVLRLMTLEEKIGQMTQFSADWSVTGPVMADKYQPYLEKGLVGSIFNATSVAGIRNL 105

Query: 110 G-----------PGTHFDDVIPG-ATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNL 157
                       P     DVI G  T FP  +  + S++ +L +K  +  + EA A    
Sbjct: 106 QKIAVEQTRLGIPILFGQDVIHGYKTIFPIPLAESCSWDLTLMRKTAELAAREASA---- 161

Query: 158 GRAGLTY-WSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNS 216
              G+ + ++P +++ RD RWGR  E  GEDP++    A   V+G Q   G +N   L+S
Sbjct: 162 --DGINWTFAPMVDITRDARWGRAMEGAGEDPYLGSLIAEARVKGFQ---GGDNWQMLSS 216

Query: 217 RPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCS 276
            P  + +C KH+A Y      G D  +  A ++   +   +L P+E  +      S+M S
Sbjct: 217 -PHTLLACGKHFAGYGAAE-SGKD--YNTAELSMHTLRNVYLPPYEATLN-ARVGSIMAS 271

Query: 277 YNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLK 336
            N +NG+P+ AD  LL + +R EW  +G +V+D   I  +V  H    D K+ A   +  
Sbjct: 272 LNEINGVPATADKWLLTEVLRKEWGFNGLLVSDYTGINELV-RHGVAKDDKQ-AANLSAN 329

Query: 337 AGLDLDC-GQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYV--SLGK 393
           AG+++D  G  +  +    V++GKV E  IDK+++++  +   LG FD   +Y+  +  K
Sbjct: 330 AGIEMDMNGATFIKYLSALVKEGKVTEAQIDKAVRHILEIKFLLGLFDDPYRYLDETRAK 389

Query: 394 QDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYA-- 451
           ++  +++ +++A +A    +VLLKN+   LP+     KT+AV+GP  N T  + G++   
Sbjct: 390 ENTFTEKYLKVARQAVASSVVLLKNEAEVLPIKKDSGKTIAVIGPMMNNTSDINGSWTCL 449

Query: 452 GIPCRYMSPIAGFSGYANVT-----YKTGCDDVACKSNNSIFAASEAAKTADATIILAGL 506
           G   + +S + G +     T     Y  GC      S   +  A   A+ AD  ++  G 
Sbjct: 450 GDGKQSVSLLTGLTEKYKATNVKLLYAEGCG-FTTISTEQLKEAVAMARKADRVLVAVGE 508

Query: 507 DLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAIL 566
             S   ES  R D+ LP  Q QL+  +  + K P+ ++  S   +D+++   N N++AIL
Sbjct: 509 QSSWSGESAVRTDIRLPQAQRQLLEALKTINK-PIAIITFSGRPLDLSWE--NENVQAIL 565

Query: 567 WAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPL------TSMPL----RPV 616
            A +PG +GG  IADV+ G  NP G L +++     V  +P+      T  P+      V
Sbjct: 566 QAWFPGTQGGYGIADVIAGDVNPSGHLTMSFPRS--VGQIPIYYNYKSTGRPVHTNNEEV 623

Query: 617 DSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDA 676
           D   +    Y   +   LYPFGYGLSYT F  +         V+LNK    ++L   +D+
Sbjct: 624 DHRPHYNAGYLDSSITPLYPFGYGLSYTTFAIS--------NVHLNK----KSLKRYNDS 671

Query: 677 SKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQR 736
                  ++VN              QN G+T+G  VV +Y++      +  +K++ GFQ+
Sbjct: 672 -------IIVN-----------ASVQNTGTTEGEIVVQLYTRQLVASVSRPVKELKGFQK 713

Query: 737 VFVRAGRNKRIKF 749
           + ++AG +K+++F
Sbjct: 714 ISLKAGESKQVRF 726


>gi|224536538|ref|ZP_03677077.1| hypothetical protein BACCELL_01413 [Bacteroides cellulosilyticus
           DSM 14838]
 gi|224521794|gb|EEF90899.1| hypothetical protein BACCELL_01413 [Bacteroides cellulosilyticus
           DSM 14838]
          Length = 863

 Score =  257 bits (656), Expect = 2e-65,   Method: Compositional matrix adjust.
 Identities = 167/460 (36%), Positives = 235/460 (51%), Gaps = 43/460 (9%)

Query: 54  FCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGT 113
           + D+SL    R + LV  +TL+EK   + D +  V RLG+  Y WW+EALHGV+  G   
Sbjct: 23  YKDASLSPERRAELLVKELTLEEKAHLMMDGSRSVERLGIKPYNWWNEALHGVARAGL-- 80

Query: 114 HFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRA--------GLTYW 165
                   AT FP  I   ASFN  +  ++  AVS EARA      +        GLT W
Sbjct: 81  --------ATVFPQPIGMAASFNPEMVYEVFNAVSDEARAKNTYYASQDSRERYQGLTMW 132

Query: 166 SPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCC 225
           +P +N+ RDPRWGR  ET GEDP++  R  V  V+GLQ           + +  K+ +C 
Sbjct: 133 TPTVNIYRDPRWGRGIETYGEDPYLTSRMGVMVVKGLQG--------PADGKYDKLHACA 184

Query: 226 KHYAAYDVDNWKGVDRYHFDAR-VTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIP 284
           KH+A +    W   +R+ F+A  +  +D+ ET+L PFE  VKEG    VMC+YNR  G P
Sbjct: 185 KHFAVHSGPEW---NRHSFNAENIKPRDLYETYLPPFEALVKEGKVEEVMCAYNRFEGDP 241

Query: 285 SCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDN--HKFLADSKEDAVAQTLKAGLDLD 342
            C   +LL Q +RGEW   G +V+DC +I    ++  H    D+ E A A  + +G DL+
Sbjct: 242 CCGSDRLLMQILRGEWGFDGIVVSDCGAIADFYNDRGHHTHPDA-ESASAAAVISGTDLE 300

Query: 343 CGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGK---QDICSD 399
           CG  Y      +V++G + E  +D S+K L      LG  D  P+ VS  K     + S 
Sbjct: 301 CGSSYKALI-ESVKKGLISEETVDTSVKRLMKARFALGEMD-EPEKVSWTKIPFSVVASA 358

Query: 400 ENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMS 459
            +  LA   ARE + LL N  N LPL    + TVAV+GP+AN +V   GNY G+P   ++
Sbjct: 359 AHDSLALNMARESMTLLMNKDNFLPLKRGGL-TVAVMGPNANDSVMQWGNYNGMPAHTVT 417

Query: 460 PIAGFSGYA----NVTYKTGCDDVACKSNNSIFAASEAAK 495
            + G          + Y+ GC  V      S F+  ++ K
Sbjct: 418 ILDGVRNLLGTDDKLIYEQGCPWVERTLIQSAFSQCKSDK 457



 Score =  124 bits (311), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 95/312 (30%), Positives = 144/312 (46%), Gaps = 56/312 (17%)

Query: 478 DVACKSNNSIFAASEAAKTADATIILAGLDLSVEAESL----------DREDLWLPGYQT 527
           D+  K +  I  + E  K AD  I  +G+  S+E E +          DR D+ LP  Q 
Sbjct: 581 DLGFKKDVDIRKSVERVKDADIVIFASGISPSLEGEEMGVNLPGFKKGDRTDIELPAVQR 640

Query: 528 QLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKF 587
           +LI+ +    K    +++++  G  I         +AIL A YPG++GG+A+A+V+FG +
Sbjct: 641 ELIDALHRAGKK---IILVNCSGSPIGLEPETQKCEAILQAWYPGQQGGKAVAEVLFGDY 697

Query: 588 NPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFK 647
           NP G+LP+T+Y    V  LP         +     GRTY++     L+PFGYGLSYT F 
Sbjct: 698 NPAGKLPVTFYRN--VSQLP-------DFEDYNMTGRTYRYMQDVPLFPFGYGLSYTTFG 748

Query: 648 YNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGST 707
           Y      KT+   L+K                       N+L      +  V   N G  
Sbjct: 749 YG-----KTV---LDK-----------------------NELTAGQSLKLTVPVTNTGKR 777

Query: 708 DGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTL 767
           +G +VV VY +   + A   IK +  F+RV + AG+   ++F     K L   D  +NT+
Sbjct: 778 NGEEVVQVYLRKQGD-AEGPIKTLRAFKRVSIPAGKTVNVEFDLKD-KELEWWDDQSNTV 835

Query: 768 -LPAGEHTIFVG 778
            +  G + I VG
Sbjct: 836 RVCPGNYDIMVG 847


>gi|219118959|ref|XP_002180246.1| beta-xylosidase [Phaeodactylum tricornutum CCAP 1055/1]
 gi|217408503|gb|EEC48437.1| beta-xylosidase [Phaeodactylum tricornutum CCAP 1055/1]
          Length = 682

 Score =  256 bits (655), Expect = 2e-65,   Method: Compositional matrix adjust.
 Identities = 199/611 (32%), Positives = 298/611 (48%), Gaps = 62/611 (10%)

Query: 54  FCDSSLPYSIRVKDLVSRMTLDEKVQQLG---------DFAHGVPRLGLPQYEWWSEALH 104
           +CD SL    R++DL+S +TLDEKV  +G              V R+GLP Y W  E   
Sbjct: 72  YCDMSLSIDERLEDLLSHLTLDEKVDMIGADPTQDVCMTHTMNVSRIGLPDYYWLVE--- 128

Query: 105 GVSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNL------- 157
             +N   G+        AT F   +   ASFN S W   G    TE RA+ N+       
Sbjct: 129 --TNTAVGSACIAENKCATEFSGPLSIAASFNRSSWFLKGSVFGTEQRALMNVHGERFHT 186

Query: 158 --GRA-GLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDL 214
             GR  GLT + PNIN  RDPR+GR +E PGEDPF+ G+YA + V+G+Q+        D 
Sbjct: 187 HSGRHIGLTAFGPNINQQRDPRFGRSSELPGEDPFLSGQYAAHMVQGMQE-------RDA 239

Query: 215 NSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVM 274
           N  P KV +  KH+ AY  +  +G D Y+    ++  D+ +T+L  +EM + +G A+ VM
Sbjct: 240 NGYP-KVLAYLKHFTAYSREEGRGNDDYN----ISMYDLFDTYLPQYEMGMVQGGATGVM 294

Query: 275 CSYNRVNGIPSCADPKLLNQTVRGEWDLH-GYIVADCDSIQVMVDNHKFLADSKEDAVAQ 333
           CSYN VNGIP+CA+  LLN+ +R  W+    ++  DC ++  +       A  +  A A 
Sbjct: 295 CSYNAVNGIPACANDYLLNKILRQRWNRSDAHVTTDCGAVNNL-RGKPIQAADEAQAAAM 353

Query: 334 TLKAGLDLDCGQ--YYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGS--PQYV 389
            L  G D++ G   +  N T  A+  G   E  ++++++  Y      G FD     ++ 
Sbjct: 354 ALMNGADIEMGSTLFVHNLT-TAITLGYATEEAVNQAIRRSYRPHFIAGRFDDPTLSEWF 412

Query: 390 SLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGN 449
           SLG  DI S ++ E+  EAA +G+VLLK++ + LP+  A    +AV+GP       ++ +
Sbjct: 413 SLGLDDIQSKKHQEIQLEAALQGLVLLKHEDSILPI--AAGTKLAVLGPLGMTRSGLMSD 470

Query: 450 Y--------AGIPC-RYMSPIAGFSGYANVTYKTGCDDVACKSNNSIFAASEAAKTADAT 500
           Y         G  C   ++   GF      T      DV  ++ + +    + A   D  
Sbjct: 471 YESDQSCFGGGHDCIPTLAESIGFINGKEFTVAAAGVDVDSRNTSDVERILQLAADRDLI 530

Query: 501 IILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNT 560
           ++  G   + E E  DR+D  LPG Q  L   V  + K PV+LV+++ G   IA      
Sbjct: 531 VLCLGNTKTQEQEGFDRKDTALPGQQYALFEAVLTLRK-PVVLVLVNGG--QIALDGMTG 587

Query: 561 NIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLG 620
              AI+ A  P   GG A+A  +FG+ N  G+LP T Y    +Q     S  ++      
Sbjct: 588 YPSAIIEAFNPNGIGGTALAASLFGQENRWGKLPYTIYPYSVMQ-----SFDMKDHSMSA 642

Query: 621 YPGRTYKFYNG 631
            PGRTY+++ G
Sbjct: 643 PPGRTYRYFTG 653


>gi|256819849|ref|YP_003141128.1| glycoside hydrolase [Capnocytophaga ochracea DSM 7271]
 gi|256581432|gb|ACU92567.1| glycoside hydrolase family 3 domain protein [Capnocytophaga
           ochracea DSM 7271]
          Length = 804

 Score =  256 bits (655), Expect = 2e-65,   Method: Compositional matrix adjust.
 Identities = 195/655 (29%), Positives = 327/655 (49%), Gaps = 74/655 (11%)

Query: 117 DVIPG-ATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTY-WSPNINVARD 174
           DVI G  T FP  +  + S++ +L +K  +  + EA A       G+ + ++P +++ RD
Sbjct: 158 DVIHGYKTIFPIPLAESCSWDLALMRKTAELAAREASA------DGINWTFAPMVDITRD 211

Query: 175 PRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVD 234
            RWGR  E  GEDP++    A   V+G Q   G +N   L+S P  + +C KH+A Y   
Sbjct: 212 ARWGRAMEGAGEDPYLGSLIAEARVKGFQ---GGDNWQTLSS-PHTLLACGKHFAGYGAA 267

Query: 235 NWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQ 294
              G D  +  A ++   +   +L P+E  +K G   S+M S N +NG+P+ AD  LL +
Sbjct: 268 E-SGKD--YNTAELSMHTLRNVYLPPYEATLKAG-VGSIMASLNEINGVPATADKWLLTE 323

Query: 295 TVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDC-GQYYTNFTGN 353
            +R EW  +G +V+D   I  +V  H    D K+ A   +  AG+++D  G  +  +   
Sbjct: 324 VLRKEWGFNGLLVSDYTGINELV-RHGVAKDDKQVA-NLSANAGIEMDMNGATFIKYLSA 381

Query: 354 AVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYV--SLGKQDICSDENIELAAEAARE 411
            V++GKV E  IDK+++++  +   LG FD   +Y+  +  K++  ++E +++A +A   
Sbjct: 382 LVKEGKVTENQIDKAVRHILEMKFLLGLFDDPYRYLDETRAKENTFTEEYLKVARQAVAS 441

Query: 412 GIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYA--GIPCRYMSPIAGFSGY-- 467
            +VLLKN+   LP+     KT+AV+GP  N T  + G++   G   + +S + G +    
Sbjct: 442 SVVLLKNEAEALPIKKNSDKTIAVIGPMMNNTSDINGSWTCLGDGKQSVSLLTGLTEKYK 501

Query: 468 ---ANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAESLDREDLWLPG 524
                + Y  GC      S   +  A   A+ AD  ++  G   S   ES  R D+ LP 
Sbjct: 502 GTNVKLLYAEGCGFTTI-STEQLKEAVAIARKADRVLVAVGEQSSWAGESAVRTDIRLPQ 560

Query: 525 YQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVF 584
            Q QL+  +  + K P+ ++  S   +D+++   N N++AIL A +PG +GG  IADV+ 
Sbjct: 561 AQRQLLEALKAINK-PIAIITFSGRPLDLSWE--NENVQAILQAWFPGTQGGNGIADVIA 617

Query: 585 GKFNPGGRLPITWYNGDYVQMLPL------TSMPL----RPVDSLGYPGRTYKFYNGPTL 634
           G  NP G L +++     V  +P+      T  P+      VD   +    Y   +   L
Sbjct: 618 GDVNPSGHLTMSFPRS--VGQIPIYYNYKSTGRPVHTNNEEVDHRPHYNAGYLDSSITPL 675

Query: 635 YPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDY 694
           YPFGYGLSYT F  +         V+LNK    +++   +D+       ++VN       
Sbjct: 676 YPFGYGLSYTTFAIS--------NVHLNK----KSIKRYNDS-------IIVN------- 709

Query: 695 FEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKF 749
                  QN G T+G  VV +Y++      +  +K++ GFQ++ ++AG +K+++F
Sbjct: 710 ----ASVQNTGRTEGEIVVQLYTRQLVASVSRPVKELKGFQKIPLKAGESKQVRF 760


>gi|153809437|ref|ZP_01962105.1| hypothetical protein BACCAC_03751 [Bacteroides caccae ATCC 43185]
 gi|423292726|ref|ZP_17271288.1| hypothetical protein HMPREF1069_06331 [Bacteroides ovatus
           CL02T12C04]
 gi|149127897|gb|EDM19119.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
           caccae ATCC 43185]
 gi|392661162|gb|EIY54749.1| hypothetical protein HMPREF1069_06331 [Bacteroides ovatus
           CL02T12C04]
          Length = 859

 Score =  256 bits (655), Expect = 2e-65,   Method: Compositional matrix adjust.
 Identities = 219/798 (27%), Positives = 344/798 (43%), Gaps = 142/798 (17%)

Query: 51  SFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGD--------------------------- 83
           SF + +  LP  +RV DL+ RMTL+EK+ Q+                             
Sbjct: 24  SFSYKNPLLPTELRVNDLLGRMTLEEKIAQIRHLHSWDVFDGQILNQEKLDKMCGGIGYG 83

Query: 84  FAHGVP---------------------RLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGA 122
           F  G P                     RLG+P +   +E+LHGV           V  G 
Sbjct: 84  FFEGFPLTAASCRKTFREIQTYMVEKTRLGIPGFPV-AESLHGV-----------VHEGT 131

Query: 123 TSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNINVARDPRWGRITE 182
           T +P  I   ++FN  L  +  + ++ E   M           +P I+V RD RWGR+ E
Sbjct: 132 TIYPQNIAMGSTFNPELAYEKTKHIAGELNTM-----GVKQVLAPCIDVVRDLRWGRVEE 186

Query: 183 TPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRY 242
           + GEDPF+  + AV  V+G  +   H            +S   KHY  +  +   G++  
Sbjct: 187 SFGEDPFLCSKMAVAEVKGYME---H-----------GISPMLKHYGPHG-NPLGGLNLA 231

Query: 243 HFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDL 302
             +  V  +D+ + +L+PFE  + E +  +VM SYN  N IP+ A   +L   +R  +  
Sbjct: 232 SVECGV--RDLFDIYLKPFEAVLAETEIMAVMSSYNSWNRIPNSASRFMLTDILRNRFGF 289

Query: 303 HGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFTGNAVQQGKVKE 362
            GY+ +D   + ++   HK   D  E A  Q L AG+D++          + ++ G+   
Sbjct: 290 RGYVYSDWGVVSMLKTFHKTAVDDFE-AARQVLTAGMDVEASSSCYAVLADKIRNGEFDI 348

Query: 363 TDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQDICSDENIELAAEAAREGIVLLKNDQNT 422
           + ID++++ +      LG F+   Q  ++ +  + S E+++L+   A E  VLLKND   
Sbjct: 349 SYIDQAVRRVLRAKFELGLFEDPYQEQAVYRLPLRSKESVKLSRRIADESTVLLKNDGQL 408

Query: 423 LPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRY--MSPIAGFSGY----ANVTYKTGC 476
           LPLN   +K+VAV+GP  NA     G+Y     +   ++P+ G          + Y  GC
Sbjct: 409 LPLNVRNLKSVAVIGP--NADNVQFGDYTWSKKKEDGVTPLQGIKNLLGDRVKINYAKGC 466

Query: 477 DDVACKSNNSIFAASEAAKTADATIILAG----------LDLSVEAESLDREDLWLPGYQ 526
             +A    + I  A +AA+ +D  +I  G           + S   E +D  D+ L G Q
Sbjct: 467 -SLASLDTSGIAEAVDAARHSDVALIFVGSSSTAFVRHTQEPSTSGEGIDLSDISLTGAQ 525

Query: 527 TQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGK 586
            QLI +V  V K PV++++++  G   A      NI AIL   Y GE+ G +IAD++FG 
Sbjct: 526 EQLIREVFAVGK-PVVVILVA--GKPFAIPWVKENIPAILAQWYAGEQEGNSIADILFGN 582

Query: 587 FNPGGRLPITWYNGD-----YVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGL 641
            NP G+L  ++         Y   LP      +   +   PGR Y F N   L+ FGYGL
Sbjct: 583 VNPSGKLTFSFPQSTGHLPVYYNYLPTDKGYYKEPGTYEKPGRDYVFSNSSPLWAFGYGL 642

Query: 642 SYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDF 701
           SYTQF+Y                     L   +D    +      ND  C       V  
Sbjct: 643 SYTQFEY---------------------LKAVTDKELYQA-----NDTVC-----VTVQL 671

Query: 702 QNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVD 761
           +N G   G +V+ VY +       T +KQ+ GF++V +  G+ +    +        + D
Sbjct: 672 KNTGKRTGKEVIQVYMRDVVSSVMTPVKQLKGFRKVDLLPGQTRETTIMI-PVHEFYLTD 730

Query: 762 YAANTLLPAGEHTIFVGN 779
              N  L +G+  + VG 
Sbjct: 731 DLGNRYLESGKFELQVGT 748


>gi|365121645|ref|ZP_09338561.1| hypothetical protein HMPREF1033_01907 [Tannerella sp.
           6_1_58FAA_CT1]
 gi|363645135|gb|EHL84409.1| hypothetical protein HMPREF1033_01907 [Tannerella sp.
           6_1_58FAA_CT1]
          Length = 868

 Score =  256 bits (655), Expect = 3e-65,   Method: Compositional matrix adjust.
 Identities = 166/456 (36%), Positives = 238/456 (52%), Gaps = 47/456 (10%)

Query: 47  LQMSSFLFCDSSLPYS-------IRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWW 99
           L  S+F F   + PY         R  DL++RMTL EK  Q+ +   G+ RLG+  Y+WW
Sbjct: 12  LFFSAFSFRAENPPYKNPELSPDERALDLLNRMTLKEKFAQMHNNTGGIERLGVRPYDWW 71

Query: 100 SEALHGVSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARA------ 153
           +EALHG++  G           AT FP  I   A+F+++   ++   VS E RA      
Sbjct: 72  NEALHGIARAGK----------ATVFPQAIGLAATFDDTAVYEMFDMVSDEGRAKYHDFQ 121

Query: 154 ---MYNLGRAGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHEN 210
              MYN G  GLT+W+PNIN+ RDPRWGR  ET GEDPF+  +  +  V+GLQ       
Sbjct: 122 RKGMYN-GYKGLTFWTPNINIFRDPRWGRGMETYGEDPFLTTKMGLAVVKGLQ------- 173

Query: 211 ATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDAR-VTEQDMEETFLRPFEMCVKEGD 269
             D   +  K  +C KHYA +    W   +R+ ++A  ++ +D+ ET+L  F+  V EG 
Sbjct: 174 -GDGTQKYDKAHACAKHYAVHSGPEW---NRHSYNAENISIRDLRETYLPAFKALVTEGK 229

Query: 270 ASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSI-QVMVDNHKFLADSKE 328
              VMC+YNR  G P C++  LL   ++ EW     IV+DC +I             S  
Sbjct: 230 VKEVMCAYNRFEGEPCCSNKTLLINILKDEWGFDDVIVSDCGAIADFYTKGRHETHASAA 289

Query: 329 DAVAQTLKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSP-- 386
           DA A  + +G DL+CG  Y      A+++G + ET I++S+  L      LG FD     
Sbjct: 290 DASADAVISGTDLECGGSYWALD-EALEKGLITETKINESVFRLLRARFELGMFDDDSLV 348

Query: 387 QYVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAM 446
            + S+    +C D++   A E AR+ +VLL N  NTLPL S  +K VAV+GP+AN +V +
Sbjct: 349 SWSSIPYSVVCCDKHKAKALEMARKSMVLLSNKNNTLPL-SKSIKKVAVMGPNANDSVML 407

Query: 447 IGNYAGIPCRYMSPIAGFSGY---ANVTYKTGCDDV 479
             NY G P R ++ + G        +V Y+ GCD V
Sbjct: 408 WANYNGTPDRSVTILEGIKAKLPEGSVIYEKGCDYV 443



 Score =  124 bits (312), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 93/282 (32%), Positives = 137/282 (48%), Gaps = 53/282 (18%)

Query: 478 DVACKSNNSIFAASEAAKTADATIILAGLDLSVEAESL----------DREDLWLPGYQT 527
           DV         A +E  K ADA I + G+  S+E E +          DR ++ LP  Q 
Sbjct: 583 DVGLSRQIDYKAVAEKVKDADAIIFVGGISSSLEGEEMGVKYPGFRNGDRTNIDLPQVQK 642

Query: 528 QLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKF 587
            ++  + E  K PVI V+ S  G  +A +  + N+ AIL A YPG+EGG A+ADV+FG +
Sbjct: 643 NMMKALKETGK-PVIFVLCS--GSTMALSWEDKNMDAILQAWYPGQEGGTAVADVLFGDY 699

Query: 588 NPGGRLPITWY-NGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQF 646
           NP GRLP+T+Y + D +      +M      S G  GRTY+++ G  LYPFG+GLSYT F
Sbjct: 700 NPAGRLPLTFYASSDDLPDFENYNM------SEG-QGRTYRYFKGKPLYPFGHGLSYTGF 752

Query: 647 KYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGS 706
            Y+        +  LNK                         +  +D     ++ +N G 
Sbjct: 753 SYS--------KAKLNK-----------------------KSMSVNDSVFLSLNLKNTGL 781

Query: 707 TDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIK 748
            DG +VV VY +   +      K + G++RV V+AG+   +K
Sbjct: 782 RDGDEVVQVYIRNLQDPEGPS-KSLRGYKRVSVKAGQTVPVK 822


>gi|397691065|ref|YP_006528319.1| glycoside hydrolase family 3 protein [Melioribacter roseus P3M]
 gi|395812557|gb|AFN75306.1| glycoside hydrolase family 3 protein [Melioribacter roseus P3M]
          Length = 769

 Score =  256 bits (655), Expect = 3e-65,   Method: Compositional matrix adjust.
 Identities = 211/725 (29%), Positives = 339/725 (46%), Gaps = 115/725 (15%)

Query: 90  RLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVST 149
           RLG+P   +  E LHG++              ATS+P  I   A+FN  L +KI  A++ 
Sbjct: 110 RLGIPVI-FHEECLHGLA-----------AKDATSYPVPIGLAATFNPELIEKIFSAIAE 157

Query: 150 EARAMYNLGRAGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHE 209
           +AR+     R      +P ++V RDPRWGR+ ET GED ++V +  +  V+GLQ  +G  
Sbjct: 158 DARS-----RGAHQALTPVVDVVRDPRWGRVEETFGEDTYLVSQMGIASVKGLQG-DGSL 211

Query: 210 NATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGD 269
           N  +      KV +  KH+AA+      G +     A  +E+ + +TFL PF+  + +  
Sbjct: 212 NNNN------KVIATLKHFAAHGQPE-SGTN--CAPANFSERFLRDTFLMPFKEAIDKAG 262

Query: 270 ASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKF----LAD 325
             SVM SYN ++GIPS A+  LL + +R EW+  G++V+D  +I  +    +     +A 
Sbjct: 263 VISVMASYNEIDGIPSHANKWLLRKVLRDEWNFKGFVVSDYYAITELFHKEETVSHGVAA 322

Query: 326 SKEDAVAQTLKAGLDL-----DCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLG 380
           +K +A    L+AG+++     DC   Y N T   V+ G   E+DID  +  +      LG
Sbjct: 323 NKVEAAKLALEAGVNIEFPNPDC---YPNLT-EMVKGGLADESDIDALVLPMLKYKFELG 378

Query: 381 FFDGSPQYVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHA 440
            FD        G+ +   +++ ELA +AARE I LLKN+ N LPL     K +AV+GP+A
Sbjct: 379 LFDNPYVEAEPGQFENKLEQDRELALQAARETITLLKNEGNLLPLKD--FKKIAVIGPNA 436

Query: 441 NATVAMIGNYAGIPCRYMSPIAGFSGY----ANVTYKTGC----------DDV----ACK 482
           + T  ++G Y G P  Y S   G          V Y  GC          D+V      +
Sbjct: 437 DRT--LLGGYHGTPKYYTSVYQGIKDKVGKNGEVFYSEGCKITVGGSWNDDEVILPDPAE 494

Query: 483 SNNSIFAASEAAKTADATIILAGLDLSVEAESL------DREDLWLPGYQTQLINQVAEV 536
               I  A   A+ +D  +++ G +     E+       DR  L L G Q +L+ ++ + 
Sbjct: 495 DEKLINEAVAVAQKSDVAVLVLGGNEQTSREAWNKKHLGDRPSLELVGRQNKLVEEILKT 554

Query: 537 AKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPIT 596
            K PV++++ +     I F     N+ AIL   Y G+E GRA+ADV+FG +NP G+LP++
Sbjct: 555 GK-PVVVLLFNGRPNSIGF--IKDNVPAILECWYLGQETGRAVADVLFGDYNPSGKLPVS 611

Query: 597 WYNGDYVQMLPLTSMPLRPVDSLGYPG--RTYKFYNGPTLYPFGYGLSYTQFKYNLLSFT 654
                    +P ++  + P      P   R Y F +   L+ FGYGLSYT+F ++ L  +
Sbjct: 612 ---------IPRSAGHI-PAHYSHKPSARRGYLFDDVSPLFAFGYGLSYTKFSFDNLRLS 661

Query: 655 KTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVI 714
           K                               + +  D+     ++ +N G+  G +VV 
Sbjct: 662 K-------------------------------DTISADEKVSVSIEVKNEGAIAGEEVVQ 690

Query: 715 VYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHT 774
           +Y +         +K++ GF+++ +  G+   + F     + L   +      +  GE  
Sbjct: 691 LYIRDKVSSVTRPVKELKGFRKITLAPGQTSTVVFEL-LPEHLAFTNVDMKFTVEPGEFE 749

Query: 775 IFVGN 779
           I VGN
Sbjct: 750 IMVGN 754


>gi|387789382|ref|YP_006254447.1| beta-glucosidase-like glycosyl hydrolase [Solitalea canadensis DSM
           3403]
 gi|379652215|gb|AFD05271.1| beta-glucosidase-like glycosyl hydrolase [Solitalea canadensis DSM
           3403]
          Length = 771

 Score =  256 bits (655), Expect = 3e-65,   Method: Compositional matrix adjust.
 Identities = 219/775 (28%), Positives = 362/775 (46%), Gaps = 121/775 (15%)

Query: 65  VKDLVSRMTLDEKVQQLGDFAHGVPRL----------------------GLPQYEWWSEA 102
           V DL+S+MTL+EK+ QL     G   L                      G+   E   +A
Sbjct: 36  VNDLMSKMTLEEKIGQLNLVTPGGGILTGAVVSQSVEKKIMNGSVGGMFGIIGPEKIRKA 95

Query: 103 LHGVSNVG----PGTHFDDVIPG-ATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNL 157
                N      P     DVI G  T+FP  +   AS+N  L +K  Q  + EA A    
Sbjct: 96  QELAVNKSRLKIPMIFGSDVIHGHKTTFPIPLGLAASWNIELIEKSAQIAAKEATA---- 151

Query: 158 GRAGLTY-WSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNS 216
              GL + +SP ++VARDPRWGRI E  GEDP++    A   V+G Q    + +AT+L  
Sbjct: 152 --DGLNWVFSPMVDVARDPRWGRIAEGSGEDPYLGSLIAKAMVKGYQGDNTYSSATNL-- 207

Query: 217 RPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCS 276
                 +C KH+A Y      G D    D  ++ Q M E +L P++  V+ G   SVM S
Sbjct: 208 -----MACVKHFALYGAAE-AGRDYNSVD--MSRQKMYEFYLPPYKAAVEAG-VGSVMSS 258

Query: 277 YNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLK 336
           +N V G+P+  +  LL   +R +W  +G +V+D  S+  M+++      + ++  A  +K
Sbjct: 259 FNEVEGVPATGNQWLLTDLLRKQWGFNGMVVSDYTSVNEMMEHG---MGNLQEVSALAIK 315

Query: 337 AGLDLD-CGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGK-- 393
           AGLD+D  G+ Y +    ++Q+GKV ETDI+ + + +     +LG F    ++++  +  
Sbjct: 316 AGLDMDMVGEGYLSTLQKSLQEGKVSETDINLACRRILEAKYKLGLFSDPYKFINEKRAA 375

Query: 394 QDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGI 453
            +I + +++  + EAA    VLLKN++  LPL   K  T+A++GP A++   M+G +A +
Sbjct: 376 TEILTTQSLSFSREAATRSFVLLKNEKQVLPLK--KTGTIALIGPLADSKRNMLGTWA-V 432

Query: 454 PCRYMSPIAGFSG-------YANVTYKTGCD------------------DVACKSNNSIF 488
              + + ++   G       +A V Y  G +                  D+  +S+  + 
Sbjct: 433 SGNWKTSVSVKEGLMNAVGTHAKVLYAKGANISDDSAFARRVNTFGVEIDIDKRSSKELL 492

Query: 489 -AASEAAKTADATIILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMS 547
             A   A+ +D  I+  G    +  E+  R D+ +P  Q +L+  + +  K PV++V+ +
Sbjct: 493 DEALSIAQQSDVIIVAVGEAADMSGEAASRTDINIPESQKELLKALVQTGK-PVVMVLFN 551

Query: 548 AGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITW-YNGDYVQML 606
             G  +  +  N ++ AIL    PG + G AIADV+FG +NP G++ +T+  N   V M 
Sbjct: 552 --GRPLTLSWENEHLNAILDVWAPGHQAGNAIADVLFGDYNPSGKITVTFPKNVGQVPMY 609

Query: 607 PLTSMPLRPVDSLGYPGRTYKFYNGPT---LYPFGYGLSYTQFKYNLLSFTKTIQVNLNK 663
                  RP D       T K+ + P    +YPFGYGLSYT F+Y  ++  +        
Sbjct: 610 YNHKNTGRPYDDRNR--FTSKYLDMPDNAPMYPFGYGLSYTTFQYGDVTIDQ-------- 659

Query: 664 LQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEI 723
                          T  PG         +    KV   N G+ DG + V +Y +     
Sbjct: 660 --------------DTIKPG---------ETITAKVTITNTGNYDGVETVQLYIQDVIAS 696

Query: 724 AATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVG 778
            A  +K + GF+++ ++ G +K ++FV +  + L   +     +  AG+  +F+G
Sbjct: 697 VAPPVKTLKGFKQISLKKGESKVVEFVISE-EDLRFYNANLEHVSEAGDFNLFIG 750


>gi|319901526|ref|YP_004161254.1| glycoside hydrolase 3 [Bacteroides helcogenes P 36-108]
 gi|319416557|gb|ADV43668.1| glycoside hydrolase family 3 domain protein [Bacteroides helcogenes
           P 36-108]
          Length = 750

 Score =  256 bits (655), Expect = 3e-65,   Method: Compositional matrix adjust.
 Identities = 211/736 (28%), Positives = 338/736 (45%), Gaps = 112/736 (15%)

Query: 64  RVKDLVSRMTLDEKV---QQLGDFAHGVPRLGLPQYEWWSEALHGV-------------- 106
           ++++L+S MTL+EK+    Q+  + +    +GL +       L+ V              
Sbjct: 34  KIENLLSDMTLEEKLGQMNQISSYGNIEDMIGLIKKGEVGSILNEVDAVRVNALQRVAVE 93

Query: 107 -SNVG-PGTHFDDVIPG-ATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLT 163
            S +G P     DVI G  T FP  +   A+F+  + K   +  + EA ++      G+ 
Sbjct: 94  ESRLGIPLLMARDVIHGFKTIFPIPLGQAATFDPEVAKDGARIAAIEASSV------GVR 147

Query: 164 Y-WSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVS 222
           + ++P I+++RDPRWGRI E+ GED ++        V+G Q          LNS P  ++
Sbjct: 148 WTFAPMIDISRDPRWGRIAESCGEDVYLSSVMGSAMVKGFQ-------GDSLNS-PTSIA 199

Query: 223 SCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNG 282
           +C KH+  Y         R +    ++E+ +   +  PFE   K G A+  M S+N  +G
Sbjct: 200 ACAKHFVGYGAAEG---GRDYNSTFISERSLRNVYFPPFEAAAKAGVAT-FMTSFNDNDG 255

Query: 283 IPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLD 342
           +PS  +  +L   +RGEW   G +V D +S + M+  H F AD K DA    + AG+D++
Sbjct: 256 VPSTGNKFILKDVLRGEWGFDGLVVTDWNSAREMI-AHGFAADDK-DAATLAVNAGVDME 313

Query: 343 CGQY--YTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQDICSDE 400
              Y  + N     ++ GKVKE  ID+++K +  V  RLG FD    YV   +  +  DE
Sbjct: 314 MVSYAFFKNLP-EQIKSGKVKEEVIDEAVKNILRVKFRLGLFDNP--YVDEKRPSVMYDE 370

Query: 401 -NIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYA--GIPCRY 457
            ++  A  AA E ++LLKN++  LPL    V+TVAVVGP A+A    +G +   G     
Sbjct: 371 SHLAAAKRAAEESVILLKNEREVLPLKET-VRTVAVVGPMADAPYEQLGTWVFDGEKSHT 429

Query: 458 MSPIAG----FSGYANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAE 513
            +P+A     +     V Y+ G      K+   I  A      AD  I   G +  +  E
Sbjct: 430 QTPLAAIRSIYGDKVQVVYEPGLTYSRDKNVAGIAKAVSVTAHADVVIAFVGEEAILSGE 489

Query: 514 SLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGE 573
           +    DL L G Q++LI  +A+  K P++ V+M+  G  +   +      A+L++ +PG 
Sbjct: 490 AHSLADLNLQGAQSELIAALAKTGK-PLVTVVMA--GRQLTIGKEAEESDAVLYSFHPGT 546

Query: 574 EGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPL----------TSMPLRPVDSLGYPG 623
            GG AIAD++FGK  P G+ P+T+     V  +PL           S+  +P++ +  P 
Sbjct: 547 MGGPAIADLLFGKAVPSGKTPVTFLKA--VGQIPLYYAHNNSGRPASLNYKPLEEI--PV 602

Query: 624 RTYKFYNGPT----------LYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYT 673
              +   G +          LYPFGYGLSYT FKY                         
Sbjct: 603 EAGQTSEGSSSSYMDAGVQPLYPFGYGLSYTTFKYG------------------------ 638

Query: 674 SDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIG 733
                   P +   +L   D      D +N G  +G++VV +Y +         +K++  
Sbjct: 639 -------KPKISSRELSSKDVLTVVFDLENTGRYEGTEVVQLYVQDKVASVTRPVKELKR 691

Query: 734 FQRVFVRAGRNKRIKF 749
           F RV +++G  K + F
Sbjct: 692 FTRVTLKSGEKKTVTF 707


>gi|429745624|ref|ZP_19279029.1| glycosyl hydrolase family 3 protein [Capnocytophaga sp. oral taxon
           380 str. F0488]
 gi|429168470|gb|EKY10301.1| glycosyl hydrolase family 3 protein [Capnocytophaga sp. oral taxon
           380 str. F0488]
          Length = 770

 Score =  256 bits (655), Expect = 3e-65,   Method: Compositional matrix adjust.
 Identities = 195/655 (29%), Positives = 327/655 (49%), Gaps = 74/655 (11%)

Query: 117 DVIPG-ATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTY-WSPNINVARD 174
           DVI G  T FP  +  + S++ +L +K  +  + EA A       G+ + ++P +++ RD
Sbjct: 124 DVIHGYKTIFPIPLAESCSWDLALMRKTAELAAREASA------DGINWTFAPMVDITRD 177

Query: 175 PRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVD 234
            RWGR  E  GEDP++    A   V+G Q   G +N   L+S P  + +C KH+A Y   
Sbjct: 178 ARWGRAMEGAGEDPYLGSLIAEARVKGFQ---GGDNWQMLSS-PHTLLACGKHFAGYGAA 233

Query: 235 NWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQ 294
              G D  +  A ++   +   +L P+E  +  G   S+M S N +NG+P+ AD  LL +
Sbjct: 234 E-SGKD--YNTAELSMHTLRNVYLPPYEATLNAG-VGSIMASLNEINGVPATADKWLLTE 289

Query: 295 TVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDC-GQYYTNFTGN 353
            +R EW  +G +V+D   I  +V  H    D K+ A   +  AG+++D  G  +  +   
Sbjct: 290 VLRKEWGFNGLLVSDYTGINELV-RHGVAKDDKQ-AANLSANAGIEMDMNGATFIKYLSA 347

Query: 354 AVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYV--SLGKQDICSDENIELAAEAARE 411
            V++GKV E  IDK+++++  +   LG FD   +Y+  +  K++  ++E +++A +A   
Sbjct: 348 LVKEGKVTEAQIDKAVRHILEMKFLLGLFDDPYRYLDETRAKENTFTEEYLKVARQAVAS 407

Query: 412 GIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYA--GIPCRYMSPIAGFSGY-- 467
            +VLLKN+   LP+     KT+AV+GP  N T  + G++   G   + +S + G +    
Sbjct: 408 SVVLLKNEAEVLPIKKDSGKTIAVIGPMMNNTSDINGSWTCLGDGKQSVSLLTGLTEKYK 467

Query: 468 ---ANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAESLDREDLWLPG 524
                + Y  GC      S   +  A   A+ AD  ++  G   S   ES  R D+ LP 
Sbjct: 468 GTNVKLLYAEGCG-FTTISTEQLKEAVAIARKADRVLVAVGEQSSWAGESAVRTDIRLPQ 526

Query: 525 YQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVF 584
            Q QL+  +  + K P+ +V  S   +D+++   N N++AIL A +PG +GG  IADV+ 
Sbjct: 527 AQRQLLEALKAINK-PIAIVTFSGRPLDLSWE--NENVQAILQAWFPGTQGGNGIADVIA 583

Query: 585 GKFNPGGRLPITWYNGDYVQMLPL------TSMPL----RPVDSLGYPGRTYKFYNGPTL 634
           G  NP G L +++     V  +P+      T  P+      VD   +    Y   +   L
Sbjct: 584 GDVNPSGHLTMSFPRS--VGQIPIYYNYKSTGRPVYTNNEEVDHRPHYNAGYLDSSITPL 641

Query: 635 YPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDY 694
           YPFGYGLSYT F  +         V+LNK    +++   +D+       ++VN       
Sbjct: 642 YPFGYGLSYTTFAIS--------NVHLNK----KSIKRYNDS-------IIVN------- 675

Query: 695 FEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKF 749
                  QN G+T+G  VV +Y++      +  +K++ GFQ++ ++AG +K+++F
Sbjct: 676 ----ASVQNTGTTEGEIVVQLYTRQLVASVSRPVKELKGFQKISLKAGESKQVRF 726


>gi|149280000|ref|ZP_01886125.1| putative beta-glucosidase [Pedobacter sp. BAL39]
 gi|149229197|gb|EDM34591.1| putative beta-glucosidase [Pedobacter sp. BAL39]
          Length = 793

 Score =  256 bits (654), Expect = 3e-65,   Method: Compositional matrix adjust.
 Identities = 212/722 (29%), Positives = 345/722 (47%), Gaps = 109/722 (15%)

Query: 90  RLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVST 149
           RLG+P   +  E  HG   +G            T FPT I  +++++ +L K++  A++ 
Sbjct: 133 RLGIPML-FSEECPHGHMAIG-----------TTVFPTSIGQSSTWDPALIKEMAAAIAM 180

Query: 150 EARAMYNLGRAGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHE 209
           E R      + G   + P +++AR+PRW R+ ET GEDP +  R     V G Q      
Sbjct: 181 ETRL-----QGGHIGYGPVLDLAREPRWSRVEETYGEDPVLNSRMGEAMVSGFQ------ 229

Query: 210 NATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVT--EQDMEETFLRPFEMCVKE 267
             T++ S  + + S  KH+ AY V      +  H    VT   +++ +++L PF+  VK 
Sbjct: 230 -GTNIGS-GVNILSTLKHFTAYGVP-----EGGHNGGSVTVGNRELFQSYLPPFKAAVKA 282

Query: 268 GDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSK 327
           G A SVM +YN V+GIP  ++  LL   +RG+W  +G++V+D +SI  +  NH  +A S 
Sbjct: 283 G-ALSVMTAYNSVDGIPCSSNRYLLTDILRGQWGFNGFVVSDLNSISGLEGNH-HVASSA 340

Query: 328 EDAVAQTLKAGLDLDCGQY-YTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSP 386
            +A A  + AGLD D   Y Y      AV  G VK   +D +L  +  +   +G F+   
Sbjct: 341 TEAAALAMNAGLDADLSGYGYGPALVKAVNGGLVKMATVDTALARVLRLKFNMGLFENPY 400

Query: 387 QYVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAM 446
                 ++ + + +++ LA + A+E +VLLKN++N LPL+ A +K +AV+GP+A+     
Sbjct: 401 VNPKQAEKQVMNAKHVTLARKVAQESVVLLKNEKNILPLSKA-LKNIAVIGPNADNVYNQ 459

Query: 447 IGNYA-----GIPCRYMSPI-AGFSGYANVTYKTGCDDVACKSNNSIFAASEAAKTADAT 500
           +G+Y      G     ++ I A  S    V Y+ GC      S     A + A+K+  A 
Sbjct: 460 LGDYTAPQADGKVITVLNGIRAKVSKETGVFYQKGCAIRDTASAGIAAAVALASKSDVAI 519

Query: 501 IILAG---LDLSVE---------------------AESLDREDLWLPGYQTQLINQVAEV 536
           ++L G    D   E                      E  DR  L L G Q +L+  V + 
Sbjct: 520 VVLGGSSARDFKTEYQNTGAAEVKASAVAVSDMESGEGFDRSTLDLMGRQMELLRAVVKT 579

Query: 537 AKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPIT 596
              PV++V++    + + +A    N+ A++ A YPG+EGG AIADV+FG +NP GRL ++
Sbjct: 580 GT-PVVVVLIKGRPLTLNWAA--ENVAAMVDAWYPGQEGGNAIADVLFGDYNPAGRLSVS 636

Query: 597 WYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKT 656
                 V  LP+     RP+         Y   +   LY FGYGLSY+ F+Y+ L     
Sbjct: 637 VPKS--VGQLPVYYNKKRPLP------HNYVELDEQPLYSFGYGLSYSTFEYSNL----- 683

Query: 657 IQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVY 716
                                KT   G    D+R    F    D +N GS DG +VV +Y
Sbjct: 684 ---------------------KTNVSG-RGKDVRVQVTF----DLKNTGSRDGDEVVQLY 717

Query: 717 SKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIF 776
            +       T ++Q+  F+R+ +++G+ +++ F  +A + L +++      +  G+ ++ 
Sbjct: 718 LRDEQSSVVTPMQQLKQFRRLSLKSGQQQQLSFELSA-EDLQLMNQQMEWQVEPGDFSLM 776

Query: 777 VG 778
           VG
Sbjct: 777 VG 778


>gi|300778434|ref|ZP_07088292.1| beta-glucosidase [Chryseobacterium gleum ATCC 35910]
 gi|300503944|gb|EFK35084.1| beta-glucosidase [Chryseobacterium gleum ATCC 35910]
          Length = 740

 Score =  256 bits (654), Expect = 3e-65,   Method: Compositional matrix adjust.
 Identities = 215/762 (28%), Positives = 351/762 (46%), Gaps = 109/762 (14%)

Query: 64  RVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTHFD------- 116
           +V +L+S+MTL+EKV Q+  ++ G      PQ+   +  L  +     G+  +       
Sbjct: 26  KVAELLSKMTLEEKVGQMVQYS-GFEYATGPQHSNSAAVLDEIKKGKVGSMLNVAGSEET 84

Query: 117 --------------------DVIPG-ATSFPTVILTTASFNESLWKKIGQAVSTEARAMY 155
                               DVI G  T+FP  I   AS++  + +K  +  +TEA A Y
Sbjct: 85  RAFQKLAMQSRLKIPLLFGQDVIHGYRTTFPVNIGQAASWDLGMIEKSERIAATEA-AAY 143

Query: 156 NLGRAGLTYWS--PNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATD 213
            +      +W+  P +++ARDPRWGR+ E  GED ++  +  +  ++G Q     +    
Sbjct: 144 GI------HWTFAPMVDIARDPRWGRVMEGSGEDTYLGTKIGLARIKGFQG----KGLGS 193

Query: 214 LNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSV 273
           L++    V +C KH+AAY      G D    D  + +  + ET+L PF+   + G  ++ 
Sbjct: 194 LDA----VMACAKHFAAYGA-AVGGRDYNSVDMSLRQ--LNETYLPPFKAAAEAG-VATF 245

Query: 274 MCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQ 333
           M S+N +NGIP+ A+  +    ++G+W+  G++V+D  SI  M+  H +  D+ + A  +
Sbjct: 246 MNSFNDINGIPATANQYIQRNLLKGKWNYKGFVVSDWGSIGEMIP-HGYAKDAAQ-AAER 303

Query: 334 TLKAGLDLDC-GQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLG 392
            ++ G D+D   + Y       V++GKV    +D +   + T   ++G FD   ++ +  
Sbjct: 304 AVQGGSDMDMESRVYMAELPKLVKEGKVDAKLVDDAAGRILTKKFQMGLFDDPYRFSNEK 363

Query: 393 KQDICSD--ENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNY 450
           +Q   +D  EN +   E   + IVLLKN  N LPL S   KTVA++GP    TVA  G +
Sbjct: 364 RQKEQTDNQENRKFGREFGSKSIVLLKNHGNILPL-SKNTKTVALIGPFGKETVANHGFW 422

Query: 451 A-GIPCRYMSPIAGFSGYAN-------VTYKTGCDDVACKSNNSIFAASEAAKTADATII 502
           +          ++ F G  N       + Y  GC+ V  +       A E A+ AD  I+
Sbjct: 423 SVAFKDDNQRIVSQFDGIKNQLDKNSTLLYAKGCN-VDDQDKTQFAEAIETARRADVVIM 481

Query: 503 LAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNI 562
             G   ++  E+  R ++   G Q  L+ ++A+  K P+IL+I +  G  + F   + NI
Sbjct: 482 TLGEGHAMSGEAKSRSNIGFTGVQEDLLQEIAKTGK-PIILMINA--GRPLIFNWASDNI 538

Query: 563 KAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPL------TSMPLRPV 616
            AI++  + G E G +IADV+FGK NPGG+LP+T+   +    +P+      T  P +  
Sbjct: 539 PAIMYTWWLGTEAGNSIADVLFGKVNPGGKLPMTFPRTE--GQIPVYYNHYNTGRPAKNN 596

Query: 617 DSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDA 676
               Y        N P  YPFGYGLSYT FKY+ +  +                      
Sbjct: 597 TDRNYVSAYIDLDNDPK-YPFGYGLSYTDFKYSDMVLSSA-------------------- 635

Query: 677 SKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQR 736
                      +L  +      V   N G  DG +VV +Y +         +K++ GFQ+
Sbjct: 636 -----------NLTGNQTLNISVTVSNTGKYDGEEVVQLYVRDLFGKVVRPVKELKGFQK 684

Query: 737 VFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVG 778
           VF++ G +K+I F     + L   D   N     GE  I +G
Sbjct: 685 VFIKKGESKKIDFKLTP-EDLKFFDDELNFDWEGGEFDIMIG 725


>gi|265765465|ref|ZP_06093740.1| periplasmic beta-glucosidase [Bacteroides sp. 2_1_16]
 gi|263254849|gb|EEZ26283.1| periplasmic beta-glucosidase [Bacteroides sp. 2_1_16]
          Length = 814

 Score =  256 bits (654), Expect = 4e-65,   Method: Compositional matrix adjust.
 Identities = 240/853 (28%), Positives = 360/853 (42%), Gaps = 147/853 (17%)

Query: 8   LLCFSLSIALLVFSTNAVDANGSSSPVFVCDPGRFSKLGLQMSSFLFCDSSLPYSIRVKD 67
           L+CF +     +F   A +  G            F  L        + + S P   RV+ 
Sbjct: 5   LICFLMLSVFFIFPVRAKNTFGKKKDKVTRL--HFYDLNKNGRMDTYENPSAPVEYRVEH 62

Query: 68  LVSRMTLDEKVQQL------------GDFAHGVPRLGLPQYEWWSEALHGVSNVGPGT-- 113
           L+S+MTL+EKV Q+            G+     P+L     E+   +L G     P T  
Sbjct: 63  LLSQMTLEEKVGQMLTSLGWPMYERVGEDIRLTPQLEKEIGEYHIGSLWGFMRADPWTQR 122

Query: 114 ------------------------HFDDVIP--------------GATSFPTVILTTASF 135
                                   H    IP              G T FPT I   +++
Sbjct: 123 TLHTGLNPSLAARASNRLQSYVIEHSRLGIPLFLAEECPHGHMAIGTTVFPTSIGQASTW 182

Query: 136 NESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYA 195
           N  L +++G+ ++ EA A     +     + P +++ARDPRW R+ ET GEDP++ G   
Sbjct: 183 NPELIRQMGRVIAIEASA-----QGAHIGYGPVLDLARDPRWSRVEETYGEDPYLNGAMG 237

Query: 196 VNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEE 255
              VRG Q     E   D  S    V +  KH+A+Y    W         A + E+++EE
Sbjct: 238 TALVRGFQG----ETLNDGKS----VIATLKHFASY---GWTEGGHNGGTAHIGERELEE 286

Query: 256 TFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQV 315
               PF   V  G A SVM SYN ++G P      LL   ++  W   G++V+D  ++  
Sbjct: 287 AIFPPFREAVGAG-ALSVMSSYNEIDGNPCTGSRYLLTDILKDRWQFKGFVVSDLYAVGG 345

Query: 316 MVDNHKFLADSKEDAVAQTLKAGLDLDCG-QYYTNFTGNAVQQGKVKETDIDKSLKYLYT 374
           + ++   +A +  +A  + + AG+D D G   Y      AV++G V    IDK+++ + +
Sbjct: 346 LREHG--VAGNDYEAAIKAVNAGVDSDLGTNVYAEQLVAAVKRGDVAVATIDKAVRRILS 403

Query: 375 VLMRLGFFDGSPQYVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVA 434
           +  ++G FD          Q + S E+  LA E AR+ IVLLKN    LPL    ++T+A
Sbjct: 404 LKFQMGLFDDPFVDEKQAVQLVASSEHTGLAREVARQSIVLLKNKDKLLPLKK-DIRTLA 462

Query: 435 VVGPHANATVAMIGNYA-----GIPCRYMSPI-AGFSGYANVTYKTGCDDVACKSNNSIF 488
           V+GP+A+    M+G+Y      G     +  I    S    V Y  GC  V   S     
Sbjct: 463 VIGPNADNVYNMLGDYTAPQADGTVVTVLDGIRQKVSKETRVLYAKGC-AVRDSSRTGFK 521

Query: 489 AASEAAKTADATIILAG----LDLSVE-------------------AESLDREDLWLPGY 525
            A E A+ ADA +++ G     D S E                    E  DR  L L G 
Sbjct: 522 DAIETARNADAVVMVMGGSSARDFSSEYEETGAAKVTINQISDMESGEGYDRATLHLMGR 581

Query: 526 QTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFG 585
           Q +L+ +++ + K PV+LV++   G  +         +AI+ A YPG +GG A+ADV+FG
Sbjct: 582 QLELLEEISRLGK-PVVLVLIK--GRPLLMEGAIQEAEAIVDAWYPGMQGGNAVADVLFG 638

Query: 586 KFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQ 645
            +NP GRL ++      V  LP+     R     G   R Y    G   YPFGYGLSYT 
Sbjct: 639 DYNPAGRLTLSVPRS--VGQLPVYYNTRRK----GNRSR-YIEEPGTPRYPFGYGLSYTT 691

Query: 646 FKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVG 705
           F Y  +     +QV                           +D R D      V  QN G
Sbjct: 692 FSYTDMK----VQVTEGS-----------------------DDCRVD----VTVTIQNQG 720

Query: 706 STDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAAN 765
           + DG +V  +Y +       T  KQ+  F R+ ++AG ++ + F  +  KSL +      
Sbjct: 721 TADGDEVAQLYFRDDVSSFTTPAKQLRAFSRIHLKAGESREVTFTLDK-KSLALYMQEGE 779

Query: 766 TLLPAGEHTIFVG 778
            ++  G  TI VG
Sbjct: 780 WVVEPGRFTIMVG 792


>gi|429756169|ref|ZP_19288778.1| glycosyl hydrolase family 3 protein [Capnocytophaga sp. oral taxon
           324 str. F0483]
 gi|429171889|gb|EKY13478.1| glycosyl hydrolase family 3 protein [Capnocytophaga sp. oral taxon
           324 str. F0483]
          Length = 755

 Score =  256 bits (654), Expect = 4e-65,   Method: Compositional matrix adjust.
 Identities = 210/738 (28%), Positives = 354/738 (47%), Gaps = 109/738 (14%)

Query: 64  RVKDLVSRMTLDEKVQQLGDFAHGVPRLG---LPQYEWWSEA------LHGVSNVG---- 110
           RV  ++  MTL+EK+ Q+  F+      G     +Y+ + E        +  S VG    
Sbjct: 31  RVDSVLRLMTLEEKIGQMTQFSADWSVTGPVMADKYQPYLEKGLVGSIFNATSVVGIRKL 90

Query: 111 ------------PGTHFDDVIPG-ATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNL 157
                       P     DVI G  T FP  +  + S++ +L +K  +  + EA A    
Sbjct: 91  QKIAVEQTRLGIPILFGQDVIHGYKTIFPIPLAESCSWDLALMRKTAELAAREATA---- 146

Query: 158 GRAGLTY-WSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNS 216
              G+ + ++P +++ RD RWGR  E  GEDP++    A   V+G Q   G +N   L+S
Sbjct: 147 --DGINWTFAPMVDITRDARWGRAMEGAGEDPYLGSLIAEARVKGFQ---GGDNWQTLSS 201

Query: 217 RPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCS 276
            P  + +C KH+A Y      G D  +  A ++   +   +L P+E  +  G   S+M S
Sbjct: 202 -PHTLLACGKHFAGYGAAE-SGKD--YNTAELSMHTLRNVYLPPYEATLNAG-VGSIMAS 256

Query: 277 YNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLK 336
            N +NG+P+ AD  LL + +R EW  +G +V+D   I  +V  H    D K+ A   +  
Sbjct: 257 LNEINGVPATADKWLLTEELRKEWGFNGLLVSDYTGINELV-RHGVAKDDKQ-AANLSAN 314

Query: 337 AGLDLDC-GQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYV--SLGK 393
           AG+++D  G  +  +    V++GK  E  IDK+++++  +   LG FD   +Y+  +  K
Sbjct: 315 AGIEMDMNGATFIKYLSALVKEGKATEAQIDKAVRHILEMKFLLGLFDDPYRYLDETRAK 374

Query: 394 QDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYA-- 451
           ++  ++E +++A +A    +VLLKN+   LP+     KT+AV+GP  N T  + G++   
Sbjct: 375 ENTFTEEYLKVARQAVASSVVLLKNEAEVLPIKKNSGKTIAVIGPMMNNTSDINGSWTCL 434

Query: 452 GIPCRYMSPIAGFSGY-----ANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGL 506
           G   + +S ++G +         + Y  GC      S   +  A   A+ AD  ++  G 
Sbjct: 435 GDGKQSVSLLSGLTQKYKGTNVKLLYAEGCG-FTTISTEQLKEAVAIARKADRVLVAVGE 493

Query: 507 DLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAIL 566
             S   ES  R D+ LP  Q QL+  +  + K P+ ++  S   +D+++   N N++AIL
Sbjct: 494 QSSWAGESAVRTDIRLPQAQRQLLEALKAINK-PITIITFSGRPLDLSWE--NENVQAIL 550

Query: 567 WAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPL------TSMP-------- 612
            A +PG +GG  IADV+ G  NP G L +++     V  +P+      T  P        
Sbjct: 551 QAWFPGTQGGNGIADVIAGDVNPSGHLTMSFPRS--VGQIPIYYNYKNTGRPVYTNNEEV 608

Query: 613 -LRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLN 671
            LRP  + GY   +        LYPFGYGLSYT F  +         V+LNK    +++ 
Sbjct: 609 DLRPHYNAGYLDSSIT-----PLYPFGYGLSYTTFAIS--------NVHLNK----KSMK 651

Query: 672 YTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQV 731
             +D+       ++VN              QN G+T+G  V+ +Y++      +  +K++
Sbjct: 652 RYNDS-------IIVN-----------ASVQNTGTTEGEIVLQLYTRQLVASVSRPVKEL 693

Query: 732 IGFQRVFVRAGRNKRIKF 749
            GFQ++ ++AG +K+++F
Sbjct: 694 KGFQKISLKAGESKQVRF 711


>gi|60680320|ref|YP_210464.1| beta-glucosidase [Bacteroides fragilis NCTC 9343]
 gi|60491754|emb|CAH06512.1| putative beta-glucosidase [Bacteroides fragilis NCTC 9343]
          Length = 814

 Score =  256 bits (653), Expect = 4e-65,   Method: Compositional matrix adjust.
 Identities = 240/853 (28%), Positives = 360/853 (42%), Gaps = 147/853 (17%)

Query: 8   LLCFSLSIALLVFSTNAVDANGSSSPVFVCDPGRFSKLGLQMSSFLFCDSSLPYSIRVKD 67
           L+CF +     +F   A +  G            F  L        + + S P   RV+ 
Sbjct: 5   LICFLMLSVFFIFPVRAKNTFGKKKDKVTRL--HFYDLNKNGRMDTYENPSAPVEYRVEH 62

Query: 68  LVSRMTLDEKVQQL------------GDFAHGVPRLGLPQYEWWSEALHGVSNVGPGT-- 113
           L+S+MTL+EKV Q+            G+     P+L     E+   +L G     P T  
Sbjct: 63  LLSQMTLEEKVGQMLTSLGWPMYERVGEDIRLTPQLEKEIGEYHIGSLWGFMRADPWTQR 122

Query: 114 ------------------------HFDDVIP--------------GATSFPTVILTTASF 135
                                   H    IP              G T FPT I   +++
Sbjct: 123 TLHTGLNPSLAARASNRLQSYVIEHSRLGIPLFLAEECPHGHMAIGTTVFPTSIGQASTW 182

Query: 136 NESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYA 195
           N  L +++G+ ++ EA A     +     + P +++ARDPRW R+ ET GEDP++ G   
Sbjct: 183 NPELIRQMGRVIAIEASA-----QGAHIGYGPVLDLARDPRWSRVEETYGEDPYLNGVMG 237

Query: 196 VNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEE 255
              VRG Q     E   D  S    V +  KH+A+Y    W         A + E+++EE
Sbjct: 238 TALVRGFQG----ETLNDGKS----VIATLKHFASY---GWTEGGHNGGTAHIGERELEE 286

Query: 256 TFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQV 315
               PF   V  G A SVM SYN ++G P      LL   ++  W   G++V+D  ++  
Sbjct: 287 AIFPPFREAVGAG-ALSVMSSYNEIDGNPCTGSRYLLTDILKDRWQFKGFVVSDLYAVGG 345

Query: 316 MVDNHKFLADSKEDAVAQTLKAGLDLDCG-QYYTNFTGNAVQQGKVKETDIDKSLKYLYT 374
           + ++   +A +  +A  + + AG+D D G   Y      AV++G V    IDK+++ + +
Sbjct: 346 LREHG--VAGNDYEAAIKAVNAGVDSDLGTNVYAEQLVAAVKRGDVAVATIDKAVRRILS 403

Query: 375 VLMRLGFFDGSPQYVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVA 434
           +  ++G FD          Q + S E+  LA E AR+ IVLLKN    LPL    ++T+A
Sbjct: 404 LKFQMGLFDDPFVDEKQAVQLVASSEHTGLAREVARQSIVLLKNKDKLLPLKK-DIRTLA 462

Query: 435 VVGPHANATVAMIGNYA-----GIPCRYMSPI-AGFSGYANVTYKTGCDDVACKSNNSIF 488
           V+GP+A+    M+G+Y      G     +  I    S    V Y  GC  V   S     
Sbjct: 463 VIGPNADNVYNMLGDYTAPQADGTVVTVLDGIRQKVSKETRVLYAKGC-TVRDSSRTGFK 521

Query: 489 AASEAAKTADATIILAG----LDLSVE-------------------AESLDREDLWLPGY 525
            A E A+ ADA +++ G     D S E                    E  DR  L L G 
Sbjct: 522 DAIETARNADAVVMVMGGSSARDFSSEYEETGAAKVTINQISDMESGEGYDRATLHLMGR 581

Query: 526 QTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFG 585
           Q +L+ +++ + K PV+LV++   G  +         +AI+ A YPG +GG A+ADV+FG
Sbjct: 582 QLELLEEISRLGK-PVVLVLIK--GRPLLMEGAIQEAEAIVDAWYPGMQGGNAVADVLFG 638

Query: 586 KFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQ 645
            +NP GRL ++      V  LP+     R     G   R Y    G   YPFGYGLSYT 
Sbjct: 639 DYNPAGRLTLSVPRS--VGQLPVYYNTRRK----GNRSR-YIEEPGTPRYPFGYGLSYTT 691

Query: 646 FKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVG 705
           F Y  +     +QV                           +D R D      V  QN G
Sbjct: 692 FSYTDMK----VQVTEGS-----------------------DDCRVD----VTVTIQNQG 720

Query: 706 STDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAAN 765
           + DG +V  +Y +       T  KQ+  F R+ ++AG ++ + F  +  KSL +      
Sbjct: 721 TADGDEVAQLYFRDDVSSFTTPAKQLRAFSRIHLKAGESREVTFTLDK-KSLALYMQEGE 779

Query: 766 TLLPAGEHTIFVG 778
            ++  G  TI VG
Sbjct: 780 WVVEPGRFTIMVG 792


>gi|224025503|ref|ZP_03643869.1| hypothetical protein BACCOPRO_02243 [Bacteroides coprophilus DSM
           18228]
 gi|224018739|gb|EEF76737.1| hypothetical protein BACCOPRO_02243 [Bacteroides coprophilus DSM
           18228]
          Length = 787

 Score =  256 bits (653), Expect = 4e-65,   Method: Compositional matrix adjust.
 Identities = 213/741 (28%), Positives = 344/741 (46%), Gaps = 119/741 (16%)

Query: 90  RLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVST 149
           RLG+P   +  E  HG   +G           AT FPT +   +++NESL +++G+ +  
Sbjct: 125 RLGIPVL-FAEECPHGHMAIG-----------ATVFPTSMGQASTWNESLIRQMGEVIGL 172

Query: 150 EARAM-YNLGRAGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGH 208
           EAR    N+G      + P +++AR+PRW R+ ET GEDP++ G     +V+G+Q  +  
Sbjct: 173 EARLQGANIG------YGPVLDIAREPRWSRVEETFGEDPYLTGILGTAFVQGMQGKDFK 226

Query: 209 ENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFD---ARVTEQDMEETFLRPFEMCV 265
           +           V S  KH AAY      GV R   +   A +  + + + +L  F+  V
Sbjct: 227 DGR--------HVYSTLKHLAAY------GVPRGGHNGGPADMGLRALLDEYLPGFQRAV 272

Query: 266 KEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLAD 325
           + G A++VM SYN ++G+P  ++  L++  +R  W   G++ +D  SI  +   H  +A 
Sbjct: 273 EVGKAATVMTSYNSIDGVPCTSNKFLIDSLLRKRWGFDGFVYSDLASIDGIAGAH--VAA 330

Query: 326 SKEDAVAQTLKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDG- 384
           + EDA  Q ++AG D+D G         AVQ GKVKE+ I++++  +  +  R+G F+  
Sbjct: 331 NLEDAAIQAVEAGTDMDLGANAYRRLVKAVQTGKVKESAINRAVSNVLRLKFRMGLFEQP 390

Query: 385 --SPQYVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANA 442
             SP+  +  +   C D  + LA + AREG VLLKN+   LPL   KVK +AV+GP+A+ 
Sbjct: 391 YVSPEEAA--RLVNCEDHRM-LARKIAREGTVLLKNN-GILPL--GKVKRIAVIGPNADV 444

Query: 443 TVAMIGNYAGIPCRYMSPIAGFSGYAN------VTYKTGCDDVACKSNNSIFAASEAAKT 496
               +G+Y   P      +       N      + Y  GC  +   + ++I  A EAA+ 
Sbjct: 445 MYNYLGDYTA-PQERSKVVTLLDALRNRMPDVRIDYVKGC-AIRDTTQSNIKEAVEAARK 502

Query: 497 ADATIILAG----LDLSVE----------------------AESLDREDLWLPGYQTQLI 530
           AD  I+  G     D   +                       E  DR  L L G Q +LI
Sbjct: 503 ADLVILAVGGSSARDFKTKYINTGAATVDSENSGILSDMECGEGFDRATLDLLGDQEKLI 562

Query: 531 NQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPG 590
             +A   K P++ V ++   +++  A   ++  A+L A YPGE+GG  I DV+ G++NP 
Sbjct: 563 RAIAATEK-PLVTVYIAGRPLNMNLASEVSD--ALLTAWYPGEQGGNGIVDVLTGEYNPS 619

Query: 591 GRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNL 650
           GRLP++     +V  +P+        D +  PG+         LY FGYGLSYT F Y+ 
Sbjct: 620 GRLPMSVPR--HVGQIPVHYSQGTLRDYMDCPGK--------PLYTFGYGLSYTTFAYSN 669

Query: 651 LSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGS 710
           L  + T +                 AS+      ++  + C           N G  DG 
Sbjct: 670 LKLSATAKA----------------ASQPAGDNEVMQTITC--------TVTNTGDRDGD 705

Query: 711 DVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPA 770
           +VV +Y        A    ++ GFQ++F++ G ++ + F     + L+I D   N     
Sbjct: 706 EVVQLYLNDEVSSVAVPPIRLKGFQKIFLKKGESREVTFQLTR-QDLSIYDRNMNFTAEP 764

Query: 771 GEHTIFVGNGGVSFPIHLNFN 791
           G   + +G    + P+  +F 
Sbjct: 765 GRFNVMIGGSSDNLPLKGSFE 785


>gi|268316106|ref|YP_003289825.1| glycoside hydrolase [Rhodothermus marinus DSM 4252]
 gi|262333640|gb|ACY47437.1| glycoside hydrolase family 3 domain protein [Rhodothermus marinus
           DSM 4252]
          Length = 754

 Score =  256 bits (653), Expect = 4e-65,   Method: Compositional matrix adjust.
 Identities = 229/768 (29%), Positives = 360/768 (46%), Gaps = 116/768 (15%)

Query: 65  VKDLVSRMTLDEKVQQL----GDFAHGVP-------------RLGLPQYEWWSEALHGV- 106
           ++ L++RMTL+EK+ QL    G  A   P             R+G     + +EA+  + 
Sbjct: 33  IEALLARMTLEEKLGQLTLYNGGMAETGPVVREGEPDAIRRGRVGAVMNFFGAEAVCAMQ 92

Query: 107 ------SNVG-PGTHFDDVIPG-ATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLG 158
                 S +G P     DVI G  T FP  +   A+F+ +L ++  +  + EA A+    
Sbjct: 93  RQAVEESRLGIPLLFALDVIHGFRTIFPVPLAEAATFDPALVEQAARVAAGEASAV---- 148

Query: 159 RAGLTY-WSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSR 217
             GL + ++P +++ARD RWGRI E  GEDP++    A   VRG Q         DL   
Sbjct: 149 --GLNWTFAPMVDIARDARWGRIVEGSGEDPYLGAVMAAARVRGFQ-------GRDLRD- 198

Query: 218 PLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSY 277
           P  + +  KH+AAY      G D    D  V+E+ + E +L PFE  V+ G A S+M ++
Sbjct: 199 PTTILATAKHFAAYGAAE-AGRDYNTVD--VSERTLREVYLPPFEAAVRAG-ALSIMSAF 254

Query: 278 NRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKA 337
           N + G+P+ AD  LL   +R EW   G +V+D  S+  ++  H   ADS E    + L+A
Sbjct: 255 NEIGGVPATADRWLLTDVLRHEWGFEGLVVSDYTSVWELL-FHGIAADSAEVG-RKALEA 312

Query: 338 GLDLD-CGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYV--SLGKQ 394
           G+D+D     Y       V+ G++ E  +D++++ +  V  RLG F+   +Y   +  +Q
Sbjct: 313 GVDMDMVSGIYVRKLAEEVRAGRLSEAVVDEAVRRVLRVKYRLGLFEDPYRYCRDASREQ 372

Query: 395 DICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYA--G 452
            + S  +  LA E AR+ IVLLKN+   LPL    ++ VAV+G  AN + +++G +A  G
Sbjct: 373 VLLSPAHRRLAREVARKAIVLLKNEGELLPLAD-TLQRVAVIGALANDSASVLGPWAAAG 431

Query: 453 IPCRYMSPIAGFSGY---ANVTYKTGCDDV-----------ACKSNNSIFAASEA-AKTA 497
            P   ++ + G       A V Y  G  +V           A   + S FA +EA A+ A
Sbjct: 432 RPEDAVTILEGIRAALPGATVRYAPGYAEVPSGSFQEMVAAALSPDTSGFAEAEAVARWA 491

Query: 498 DATIILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAE 557
           +  I++ G    +  E+  R  + LPG Q  L  ++  + + PV++V+M+  G  +A  E
Sbjct: 492 EVVILVLGEHRELSGEAASRASVELPGVQLALAWRLLALGR-PVVVVLMN--GRPLAIPE 548

Query: 558 TNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVD 617
              +  AI+ A + G E G A+ADV+ GK +PGGRLP+++      + L     P     
Sbjct: 549 LAASAPAIVEAWFLGTEMGHAVADVLLGKASPGGRLPVSFPRATGQEPLYYNHKP----- 603

Query: 618 SLGYPGR-----TYKFYNGP--TLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNL 670
             G P R     T K+ + P   LYPFGYGL+YT F Y+ L  ++               
Sbjct: 604 -TGRPPRAEEKYTSKYVDVPWTPLYPFGYGLTYTTFAYDSLRLSRRRLG----------- 651

Query: 671 NYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQ 730
                                DD  E  V   N G   G +VV +Y +         +K+
Sbjct: 652 --------------------LDDTLEVVVSVTNTGRRRGEEVVQLYVRDEVASVTRPVKE 691

Query: 731 VIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVG 778
           + GF RV +  G  K ++F     ++L         ++  G  T++VG
Sbjct: 692 LKGFARVELAPGETKAVQFRLP-VRALRFWGLEGGWVVEPGWFTLWVG 738


>gi|423271149|ref|ZP_17250120.1| hypothetical protein HMPREF1079_03202 [Bacteroides fragilis
           CL05T00C42]
 gi|423274973|ref|ZP_17253919.1| hypothetical protein HMPREF1080_02572 [Bacteroides fragilis
           CL05T12C13]
 gi|392699073|gb|EIY92255.1| hypothetical protein HMPREF1079_03202 [Bacteroides fragilis
           CL05T00C42]
 gi|392704252|gb|EIY97391.1| hypothetical protein HMPREF1080_02572 [Bacteroides fragilis
           CL05T12C13]
          Length = 859

 Score =  255 bits (652), Expect = 5e-65,   Method: Compositional matrix adjust.
 Identities = 215/769 (27%), Positives = 337/769 (43%), Gaps = 146/769 (18%)

Query: 50  SSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGD-------------------------- 83
           ++F + ++SLP  +RV+DL+SRMTL+EK+ Q+                            
Sbjct: 22  TNFKYKNASLPVEVRVQDLLSRMTLEEKIAQMRHIHAYSIMENGKLNEEKLEKMIGGQNY 81

Query: 84  -FAHGVP---------------------RLGLPQYEWWSEALHGVSNVGPGTHFDDVIPG 121
            F  G+                      RLG+P +   +E+LHG            V  G
Sbjct: 82  GFIEGITLPGKECLTLMNEVQKYMREKTRLGIPVFTL-TESLHG-----------SVHDG 129

Query: 122 ATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTY-WSPNINVARDPRWGRI 180
           +T FP  I   ++FN  L  ++  A++ E      L   G+T   +P I+V RD RWGR+
Sbjct: 130 STIFPQAIALGSTFNPILAYEMTSAIAKE------LSAQGITQSLTPVIDVCRDLRWGRV 183

Query: 181 TETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVD 240
            E  GEDPF+V R  V+ VRG  D +              VS   KH+ A+      G++
Sbjct: 184 EECFGEDPFLVSRMGVSQVRGYLDNQ--------------VSPMIKHFGAHGAPQ-GGLN 228

Query: 241 RYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEW 300
                    ++++   +L+ FE  VKE    +VM SYN  N  P+ +   L+ + +R  W
Sbjct: 229 LASVSC--GQRELLSIYLKTFETVVKEAKPWAVMSSYNSWNNEPNSSSHYLMTELLRDRW 286

Query: 301 DLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFTGNAVQQGKV 360
           D  GY+ +D  +I ++   HK   +S E A+ Q L AGLD +            V+ G +
Sbjct: 287 DFQGYVYSDWGAIGMLNYFHKTAQNSAEAAI-QALTAGLDAEASDNSYAELQQLVENGML 345

Query: 361 KETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQDICSDENIELAAEAAREGIVLLKNDQ 420
               ID+++  + T    +G F+          + + +  ++ LA + A E IVLL+N+ 
Sbjct: 346 DVKYIDQAVARILTAKFNMGLFEYPLPMEKNYDKVVHAPAHVSLARKIAEESIVLLQNEN 405

Query: 421 NTLPLNSAKVKTVAVVGPHANATVAMIGNYA-------GIPCRYMSPIAGFSG-YANVTY 472
           N LPL   K+K++AV+GP  NA     G+Y        G+    +  +    G    + Y
Sbjct: 406 NILPLQMNKLKSIAVIGP--NADQVQFGDYTWSRDNKDGVTL--LEALKERVGNQLTLNY 461

Query: 473 KTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEA---------ESLDREDLWLP 523
             GC D+     +    A + AK +D  I++ G   +  A         E  D  DL L 
Sbjct: 462 AKGC-DLVTDDRSGFKEAVDVAKKSDVCIVVVGSASASLARDYSNATCGEGFDLSDLTLT 520

Query: 524 GYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVV 583
           G Q  L+  +    K PVI+V++S  G   A +    NI  I+   YPGE+GG A+AD++
Sbjct: 521 GVQEDLVEAIHATGK-PVIVVLLS--GKPFAMSWIKENIPGIVVQWYPGEQGGLALADML 577

Query: 584 FGKFNPGGRLPITWYNGD-----YVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFG 638
            GK NP G+L  ++         Y   LP      R   S   PG+ Y F +   L+ FG
Sbjct: 578 LGKVNPSGKLNYSFPQSVGHLPCYYNYLPTDKGFYRSPGSKNKPGKDYVFSSPKALWAFG 637

Query: 639 YGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFK 698
           +GLSYT F+Y  LS T + +                             D  C+D  E  
Sbjct: 638 HGLSYTDFEY--LSATTSKE-----------------------------DYACEDVIEVT 666

Query: 699 VDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRI 747
           +  +N G  DG +V  VY +         ++++ GF++V ++ G  K++
Sbjct: 667 IAIRNTGDYDGLEVPQVYVRDMVSSVVMPVQELKGFEKVLIKKGETKQV 715


>gi|423214254|ref|ZP_17200782.1| hypothetical protein HMPREF1074_02314 [Bacteroides xylanisolvens
           CL03T12C04]
 gi|392693199|gb|EIY86434.1| hypothetical protein HMPREF1074_02314 [Bacteroides xylanisolvens
           CL03T12C04]
          Length = 735

 Score =  255 bits (652), Expect = 6e-65,   Method: Compositional matrix adjust.
 Identities = 213/772 (27%), Positives = 355/772 (45%), Gaps = 105/772 (13%)

Query: 53  LFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHG--------------VP-RLGLPQYE 97
           L+ D+  P   RV DL+SRMTL+EK+ QL  +  G              VP  +G   Y 
Sbjct: 29  LYKDAKAPIEKRVDDLLSRMTLEEKILQLNQYTMGRNNNVNNIGEEVKKVPAEIGSLIYY 88

Query: 98  WWSEALHG--------VSNVGPGTHFD-DVIPG-ATSFPTVILTTASFNESLWKKIGQAV 147
             + AL           S +G    F  D I G  T +P  +    S+N  L +K     
Sbjct: 89  DTNPALRNNVQKKAMEESRLGIPIIFGYDAIHGFRTVYPISLGQACSWNPELVEKACAVT 148

Query: 148 STEARAMYNLGRAGLTY-WSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVE 206
           + EAR       +G+ + +SP I+VARDPRWGR+ E  GEDP+  G +A   VRG Q   
Sbjct: 149 AQEARM------SGVDWTFSPMIDVARDPRWGRVAEGYGEDPYANGVFAAASVRGYQG-- 200

Query: 207 GHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVK 266
                 D  S   ++++C KHY  Y         R +    ++ Q + +T+L P+EM VK
Sbjct: 201 ------DDMSAEDRIAACLKHYIGYGASE---AGRDYVYTEISAQTLWDTYLLPYEMGVK 251

Query: 267 EGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADS 326
            G A+++M S+N ++G+P  A+   + + ++  W   G+IV+D  +I+ +   ++ LA +
Sbjct: 252 AG-AATLMSSFNDISGVPGSANHYTMTEILKERWGHDGFIVSDWGAIEQL--KNQGLAAN 308

Query: 327 KEDAVAQTLKAGLDLDCGQY-YTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGS 385
           K++A      AGL++D   + Y  +    V++GK+    +D+S++ +  V  RLG F+  
Sbjct: 309 KKEAAVYAFNAGLEMDMMSHAYDRYMKELVEEGKITMAQVDESVRRVLRVKFRLGLFERP 368

Query: 386 PQYVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVA 445
              V+  K+     +++++AA+ A E +VLLKN+   LPL     K +AVVGP A     
Sbjct: 369 YTPVTNEKERFFRPQSMDIAAQLAAESMVLLKNENGILPLTDK--KKIAVVGPMAKNGWD 426

Query: 446 MIGNYAG------IPCRYMSPIAGFSGYANVTYKTGCDDVACKSNNSIFAASEAAKTADA 499
           ++G++ G      +   Y      F G A + Y  GC      +      A EAA+ +D 
Sbjct: 427 LLGSWCGHGKDTDVAMLYNGLATEFVGKAELRYALGC-STQGDNRKGFEEALEAARWSDV 485

Query: 500 TIILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETN 559
            ++  G  ++   E+  R  + LP  Q +L  ++ +  K P++LV+++   +++   E  
Sbjct: 486 VVLCLGEMMTWSGENASRSSIALPQIQEELAKELKKAGK-PIVLVLVNGRPLELNRLEPI 544

Query: 560 TNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTS--MPL---R 614
           ++  AIL    PG  G   +A ++ G+ NP G+L +T+         P ++  +P+   R
Sbjct: 545 SD--AILEIWQPGVNGALPMAGILSGRINPSGKLAMTF---------PYSTGQIPIYYNR 593

Query: 615 PVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTS 674
                G+ G  YK      LY FG+GLSYT+FKY  ++ + T      KL          
Sbjct: 594 RKSGRGHQG-FYKDITSEPLYSFGHGLSYTEFKYGTVTPSVTTVKRGGKLS--------- 643

Query: 675 DASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGF 734
                                  +V   N G  DG + V  +   P       +K++  F
Sbjct: 644 ----------------------VEVSVSNTGKRDGLETVHWFISDPYCSITRPVKELKHF 681

Query: 735 QRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGNGGVSFPI 786
           ++  ++AG  K  +F  +  +    V+      L  GE+ I V +  V   +
Sbjct: 682 EKQLIKAGETKVFRFDVDLERDFGFVNGNGKRFLEIGEYYIQVKDQKVKIDL 733


>gi|375357172|ref|YP_005109944.1| putative beta-glucosidase [Bacteroides fragilis 638R]
 gi|301161853|emb|CBW21397.1| putative beta-glucosidase [Bacteroides fragilis 638R]
          Length = 814

 Score =  255 bits (652), Expect = 6e-65,   Method: Compositional matrix adjust.
 Identities = 240/853 (28%), Positives = 360/853 (42%), Gaps = 147/853 (17%)

Query: 8   LLCFSLSIALLVFSTNAVDANGSSSPVFVCDPGRFSKLGLQMSSFLFCDSSLPYSIRVKD 67
           L+CF +     +F   A +  G            F  L        + + S P   RV+ 
Sbjct: 5   LICFLMLSVFFIFPVRAKNTFGKKKDKVTRL--HFYDLNKNGRMDTYENPSAPVEYRVEH 62

Query: 68  LVSRMTLDEKVQQL------------GDFAHGVPRLGLPQYEWWSEALHGVSNVGPGT-- 113
           L+S+MTL+EKV Q+            G+     P+L     E+   +L G     P T  
Sbjct: 63  LLSQMTLEEKVGQMLTSLGWPMYERVGEDIRLTPQLEKEIGEYHIGSLWGFMRADPWTQR 122

Query: 114 ------------------------HFDDVIP--------------GATSFPTVILTTASF 135
                                   H    IP              G T FPT I   +++
Sbjct: 123 TLHTGLNPSLAARASNRLQSYVIEHSRLGIPLFLAEECPHGHMAIGTTVFPTSIGQASTW 182

Query: 136 NESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYA 195
           N  L +++G+ ++ EA A     +     + P +++ARDPRW R+ ET GEDP++ G   
Sbjct: 183 NPELIRQMGRVIAIEASA-----QGAHIGYGPVLDLARDPRWSRVEETYGEDPYLNGVMG 237

Query: 196 VNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEE 255
              VRG Q     E   D  S    V +  KH+A+Y    W         A + E+++EE
Sbjct: 238 TALVRGFQG----ETLNDGKS----VIATLKHFASY---GWTEGGHNGGTAHIGERELEE 286

Query: 256 TFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQV 315
               PF   V  G A SVM SYN ++G P      LL   ++  W   G++V+D  ++  
Sbjct: 287 AIFPPFREAVGAG-ALSVMSSYNEIDGNPCTGSRYLLTDILKDRWQFKGFVVSDLYAVGG 345

Query: 316 MVDNHKFLADSKEDAVAQTLKAGLDLDCG-QYYTNFTGNAVQQGKVKETDIDKSLKYLYT 374
           + ++   +A +  +A  + + AG+D D G   Y      AV++G V    IDK+++ + +
Sbjct: 346 LREHG--VAGNDYEAAIKAVNAGVDSDLGTNVYAEQLVAAVKRGDVAVATIDKAVRRILS 403

Query: 375 VLMRLGFFDGSPQYVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVA 434
           +  ++G FD          Q + S E+  LA E AR+ IVLLKN    LPL    ++T+A
Sbjct: 404 LKFQMGLFDDPFVDEKQAVQLVASSEHTGLAREVARQSIVLLKNKDKLLPLKK-DIRTLA 462

Query: 435 VVGPHANATVAMIGNYA-----GIPCRYMSPI-AGFSGYANVTYKTGCDDVACKSNNSIF 488
           V+GP+A+    M+G+Y      G     +  I    S    V Y  GC  V   S     
Sbjct: 463 VIGPNADNVYNMLGDYTAPQADGTVVTVLDGIRQKVSKETRVLYAKGC-AVRDSSRTGFK 521

Query: 489 AASEAAKTADATIILAG----LDLSVE-------------------AESLDREDLWLPGY 525
            A E A+ ADA +++ G     D S E                    E  DR  L L G 
Sbjct: 522 DAIETARNADAVVMVMGGSSARDFSSEYEETGAAKVTINQISDMESGEGYDRATLHLMGR 581

Query: 526 QTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFG 585
           Q +L+ +++ + K PV+LV++   G  +         +AI+ A YPG +GG A+ADV+FG
Sbjct: 582 QLELLEEISRLGK-PVVLVLIK--GRPLLMEGAIQEAEAIVDAWYPGMQGGNAVADVLFG 638

Query: 586 KFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQ 645
            +NP GRL ++      V  LP+     R     G   R Y    G   YPFGYGLSYT 
Sbjct: 639 DYNPAGRLTLSVPRS--VGQLPVYYNTRRK----GNRSR-YIEEPGTPRYPFGYGLSYTT 691

Query: 646 FKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVG 705
           F Y  +     +QV                           +D R D      V  QN G
Sbjct: 692 FSYTDMK----VQVTEGS-----------------------DDCRVD----VTVTIQNQG 720

Query: 706 STDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAAN 765
           + DG +V  +Y +       T  KQ+  F R+ ++AG ++ + F  +  KSL +      
Sbjct: 721 TADGDEVAQLYFRDDVSSFTTPAKQLRAFSRIHLKAGESREVTFTLDK-KSLALYMQEGE 779

Query: 766 TLLPAGEHTIFVG 778
            ++  G  TI VG
Sbjct: 780 WVVEPGRFTIMVG 792


>gi|285808617|gb|ADC36136.1| glycoside hydrolase family 3 protein [uncultured bacterium 253]
          Length = 752

 Score =  255 bits (651), Expect = 7e-65,   Method: Compositional matrix adjust.
 Identities = 225/751 (29%), Positives = 345/751 (45%), Gaps = 97/751 (12%)

Query: 64  RVKDLVSRMTLDEKVQQL--------GDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTHF 115
           ++  L+ RMTL EK+ QL        G F    P L        +  + G  N     H 
Sbjct: 35  KIDALLKRMTLAEKLGQLQQLDGEGNGSFRPEHPDLIRKGLLGSTLNVRGAKNTNQLQHV 94

Query: 116 D--------------DVIPG-ATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRA 160
                          DVI G  T FP  +   +S++ +  ++     + EARA      A
Sbjct: 95  AMDESRLKIPVLFGFDVIHGYRTIFPIPLAEASSWDPTSAERSTSIAAREARA------A 148

Query: 161 GLTY-WSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPL 219
           G+ + ++P +++ARDPRWGRITE  GED F+   +A   VRG Q        TD  S P 
Sbjct: 149 GVRWTFAPMLDIARDPRWGRITEGAGEDQFLGAAFARARVRGFQ-------GTDY-SAPD 200

Query: 220 KVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNR 279
           K+ +C KH+ AY      G D    D  ++E  + E +  PF+  V  G   +VM  +N 
Sbjct: 201 KMLACAKHWVAYGATE-GGRDYNTTD--MSENTLREIYFPPFKAAVDAG-VGTVMSGFND 256

Query: 280 VNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGL 339
           +NG+P  A+   L + +RGEW   G++V+D  S++ ++ NH  LA   +DA    L AG+
Sbjct: 257 LNGVPVSANHFTLTEVLRGEWKFDGFVVSDYTSVKELI-NHG-LAFGDQDAARLALNAGV 314

Query: 340 DLDCGQYYTNFTG-NAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQDICS 398
           D++      N  G   +++GKV    ID++++ +  +  RLG F       +     + +
Sbjct: 315 DMEMVSRLFNQQGPQLLKEGKVSPATIDEAVRRILRIKFRLGLFANPYADEARETTSLLT 374

Query: 399 DENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAG--IPCR 456
            EN   A   A   +VLLKN+  TLPL S  ++++AV+GP A+   A +G ++G   P  
Sbjct: 375 SENRAAARALADRSMVLLKNEGGTLPL-SKGIRSIAVIGPLADDHRAPLGWWSGDGKPED 433

Query: 457 YMSPIAGF----SGYANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEA 512
            ++P+ G     S    V Y  GCD V   S   I  A   A+ ++  I+  G    +  
Sbjct: 434 TVTPLMGIRAKVSPATKVNYAKGCD-VQGDSTGDIAEAVAVARESELAIVFVGESAEMVG 492

Query: 513 ESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPG 572
           E+  +  L L G Q  L+  V    K P I+V+++   + + +   NT   A+L A   G
Sbjct: 493 EAASKSSLDLTGCQMDLVKAVQATGK-PTIVVLINGRPLTVGWIFDNT--PAVLEAWMGG 549

Query: 573 EEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPL---RPVDSLGYPGRTYKFY 629
            E G AIADV+FG  NPGG+LP+TW     V  +P+    +   RP ++      T K+ 
Sbjct: 550 TEAGNAIADVLFGDANPGGKLPVTWPR--TVGQVPIYYNHMNTGRPPEANNR--YTSKYL 605

Query: 630 NGP--TLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVN 687
           + P    + FGYGLSYTQFK   L  +                     A +    G L  
Sbjct: 606 DVPWTPQFCFGYGLSYTQFKITNLQLS---------------------APRISATGKLTA 644

Query: 688 DLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRI 747
                      V+ +NVG   G +VV +Y    A      +K++ GFQR+ ++ G  KR+
Sbjct: 645 S----------VEVENVGKRAGDEVVQLYIHDVAASMTRPVKELKGFQRITLQPGEKKRV 694

Query: 748 KFVFNACKSLNIVDYAANTLLPAGEHTIFVG 778
           +FV  + + L   +         GE  + VG
Sbjct: 695 EFVLTS-EELGFWNREMRFAAEPGEFKVMVG 724


>gi|294146775|ref|YP_003559441.1| beta-glucosidase [Sphingobium japonicum UT26S]
 gi|292677192|dbj|BAI98709.1| beta-glucosidase [Sphingobium japonicum UT26S]
          Length = 791

 Score =  255 bits (651), Expect = 7e-65,   Method: Compositional matrix adjust.
 Identities = 220/737 (29%), Positives = 340/737 (46%), Gaps = 109/737 (14%)

Query: 78  VQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGATSFPTVILTTASFNE 137
           V  L  +A    RLG+P   +  E LHG + VG           ATSFP  I   +S++ 
Sbjct: 125 VNALQRWATTQTRLGIPIL-FHEEGLHGYAAVG-----------ATSFPQSIAMASSWDP 172

Query: 138 SLWKKIGQAVSTEARAMYNLGRAGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVN 197
            L +++   ++ E R+     R      SP +++ARDPRWGRI ET GEDP++VG   V 
Sbjct: 173 DLLREVNAVIAREIRS-----RGVSLVLSPVVDIARDPRWGRIEETYGEDPYLVGEMGVA 227

Query: 198 YVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAY-DVDNWKGVDRYHFDARVTEQDMEET 256
            V GLQ   G   +  L   P KV +  KH   +   ++   V      A V+E+++ E 
Sbjct: 228 AVEGLQ---GKGRSRLLP--PGKVFATLKHLTGHGQPESGTNVG----PAPVSERELREN 278

Query: 257 FLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVM 316
           F  PFE  VK     +VM SYN ++G+PS A+  LL   +RGEW   G +V+D  ++  +
Sbjct: 279 FFPPFEQVVKRTGIEAVMASYNEIDGVPSHANRWLLRDVLRGEWGFRGAVVSDYSAVDQL 338

Query: 317 VDNHKFLADSKEDAVAQTLKAGLDLDC--GQYYTNFTGNAVQQGKVKETDIDKSLKYLYT 374
           +  H   AD  E A  + L AG+D D   G  Y    G  V++GK+ E  +D++++++  
Sbjct: 339 MSIHHVAAD-LEQAAGRALDAGVDADLPDGLSYATL-GRQVREGKIGEALVDRAVRHMLE 396

Query: 375 VLMRLGFFDGSPQYVSLGKQDICSD-ENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTV 433
           +  R G F+ +P   +   + I +D     LA +AA+  I+LLKND   LPL      ++
Sbjct: 397 LKFRAGLFE-NPYADAAASEKITNDARARALALKAAQRSIILLKND-GMLPLKPE--GSI 452

Query: 434 AVVGPHANATVAMIGNYAGIPCRYMSPIAGFSG----YANVTYKTGCD---------DVA 480
           AV+GP  +A VA +G Y G P   +S + G        A + +  G           D  
Sbjct: 453 AVIGP--SAAVARLGGYYGQPPHSVSILEGIRAKVGNRAKIVFAQGVRITENDDWWADKV 510

Query: 481 CKSNNS-----IFAASEAAKTADATIILAGLDLSVEAESL------DREDLWLPGYQTQL 529
            +S+ +     I  A EAA+  D  ++  G       E        DR  L L G Q +L
Sbjct: 511 TRSDPAENRRLIAQAVEAARHVDRIVLTLGDTEQSSREGWADNHLGDRPSLDLVGEQQEL 570

Query: 530 INQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNP 589
            + +  + K P+ +V+++  G   +  + +    AIL   Y GE+GG A+ADV+FG  NP
Sbjct: 571 FDALKALGK-PIAVVLIN--GRPASTVKVSEQADAILEGWYLGEQGGHAVADVLFGDVNP 627

Query: 590 GGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPG--RTYKFYNGPTLYPFGYGLSYTQFK 647
           GG+LP+T         +P ++  L P+     P   R Y F     LYPFG+GLSYT F 
Sbjct: 628 GGKLPVT---------IPRSAGQL-PMFYNVKPSARRGYLFDTTDPLYPFGFGLSYTSFD 677

Query: 648 YNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGST 707
            +                                P +    +         VD +N G  
Sbjct: 678 LS-------------------------------APRLSAAKISVGGMTRVSVDVRNSGRR 706

Query: 708 DGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTL 767
           +G +VV +Y +         IK++ GFQRV ++ G  + + F     ++L + +   + +
Sbjct: 707 EGDEVVQLYVRDKVGSVTRPIKELKGFQRVTLKPGEVRTVTFTI-GPEALQMWNDHMDRV 765

Query: 768 LPAGEHTIFVGNGGVSF 784
           +  G+  I  GN  V+ 
Sbjct: 766 VEPGDFEIMTGNSSVAL 782


>gi|110640149|ref|YP_680359.1| b-glucosidase [Cytophaga hutchinsonii ATCC 33406]
 gi|110282830|gb|ABG61016.1| candidate b-glucosidase, Glycoside Hydrolase Family 3 protein
           [Cytophaga hutchinsonii ATCC 33406]
          Length = 745

 Score =  255 bits (651), Expect = 7e-65,   Method: Compositional matrix adjust.
 Identities = 191/659 (28%), Positives = 311/659 (47%), Gaps = 84/659 (12%)

Query: 117 DVIPG-ATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNINVARDP 175
           DVI G  T FP  +   AS++  L +K     + E+ +     R     ++P +++ RD 
Sbjct: 103 DVIHGYKTIFPIPLGLAASWDSVLVEKTAMIAAQESYS-----RCINWTFAPMVDICRDA 157

Query: 176 RWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDN 235
           RWGRI E+PGEDP++    A  Y+ G Q   G+  A     +P ++ +C KH+AAY    
Sbjct: 158 RWGRIAESPGEDPYLASVLARAYINGFQ---GNNPA-----QPGRILACSKHFAAYGAAE 209

Query: 236 WKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQT 295
                R +    ++   +   +L+PF   V+ G A++ M S+N +NG+P+  +  LL   
Sbjct: 210 G---GRDYNTVSMSRSTLWNMYLKPFHASVQAG-AATFMTSFNDLNGVPASGNAYLLKDV 265

Query: 296 VRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLD-CGQYYTNFTGNA 354
           +R +W   G++V+D +S+  M+  H +  D K DA  +   AGLD++   Q Y +     
Sbjct: 266 LRNQWKFPGFVVSDWNSVTEMI-THGYCTDEK-DAALKAFSAGLDMEMTSQAYAHHLKTL 323

Query: 355 VQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQDICSDENIELAAEAAREGIV 414
           + + K+ E  +D+ +K +  + +  G F+ +P +    K  +     + LA ++A +  V
Sbjct: 324 IAEKKITEQQLDELVKNILRIKLYAGIFE-NPYFKEKEKFTLLDSAALTLAKKSAVKSFV 382

Query: 415 LLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYA--GIPCRYMSPIAGFS---GYAN 469
           LLKN  NTLPL  A  K +AV+GP A A    +G +   G      +P+A      G  N
Sbjct: 383 LLKNHNNTLPL--AATKKIAVIGPLAEAPKEQLGTWIFDGDKTNSQTPLAALKKMYGAEN 440

Query: 470 VTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAESLDREDLWLPGYQTQL 529
           + Y  G      +S++   AA +AAK +D  +  AG +  +  E+  R D+ LPG Q +L
Sbjct: 441 IKYVQGLTHSRDESHDDFNAAYKAAKKSDVVLFFAGEEAILSGEAHSRADIRLPGAQERL 500

Query: 530 INQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNP 589
           I ++ +  K P++LVIM+  G  I       N+ A++ A +PG   G A+ADV+ GK N 
Sbjct: 501 IRKLHKAGK-PIVLVIMA--GRPITIEHILPNVSAVVMAWHPGTMAGPALADVLSGKENF 557

Query: 590 GGRLPITWYNGDYVQMLPL---TSMPLRPVDSLGYPG---------------RTYKFYNG 631
            GRLP+TW     V  +P+    +   RP DS+ + G                ++    G
Sbjct: 558 SGRLPVTW--PKTVGQIPIYYNHTNTGRPADSVSFVGIKDIPIEAWQSSLGNNSHYLDAG 615

Query: 632 PT-LYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLR 690
            T  YPFGYGLSYT+F                    C N              +  N L 
Sbjct: 616 YTPQYPFGYGLSYTKFV-------------------CTN------------SSIEKNTLT 644

Query: 691 CDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKF 749
             D     +   N GS  G + + +Y +         ++++  F +V ++AG  K ++F
Sbjct: 645 VKDSLIVTLSVSNAGSRSGIETIQLYVQDVTASLVRPVRELKAFAQVELKAGETKTVRF 703


>gi|340616356|ref|YP_004734809.1| xylosidase/arabinosidase [Zobellia galactanivorans]
 gi|339731153|emb|CAZ94417.1| Xylosidase/arabinosidase, family GH3 [Zobellia galactanivorans]
          Length = 738

 Score =  255 bits (651), Expect = 7e-65,   Method: Compositional matrix adjust.
 Identities = 221/790 (27%), Positives = 367/790 (46%), Gaps = 130/790 (16%)

Query: 42  FSKLGLQMSSFLFC-DSSLPYSIRVKDLVSRMTLDEKVQQLG-DFAHGVPRLGLPQYEWW 99
           F+ L L M +  F  D + P   +++ L+S+M+L+EKV QL   + +   RLG+P     
Sbjct: 11  FTLLALVMFNMGFAQDKARPSDKKIEKLISKMSLEEKVHQLATQYPNANMRLGIPNLSA- 69

Query: 100 SEALHGVSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGR 159
           +E LHG+            +  AT FP  I   ++++  L +++G  V+ E+RA      
Sbjct: 70  NECLHGIK-----------MDSATVFPQAIAMASTWDTELIERMGHTVAKESRAF----- 113

Query: 160 AGLTY-WSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRP 218
            G+   ++P + V RD RWGR  E+ GEDP++VG+   +Y+RGLQ + G E   + +   
Sbjct: 114 -GIHQCYTPMLAVVRDVRWGRTEESYGEDPYLVGKIGSSYIRGLQGM-GAERFDENH--- 168

Query: 219 LKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYN 278
             + +  KH+ A D +   G +    D  ++E  ++   L PF M ++E +  ++M +++
Sbjct: 169 --IMATAKHFVA-DGEPMAGDNGAAHD--ISEYTLQNVHLYPFRMAIEEAEVGAIMPAHH 223

Query: 279 RVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAG 338
            +NGIP  A+  ++   +R EW   G +V+D   ++ +     ++ D  E A  + L+AG
Sbjct: 224 LLNGIPCHANKHVMQTVLRDEWGWDGLVVSDNGDMRSLKRVFNYVPDY-EHAAKKGLEAG 282

Query: 339 LDLDCGQY--------YTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFD------- 383
           +  +   +        + ++  +AV +  V    +D ++K++      LG FD       
Sbjct: 283 IHQELALFQGWSDHRMFGDYLISAVNKKIVPVALVDDAVKHVLQAKFDLGLFDTDIKNDE 342

Query: 384 ----------GSPQ--------------YVSLGKQD----ICSDENIELAAEAAREGIVL 415
                     G P               YV + K+D    +    + +LA E A++ IVL
Sbjct: 343 RFDVLKNPDNGEPDKVSQHDAEMFKKALYVGIPKKDWKKTVFDQSHNDLALEVAQKSIVL 402

Query: 416 LKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYA-GIPCRYMSPIAGFSGY----ANV 470
           LKN+ + LPL   K K ++VVGP  N     +G Y+   P  Y++ + G   Y      V
Sbjct: 403 LKNEGDLLPLKKEKYKKISVVGP--NGKAMRLGGYSPDNPKYYINIVEGIQNYLGSDREV 460

Query: 471 TYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAESLDREDLWLPGYQTQLI 530
            ++ GCD     +N  I  A   A+++D TI+  G       E+ DR+DL LPG Q +L+
Sbjct: 461 AFEEGCDFTDSTAN--IPKAVALAESSDITIVAIGGSEETCRENEDRDDLSLPGPQQKLV 518

Query: 531 NQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPG 590
             +    K P ++V+++   + I +   N+  +AI+   Y G+E G+AIA+++FGK NP 
Sbjct: 519 EAIHATGK-PYVVVLLNGRPLSIEWIAENS--QAIVEGWYLGQETGKAIANILFGKVNPS 575

Query: 591 GRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNG--PTLYPFGYGLSYTQFKY 648
           G+LPIT+     V  +PL    L         GR  + YN     L+PFGYGLSYT F  
Sbjct: 576 GKLPITFPRN--VGQVPLFYNKLE-------TGRPRQIYNSDPEPLFPFGYGLSYTSF-- 624

Query: 649 NLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTD 708
                         +L   R  N T          +  N+L         +   N G+  
Sbjct: 625 --------------ELGEPRLSNET----------IAANELTT-----VNIPITNTGTRS 655

Query: 709 GSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLL 768
           G  VV +Y            K++  F+RV ++ G  K+I     A + L   +    T+ 
Sbjct: 656 GETVVQLYVHDVLSERVRPQKELRNFKRVALKPGETKQISIKIGA-QQLEYWNDGKWTIE 714

Query: 769 PAGEHTIFVG 778
           P G+  I VG
Sbjct: 715 P-GQFDIMVG 723


>gi|423258860|ref|ZP_17239783.1| hypothetical protein HMPREF1055_02060 [Bacteroides fragilis
           CL07T00C01]
 gi|423264169|ref|ZP_17243172.1| hypothetical protein HMPREF1056_00859 [Bacteroides fragilis
           CL07T12C05]
 gi|387776440|gb|EIK38540.1| hypothetical protein HMPREF1055_02060 [Bacteroides fragilis
           CL07T00C01]
 gi|392706435|gb|EIY99558.1| hypothetical protein HMPREF1056_00859 [Bacteroides fragilis
           CL07T12C05]
          Length = 805

 Score =  255 bits (651), Expect = 8e-65,   Method: Compositional matrix adjust.
 Identities = 232/807 (28%), Positives = 348/807 (43%), Gaps = 145/807 (17%)

Query: 54  FCDSSLPYSIRVKDLVSRMTLDEKVQQL------------GDFAHGVPRLGLPQYEWWSE 101
           + + S P   RV+ L+S+MTL+EKV Q+            G+     P+L     E+   
Sbjct: 40  YENPSAPVEYRVEHLLSQMTLEEKVGQMLTSLGWPMYERVGEDIRLTPQLEKEIGEYHIG 99

Query: 102 ALHGVSNVGPGT--------------------------HFDDVIP--------------G 121
           +L G     P T                          H    IP              G
Sbjct: 100 SLWGFMRADPWTQRTLHTGLNPSLAARASNRLQSYVIEHSRLGIPLFLAEECPHGHMAIG 159

Query: 122 ATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNINVARDPRWGRIT 181
            T FPT I   +++N  L +++G+ ++ EA A     +     + P +++ARDPRW R+ 
Sbjct: 160 TTVFPTSIGQASTWNPELIRQMGRVIAIEASA-----QGAHIGYGPVLDLARDPRWSRVE 214

Query: 182 ETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDR 241
           ET GEDP++ G      VRG Q     E   D  S    V +  KH+A+Y    W     
Sbjct: 215 ETYGEDPYLNGAMGTALVRGFQG----ETLNDGKS----VIATLKHFASY---GWTEGGH 263

Query: 242 YHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWD 301
               A + E+++EE    PF   V  G A SVM SYN ++G P      LL   ++  W 
Sbjct: 264 NGGTAHIGERELEEAIFPPFREAVGAG-ALSVMSSYNEIDGNPCTGSRYLLTDILKDRWQ 322

Query: 302 LHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCG-QYYTNFTGNAVQQGKV 360
             G++V+D  ++  + ++   +A +  +A  + + AG+D D G   Y      AV++G V
Sbjct: 323 FKGFVVSDLYAVGGLREHG--VAGNDYEAAIKAVNAGVDSDLGTNVYAEQLVAAVKRGDV 380

Query: 361 KETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQDICSDENIELAAEAAREGIVLLKNDQ 420
               IDK+++ + ++  ++G FD          Q + S E+  LA E AR+ IVLLKN  
Sbjct: 381 AVATIDKAVRRILSLKFQMGLFDDPFVDEKQAVQLVASSEHTGLAREVARQSIVLLKNKD 440

Query: 421 NTLPLNSAKVKTVAVVGPHANATVAMIGNYA-----GIPCRYMSPI-AGFSGYANVTYKT 474
             LPL    ++T+AV+GP+A+    M+G+Y      G     +  I    S    V Y  
Sbjct: 441 KLLPLKK-DIRTLAVIGPNADNVYNMLGDYTAPQADGTVVTVLDGIRQKVSKETRVLYAK 499

Query: 475 GCDDVACKSNNSIFAASEAAKTADATIILAG----LDLSVE------------------- 511
           GC  V   S      A E A+ ADA +++ G     D S E                   
Sbjct: 500 GC-AVRDSSRTGFKDAIETARNADAVVMVMGGSSARDFSSEYEETGAAKVTINQISDMES 558

Query: 512 AESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYP 571
            E  DR  L L G Q +L+ +++ + K PV+LV++   G  +         +AI+ A YP
Sbjct: 559 GEGYDRATLHLMGRQLELLEEISRLGK-PVVLVLIK--GRPLLMEGAIQEAEAIVDAWYP 615

Query: 572 GEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNG 631
           G +GG A+ADV+FG +NP GRL ++      V  LP+     R     G   R Y    G
Sbjct: 616 GMQGGNAVADVLFGDYNPAGRLTLSVPRS--VGQLPVYYNTRRK----GNRSR-YIEEPG 668

Query: 632 PTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRC 691
              YPFGYGLSYT F Y  +     +QV                           +D R 
Sbjct: 669 TPRYPFGYGLSYTTFSYTDMK----VQVTEGS-----------------------DDCRV 701

Query: 692 DDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVF 751
           D      V  QN G+ DG +V  +Y +       T  KQ+  F R+ ++AG ++ + F  
Sbjct: 702 D----VTVTIQNQGTADGDEVAQLYFRDDVSSFTTPAKQLRAFSRIHLKAGESREVTFTL 757

Query: 752 NACKSLNIVDYAANTLLPAGEHTIFVG 778
           +  KSL +       ++  G  TI VG
Sbjct: 758 DK-KSLALYMQEGEWVVEPGRFTIMVG 783


>gi|410096731|ref|ZP_11291716.1| hypothetical protein HMPREF1076_00894 [Parabacteroides goldsteinii
           CL02T12C30]
 gi|409225348|gb|EKN18267.1| hypothetical protein HMPREF1076_00894 [Parabacteroides goldsteinii
           CL02T12C30]
          Length = 746

 Score =  255 bits (651), Expect = 8e-65,   Method: Compositional matrix adjust.
 Identities = 216/762 (28%), Positives = 349/762 (45%), Gaps = 108/762 (14%)

Query: 64  RVKDLVSRMTLDEKVQQLGDFA--------HGVPR-------LGLPQYEWWSEALHGV-- 106
           RV  L+ +MTL EK+ Q+   +         G+ R       L L   E  ++A      
Sbjct: 32  RVNALLGQMTLQEKIGQMNQLSPFGGLEEMAGLIREGNVGSLLNLTDPELVNKAQRIAVE 91

Query: 107 -SNVG-PGTHFDDVIPG-ATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLT 163
            S +G P     DVI G  T FP  +   A+FN  L +   +  + EA A       G+ 
Sbjct: 92  ESRLGIPLLMSRDVIHGYKTIFPIPLGQAATFNPQLVEDGARVAAVEASA------DGIR 145

Query: 164 Y-WSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVS 222
           + ++P I+++RDPRWGRI E+ GEDP++     V  V+G Q         D  + P  V+
Sbjct: 146 WTFAPMIDISRDPRWGRIAESCGEDPYLSSVMGVAMVKGFQG--------DSLNNPTAVA 197

Query: 223 SCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNG 282
           +C KH+  Y         R +    + E+ +   +  PFE   K G  ++ M S+N  +G
Sbjct: 198 ACAKHFVGYGASEG---GRDYNSTFIPERQLRNVYFPPFEAAAKAG-CATFMTSFNDNDG 253

Query: 283 IPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLD 342
           IPS  +  +L   +RGEW+  G +V D  S   M+ +H F  D KE A+ +++ AG++++
Sbjct: 254 IPSTGNSFILKDVLRGEWNYDGLVVTDWASSAEMI-SHGFCKDEKEAAM-KSVNAGINME 311

Query: 343 --CGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQDICSDE 400
              G +  N     V++ KV E  ID++++ +  +  RLG FD    Y    +Q   +  
Sbjct: 312 MVSGTFIRNLE-ELVKEKKVSEAAIDEAVRNILRLKFRLGLFDNP--YTDTDQQVKYAPT 368

Query: 401 NIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYA--GIPCRYM 458
           ++  A EAA + ++LLKND+ TLP    K++T+AV+GP A+A    +G +   G      
Sbjct: 369 HLAKAKEAAEQSVILLKNDRETLPFTD-KIRTLAVIGPLADAAHDQMGTWVFDGEKAHTQ 427

Query: 459 SPIAG----FSGYANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAES 514
           + +      +     + Y+ G      K    I  A  AA  ADA ++ AG +  +  E+
Sbjct: 428 TVLTALKEMYGDKVRIIYEPGLGYSRDKHTAGIAKAVNAAMHADAVLVCAGEESILSGEA 487

Query: 515 LDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEE 574
               DL L G Q++LI  +A+  K P++ V+M+  G  +   +      A+L+A +PG  
Sbjct: 488 HSLADLHLQGAQSELIAALAKTGK-PLVTVVMA--GRPLTIGQEVEQSDAVLYAFHPGTM 544

Query: 575 GGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPL------TSMPLRPVDSL-----GYPG 623
           GG A+AD++FGK  P G+ P+T+     V  +P+      T  P    ++L        G
Sbjct: 545 GGPALADLLFGKAVPSGKTPVTFPK--MVGQIPVYYAHNNTGRPASRQETLIDDIPQEAG 602

Query: 624 RT----YKFYNGP---TLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDA 676
           +T      FY       L+PFGYGLSYT F Y+ L                         
Sbjct: 603 QTSLGCTSFYMDAGFDPLFPFGYGLSYTTFGYDNLQLA---------------------- 640

Query: 677 SKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQR 736
                     N L  D   E   D  N G  +G+++V +Y +  A      +K++ GF+R
Sbjct: 641 ---------TNQLAVDGTLEISFDLTNTGKYEGTEIVQLYIQDKAGSITRPVKELKGFRR 691

Query: 737 VFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVG 778
           + ++ G  K + F     + L   +     ++  GE  ++VG
Sbjct: 692 IPLKQGETKTVSFSL-PVEELAFWNIDRQRVVEPGEFNLWVG 732


>gi|336399370|ref|ZP_08580170.1| Beta-glucosidase [Prevotella multisaccharivorax DSM 17128]
 gi|336069106|gb|EGN57740.1| Beta-glucosidase [Prevotella multisaccharivorax DSM 17128]
          Length = 862

 Score =  255 bits (651), Expect = 8e-65,   Method: Compositional matrix adjust.
 Identities = 164/464 (35%), Positives = 240/464 (51%), Gaps = 41/464 (8%)

Query: 45  LGLQMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALH 104
           +G+      + D  L +  R KDL SR+TL+EK   + D +  +PRLG+  + WWSEALH
Sbjct: 16  VGVNAQQSPYQDPGLSFEARAKDLCSRLTLEEKASLMCDVSPAIPRLGIKPFNWWSEALH 75

Query: 105 GVSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRA---- 160
           G +N G      DV    T FP  I   ASFN ++  ++  A S EAR  YN   A    
Sbjct: 76  GYANNG------DV----TVFPEPIGMAASFNPTMVYQVFTATSDEARGKYNQSMAEGKE 125

Query: 161 -----GLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLN 215
                 L+ W+PN+N+ RDPRWGR  ET GEDP++     V  V+GLQ  E        +
Sbjct: 126 DTRFHSLSVWTPNVNIFRDPRWGRGQETYGEDPYLTSVMGVEVVKGLQGPE--------S 177

Query: 216 SRPLKVSSCCKHYAAYDVDNWKGVDRYHFD-ARVTEQDMEETFLRPFEMCVKEGDASSVM 274
           ++  K+ +C KH+A +    +    R+  + A ++ +D+ ET+L  F+  V++     VM
Sbjct: 178 TKYRKLYACAKHFAVHSGPEYT---RHTANLADISPRDLWETYLPAFKATVQQAGVREVM 234

Query: 275 CSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQT 334
           C+Y R++  P C + +LL Q +R EW     +V+DC +I     NH   +D+   A   T
Sbjct: 235 CAYQRLDDEPCCGNSRLLQQILRDEWGFRHMVVSDCGAIADFYTNHHVSSDAVHAAAKGT 294

Query: 335 LKAGLDLDCGQYYTNFT-GNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGK 393
           L AG D++CG  Y       AV++G V E ++DK +  L      LG  D  P+ VS  K
Sbjct: 295 L-AGTDVECGFGYAYMKLPEAVRRGLVSEAEVDKHVIRLLKGRFELGVMD-DPKLVSWTK 352

Query: 394 ---QDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNY 450
              + + SD + +LA   AR+ + LL+N  N LPL  AK + +AVVGP+A     + GNY
Sbjct: 353 ISPKVVDSDAHRQLALNMARQTMTLLQNRNNVLPL--AKGEKIAVVGPNAADGPMLWGNY 410

Query: 451 AGIPCRYMSPIAGFSGYA--NVTYKTGCDDVACKSNNSIFAASE 492
            G P R  + + G    A  ++ Y  GCD V      S+ A  E
Sbjct: 411 NGTPSRTTTILEGIRAKAGKDIPYLQGCDLVNKNVLTSLLAECE 454



 Score =  109 bits (273), Expect = 6e-21,   Method: Compositional matrix adjust.
 Identities = 80/280 (28%), Positives = 129/280 (46%), Gaps = 50/280 (17%)

Query: 503 LAGLDLSVEAESL---DREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETN 559
           L G ++ V  E     DR  + LP  Q   +  +    K    +V ++  G  IA     
Sbjct: 613 LEGEEMPVHVEGFKGGDRTSIELPAVQRDFLKALKAAGK---TVVFVNCSGSAIALTPEV 669

Query: 560 TNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSL 619
            +  AIL A Y GEEGGRA+ADV++G +NPGG+LP+T+Y          ++  L   D  
Sbjct: 670 ESCDAILQAWYAGEEGGRAVADVLYGDYNPGGKLPVTFYR---------STTQLPAFDDY 720

Query: 620 GYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKT 679
              GRTY++++   L+PFGYGLSYT+F     S +                         
Sbjct: 721 SMKGRTYRYFSD-ALFPFGYGLSYTRFAIGKGSLSAPA---------------------- 757

Query: 680 RCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFV 739
                    ++ D      V   NVG   G +VV VY +   + A   +K +  F+RV +
Sbjct: 758 ---------MKADGKVTLTVPVSNVGKRTGDEVVQVYVRDVND-ADGPLKSLKAFRRVSL 807

Query: 740 RAGRNKRIKFVFNACKSLNIVDYAANTL-LPAGEHTIFVG 778
           +AG ++++     A ++ ++ D A+NT+    G++ ++ G
Sbjct: 808 KAGESRKVTIPLTA-ETFSLFDSASNTVRTKPGKYVVYYG 846


>gi|53714352|ref|YP_100344.1| beta-glucosidase [Bacteroides fragilis YCH46]
 gi|52217217|dbj|BAD49810.1| periplasmic beta-glucosidase precursor [Bacteroides fragilis YCH46]
          Length = 859

 Score =  254 bits (650), Expect = 9e-65,   Method: Compositional matrix adjust.
 Identities = 214/769 (27%), Positives = 338/769 (43%), Gaps = 146/769 (18%)

Query: 50  SSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGD-------------------------- 83
           ++F + ++SLP  +RV+DL+SRMTL+EK+ Q+                            
Sbjct: 22  TNFKYKNASLPVEVRVQDLLSRMTLEEKIAQMRHIHAYSIMENGKLNEEKLEKMIGGQNY 81

Query: 84  -FAHGVP---------------------RLGLPQYEWWSEALHGVSNVGPGTHFDDVIPG 121
            F  G+                      RLG+P +   +E+LHG            V  G
Sbjct: 82  GFIEGITLPGKECLTLMNEVQKYMREKTRLGIPVFTL-TESLHG-----------SVHDG 129

Query: 122 ATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTY-WSPNINVARDPRWGRI 180
           +T FP  I   ++FN  L  ++  A++ E  A       G+T   +P I+V RD RWGR+
Sbjct: 130 STIFPQAIALGSTFNPILAYEMTSAIAKELTAQ------GITQSLTPVIDVCRDLRWGRV 183

Query: 181 TETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVD 240
            E  GEDP++V R  V+ VRG  D +              VS   KH+ A+      G++
Sbjct: 184 EECFGEDPYLVSRMGVSQVRGYLDNQ--------------VSPMIKHFGAHGAPQ-GGLN 228

Query: 241 RYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEW 300
                    ++++   +L+ FE  VKE    +VM SYN  N  P+ +   L+ + +R  W
Sbjct: 229 LASVSC--GQRELLSIYLKTFETVVKEAKPWAVMSSYNSWNNEPNSSSHYLMTELLRDRW 286

Query: 301 DLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFTGNAVQQGKV 360
           D  GY+ +D  +I ++   HK   +S E A+ Q L AGLD +            V+ G +
Sbjct: 287 DFQGYVYSDWGAIGMLNYFHKTAQNSAEAAI-QALTAGLDAEASDNSYAELQQLVENGML 345

Query: 361 KETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQDICSDENIELAAEAAREGIVLLKNDQ 420
               ID+++  + T    +G F+          + + +  ++ LA + A E IVLL+N+ 
Sbjct: 346 DVKYIDQAVARILTAKFNMGLFEYPLPMEKNYDKVVHAPAHVSLARKIAEESIVLLQNEN 405

Query: 421 NTLPLNSAKVKTVAVVGPHANATVAMIGNYA-------GIPCRYMSPIAGFSG-YANVTY 472
           N LPL   K+K++AV+GP  NA     G+Y        G+    +  +    G    + Y
Sbjct: 406 NILPLQMNKLKSIAVIGP--NADQVQFGDYTWSRDNKDGVTL--LEALKERVGNQLTLNY 461

Query: 473 KTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEA---------ESLDREDLWLP 523
             GC D+     +    A + AK +D  I++ G   +  A         E  D  DL L 
Sbjct: 462 AKGC-DLVTDDRSGFKEAVDVAKKSDVCIVVVGSASASLARDYSNATCGEGFDLSDLTLT 520

Query: 524 GYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVV 583
           G Q  L+  +    K PVI+V++S  G  +A +    NI  I+   YPGE+GG A+AD++
Sbjct: 521 GVQEDLVEAIHATGK-PVIVVLLS--GKPLAMSWIKENIPGIVVQWYPGEQGGLALADML 577

Query: 584 FGKFNPGGRLPITWYNGD-----YVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFG 638
            GK NP G+L  ++         Y   LP      R   S   PG+ Y F +   L+ FG
Sbjct: 578 LGKVNPSGKLNYSFPQSVGHLPCYYNYLPTDKGFYRSPGSKNKPGKDYVFSSPKALWAFG 637

Query: 639 YGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFK 698
           +GLSYT F+Y  LS T + +                             D  C+D  E  
Sbjct: 638 HGLSYTDFEY--LSATTSKE-----------------------------DYACEDVIEVT 666

Query: 699 VDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRI 747
           +  +N G  DG +V  VY +         ++++ GF++V ++ G  K++
Sbjct: 667 IAIRNTGDYDGLEVPQVYVRDMVSSVVMPVQELKGFEKVLIKKGETKQV 715


>gi|365121891|ref|ZP_09338802.1| hypothetical protein HMPREF1033_02148 [Tannerella sp.
           6_1_58FAA_CT1]
 gi|363644131|gb|EHL83433.1| hypothetical protein HMPREF1033_02148 [Tannerella sp.
           6_1_58FAA_CT1]
          Length = 855

 Score =  254 bits (650), Expect = 1e-64,   Method: Compositional matrix adjust.
 Identities = 167/431 (38%), Positives = 236/431 (54%), Gaps = 41/431 (9%)

Query: 40  GRFSKLGL--QMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYE 97
           G F  L L  Q    L+ D + P   RV DL+SRMT++EKV  +   A G+PRL + +Y 
Sbjct: 16  GLFMALTLHAQNEQPLYKDMNAPIHDRVMDLLSRMTVEEKVSLMIHNAPGIPRLEIDKYY 75

Query: 98  WWSEALHGVSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYN- 156
             +EALHG+  V PG          T FP  I   AS+N  L  KI  A+S EAR  +N 
Sbjct: 76  HGNEALHGI--VRPGKF--------TVFPQAIGMAASWNPELIYKISTAISDEARGKWNA 125

Query: 157 --LGRAGL-------TYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEG 207
             LG+  L       ++WSP +N+ARDPRWGR  ET GEDP + G     +V+GLQ   G
Sbjct: 126 LGLGKKQLDGSSDLLSFWSPTVNMARDPRWGRTPETYGEDPHLTGTLGCAFVKGLQ---G 182

Query: 208 HENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKE 267
           +      + + LK  +  KH+AA + ++    +R H +A ++E+D+ E +L  FE C+ E
Sbjct: 183 N------HPKYLKAVATPKHFAANNEEH----NRAHCNAVISERDLREYYLPSFEKCIVE 232

Query: 268 GDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSK 327
           G A S+M +YN VNGIP   +  L+ + +R +W   GY+V DC +   MV  HK++ D +
Sbjct: 233 GKAQSIMTAYNAVNGIPCTVNTYLIKKVLREDWGFQGYVVTDCSAPAWMVTQHKYVKDYE 292

Query: 328 EDAVAQTLKAGLDLDCG-QYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSP 386
             AV    KAG D++C    YT    NA    +V + DID    +L    M LG FD   
Sbjct: 293 TAAVLMA-KAGSDMECADNVYTQPLLNAYYNYRVSDADIDSIAYHLLRGRMLLGLFDDPE 351

Query: 387 Q--YVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATV 444
           +  Y  +  + +   E+ ELA E AR+ +VLLKN+ N LP+N  K+K++AVVG   NA  
Sbjct: 352 KNPYNKISPEKVGCKEHQELALETARQSLVLLKNENNFLPINPKKIKSIAVVG--INADR 409

Query: 445 AMIGNYAGIPC 455
              G+Y+G P 
Sbjct: 410 CEFGDYSGTPV 420



 Score =  145 bits (365), Expect = 1e-31,   Method: Compositional matrix adjust.
 Identities = 107/342 (31%), Positives = 158/342 (46%), Gaps = 57/342 (16%)

Query: 456 RYMSPIAGFSG----YANVTYKTGCDDVACKSNNSIFA-ASEAAKTADATIILAGLDLSV 510
           RY   +  F G    +A + +K    D+  +   ++F  A +AAK  D T+ + G+D S+
Sbjct: 562 RYKIKVEYFDGGGDCFARLYWK--APDLDSRDRINLFGEAGKAAKECDITVAVLGIDKSI 619

Query: 511 EAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGY 570
           E E  DR  L LP  Q + I ++ ++   P  +V++ AG   IA    + NI AI+ A Y
Sbjct: 620 EREGQDRYTLELPADQQEFIREIYKI--NPKTVVVLVAGS-SIAINWIDENIPAIIDAWY 676

Query: 571 PGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYP-GRTYKFY 629
           PGE+GG A+A+ +FGK+NPGGRLP+T+YN        +  +P  P D      GRTY+++
Sbjct: 677 PGEQGGTAVAEALFGKYNPGGRLPLTFYNS-------MDELP--PFDDYAVKKGRTYQYF 727

Query: 630 NGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDL 689
            G  LY FGYGLSYT+F Y                   R LN  S               
Sbjct: 728 TGKPLYEFGYGLSYTKFNY-------------------RKLNIASK-------------- 754

Query: 690 RCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKF 749
              D    +    N G  DG +V  VY + P       IKQ+ GF+RV ++ G+ + +  
Sbjct: 755 --QDTINIQFSISNTGKYDGDEVAQVYVQYPETGTYMPIKQLKGFKRVHIKKGQTQNVSI 812

Query: 750 VFNACKSLNIVDYAANTLL-PAGEHTIFVGNGGVSFPIHLNF 790
                K L   D      + P+G +   VG+      +   F
Sbjct: 813 SIPK-KELRYWDEKTRKFVTPSGNYIFQVGSSSQRINLQKTF 853


>gi|423269263|ref|ZP_17248235.1| hypothetical protein HMPREF1079_01317 [Bacteroides fragilis
           CL05T00C42]
 gi|423273173|ref|ZP_17252120.1| hypothetical protein HMPREF1080_00773 [Bacteroides fragilis
           CL05T12C13]
 gi|392701685|gb|EIY94842.1| hypothetical protein HMPREF1079_01317 [Bacteroides fragilis
           CL05T00C42]
 gi|392708205|gb|EIZ01313.1| hypothetical protein HMPREF1080_00773 [Bacteroides fragilis
           CL05T12C13]
          Length = 805

 Score =  254 bits (650), Expect = 1e-64,   Method: Compositional matrix adjust.
 Identities = 232/807 (28%), Positives = 348/807 (43%), Gaps = 145/807 (17%)

Query: 54  FCDSSLPYSIRVKDLVSRMTLDEKVQQL------------GDFAHGVPRLGLPQYEWWSE 101
           + + S P   RV+ L+S+MTL+EKV Q+            G+     P+L     E+   
Sbjct: 40  YENPSAPVEYRVEHLLSQMTLEEKVGQMLTSLGWPMYERVGEDIRLTPQLEKEIGEYHIG 99

Query: 102 ALHGVSNVGPGT--------------------------HFDDVIP--------------G 121
           +L G     P T                          H    IP              G
Sbjct: 100 SLWGFMRADPWTQRTLHTGLNPSLAARASNRLQSYVIEHSRLGIPLFLAEECPHGHMAIG 159

Query: 122 ATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNINVARDPRWGRIT 181
            T FPT I   +++N  L +++G+ ++ EA A     +     + P +++ARDPRW R+ 
Sbjct: 160 TTVFPTSIGQASTWNPELIRQMGRVIAIEASA-----QGAHIGYGPVLDLARDPRWSRVE 214

Query: 182 ETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDR 241
           ET GEDP++ G      VRG Q     E   D  S    V +  KH+A+Y    W     
Sbjct: 215 ETYGEDPYLNGVMGTALVRGFQG----ETLNDGKS----VIATLKHFASY---GWTEGGH 263

Query: 242 YHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWD 301
               A + E+++EE    PF   V  G A SVM SYN ++G P      LL   ++  W 
Sbjct: 264 NGGTAHIGERELEEAIFPPFREAVGAG-ALSVMSSYNEIDGNPCTGSRYLLTDILKDRWQ 322

Query: 302 LHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCG-QYYTNFTGNAVQQGKV 360
             G++V+D  ++  + ++   +A +  +A  + + AG+D D G   Y      AV++G V
Sbjct: 323 FKGFVVSDLYAVGGLREHG--VAGNDYEAAIKAVNAGVDSDLGTNVYAEQLVAAVKRGDV 380

Query: 361 KETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQDICSDENIELAAEAAREGIVLLKNDQ 420
               IDK+++ + ++  ++G FD          Q + S E+  LA E AR+ IVLLKN  
Sbjct: 381 AVATIDKAVRRILSLKFQMGLFDDPFVDEKQAVQLVASSEHTGLAREVARQSIVLLKNKD 440

Query: 421 NTLPLNSAKVKTVAVVGPHANATVAMIGNYA-----GIPCRYMSPI-AGFSGYANVTYKT 474
             LPL    ++T+AV+GP+A+    M+G+Y      G     +  I    S    V Y  
Sbjct: 441 KLLPLKK-DIRTLAVIGPNADNVYNMLGDYTAPQADGTVVTVLDGIRQKVSKETRVLYAK 499

Query: 475 GCDDVACKSNNSIFAASEAAKTADATIILAG----LDLSVE------------------- 511
           GC  V   S      A E A+ ADA +++ G     D S E                   
Sbjct: 500 GC-AVRDSSRTGFKDAIETARNADAVVMVMGGSSARDFSSEYEETGAAKVTINQISDMES 558

Query: 512 AESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYP 571
            E  DR  L L G Q +L+ +++ + K PV+LV++   G  +         +AI+ A YP
Sbjct: 559 GEGYDRATLHLMGRQLELLEEISRLGK-PVVLVLIK--GRPLLMEGAIQEAEAIVDAWYP 615

Query: 572 GEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNG 631
           G +GG A+ADV+FG +NP GRL ++      V  LP+     R     G   R Y    G
Sbjct: 616 GMQGGNAVADVLFGDYNPAGRLTLSVPRS--VGQLPVYYNTRRK----GNRSR-YIEEPG 668

Query: 632 PTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRC 691
              YPFGYGLSYT F Y  +     +QV                           +D R 
Sbjct: 669 TPRYPFGYGLSYTTFSYTDMK----VQVTEGS-----------------------DDCRV 701

Query: 692 DDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVF 751
           D      V  QN G+ DG +V  +Y +       T  KQ+  F R+ ++AG ++ + F  
Sbjct: 702 D----VTVTIQNQGTADGDEVAQLYFRDDVSSFTTPAKQLRAFSRIHLKAGESREVTFTL 757

Query: 752 NACKSLNIVDYAANTLLPAGEHTIFVG 778
           +  KSL +       ++  G  TI VG
Sbjct: 758 DK-KSLALYMQEGEWVVEPGRFTIMVG 783


>gi|255690204|ref|ZP_05413879.1| xylosidase/arabinosidase [Bacteroides finegoldii DSM 17565]
 gi|260624223|gb|EEX47094.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
           finegoldii DSM 17565]
          Length = 954

 Score =  254 bits (650), Expect = 1e-64,   Method: Compositional matrix adjust.
 Identities = 223/749 (29%), Positives = 350/749 (46%), Gaps = 109/749 (14%)

Query: 54  FCDSSLPYSIRVKDLVSRMTLDEKVQQL--GDFAHGVPRLGLPQYEWWSEALHGVSNVGP 111
           + D+SLP   RV+ L++ MT  +K++ +  G    G+P L +P      EA+HG S    
Sbjct: 170 YMDASLPVDERVESLLAAMTPADKMELIREGWGIPGIPHLYVPPITK-VEAVHGFSYGS- 227

Query: 112 GTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNINV 171
                    GAT FP  +   A++N  L +++  A+  E   + N  +A    WSP ++V
Sbjct: 228 ---------GATIFPQALAMGATWNRQLTEEVAMAIGDET-VIANTKQA----WSPVLDV 273

Query: 172 ARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAY 231
           A+D RWGR  ET GEDP +V +    +++G Q       +  L + P       KH+  +
Sbjct: 274 AQDARWGRCEETFGEDPVLVSQMGGAWIKGYQ-------SKGLFTTP-------KHFGGH 319

Query: 232 DVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKL 291
                 G D +  D  ++E++M E  L PF   ++  D  S+M +Y+   GIP     +L
Sbjct: 320 GAP-LGGRDSH--DIGLSEREMREVHLVPFRHVIRNYDCQSLMMAYSDYMGIPIAKSTEL 376

Query: 292 LNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNF- 350
           L + +R EW  +G+IV+DC +I  +     + A  K +A  Q L AG+  +CG  Y N  
Sbjct: 377 LQRILRQEWGFNGFIVSDCGAIGNLTARKHYTAKDKIEAANQALAAGIATNCGDTYNNKE 436

Query: 351 TGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSP-QYVSLGK--QDICSDENIELAAE 407
              A + G++   ++D   + +   + R   F+ +P + +   K      SD +  +A  
Sbjct: 437 VIQAAKDGRINMENLDNVCRTMLATMFRNELFEKNPCKPLDWNKIYPGWNSDSHKAMAHR 496

Query: 408 AAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGI--PCRYMSPIAGF- 464
           AA E IV+L+N  N LPL S +++T+AV+GP A+      G+Y     P +  S + G  
Sbjct: 497 AACESIVMLENKDNLLPL-SKELRTIAVLGPGADDLQP--GDYTPKLQPGQLKSVLTGIK 553

Query: 465 ---SGYANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVE---------- 511
              S    V Y+ GCD       + I  A + A  AD  +++ G D S+           
Sbjct: 554 AAVSKQTKVLYEKGCDFTETGMTD-IPKAVKTASQADVVVMVLG-DCSISEATKDVRKTC 611

Query: 512 AESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYP 571
            E+ D   L LPG Q +L+  V    K PVIL++ +    D+  A  +   KAIL    P
Sbjct: 612 GENNDLATLVLPGKQQELLEAVCATGK-PVILILQAGRPYDLLKA--SEMCKAILVNWLP 668

Query: 572 GEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNG 631
           G+EGG A ADV+FG +NPGGRLP+T+    +V  LPL         +    GR Y++ + 
Sbjct: 669 GQEGGPATADVLFGDYNPGGRLPMTFPR--HVGQLPLYY-------NFKTSGRRYEYVDM 719

Query: 632 P--TLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDL 689
               LY FGYGLSYT F+Y+ L           K+Q   N N T +A+            
Sbjct: 720 EYYPLYRFGYGLSYTSFEYSGL-----------KVQEKPNGNVTVEAT------------ 756

Query: 690 RCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKF 749
                       +NVG   G +V  +Y         T + ++  F R+ +  G +K + F
Sbjct: 757 -----------VKNVGGRAGDEVAQLYVTDMYASVKTRVMELKDFARIHLNPGESKTVSF 805

Query: 750 VFNACKSLNIVDYAANTLLPAGEHTIFVG 778
                  L++++   + ++  GE  I VG
Sbjct: 806 ELTPY-DLSLLNDHMDRVVEKGEFKICVG 833


>gi|119476117|ref|ZP_01616469.1| periplasmic beta-glucosidase [marine gamma proteobacterium
           HTCC2143]
 gi|119450744|gb|EAW31978.1| periplasmic beta-glucosidase [marine gamma proteobacterium
           HTCC2143]
          Length = 748

 Score =  254 bits (650), Expect = 1e-64,   Method: Compositional matrix adjust.
 Identities = 216/766 (28%), Positives = 358/766 (46%), Gaps = 111/766 (14%)

Query: 64  RVKDLVSRMTLDEKVQQLGDFAHGVP---------RLGLPQY-------------EWWSE 101
           RV+ L+++MTL EK+ Q+   AHG            L L Q              E    
Sbjct: 20  RVEILLAKMTLAEKIGQMAQ-AHGSEDGVSDDQRRALELGQLGSVLNIVSIDVICELQRI 78

Query: 102 ALHGVSNVGPGTHFDDVIPG-ATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRA 160
           AL       P     DVI G  T FP  +   AS+N  L ++  +  + EA  +      
Sbjct: 79  ALEDSRLGIPLLIGRDVIHGYKTIFPIPLGQAASWNPELIEQGARVAALEAATV------ 132

Query: 161 GLTY-WSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPL 219
           G+ + ++P I++ RDPRWGRI E+ GEDP++ G      VRG Q         DL++   
Sbjct: 133 GVNWTFAPMIDITRDPRWGRIAESLGEDPYLCGELGAAMVRGFQ-------GKDLSAIG- 184

Query: 220 KVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNR 279
            +++C KH+A Y      GVD  +  A + E ++   +L PF+  +  G  +S M ++N 
Sbjct: 185 SIAACAKHFAGYGAAE-GGVD--YNTAIIAENELRNVYLPPFKAALDSG-VASFMTAFND 240

Query: 280 VNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGL 339
           +NG+P+  +  LL Q +R EW   G +V+D +SI V +  H F A+ KE A  +   AG+
Sbjct: 241 LNGVPASGNEFLLKQILREEWCYQGMVVSDWESI-VQLTEHGFTANDKEAAF-EAANAGI 298

Query: 340 DLD-CGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGS-PQYVSLGKQDIC 397
           D++     Y+    + + +G++    +D+ +K +  +  RLG F+   PQ   L    + 
Sbjct: 299 DMEMVSNTYSQHLESLIIEGRISLAQVDEMVKNILRLKFRLGLFENPYPQPDKLPA--LV 356

Query: 398 SDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNY-----AG 452
           + ++ + A + A E +VLLKN   +LPL  + + ++A++GP A+     +G +     A 
Sbjct: 357 NHDHRQAAKKLALESVVLLKNSHQSLPLRLSALSSIALIGPLADDAYEQLGTWIFDGDAD 416

Query: 453 IPCRYMSPIAGFSGYA-NVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVE 511
                +  I  F+G +  V      +     +   I     AA+++DA ++  G +  + 
Sbjct: 417 DSETVLQAINAFAGDSLTVNVDRALETTRSNTFIDIDRTMAAAQSSDAIVLCLGEESILS 476

Query: 512 AESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYP 571
            E+  R D+ LPG Q QLI+ +A+ AK P+IL++M+  G  +       ++ AIL+A +P
Sbjct: 477 GEAHSRADISLPGAQEQLIHLLAKTAK-PMILIVMA--GRPLTLEPIIDHVDAILYAWHP 533

Query: 572 GEEGGRAIADVVFGKFNPGGRLPITW---------YNGDYVQMLPLTS---------MPL 613
           G   G A+ D++FG+ +P G+LPIT+         Y G      P ++          P 
Sbjct: 534 GTMAGTALTDLLFGEVSPSGKLPITFPRMVGQVPIYYGKKNTGKPPSAESVVHMNDIAPR 593

Query: 614 RPVDSLGYPGRTYKFYNGPT-LYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNY 672
               SLG     +    G T L+PFG+GLSYT F Y                    NL+ 
Sbjct: 594 AAQTSLGM--SAFHLDAGFTPLFPFGFGLSYTSFTY-------------------ENLHL 632

Query: 673 TSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVI 732
           +S            + +  D      VD  N G  +G +VV +Y++  A      +K++ 
Sbjct: 633 SS------------STMNIDGVITVTVDVINCGEREGQEVVQLYTRDLAANVTRPVKELK 680

Query: 733 GFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVG 778
            FQ+V + AG  +++KF+  A  +L   D   N ++  G   ++ G
Sbjct: 681 QFQKVHLSAGERQQVKFLLKAS-ALAFYDRKMNRIIEPGVFHLWTG 725


>gi|60682370|ref|YP_212514.1| hydrolase [Bacteroides fragilis NCTC 9343]
 gi|60493804|emb|CAH08594.1| putative exported hydrolase [Bacteroides fragilis NCTC 9343]
          Length = 859

 Score =  254 bits (650), Expect = 1e-64,   Method: Compositional matrix adjust.
 Identities = 215/769 (27%), Positives = 336/769 (43%), Gaps = 146/769 (18%)

Query: 50  SSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGD-------------------------- 83
           ++F + ++SLP  +RV+DL+SRMTL+EK+ Q+                            
Sbjct: 22  TNFKYKNASLPVEVRVQDLLSRMTLEEKIAQMRHIHAYSIMENGKLNEEKLEKMIGGQNY 81

Query: 84  -FAHGVP---------------------RLGLPQYEWWSEALHGVSNVGPGTHFDDVIPG 121
            F  G+                      RLG+P +   +E+LHG            V  G
Sbjct: 82  GFIEGITLPGKECLTLMNEVQKYMREKTRLGIPVFTL-TESLHG-----------SVHDG 129

Query: 122 ATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTY-WSPNINVARDPRWGRI 180
           +T FP  I   ++FN  L  ++  A++ E      L   G+T   +P I+V RD RWGR+
Sbjct: 130 STIFPQAIALGSTFNPILAYEMTSAIAKE------LSAQGITQSLTPVIDVCRDLRWGRV 183

Query: 181 TETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVD 240
            E  GEDPF+V R  V+ VRG  D +              VS   KH+ A+      G++
Sbjct: 184 EECFGEDPFLVSRMGVSQVRGYLDNQ--------------VSPMIKHFGAHGAPQ-GGLN 228

Query: 241 RYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEW 300
                    ++++   +L+ FE  VKE    +VM SYN  N  P+ +   L+ + +R  W
Sbjct: 229 LA--SVLCGQRELLSIYLKTFETVVKEAKPWAVMSSYNSWNNEPNSSSHYLMTELLRDRW 286

Query: 301 DLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFTGNAVQQGKV 360
           D  GY+ +D  +I ++   HK   +S E A+ Q L AGLD +            V+ G +
Sbjct: 287 DFQGYVYSDWGAIGMLNYFHKTAQNSAEAAI-QALTAGLDAEASDNSYAELQQLVENGML 345

Query: 361 KETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQDICSDENIELAAEAAREGIVLLKNDQ 420
               ID+++  + T    +G F+          + + +  ++ LA + A E IVLL+N  
Sbjct: 346 DVKYIDQAVARILTAKFNMGLFEYPLPMEKNYDKVVHAPAHVSLARKIAEESIVLLQNKN 405

Query: 421 NTLPLNSAKVKTVAVVGPHANATVAMIGNYA-------GIPCRYMSPIAGFSG-YANVTY 472
           N LPL   K+K++AV+GP  NA     G+Y        G+    +  +    G    + Y
Sbjct: 406 NILPLQMNKLKSIAVIGP--NADQVQFGDYTWSRDNKDGVTL--LEALKERVGNQLTLNY 461

Query: 473 KTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEA---------ESLDREDLWLP 523
             GC D+     +    A + AK +D  I++ G   +  A         E  D  DL L 
Sbjct: 462 AKGC-DLVTDDRSGFKEAVDVAKKSDVCIVVVGSASASLARDYSNATCGEGFDLSDLTLT 520

Query: 524 GYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVV 583
           G Q  L+  +    K PVI+V++S  G   A +    NI  I+   YPGE+GG A+AD++
Sbjct: 521 GVQEDLVEAIHATGK-PVIVVLLS--GKPFAMSWIKENIPGIVVQWYPGEQGGLALADML 577

Query: 584 FGKFNPGGRLPITWYNGD-----YVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFG 638
            GK NP G+L  ++         Y   LP      R   S   PG+ Y F +   L+ FG
Sbjct: 578 LGKVNPSGKLNYSFPQSVGHLPCYYNYLPTDKGFYRSPGSKNKPGKDYVFSSPKALWAFG 637

Query: 639 YGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFK 698
           +GLSYT F+Y  LS T + +                             D  C+D  E  
Sbjct: 638 HGLSYTDFEY--LSATTSKE-----------------------------DYACEDVIEVT 666

Query: 699 VDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRI 747
           +  +N G  DG +V  VY +         ++++ GF++V ++ G  K++
Sbjct: 667 IAIRNTGDYDGLEVPQVYVRDMVSSVVMPVQELKGFEKVLIKKGETKQV 715


>gi|423221630|ref|ZP_17208100.1| hypothetical protein HMPREF1062_00286 [Bacteroides cellulosilyticus
           CL02T12C19]
 gi|392645869|gb|EIY39591.1| hypothetical protein HMPREF1062_00286 [Bacteroides cellulosilyticus
           CL02T12C19]
          Length = 864

 Score =  254 bits (650), Expect = 1e-64,   Method: Compositional matrix adjust.
 Identities = 156/440 (35%), Positives = 232/440 (52%), Gaps = 38/440 (8%)

Query: 50  SSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNV 109
             F + D+SL    R  DL+ R+TL+EK   + + +  +PRL +  Y WW+EALHG++  
Sbjct: 25  EKFPYQDTSLTAEERADDLLKRLTLEEKASLMMNGSPAIPRLSIKAYGWWNEALHGLART 84

Query: 110 GPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMY-------NLGR-AG 161
           G           AT FP  I   ASF++SL  ++  AVS EARA         NL R   
Sbjct: 85  GL----------ATVFPQAIGMGASFDDSLLYEVFTAVSDEARAKSRRLDSKGNLTRYQA 134

Query: 162 LTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKV 221
           LT W+PN+N+ RDPRWGR  ET GEDP++  R  V  V GLQ  +         +R  K+
Sbjct: 135 LTVWTPNVNIFRDPRWGRGQETYGEDPYLTSRLGVAVVNGLQGPD--------TARYNKL 186

Query: 222 SSCCKHYAAYDVDNWKGVDRYHFDAR-VTEQDMEETFLRPFEMCVKEGDASSVMCSYNRV 280
            +C KHYA +    W   +R+ F+A  ++ +D+ ET+L  F+  V+E     VMC+YNR 
Sbjct: 187 HACAKHYAVHSGPEW---NRHSFNAENISPRDLWETYLPAFKTLVQEAKVKEVMCAYNRF 243

Query: 281 NGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLAD-SKEDAVAQTLKAGL 339
            G P C   +LL Q +R EW   G +V+DC ++       K         A A  +  G 
Sbjct: 244 EGEPCCGSNRLLTQILRDEWGFDGVVVSDCGAVSDFWQKRKHETHPDAASASADAVLNGT 303

Query: 340 DLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQDICSD 399
           D++CG  Y +   +AV+ G + E  ID S+K L      LG  D +  +  +    + S 
Sbjct: 304 DVECGNSYKSLP-DAVKAGLITENQIDISVKRLLKARFELGEMDEN-VWTGISSDVVDSP 361

Query: 400 ENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMS 459
           ++ +LA + ARE + LL+N+ N LPL  +K   +A++GP+AN +V   GNY G+P   ++
Sbjct: 362 KHRQLALQMARETMTLLQNNNNILPL--SKQAKIALIGPNANDSVMQWGNYNGLPSHTIT 419

Query: 460 PIAGFSGY---ANVTYKTGC 476
            + G   Y   +N+ Y+  C
Sbjct: 420 LLEGMQRYLPTSNLIYEPVC 439



 Score =  122 bits (306), Expect = 8e-25,   Method: Compositional matrix adjust.
 Identities = 92/326 (28%), Positives = 150/326 (46%), Gaps = 61/326 (18%)

Query: 464 FSGYANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAESL-------- 515
           F   AN+++     D+   +   +    E  K  D  I   G+  ++E E +        
Sbjct: 573 FDKTANLSF-----DMGVNAQIDVKGLLERIKDVDVVIFAGGISPALEGEEMPVDAAGFR 627

Query: 516 --DREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGE 573
             DR ++ LP  Q +++  +    K    +V ++  G  IA    + N +AIL A YPG+
Sbjct: 628 GGDRTEIELPAVQRRVVEALKTAGKR---IVFVNFSGAAIALEPESLNCEAILQAWYPGQ 684

Query: 574 EGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPT 633
            GG+A+A+V+FG +NP G+LP+T+Y         L  +P    +     GRTY++     
Sbjct: 685 AGGQAVAEVLFGDYNPAGKLPLTFYRN-------LAQIP--DFEDYNMTGRTYRYMKETP 735

Query: 634 LYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDD 693
           L+PFG+GLSYT FKY  L      ++N +K+   +NLN                      
Sbjct: 736 LFPFGHGLSYTTFKYGKL------KMNDDKIAAGQNLNLV-------------------- 769

Query: 694 YFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNA 753
                +   N GS DG +VV VY K   +     +K +  F+RV + AG+   +KF  + 
Sbjct: 770 -----IPVTNTGSRDGDEVVQVYLKKMDDTEGP-VKTLRAFKRVRIPAGKTVEVKFSLDD 823

Query: 754 CKSLNIVDYAANTL-LPAGEHTIFVG 778
            + L   D  +NT+ +  G +T+ +G
Sbjct: 824 TQ-LEWWDEQSNTMRVCPGNYTVMIG 848


>gi|163849391|ref|YP_001637435.1| glycoside hydrolase family 3 [Chloroflexus aurantiacus J-10-fl]
 gi|222527388|ref|YP_002571859.1| glycoside hydrolase family protein [Chloroflexus sp. Y-400-fl]
 gi|163670680|gb|ABY37046.1| glycoside hydrolase family 3 domain protein [Chloroflexus
           aurantiacus J-10-fl]
 gi|222451267|gb|ACM55533.1| glycoside hydrolase family 3 domain protein [Chloroflexus sp.
           Y-400-fl]
          Length = 702

 Score =  254 bits (650), Expect = 1e-64,   Method: Compositional matrix adjust.
 Identities = 211/732 (28%), Positives = 337/732 (46%), Gaps = 117/732 (15%)

Query: 64  RVKDLVSRMTLDEKVQQLGD-FAHGVPRLGLPQYEWWSEALHGVSNVGPGTHFD------ 116
           RV  L+ +MTL+EK+ QL     HG+P L L +       ++    +  G  FD      
Sbjct: 7   RVNTLLGQMTLEEKIGQLNQPMIHGLPGLDLLRQGKAGSIINAFGALS-GQGFDHLNSAE 65

Query: 117 ----------------------DVIPGA-TSFPTVILTTASFNESLWKKIGQAVSTEARA 153
                                 D+I G  T FP  +   ASFN SL ++I Q  + EA A
Sbjct: 66  QCNALQRAALESRLGIPLLFGRDIIHGQRTVFPIPLAQAASFNPSLVEQINQIAAREASA 125

Query: 154 MYNLGRAGLTY-WSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENAT 212
           +      G+ + ++P +++ARD RWGRI E  GEDP +  R A   VRG Q         
Sbjct: 126 L------GIRWTFAPMLDIARDARWGRIAEGYGEDPLLTSRMAAAAVRGFQG-------- 171

Query: 213 DLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASS 272
           D  S+P ++ +C KHY  Y         R +  A ++E  + + +L PF   V  G   +
Sbjct: 172 DDVSQPDRLVACAKHYVGYGAAEG---GRDYEQAEISEPTLRDVYLPPFRAAVAAG-VGT 227

Query: 273 VMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVA 332
           +M ++  +NG+P+ A+ +LL   +R EW   G++V+D +S+  +V +   +A+ +  A A
Sbjct: 228 IMSAFLDLNGMPATANRRLLTDVLRNEWGFDGFVVSDWESVGELVQHG--IAEDRAHAAA 285

Query: 333 QTLKAGLDLD--CGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVS 390
             L+AG+D+D   G Y      N V+ G+V   +ID++++ +  +  R G F+       
Sbjct: 286 LALRAGVDMDMVSGAYLETLAEN-VRCGRVTLAEIDEAVRRILRIKCRAGLFEHPLTDPE 344

Query: 391 LGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNY 450
               DI + +  ELA +AARE +VLLKN+++ LPL     + + V GP  +AT  + G +
Sbjct: 345 RAIHDILTPKARELARQAARETMVLLKNERHLLPLRD--FRRILVAGPFVHATGELFGTW 402

Query: 451 AGIPCRYMSPIAGFSGYAN--VTYKTGCDDVACKSNNSIFAAS-----EAAKTADATIIL 503
                          G A   V        +A    +  FAA+       A  ADA ++L
Sbjct: 403 T------------MDGRAEDAVPLDQAFQAIAPAGTDLWFAAAPDLALSRAHYADAVVLL 450

Query: 504 AGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIK 563
            G   +   E+ +  DL LP  Q + I  +A + K PV+LV+ +  G  +A        +
Sbjct: 451 VGEHPARSGENANVSDLGLPPGQLEWITAMAAIGK-PVVLVVFA--GRPLAITRAVAQAQ 507

Query: 564 AILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPL-RPVDSLGYP 622
           A+++A +PG EG  A+A+++FG   P GRLP++         L     P  RP+++ G P
Sbjct: 508 AVIYAWHPGLEGAAALAEILFGLATPTGRLPVSMPRTTGQAPLYYAHKPSGRPLEADG-P 566

Query: 623 GRTYKFYNGPT--LYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTR 680
            RT ++ + PT  L+PFGYGLSYT F Y+ L  +           H R            
Sbjct: 567 FRT-RYVDIPTAPLFPFGYGLSYTSFSYSDLRLSSA---------HMRG----------- 605

Query: 681 CPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVR 740
                          E      N G   GS+VV +Y +         ++++  FQR+ ++
Sbjct: 606 -------------TLEISALITNTGERTGSEVVQLYVRDLVGSLTRPVRELKDFQRITLQ 652

Query: 741 AGRNKRIKFVFN 752
            G  +R+ F+  
Sbjct: 653 PGEARRVSFILR 664


>gi|329922637|ref|ZP_08278189.1| glycosyl hydrolase family 3 N-terminal domain protein
           [Paenibacillus sp. HGF5]
 gi|328941979|gb|EGG38262.1| glycosyl hydrolase family 3 N-terminal domain protein
           [Paenibacillus sp. HGF5]
          Length = 765

 Score =  254 bits (649), Expect = 1e-64,   Method: Compositional matrix adjust.
 Identities = 198/711 (27%), Positives = 324/711 (45%), Gaps = 109/711 (15%)

Query: 76  EKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGATSFPTVILTTASF 135
           E V  +  +A    RLG+P      E  HG   +G            T FP  +   +++
Sbjct: 89  EAVNHIQRYAIEQSRLGIPIL-IGEECSHGHMAIG-----------GTVFPVPLSIGSTW 136

Query: 136 NESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYA 195
           N  L++ + +AV+ E R+     + G   +SP ++V RDPRWGR  E  GEDP+++  YA
Sbjct: 137 NLDLYRDMCRAVALETRS-----QGGAVTYSPVLDVVRDPRWGRTEECFGEDPYLISEYA 191

Query: 196 VNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAY-DVDNWKGVDRYHFDARVTEQDME 254
           V  V GLQ          L+S P  V++  KH+  Y   +  +     H   R    ++ 
Sbjct: 192 VASVEGLQ-------GESLDS-PSSVAATLKHFVGYGSSEGGRNAGPVHMGTR----ELM 239

Query: 255 ETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQ 314
           E  + PF+  V+ G A+S+M +YN ++G+P   + +LL+  +R EW   G ++ DC +I 
Sbjct: 240 EVDMLPFKKAVEAG-AASIMPAYNEIDGVPCTVNTELLDGILRKEWGFDGMVITDCGAID 298

Query: 315 VMVDNHKFLADSKEDAVAQTLKAGLDLD-CGQYYTNFTGNAVQQGKVKETDIDKSLKYLY 373
           ++   H    D   DA  Q ++AG+DL+  G+ +      AV+  K++ + +D++++ + 
Sbjct: 299 MLASGHDTAEDGM-DAAVQAIRAGIDLEMSGEMFGKHLQKAVESNKLEVSVLDEAVRRVL 357

Query: 374 TVLMRLGFFDGSPQYVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTV 433
           T+  +LG F+         +  I S ++I LA + A EGIVLLKN+   LPL S +   +
Sbjct: 358 TLKFKLGLFENPYVDPQTAENVIGSGQHIGLARQLAAEGIVLLKNEAKALPL-SKEGGVI 416

Query: 434 AVVGPHANATVAMIGNYAG--IPCRYMSPIAGFSGY-----ANVTYKTGCDDVACKSNNS 486
           AV+GP+A+     +G+Y     P    + + G           V Y  GC  +   S   
Sbjct: 417 AVIGPNADQGYNQLGDYTSPQPPAAVTTVLGGIRAKLGEEAQRVLYAPGC-RIKDDSREG 475

Query: 487 IFAASEAAKTADATIILAG-----------LDLSVEA--------------ESLDREDLW 521
              A   A+ AD  +++ G           +DL   A              E +DR  L 
Sbjct: 476 FEFALSCAEQADTVVMVLGGSSARDFGEGTIDLRTGASKVTDDALSDMDCGEGIDRMTLQ 535

Query: 522 LPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIAD 581
           L G Q  L  ++ ++ K  +++ I    G  IA    + +  AIL A YPG+EGG AIAD
Sbjct: 536 LSGVQLDLAQEIHKLGKRMIVVYI---NGRPIAEPWIDEHADAILEAWYPGQEGGHAIAD 592

Query: 582 VVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGL 641
           ++FG  NP G+L ++     +V  LP+     R        G+ Y   +    YPFGYGL
Sbjct: 593 ILFGDVNPSGKLTMSIPK--HVGQLPVYYNGKRS------RGKRYLEEDSQPRYPFGYGL 644

Query: 642 SYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDF 701
           SYT+F Y+ +  T  +                               +  D      V+ 
Sbjct: 645 SYTEFSYSDIQMTPEV-------------------------------IGTDGTAVVSVNV 673

Query: 702 QNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFN 752
            N G  +GS+VV +Y    A       +++ GFQ++ ++ G  ++++F   
Sbjct: 674 TNSGDCEGSEVVQLYVSDAASKYTRPARELKGFQKISLQPGERRKVEFTIG 724


>gi|224538282|ref|ZP_03678821.1| hypothetical protein BACCELL_03173 [Bacteroides cellulosilyticus
           DSM 14838]
 gi|224520107|gb|EEF89212.1| hypothetical protein BACCELL_03173 [Bacteroides cellulosilyticus
           DSM 14838]
          Length = 864

 Score =  254 bits (649), Expect = 1e-64,   Method: Compositional matrix adjust.
 Identities = 156/440 (35%), Positives = 232/440 (52%), Gaps = 38/440 (8%)

Query: 50  SSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNV 109
             F + D+SL    R  DL+ R+TL+EK   + + +  +PRL +  Y WW+EALHG++  
Sbjct: 25  EKFPYQDTSLTAEERADDLLKRLTLEEKASLMMNGSPAIPRLSIKAYGWWNEALHGLART 84

Query: 110 GPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMY-------NLGR-AG 161
           G           AT FP  I   ASF++SL  ++  AVS EARA         NL R   
Sbjct: 85  GL----------ATVFPQAIGMGASFDDSLLYEVFTAVSDEARAKSRRLDSKGNLTRYQA 134

Query: 162 LTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKV 221
           LT W+PN+N+ RDPRWGR  ET GEDP++  R  V  V GLQ  +         +R  K+
Sbjct: 135 LTVWTPNVNIFRDPRWGRGQETYGEDPYLTSRLGVAVVNGLQGPD--------TARYNKL 186

Query: 222 SSCCKHYAAYDVDNWKGVDRYHFDAR-VTEQDMEETFLRPFEMCVKEGDASSVMCSYNRV 280
            +C KHYA +    W   +R+ F+A  ++ +D+ ET+L  F+  V+E     VMC+YNR 
Sbjct: 187 HACAKHYAVHSGPEW---NRHSFNAENISPRDLWETYLPAFKTLVQEAKVKEVMCAYNRF 243

Query: 281 NGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLAD-SKEDAVAQTLKAGL 339
            G P C   +LL Q +R EW   G +V+DC ++       K         A A  +  G 
Sbjct: 244 EGEPCCGSNRLLTQILRDEWGFDGVVVSDCGAVSDFWQKRKHETHPDAASASADAVLNGT 303

Query: 340 DLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQDICSD 399
           D++CG  Y +   +AV+ G + E  ID S+K L      LG  D +  +  +    + S 
Sbjct: 304 DVECGNSYKSLP-DAVKAGLITENQIDISVKRLLKARFELGEMDEN-VWTGISSDVVDSP 361

Query: 400 ENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMS 459
           ++ +LA + ARE + LL+N+ N LPL  +K   +A++GP+AN +V   GNY G+P   ++
Sbjct: 362 KHRQLALQMARETMTLLQNNNNILPL--SKQAKIALIGPNANDSVMQWGNYNGLPSHTIT 419

Query: 460 PIAGFSGY---ANVTYKTGC 476
            + G   Y   +N+ Y+  C
Sbjct: 420 LLEGMQRYLPTSNLIYEPVC 439



 Score =  122 bits (307), Expect = 5e-25,   Method: Compositional matrix adjust.
 Identities = 92/326 (28%), Positives = 150/326 (46%), Gaps = 61/326 (18%)

Query: 464 FSGYANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAESL-------- 515
           F   AN+++     D+   +   +    E  K  D  I   G+  ++E E +        
Sbjct: 573 FDKTANLSF-----DMGVNAQIDVKGLLERIKDVDVVIFAGGISPALEGEEMPVDAAGFR 627

Query: 516 --DREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGE 573
             DR ++ LP  Q +++  +    K    +V ++  G  IA    + N +AIL A YPG+
Sbjct: 628 GGDRTEIELPAVQRRVVEALKTAGKR---IVFVNFSGAAIALEPESQNCEAILQAWYPGQ 684

Query: 574 EGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPT 633
            GG+A+A+V+FG +NP G+LP+T+Y         L  +P    +     GRTY++     
Sbjct: 685 AGGQAVAEVLFGDYNPAGKLPLTFYRN-------LAQIP--DFEDYNMTGRTYRYMKETP 735

Query: 634 LYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDD 693
           L+PFG+GLSYT FKY  L      ++N +K+   +NLN                      
Sbjct: 736 LFPFGHGLSYTTFKYGKL------KMNDDKIAAGQNLN---------------------- 767

Query: 694 YFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNA 753
                +   N GS DG +VV VY K   +     +K +  F+RV + AG+   +KF  + 
Sbjct: 768 ---LAIPVTNTGSRDGDEVVQVYLKKMDDTEGP-VKTLRAFKRVRIPAGKTVEVKFSLDD 823

Query: 754 CKSLNIVDYAANTL-LPAGEHTIFVG 778
            + L   D  +NT+ +  G +T+ +G
Sbjct: 824 TQ-LEWWDEQSNTMRVCPGNYTVMIG 848


>gi|265766195|ref|ZP_06094236.1| periplasmic beta-glucosidase [Bacteroides sp. 2_1_16]
 gi|263253863|gb|EEZ25328.1| periplasmic beta-glucosidase [Bacteroides sp. 2_1_16]
          Length = 859

 Score =  254 bits (649), Expect = 1e-64,   Method: Compositional matrix adjust.
 Identities = 214/768 (27%), Positives = 336/768 (43%), Gaps = 144/768 (18%)

Query: 50  SSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGD-------------------------- 83
           ++F + ++SLP  +RV+DL+SRMTL+EK+ Q+                            
Sbjct: 22  TNFKYKNASLPVEVRVQDLLSRMTLEEKIAQMRHIHAYSIMENGKLNEEKLEKMIGGQNY 81

Query: 84  -FAHGVP---------------------RLGLPQYEWWSEALHGVSNVGPGTHFDDVIPG 121
            F  G+                      RLG+P +   +E+LHG            V  G
Sbjct: 82  GFIEGITLPGKECLTLMNEVQKYMREKTRLGIPVFTL-TESLHG-----------SVHDG 129

Query: 122 ATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTY-WSPNINVARDPRWGRI 180
           +T FP  I   ++FN  L  ++  A++ E  A       G+T   +P I+V RD RWGR+
Sbjct: 130 STIFPQAIALGSTFNPILAYEMTSAIAKELTAQ------GITQSLTPVIDVCRDLRWGRV 183

Query: 181 TETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVD 240
            E  GEDP++V R  V+ VRG  D +              VS   KH+ A+      G++
Sbjct: 184 EECFGEDPYLVSRMGVSQVRGYLDNQ--------------VSPMIKHFGAHGAPQ-GGLN 228

Query: 241 RYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEW 300
                    ++++   +L+ FE  VKE    +VM SYN  N  P+ +   L+ + +R  W
Sbjct: 229 LASVSC--GQRELLSIYLKTFETVVKEAKPWAVMSSYNSWNNEPNSSSHYLMTELLRDRW 286

Query: 301 DLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFTGNAVQQGKV 360
           D  GY+ +D  +I ++   HK   +S E A+ Q L AGLD +            V+ G +
Sbjct: 287 DFQGYVYSDWGAIGMLNYFHKTAQNSAEAAI-QALTAGLDAEASDNSYAELQQLVENGML 345

Query: 361 KETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQDICSDENIELAAEAAREGIVLLKNDQ 420
               ID+++  + T    +G F+          + + +  ++ LA + A E IVLL+N+ 
Sbjct: 346 DVKYIDQAVARILTAKFNMGLFEYPLPMEKNYDKVVHAPAHVSLARKIAEESIVLLQNEN 405

Query: 421 NTLPLNSAKVKTVAVVGPHANATVAMIGNYA-------GIPCRYMSPIAGFSGYANVTYK 473
           N LPL   K+K++AV+GP  NA     G+Y        G+     +     S    + Y 
Sbjct: 406 NILPLQMNKLKSIAVIGP--NADQVQFGDYTWSRDNKDGVTL-LEALKERVSNQLTLNYA 462

Query: 474 TGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEA---------ESLDREDLWLPG 524
            GC D+     +    A + AK +D  I++ G   +  A         E  D  DL L G
Sbjct: 463 KGC-DLVTDDCSGFKEAVDVAKKSDVCIVVVGSASASLARDYSNATCGEGFDLSDLTLTG 521

Query: 525 YQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVF 584
            Q  L+  +    K PVI+V++S  G   A +    NI  I+   YPGE+GG A+AD++ 
Sbjct: 522 VQEDLVEAIHATGK-PVIVVLLS--GKPFAMSWIKENIPGIVVQWYPGEQGGLALADMLL 578

Query: 585 GKFNPGGRLPITWYNGD-----YVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGY 639
           GK NP G+L  ++         Y   LP      R   S   PG+ Y F +   L+ FG+
Sbjct: 579 GKVNPSGKLNYSFPQSVGHLPCYYNYLPTDKGFYRSPGSKNKPGKDYVFSSPKALWAFGH 638

Query: 640 GLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKV 699
           GLSYT F+Y  LS T + +                             D  C+D  E  +
Sbjct: 639 GLSYTDFEY--LSATTSKE-----------------------------DYACEDVIEVTI 667

Query: 700 DFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRI 747
             +N G  DG +V  VY +         ++++ GF++V ++ G  K++
Sbjct: 668 AIRNTGDYDGLEVPQVYVRDMVSSVVMPVQELKGFEKVLIKKGETKQV 715


>gi|423260853|ref|ZP_17241755.1| hypothetical protein HMPREF1055_04032 [Bacteroides fragilis
           CL07T00C01]
 gi|423266988|ref|ZP_17245970.1| hypothetical protein HMPREF1056_03657 [Bacteroides fragilis
           CL07T12C05]
 gi|387774614|gb|EIK36724.1| hypothetical protein HMPREF1055_04032 [Bacteroides fragilis
           CL07T00C01]
 gi|392697691|gb|EIY90874.1| hypothetical protein HMPREF1056_03657 [Bacteroides fragilis
           CL07T12C05]
          Length = 859

 Score =  254 bits (649), Expect = 1e-64,   Method: Compositional matrix adjust.
 Identities = 212/769 (27%), Positives = 335/769 (43%), Gaps = 146/769 (18%)

Query: 50  SSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGD-------------------------- 83
           ++F + ++SLP  +RV+DL+SRMTL+EK+ Q+                            
Sbjct: 22  TNFKYKNASLPVEVRVQDLLSRMTLEEKIAQMRHIHAYSIMENGKLNEEKLEKMIGGQNY 81

Query: 84  -FAHGVP---------------------RLGLPQYEWWSEALHGVSNVGPGTHFDDVIPG 121
            F  G+                      RLG+P +   +E+LHG            V  G
Sbjct: 82  GFIEGITLPGKECLTLMNEVQKYMREKTRLGIPVFTL-TESLHG-----------SVHDG 129

Query: 122 ATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTY-WSPNINVARDPRWGRI 180
           +T FP  I   ++FN  L  ++  A++ E      L   G+T   +P I+V RD RWGR+
Sbjct: 130 STIFPQAIALGSTFNPILAYEMTSAIAKE------LSAQGITQSLTPVIDVCRDLRWGRV 183

Query: 181 TETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVD 240
            E  GEDP++V R  V+ VRG  D +              VS   KH+ A+      G++
Sbjct: 184 EECFGEDPYLVSRMGVSQVRGYLDNQ--------------VSPMIKHFGAHGAPQ-GGLN 228

Query: 241 RYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEW 300
                    ++++   +L+ FE  VKE    +VM SYN  N  P+ +   L+ + +R  W
Sbjct: 229 LASVSC--GQRELLSIYLKTFETVVKEAKPWAVMSSYNSWNNEPNSSSHYLMTELLRDRW 286

Query: 301 DLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFTGNAVQQGKV 360
           D  GY+ +D  +I ++   HK   +S E A+ Q L AGLD +            V+ G +
Sbjct: 287 DFQGYVYSDWGAIGMLNYFHKTAQNSAEAAI-QALTAGLDAEASDNSYAELQQLVENGML 345

Query: 361 KETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQDICSDENIELAAEAAREGIVLLKNDQ 420
               ID+++  + T    +G F+          + + +  ++ LA + A E IVLL+N+ 
Sbjct: 346 DVKYIDQAVARILTAKFNMGLFEYPLPMEKNYDKVVHAPAHVSLARKIAEESIVLLQNEN 405

Query: 421 NTLPLNSAKVKTVAVVGPHANATVAMIGNYA-------GIPCRYMSPIAGFSG-YANVTY 472
           N LPL   K+K++AV+GP  NA     G+Y        G+    +  +    G    + Y
Sbjct: 406 NILPLQMNKLKSIAVIGP--NADQVQFGDYTWSRDNKDGVTL--LEALKERVGNQLTLNY 461

Query: 473 KTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEA---------ESLDREDLWLP 523
             GC D+     +    A + AK +D  I++ G   +  A         E  D  DL L 
Sbjct: 462 AKGC-DLVTDDRSGFKEAVDVAKKSDVCIVVVGSASASLARDYSNATCGEGFDLSDLTLT 520

Query: 524 GYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVV 583
           G Q  L+  +    K PVI+V++S  G   A +    NI  I+   YPGE+GG A+AD++
Sbjct: 521 GVQEDLVEAIHATGK-PVIVVLLS--GKPFAMSWIKENIPGIVVQWYPGEQGGLALADML 577

Query: 584 FGKFNPGGRLPITWYNGD-----YVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFG 638
            GK NP G+L  ++         Y   LP      R   S   PG+ Y F +   L+ FG
Sbjct: 578 LGKVNPSGKLNYSFPQSVGHLPCYYNYLPTDKGFYRSPGSKNKPGKDYVFSSPKALWAFG 637

Query: 639 YGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFK 698
           +GLSYT F+Y   + +K                                D  C+D  E  
Sbjct: 638 HGLSYTDFEYLSATISK-------------------------------EDYACEDVIEVT 666

Query: 699 VDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRI 747
           +  +N G  DG +V  VY +         ++++ GF++V ++ G  K++
Sbjct: 667 IAIRNTGDYDGLEVPQVYVRDMVSSVVMPVQELKGFEKVLIKKGETKQV 715


>gi|404404031|ref|ZP_10995615.1| glycoside hydrolase family protein [Alistipes sp. JC136]
          Length = 740

 Score =  254 bits (649), Expect = 1e-64,   Method: Compositional matrix adjust.
 Identities = 203/679 (29%), Positives = 330/679 (48%), Gaps = 80/679 (11%)

Query: 117 DVIPG-ATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTY-WSPNINVARD 174
           DVI G  T  P  +  + S++    +   +  + EA A      AGL + ++P +++ARD
Sbjct: 111 DVIHGYKTISPVPLAESCSWDMETIEASARMAAVEASA------AGLQWTFAPMVDIARD 164

Query: 175 PRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVD 234
           PRWGR+ E  GEDP++    A   VRG Q         DL S P  + +C KH+A Y   
Sbjct: 165 PRWGRVMEGAGEDPYLGSHIARARVRGFQ-------GDDL-SAPNTILACAKHFAGYGAS 216

Query: 235 NWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQ 294
              G D    D  +++Q + E +L PF+       A++ M S+N ++G+P+  +  L+ Q
Sbjct: 217 E-GGRDYNTVD--ISDQRLRELYLPPFKAAADA-GAATFMNSFNELSGVPATGNRFLVKQ 272

Query: 295 TVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDC-GQYYTNFTGN 353
            +R EW   G IV+D  S+  M+ +   +A+ K+ A    +K   D+D  G  Y +    
Sbjct: 273 ILRNEWGWDGVIVSDWGSVAEMIPHG--IAEDKKQAALLAVKNECDIDMEGNCYPSSLEE 330

Query: 354 AVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYV--SLGKQDICSDENIELAAEAARE 411
            V++GKV E +ID+S++ +  +   LG FD   +Y      K+   S  + E A + AR+
Sbjct: 331 LVKEGKVSEKEIDRSVRRILRLKYELGLFDDPYRYCDEQREKEVTLSAAHREAARDMARK 390

Query: 412 GIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNY--AGIPCRYMSPIAGFSGYA- 468
            IVLL+N ++ LPL   K +++AVVGP A++ V M+G +   G P   ++ + G    A 
Sbjct: 391 SIVLLENRKSVLPL--GKPRSIAVVGPLADSPVDMLGEWRAKGDPKEVVTILRGIEKTAG 448

Query: 469 ---NVTYKTGCDDVACKSNNSIFA-ASEAAKTADATIILAGLDLSVEAESLDREDLWLPG 524
               VT+  GCD     S+ S FA A  AA++AD  I   G    +  E   R +L LPG
Sbjct: 449 AGTRVTHAKGCD--VTGSDRSGFAEAVRAARSADVVIACLGESADMSGEGYCRSELGLPG 506

Query: 525 YQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVF 584
            Q +L+ ++ +  K P++L++  + G  +  A    NI+ I+   + G E G A+ADV+F
Sbjct: 507 VQQELLKELKKTGK-PIVLLL--SNGRPLTLAWEKENIETIVETWFLGTEAGNAVADVLF 563

Query: 585 GKFNPGGRLPITW-YNGDYVQMLPLTSMPLRPVDSLGYPGRTY--KFYNGP--TLYPFGY 639
           GK+NP G+L +++ YN   + +        RP +    P + Y   + + P   LYPFGY
Sbjct: 564 GKYNPSGKLVMSFPYNVGQIPVYYNHKHTGRPFE----PNQRYVMHYIDAPVDALYPFGY 619

Query: 640 GLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKV 699
           GLSYT+F+Y                                 P +  + +   D     V
Sbjct: 620 GLSYTRFEYGE-------------------------------PTLSSDRMAAGDTITATV 648

Query: 700 DFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNI 759
              N G  DG +VV +Y +         +K++ GF+++F++ G +  + F     + L  
Sbjct: 649 KVTNAGDYDGEEVVQLYIRDLKAQITRPVKELKGFRKIFLKKGESADVTFDITRAE-LEY 707

Query: 760 VDYAANTLLPAGEHTIFVG 778
           V    + +   GE  +F+G
Sbjct: 708 VLADGSVVSDPGEFELFIG 726


>gi|254514842|ref|ZP_05126903.1| periplasmic beta-glucosidase [gamma proteobacterium NOR5-3]
 gi|219677085|gb|EED33450.1| periplasmic beta-glucosidase [gamma proteobacterium NOR5-3]
          Length = 740

 Score =  254 bits (649), Expect = 1e-64,   Method: Compositional matrix adjust.
 Identities = 211/755 (27%), Positives = 343/755 (45%), Gaps = 114/755 (15%)

Query: 48  QMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHG---VPR-------------- 90
           + +  +  D  L    RV +L+  M LDEK+ Q+     G   +P               
Sbjct: 4   ETAQTIAVDEQLSIDSRVAELLGSMGLDEKIGQMSQLQAGGGWIPDELADSIRRGQVGSV 63

Query: 91  LGLPQYEWWSEALHGV---SNVG-PGTHFDDVIPG-ATSFPTVILTTASFNESLWKKIGQ 145
           L  P     +E        S +G P     DVI G  T FP  +   AS+N S+  + G 
Sbjct: 64  LNEPDVNIVNELQRLAVEESRLGIPLLIGRDVIHGFKTIFPIPLGQAASWNPSV-VEAGA 122

Query: 146 AVSTEARAMYNLGRAGLTY-WSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQD 204
            VS E        RAG+ + ++P I++ RDPRWGRI E+ GEDP++  +     VRG Q 
Sbjct: 123 RVSAEEAV-----RAGINWTFAPMIDITRDPRWGRIAESLGEDPYLCSKLGAAMVRGFQ- 176

Query: 205 VEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMC 264
                  +D  S P  +++C KH+A Y         R +  A + E +M   +LRPF+  
Sbjct: 177 -------SDDLSAPDAIAACAKHFAGYGAAEGG---RDYNTANIPENEMRNVYLRPFKAA 226

Query: 265 VKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLA 324
            + G  ++ M ++  +NG+P+  +  L+++ +R EW   G +V+D +S+ V +  H F  
Sbjct: 227 AEAG-VATFMSAFCDLNGVPATGNRWLMDEILRQEWSYQGMVVSDWESV-VEMSVHGFTH 284

Query: 325 DSKEDAVAQTLKAGLDLD-CGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFD 383
           D  E A  +   AG+D++     Y +     V + K+    ID+ +  +  +   LG F+
Sbjct: 285 DD-EQAAYEAAMAGIDMEMASSSYRDHLEGLVGENKITLEQIDRMVARVLRLKFELGLFE 343

Query: 384 GSPQYVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANAT 443
             P        ++ +  N++ A +AA +  VLLKN   TLPL  AK+ ++A++GP A+  
Sbjct: 344 -QPYTDPAQHPELLNKANLKAAKQAATQSCVLLKNAHQTLPLVPAKLDSIALIGPLADDG 402

Query: 444 VAMIGNYA-------GIPCRY-MSPIAGFSGYANVTYKTGCDDVACKSNNSIFAASEAAK 495
              +G +         + CR  +  + G +    + Y+   +     S ++  AA  AA+
Sbjct: 403 YEQMGTWVFDGDAAHSVTCRQALDELLGRT--VEIHYEKALETTRAASPDNFAAAKNAAQ 460

Query: 496 TADATIILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAF 555
            +DA II+ G +  +  E+  R ++ LPG+Q  LI  VA   K P+I+VIM+  G  +  
Sbjct: 461 QSDAAIIVVGEEAFMSGEAHSRANIDLPGHQQALIEAVASAGK-PIIVVIMA--GRPLTI 517

Query: 556 AETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPL------- 608
                +  A+L+A +PG  GG AIAD++ G  +P G+LP+T+     V  +P+       
Sbjct: 518 EPVLEHADAVLYAWHPGTMGGPAIADLLLGLESPSGKLPVTFPR--VVGQVPIHYAQKNT 575

Query: 609 -------------TSMPLRPVDSLGYPGRTYKFYNG-PTLYPFGYGLSYTQFKYNLLSFT 654
                         + P  P  SLG    ++    G   L+PFGYGLSY +F+Y      
Sbjct: 576 GRPATQESCVDINEAPPRAPQTSLGM--TSFHLDAGFKPLFPFGYGLSYGRFQY------ 627

Query: 655 KTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVI 714
               V +    H                      +R     +   D  N+GS  G +VV 
Sbjct: 628 ----VKITTSHHS---------------------IRMGQSLDISADVVNMGSHAGEEVVQ 662

Query: 715 VYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKF 749
           +Y +         IK++ GF+RV ++ G  +RI F
Sbjct: 663 LYIRDLVGSVTRPIKELKGFRRVRLKPGERQRISF 697


>gi|237721943|ref|ZP_04552424.1| glycoside hydrolase family 3 protein [Bacteroides sp. 2_2_4]
 gi|229448812|gb|EEO54603.1| glycoside hydrolase family 3 protein [Bacteroides sp. 2_2_4]
          Length = 792

 Score =  254 bits (649), Expect = 1e-64,   Method: Compositional matrix adjust.
 Identities = 226/816 (27%), Positives = 355/816 (43%), Gaps = 156/816 (19%)

Query: 42  FSKLGLQMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRL---GLPQYEW 98
           F+K G++    ++ D S P   R+ DL+S+MTL+EK  Q+    +G  R+     P   W
Sbjct: 39  FNKNGIKD---VYEDPSAPIEARIADLLSQMTLEEKTCQMATL-YGSGRVLKDACPTAGW 94

Query: 99  WSEALH-GVSNV-----GPGTHFDDV-------------------------IP------- 120
            +E    G+ N+     G G    ++                         IP       
Sbjct: 95  LAEIWKDGIGNIDEQANGLGKFGSEISYPYANSVKNRHTIQRWFVEQTRLGIPVDFTNEG 154

Query: 121 -------GATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNINVAR 173
                   AT FP      A++N+ L ++I +  + EA+A   LG   +  +SP +++A+
Sbjct: 155 IRGLCHDRATMFPAQCGQGATWNKKLIREIAKVTANEAKA---LGYTNI--YSPILDIAQ 209

Query: 174 DPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDV 233
           DPRWGR+ E+ GEDP++ G      + GLQ+ EG             + +  KH+A Y +
Sbjct: 210 DPRWGRVVESYGEDPYLAGELGKQMILGLQN-EG-------------IVATPKHFAVYSI 255

Query: 234 DNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLN 293
                      D  V  ++M+  +L PF   ++E  A  VM SYN  +G P       L 
Sbjct: 256 PVGGRDGGTRTDPHVAPREMKTLYLEPFRKGIQEAGALGVMSSYNDYDGEPVSGSYHFLT 315

Query: 294 QTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFT-- 351
           + +R +W   GY+V+D ++++ +   H+ +  ++E+  AQ + AGL++      TNFT  
Sbjct: 316 EILRQQWGFKGYVVSDSEAVEFLHTKHR-ITPTEEEMAAQVVNAGLNI-----RTNFTPP 369

Query: 352 -------GNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGS-PQYVSLGKQDICSDENIE 403
                    A+ +GKV    +D+ +  +  V   +G FD   P      +  + +D +  
Sbjct: 370 QDFILPLRRAIDEGKVSLHTLDQRVSEILRVKFMMGLFDNPYPGDDRRPETVVHNDAHKA 429

Query: 404 LAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAG 463
           ++ +AA E +VLLKN+   LPL S   K +AV+GP+A     +   Y        +   G
Sbjct: 430 VSMKAALESVVLLKNENQMLPL-SKNFKKIAVIGPNAEEVKELTCRYGPANASIKTVYQG 488

Query: 464 FSGY---ANVTYKTGCDDV--------------ACKSNNSIFAASEAAKTADATIILAGL 506
              Y   + V Y  GCD +                +    I  A E AK +D  I++ G 
Sbjct: 489 IKEYLPNSEVRYAKGCDIIDKYFPESELYNVPLDTQEQAMIHEAVELAKASDIAILVLGG 548

Query: 507 DLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAIL 566
           +     E   R +L L G Q QL+  V    K PV+LV++      I +A  N  I AI+
Sbjct: 549 NEKTVREEFSRTNLDLCGRQQQLLEAVYATGK-PVVLVMVDGRAATINWA--NKYIPAII 605

Query: 567 WAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRP-VDSLGYPGRT 625
            A +PGE  G AIA V+FG +NPGGRL +T+     V  +P  + P +P  DS G     
Sbjct: 606 HAWFPGEFMGDAIAKVLFGDYNPGGRLAVTFPKS--VGQIPF-AFPFKPGSDSKG----- 657

Query: 626 YKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTI---QVNLNKLQHCRNLNYTSDASKTRCP 682
            K      LYPFGYGLSYT F Y+ L  +K +   Q N+                     
Sbjct: 658 -KVRVDGALYPFGYGLSYTTFGYSDLKISKPVIGPQENIT-------------------- 696

Query: 683 GVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAG 742
                 L C          +N G   G +VV +Y +       TY K + GF+R+ ++ G
Sbjct: 697 ------LSC--------TVKNTGKKAGDEVVQLYIRDDFSSVTTYDKVLRGFERIHLQPG 742

Query: 743 RNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVG 778
             + + F     + L + D      +  G  ++ VG
Sbjct: 743 EEQTVSFTLTP-QDLGLWDKNNRFTVEPGSFSVMVG 777


>gi|423250669|ref|ZP_17231684.1| hypothetical protein HMPREF1066_02694 [Bacteroides fragilis
           CL03T00C08]
 gi|423253995|ref|ZP_17234925.1| hypothetical protein HMPREF1067_01569 [Bacteroides fragilis
           CL03T12C07]
 gi|392651626|gb|EIY45288.1| hypothetical protein HMPREF1066_02694 [Bacteroides fragilis
           CL03T00C08]
 gi|392654553|gb|EIY48200.1| hypothetical protein HMPREF1067_01569 [Bacteroides fragilis
           CL03T12C07]
          Length = 859

 Score =  254 bits (649), Expect = 1e-64,   Method: Compositional matrix adjust.
 Identities = 214/769 (27%), Positives = 337/769 (43%), Gaps = 146/769 (18%)

Query: 50  SSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGD-------------------------- 83
           ++F + ++SLP  +RV+DL+SRMTL+EK+ Q+                            
Sbjct: 22  TNFKYKNASLPVEVRVQDLLSRMTLEEKIAQMRHIHAYSIMENGKLNEEKLEKMIGGQNY 81

Query: 84  -FAHGVP---------------------RLGLPQYEWWSEALHGVSNVGPGTHFDDVIPG 121
            F  G+                      RLG+P +   +E+LHG            V  G
Sbjct: 82  GFIEGITLPGKECLTLMNEVQKYMREKTRLGIPVFTL-TESLHG-----------SVHDG 129

Query: 122 ATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTY-WSPNINVARDPRWGRI 180
           +T FP  I   ++FN  L  ++  A++ E      L   G+T   +P I+V RD RWGR+
Sbjct: 130 STIFPQAIALGSTFNPILAYEMTSAIAKE------LSAQGITQSLTPVIDVCRDLRWGRV 183

Query: 181 TETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVD 240
            E  GEDP++V R  V+ VRG  D +              VS   KH+ A+      G++
Sbjct: 184 EECFGEDPYLVSRMGVSQVRGYLDNQ--------------VSPMIKHFGAHGAPQ-GGLN 228

Query: 241 RYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEW 300
                    ++++   +L+ FE  VKE    +VM SYN  N  P+ +   L+ + +R  W
Sbjct: 229 LASVSC--GQRELLSIYLKTFETVVKEAKPWAVMSSYNSWNNEPNSSSHYLMTELLRDRW 286

Query: 301 DLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFTGNAVQQGKV 360
           D  GY+ +D  +I ++   HK   +S E A+ Q L AGLD +            V+ G +
Sbjct: 287 DFQGYVYSDWGAIGMLNYFHKTAQNSAEAAI-QALTAGLDAEASDNSYAELQQLVENGML 345

Query: 361 KETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQDICSDENIELAAEAAREGIVLLKNDQ 420
               ID+++  + T    +G F+          + + +  ++ LA + A E IVLL+N+ 
Sbjct: 346 DVKYIDQAVARILTAKFNMGLFEYPLPMEKNYDKVVHAPAHVSLARKIAEESIVLLQNEN 405

Query: 421 NTLPLNSAKVKTVAVVGPHANATVAMIGNYA-------GIPCRYMSPIAGFSG-YANVTY 472
           N LPL   K+K++AV+GP  NA     G+Y        G+    +  +    G    + Y
Sbjct: 406 NILPLQMNKLKSIAVIGP--NADQVQFGDYTWSRDNKDGVTL--LEALKERVGNQLTLNY 461

Query: 473 KTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEA---------ESLDREDLWLP 523
             GC D+     +    A + AK +D  I++ G   +  A         E  D  DL L 
Sbjct: 462 AKGC-DLVTDDRSGFKEAVDVAKKSDVCIVVVGSASASLARDYSNATCGEGFDLSDLTLT 520

Query: 524 GYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVV 583
           G Q  L+  +    K PVI+V++S  G   A +    NI  I+   YPGE+GG A+AD++
Sbjct: 521 GVQEDLVEAIHATGK-PVIVVLLS--GKPFAMSWIKENIPGIVVQWYPGEQGGLALADML 577

Query: 584 FGKFNPGGRLPITWYNGD-----YVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFG 638
            GK NP G+L  ++         Y   LP      R   S   PG+ Y F +   L+ FG
Sbjct: 578 LGKVNPSGKLNYSFPQSVGHLPCYYNYLPTDKGFYRSPGSKNKPGKDYVFSSPKALWAFG 637

Query: 639 YGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFK 698
           +GLSYT F+Y  LS T + +                             D  C+D  E  
Sbjct: 638 HGLSYTDFEY--LSATTSKE-----------------------------DYACEDVIEVT 666

Query: 699 VDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRI 747
           +  +N G  DG +V  VY +         ++++ GF++V ++ G  K++
Sbjct: 667 IAIRNTGDYDGLEVPQVYVRDMVSSVVIPVQELKGFEKVLIKKGETKQV 715


>gi|423248809|ref|ZP_17229825.1| hypothetical protein HMPREF1066_00835 [Bacteroides fragilis
           CL03T00C08]
 gi|423253758|ref|ZP_17234689.1| hypothetical protein HMPREF1067_01333 [Bacteroides fragilis
           CL03T12C07]
 gi|392655387|gb|EIY49030.1| hypothetical protein HMPREF1067_01333 [Bacteroides fragilis
           CL03T12C07]
 gi|392657750|gb|EIY51381.1| hypothetical protein HMPREF1066_00835 [Bacteroides fragilis
           CL03T00C08]
          Length = 805

 Score =  254 bits (648), Expect = 2e-64,   Method: Compositional matrix adjust.
 Identities = 232/807 (28%), Positives = 348/807 (43%), Gaps = 145/807 (17%)

Query: 54  FCDSSLPYSIRVKDLVSRMTLDEKVQQL------------GDFAHGVPRLGLPQYEWWSE 101
           + + S P   RV+ L+S+MTL+EKV Q+            G+     P+L     E+   
Sbjct: 40  YENPSAPVEYRVEHLLSQMTLEEKVGQMLTSLGWPMYERVGEDIRLTPQLEKEIGEYHIG 99

Query: 102 ALHGVSNVGPGT--------------------------HFDDVIP--------------G 121
           +L G     P T                          H    IP              G
Sbjct: 100 SLWGFMRADPWTQRTLHTGLNPSLAARASNRLQSYVIEHSRLGIPLFLAEECPHGHMAIG 159

Query: 122 ATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNINVARDPRWGRIT 181
            T FPT I   +++N  L +++G+ ++ EA A     +     + P +++ARDPRW R+ 
Sbjct: 160 TTVFPTSIGQASTWNPELIRQMGRVIAIEASA-----QGAHIGYGPVLDLARDPRWSRVE 214

Query: 182 ETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDR 241
           ET GEDP++ G      VRG Q     E   D  S    V +  KH+A+Y    W     
Sbjct: 215 ETYGEDPYLNGVMGTALVRGFQG----ETLNDGKS----VIATLKHFASY---GWTEGGH 263

Query: 242 YHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWD 301
               A + E+++EE    PF   V  G A SVM SYN ++G P      LL   ++  W 
Sbjct: 264 NGGTAHIGERELEEAIFPPFREAVGAG-ALSVMSSYNEIDGNPCTGSRYLLTDILKDRWQ 322

Query: 302 LHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCG-QYYTNFTGNAVQQGKV 360
             G++V+D  ++  + ++   +A +  +A  + + AG+D D G   Y      AV++G V
Sbjct: 323 FKGFVVSDLYAVGGLREHG--VAGNDYEAAIKAVNAGVDSDLGTNVYAEQLVAAVKRGDV 380

Query: 361 KETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQDICSDENIELAAEAAREGIVLLKNDQ 420
               IDK+++ + ++  ++G FD          Q + S E+  LA E AR+ IVLLKN  
Sbjct: 381 AVATIDKAVRRILSLKFQMGLFDDPFVDEKQAVQLVASSEHTGLAREVARQSIVLLKNKD 440

Query: 421 NTLPLNSAKVKTVAVVGPHANATVAMIGNYA-----GIPCRYMSPI-AGFSGYANVTYKT 474
             LPL    ++T+AV+GP+A+    M+G+Y      G     +  I    S    V Y  
Sbjct: 441 KLLPLKK-DIRTLAVIGPNADNVYNMLGDYTAPQADGTVVTVLDGIRQKVSKETRVLYAK 499

Query: 475 GCDDVACKSNNSIFAASEAAKTADATIILAG----LDLSVE------------------- 511
           GC  V   S      A E A+ ADA +++ G     D S E                   
Sbjct: 500 GC-AVRDSSRTGFKDAIETARNADAVVMVMGGSSARDFSSEYEETGAAKVTINQISDMES 558

Query: 512 AESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYP 571
            E  DR  L L G Q +L+ +++ + K PV+LV++   G  +         +AI+ A YP
Sbjct: 559 GEGYDRATLHLMGRQLELLEEISRLGK-PVVLVLIK--GRPLLMEGAIQEAEAIVDAWYP 615

Query: 572 GEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNG 631
           G +GG A+ADV+FG +NP GRL ++      V  LP+     R     G   R Y    G
Sbjct: 616 GMQGGNAVADVLFGDYNPAGRLTLSVPRS--VGQLPVYYNTRRK----GNRSR-YVEEPG 668

Query: 632 PTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRC 691
              YPFGYGLSYT F Y  +     +QV                           +D R 
Sbjct: 669 TPRYPFGYGLSYTTFSYTDMK----VQVTEGS-----------------------DDCRV 701

Query: 692 DDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVF 751
           D      V  QN G+ DG +V  +Y +       T  KQ+  F R+ ++AG ++ + F  
Sbjct: 702 D----VTVTIQNQGTADGDEVAQLYFQDDVSSFTTPAKQLRAFSRIHLKAGESREVTFTL 757

Query: 752 NACKSLNIVDYAANTLLPAGEHTIFVG 778
           +  KSL +       ++  G  TI VG
Sbjct: 758 DK-KSLALYMQEGEWVVEPGRFTIMVG 783


>gi|255689965|ref|ZP_05413640.1| beta-glucosidase [Bacteroides finegoldii DSM 17565]
 gi|260624572|gb|EEX47443.1| glycosyl hydrolase family 3 N-terminal domain protein [Bacteroides
           finegoldii DSM 17565]
          Length = 688

 Score =  254 bits (648), Expect = 2e-64,   Method: Compositional matrix adjust.
 Identities = 190/676 (28%), Positives = 326/676 (48%), Gaps = 83/676 (12%)

Query: 117 DVIPG-ATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTY-WSPNINVARD 174
           D I G  T +P  +    S+N  L ++     + EAR       +G+ + +SP I+VARD
Sbjct: 70  DAIHGFRTVYPISLAQACSWNPDLVEQACAVSAQEARM------SGVDWTFSPMIDVARD 123

Query: 175 PRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVD 234
           PRWGR+ E  GEDP+  G +    VRG Q         D  S   +V++C KHY  Y   
Sbjct: 124 PRWGRVAEGYGEDPYANGVFGAASVRGYQG--------DNMSAENRVAACLKHYVGYGAS 175

Query: 235 NWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQ 294
                 R +    +++Q + +T+L P+EM VK G A+++M S+N ++G+P  A+P  + +
Sbjct: 176 E---AGRDYVYTEISQQTLWDTYLLPYEMGVKAG-AATLMSSFNDISGVPGSANPYTMTE 231

Query: 295 TVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQY-YTNFTGN 353
            ++  W   G+IV+D  +I+ +   ++ LA +K++A      AGL++D   + Y      
Sbjct: 232 ILKNRWRHDGFIVSDWGAIEQL--KNQGLAATKKEAARYAFTAGLEMDMMSHAYDRHLQE 289

Query: 354 AVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQDICSDENIELAAEAAREGI 413
            V++GKV    +D++++ +  +  RLG F+      +  K+     +++++AA  A E +
Sbjct: 290 LVEEGKVSMAQVDEAVRRVLLLKFRLGLFERPYTPATTEKERFFRPKSMDIAARLAAESM 349

Query: 414 VLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAG------IPCRYMSPIAGFSGY 467
           VLLKN+ N LPL     K +AV+GP A     ++G++ G      +   Y    A F+G 
Sbjct: 350 VLLKNENNVLPLTDK--KKIAVIGPMAKNGWDLLGSWRGHGKDTDVAMLYDGLAAEFAGK 407

Query: 468 ANVTYKTGCDDVACKSNNSIFA-ASEAAKTADATIILAGLDLSVEAESLDREDLWLPGYQ 526
           A + Y  GC+      N   FA A EAA+ +D  ++  G  ++   E+  R  + LP  Q
Sbjct: 408 AELRYALGCNTQG--DNREGFAEALEAARWSDVVVLCLGEMMTWSGENASRSSIALPQMQ 465

Query: 527 TQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGK 586
            +L  ++ +  K PV+LV+++   +++   E  ++  AIL    PG  G   +A ++ G+
Sbjct: 466 EELAKELKKAGK-PVVLVLVNGRPLELNRLEPVSD--AILEIWQPGVNGALPMAGILSGR 522

Query: 587 FNPGGRLPITWYNGDYVQMLPLTS--MPL---RPVDSLGYPGRTYKFYNGPTLYPFGYGL 641
            NP G+L +T          P ++  +P+   R     G+ G  YK      LYPFG+GL
Sbjct: 523 INPSGKLAMT---------FPYSTGQIPIYYNRRKSGRGHQG-FYKDITSDPLYPFGHGL 572

Query: 642 SYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDF 701
           SYT+FKY       T+  +  K++    L+                          +V  
Sbjct: 573 SYTEFKYG------TVTPSATKVKRGEKLSA-------------------------EVTV 601

Query: 702 QNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVD 761
            N+G+ DG++ V  +   P       +K++  F++  ++AG  K  +F  +  +    V+
Sbjct: 602 TNIGARDGAETVHWFISDPYCSITRPVKELKHFEKQLIKAGETKTFRFDIDLERDFGFVN 661

Query: 762 YAANTLLPAGEHTIFV 777
                 L  GE+ I V
Sbjct: 662 EDGKRFLETGEYNIHV 677


>gi|390167927|ref|ZP_10219905.1| beta-glucosidase, partial [Sphingobium indicum B90A]
 gi|389589522|gb|EIM67539.1| beta-glucosidase, partial [Sphingobium indicum B90A]
          Length = 771

 Score =  254 bits (648), Expect = 2e-64,   Method: Compositional matrix adjust.
 Identities = 220/737 (29%), Positives = 341/737 (46%), Gaps = 109/737 (14%)

Query: 78  VQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGATSFPTVILTTASFNE 137
           V  L  +A    RLG+P   +  E LHG + VG           ATSFP  I   +S++ 
Sbjct: 105 VNALQRWATTQTRLGIPIL-FHEEGLHGYAAVG-----------ATSFPQSIAMASSWDP 152

Query: 138 SLWKKIGQAVSTEARAMYNLGRAGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVN 197
            L +++   ++ E R+     R      SP +++ARDPRWGRI ET GEDP++VG   V 
Sbjct: 153 DLLREVNAVIAREIRS-----RGVSLVLSPVVDIARDPRWGRIEETYGEDPYLVGEMGVA 207

Query: 198 YVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAY-DVDNWKGVDRYHFDARVTEQDMEET 256
            V GLQ   G   +  L   P KV +  KH   +   ++   V      A V+E+++ E 
Sbjct: 208 AVEGLQ---GKGRSRLLP--PGKVFATLKHLTGHGQPESGTNVG----PAPVSERELREN 258

Query: 257 FLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVM 316
           F  PFE  VK     +VM SYN ++G+PS A+  LL   +RGEW   G +V+D  ++  +
Sbjct: 259 FFPPFEQVVKRTGIEAVMASYNEIDGVPSHANRWLLRDVLRGEWGFRGAVVSDYSAVDQL 318

Query: 317 VDNHKFLADSKEDAVAQTLKAGLDLDC--GQYYTNFTGNAVQQGKVKETDIDKSLKYLYT 374
           ++ H   AD  E A  + L AG+D D   G  Y    G  V++GK+ E  +D++++++  
Sbjct: 319 MNIHHVAAD-LEQAAGRALDAGVDADLPDGLSYATL-GRQVREGKIGEALVDRAVRHMLE 376

Query: 375 VLMRLGFFDGSPQYVSLGKQDICSD-ENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTV 433
           +  R G F+ +P   +   + I +D     LA +AA+  I+LLKND   LPL      ++
Sbjct: 377 LKFRAGLFE-NPYADAAASEKITNDGRARALALKAAQRSIILLKND-GMLPLKPE--GSI 432

Query: 434 AVVGPHANATVAMIGNYAGIPCRYMSPIAGFSG----YANVTYKTGCD---------DVA 480
           AV+GP  +A VA +G Y G P   +S + G        A + +  G           D  
Sbjct: 433 AVIGP--SAAVARLGGYYGQPPHSVSILEGIRAKVGNRAKIVFAQGVRITENDDWWADKV 490

Query: 481 CKSNNS-----IFAASEAAKTADATIILAGLDLSVEAESL------DREDLWLPGYQTQL 529
            +S+ +     I  A EAA+  D  ++  G       E        DR  L L G Q +L
Sbjct: 491 TRSDPAENRRLIAQAVEAARHVDRIVLTLGDTEQSSREGWADNHLGDRPSLDLMGEQQEL 550

Query: 530 INQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNP 589
            + +  + K P+ +V+++  G   +  + +    AIL   Y GE+GG A+ADV+FG  NP
Sbjct: 551 FDALKALGK-PIAVVLIN--GRPASTVKVSEQADAILEGWYLGEQGGHAVADVLFGDVNP 607

Query: 590 GGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPG--RTYKFYNGPTLYPFGYGLSYTQFK 647
           GG+LP+T         +P ++  L P+     P   R Y F     LYPFG+GLSYT F 
Sbjct: 608 GGKLPVT---------IPRSAGQL-PMFYNVKPSARRGYLFDTTDPLYPFGFGLSYTSFD 657

Query: 648 YNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGST 707
            +                                P +    +         VD +N G  
Sbjct: 658 LS-------------------------------APRLSAAKIGVGGTTRVSVDVRNSGRR 686

Query: 708 DGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTL 767
           +G +VV +Y +         IK++ GFQRV ++ G  + + F     ++L + +   + +
Sbjct: 687 EGDEVVQLYVRDKVGSVTRPIKELKGFQRVTLKPGEVRTVTFTV-GPEALQMWNDHMDRV 745

Query: 768 LPAGEHTIFVGNGGVSF 784
           +  G+  I  GN  V+ 
Sbjct: 746 VEPGDFEIMTGNSSVAL 762


>gi|262405837|ref|ZP_06082387.1| conserved hypothetical protein [Bacteroides sp. 2_1_22]
 gi|294647798|ref|ZP_06725350.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
           ovatus SD CC 2a]
 gi|294806192|ref|ZP_06765039.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
           xylanisolvens SD CC 1b]
 gi|345510348|ref|ZP_08789916.1| glycoside hydrolase family beta-glycosidase [Bacteroides sp. D1]
 gi|262356712|gb|EEZ05802.1| conserved hypothetical protein [Bacteroides sp. 2_1_22]
 gi|292636706|gb|EFF55172.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
           ovatus SD CC 2a]
 gi|294446448|gb|EFG15068.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
           xylanisolvens SD CC 1b]
 gi|345454537|gb|EEO48843.2| glycoside hydrolase family beta-glycosidase [Bacteroides sp. D1]
          Length = 800

 Score =  254 bits (648), Expect = 2e-64,   Method: Compositional matrix adjust.
 Identities = 232/862 (26%), Positives = 371/862 (43%), Gaps = 165/862 (19%)

Query: 5   VSSLLCFSLSIALLVFSTNAVDANG---------SSSPVFVCDPGRFSKLGLQMSSFLFC 55
           +  LLC +L ++     + ++ AN          +S  ++      F+K G++    ++ 
Sbjct: 1   MKKLLCLALLVSAGSIYSGSISANNKPTDNKSGNNSKDIYKKTWIDFNKNGIKD---VYE 57

Query: 56  DSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRL---GLPQYEWWSEALH-GVSNV-- 109
           D S P   R+ DL+S+MTL+EK  Q+    +G  R+     P   W +E    G+ N+  
Sbjct: 58  DPSAPIEARIADLLSQMTLEEKTCQMATL-YGSGRVLKDAWPTAGWSAEIWKDGIGNIDE 116

Query: 110 ---GPGTHFDDV-------------------------IP--------------GATSFPT 127
              G G    ++                         IP               AT FP 
Sbjct: 117 QANGLGKFGSEISYPYANSVKNRHTIQRWFVEQTRLGIPVDLTNEGIRGLCHDRATMFPA 176

Query: 128 VILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNINVARDPRWGRITETPGED 187
                A++N+ L ++I +  + EA+A   LG   +  +SP +++A+DPRWGR+ E+ GED
Sbjct: 177 QCGQGATWNKKLIREIAKVTANEAKA---LGYTNI--YSPILDIAQDPRWGRVVESYGED 231

Query: 188 PFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDAR 247
           P++ G      + GLQ  EG             + +  KH+A Y +           D  
Sbjct: 232 PYLAGELGKQMILGLQS-EG-------------IVATPKHFAVYSIPVGGRDGGTRTDPH 277

Query: 248 VTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIV 307
           V  ++M+  +L PF   ++E  A  VM SYN  +G P       L + +R +W   GY+V
Sbjct: 278 VAPREMKTLYLEPFRKGIQEAGALGVMSSYNDYDGEPVSGSYHFLTEILRQQWGFKGYVV 337

Query: 308 ADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFT---------GNAVQQG 358
           +D ++++ +   H+ +  ++E+  AQ + AGL++      TNFT          +A+ +G
Sbjct: 338 SDSEAVEFLHTKHR-ITPTEEEMAAQVVNAGLNI-----RTNFTPPQDFILPLRHAINEG 391

Query: 359 KVKETDIDKSLKYLYTVLMRLGFFDGS-PQYVSLGKQDICSDENIELAAEAAREGIVLLK 417
           KV    +D+ +  +  V   +G FD   P      +  + +D +  ++ +AA E +VLLK
Sbjct: 392 KVSLHTLDQRVSEILRVKFMMGLFDNPYPGDDRRPETVVHNDAHKAVSMKAALESVVLLK 451

Query: 418 NDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFSGY---ANVTYKT 474
           N    LPL S   K +AV+GP+A     +   Y        +   G   Y   + V Y  
Sbjct: 452 NKNQMLPL-SKNFKKIAVIGPNAEEVKELTCRYGPANASIKTVYQGIKEYLPNSEVRYAK 510

Query: 475 GCDDV--------------ACKSNNSIFAASEAAKTADATIILAGLDLSVEAESLDREDL 520
           GCD +                +    I  A E AK +D  I++ G +     E   R +L
Sbjct: 511 GCDIIDKYFPESELYNVPLDTQEQAMIHEAVELAKASDIAILVLGGNEKTVREEFSRTNL 570

Query: 521 WLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIA 580
            L G Q QL+  V    K PV+LV++      I +A  N  + AI+ A +PGE  G AIA
Sbjct: 571 DLCGRQQQLLEAVYATGK-PVVLVMVDGRAATINWA--NKYVPAIIHAWFPGEFMGDAIA 627

Query: 581 DVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRP-VDSLGYPGRTYKFYNGPTLYPFGY 639
            V+FG +NPGGRL +T+     V  +P  + P +P  DS G      K      LYPFGY
Sbjct: 628 KVLFGDYNPGGRLAVTFPKS--VGQIPF-AFPFKPGSDSKG------KVRVDGVLYPFGY 678

Query: 640 GLSYTQFKYNLLSFTKTI---QVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFE 696
           GLSYT F Y+ L  +K +   Q N+                           L C     
Sbjct: 679 GLSYTTFGYSDLKISKPVIGPQENIT--------------------------LSC----- 707

Query: 697 FKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKS 756
                +N G   G +VV +Y +       TY K + GF+R+ ++ G  + + F     + 
Sbjct: 708 ---TVKNTGKKAGDEVVQLYIRDDFSSVTTYDKVLRGFERIHLQPGEEQTVNFTLTP-QD 763

Query: 757 LNIVDYAANTLLPAGEHTIFVG 778
           L + D      +  G  ++ VG
Sbjct: 764 LGLWDKNNQFTVEPGSFSVMVG 785


>gi|451821117|ref|YP_007457318.1| periplasmic beta-glucosidase BglX [Clostridium
           saccharoperbutylacetonicum N1-4(HMT)]
 gi|451787096|gb|AGF58064.1| periplasmic beta-glucosidase BglX [Clostridium
           saccharoperbutylacetonicum N1-4(HMT)]
          Length = 750

 Score =  254 bits (648), Expect = 2e-64,   Method: Compositional matrix adjust.
 Identities = 214/718 (29%), Positives = 341/718 (47%), Gaps = 94/718 (13%)

Query: 76  EKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTHFDDVIPG-ATSFPTVILTTAS 134
           EK  +L   A    RLG+P        L G+          DVI G  T FP  +    S
Sbjct: 95  EKSNELQKIAVEESRLGIP-------ILFGL----------DVIHGYRTIFPIPLAEACS 137

Query: 135 FNESLWKKIGQAVSTEARAMYNLGRAGLTY-WSPNINVARDPRWGRITETPGEDPFVVGR 193
           F+    K+  +  + EA A      AGL + ++P ++++RDPRWGR+ E  GEDP++   
Sbjct: 138 FDIEKIKESARIAAKEASA------AGLHWTFAPMVDISRDPRWGRVAEGAGEDPYLGSV 191

Query: 194 YAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDM 253
            A   V G Q  E  +N       P  + +C KH+A Y   +  G D    D  +  Q +
Sbjct: 192 IAKARVEGFQG-ESLDN-------PESILACAKHFAGYGAPDG-GRDYNTVDMSL--QTL 240

Query: 254 EETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSI 313
            + +L PF+   + G   + M ++N +NGIP   +  LL   +R ++  +G++V+D +SI
Sbjct: 241 HDVYLPPFKAAAEAG-VGTFMSAFNDLNGIPCTVNKYLLTDVLREKFGFNGFVVSDANSI 299

Query: 314 -QVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQ-YYTNFTGNAVQQGKVKETDIDKSLKY 371
            +V+V  H +  D+K  A  + L AGLD+D  Q  Y N     V++G + E  +D++++ 
Sbjct: 300 PEVVV--HGYAEDNKA-ASKKALNAGLDMDMSQGTYRNELPELVKEGDILEEVLDEAVRR 356

Query: 372 LYTVLMRLGFFDGSPQYVSLGKQD--ICSDENIELAAEAAREGIVLLKNDQNTLPLNSAK 429
           +  V   LG FD +P      K++  +   E++E A + +R  IVLLKN+ N LPL    
Sbjct: 357 VLRVKFLLGLFD-NPYRTDAKKEEKTLLCKEHLEAARDISRRSIVLLKNENNALPLKK-D 414

Query: 430 VKTVAVVGPHANATVAMIGNYA--GIPCRYMSPIAGF----SGYANVTYKTGCDDVACKS 483
           +K +AVVGP A     M+G ++  G P   ++ I+G     S    + Y  GC  +  + 
Sbjct: 415 LKKIAVVGPLAENAAEMLGTWSHTGNPSDVVTIISGIKAAVSTETEILYAEGC-KITGEE 473

Query: 484 NNSIFAASEAAKTADATIILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVIL 543
                 A   AK +D  I + G +  +  E+  R D+ LPG Q +L+ ++ ++ K P+I+
Sbjct: 474 CIDFEGAVRVAKESDVIIAVVGENSDMSGEAASRIDINLPGKQEELLKELRKIGK-PLIV 532

Query: 544 VIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITW-YNGDY 602
           V+++   + I +     N+ A++ A   G + G AIADV+FG +NP G+L  T+ Y+   
Sbjct: 533 VLINGRPLTIPW--EAENVDALVEAWQLGTQSGNAIADVLFGDYNPSGKLVATFPYSVGQ 590

Query: 603 VQMLPLTSMPLRPVDSLGYPGRTYKFYNGPT--LYPFGYGLSYTQFKYNLLSFTKTIQVN 660
           V +     M  RP   + +   T K+ +GP   LYPFG+GLSYT FKY  LS        
Sbjct: 591 VPIYYNNPMTGRPAGKIKF---TSKYIDGPAEPLYPFGFGLSYTTFKYENLS-------- 639

Query: 661 LNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPP 720
                                  +L  + +  D    KV   N G   G +VV +Y    
Sbjct: 640 -----------------------ILSAENKIGDTVAVKVYVTNTGEVSGEEVVQLYVSDV 676

Query: 721 AEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVG 778
                  +K++  F++V ++    K I F  N  K L   D   N ++  G   ++VG
Sbjct: 677 VASRVRPVKELKSFEKVLLQPKECKTIIFKLN-TKDLGFHDENMNYVVEPGLFKVYVG 733


>gi|402494058|ref|ZP_10840805.1| b-glucosidase [Aquimarina agarilytica ZC1]
          Length = 708

 Score =  254 bits (648), Expect = 2e-64,   Method: Compositional matrix adjust.
 Identities = 209/746 (28%), Positives = 346/746 (46%), Gaps = 94/746 (12%)

Query: 62  SIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWS--EALHGVSNVGPGTHFDDVI 119
           S R KDL  ++  D K  ++G F + + +  + + +  +  E+  G+    P     DVI
Sbjct: 16  SSRSKDLPEQLKQDVKNGKIGAFLNVMNKAYVDELQRIAIEESPQGI----PLIFARDVI 71

Query: 120 PG-ATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTY-WSPNINVARDPRW 177
            G  T FP  +   AS++    K   +  + EA +       G+ + ++P +++A+D RW
Sbjct: 72  HGFKTIFPIPLGLAASWDAETAKSAARVSAIEASSF------GIRWTFAPMLDIAQDSRW 125

Query: 178 GRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWK 237
           GRI E+PGEDP++    A  YV G Q+        DL S+P  +++C KH+  Y      
Sbjct: 126 GRIAESPGEDPYLASILAKAYVEGFQN-------NDL-SQPTSLAACAKHFIGYGA---- 173

Query: 238 GVDRYHFDARVTEQDM-EETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTV 296
            +    ++  +  Q +   T+L+PFE  +  G A +VM S+N +NG+P+  +  LLN  +
Sbjct: 174 AIGGRDYNTAIIHQPLLHNTYLKPFEAALAAG-APTVMTSFNEINGVPASGNKWLLNDIL 232

Query: 297 RGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLD-CGQYYTNFTGNAV 355
           RG+ D  G++V+D +S   M+D H +  + K  A   +  AGLD++   + Y N     +
Sbjct: 233 RGKLDFKGFVVSDWNSTTGMID-HGYAKNEKHTA-ELSFNAGLDMEMTSKSYENHLKELL 290

Query: 356 QQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQDICSDENIELAAEAAREGIVL 415
           ++ K+ ET +D  +  +  V  +L  F  +P        +    ++++LA +A  +  VL
Sbjct: 291 EEKKITETQLDFLVANILRVKFQLDLFK-NPYRSKTFTGNYYDQKHLDLAKKAVIKSSVL 349

Query: 416 LKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYA--GIPCRYMSPIAGFSG-YANVTY 472
           LKN+   LPLN  K   VAV+GP ANA +  +G +   G      +P + F+    N  +
Sbjct: 350 LKNNA-ILPLN--KNTKVAVIGPLANAPLEQLGTWIFDGDKKHTQTPTSAFTNNKVNFKF 406

Query: 473 KTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAESLDREDLWLPGYQTQLINQ 532
             G       S      A E A+ +D  +   G +  +  E+  R  + LPG Q  LI  
Sbjct: 407 TEGLSYSRDTSTQGFKKALEIAEASDVILFFGGEEAILSGEAHSRASIDLPGKQEALIKA 466

Query: 533 VAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGR 592
           +A+  K P++LVIM  GG  ++      ++ A+L A +PG  GG AI ++++GK  P GR
Sbjct: 467 LAKTGK-PIVLVIM--GGRPLSITNIIDDVDAVLMAWHPGTMGGPAIYEMLWGKSEPQGR 523

Query: 593 LPITWYNGDYVQMLPL------TSMPLRP-----VDSL------GYPGRTYKFYN--GPT 633
           LP++W        LPL      T  P  P     +DS+         G T  + +     
Sbjct: 524 LPVSW--PKTAGQLPLFYNHKSTGRPFDPKSFVQMDSIPVGAWQSSLGNTTHYLDLGAAP 581

Query: 634 LYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDD 693
            +PFGYGL YT+F Y  L  +KT                                +  ++
Sbjct: 582 HFPFGYGLGYTRFSYKNLKISKTT-------------------------------ISKNE 610

Query: 694 YFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNA 753
                V   N G   GSD+V +Y +         +K++  F+ +F+  G  K ++F    
Sbjct: 611 TVSLSVTITNTGKNAGSDIVQLYIQDIVGSLTRPVKELKRFKPIFLEKGETKTVEFTITP 670

Query: 754 CKSLNIVDYAANTLLPAGEHTIFVGN 779
            K L  V+     +L +G+  +FVGN
Sbjct: 671 -KDLMFVNNTLQPVLESGDFNVFVGN 695


>gi|423303939|ref|ZP_17281938.1| hypothetical protein HMPREF1072_00878 [Bacteroides uniformis
           CL03T00C23]
 gi|423307339|ref|ZP_17285329.1| hypothetical protein HMPREF1073_00079 [Bacteroides uniformis
           CL03T12C37]
 gi|392686630|gb|EIY79933.1| hypothetical protein HMPREF1072_00878 [Bacteroides uniformis
           CL03T00C23]
 gi|392690354|gb|EIY83622.1| hypothetical protein HMPREF1073_00079 [Bacteroides uniformis
           CL03T12C37]
          Length = 736

 Score =  254 bits (648), Expect = 2e-64,   Method: Compositional matrix adjust.
 Identities = 214/775 (27%), Positives = 356/775 (45%), Gaps = 109/775 (14%)

Query: 53  LFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFA--------------HGVP-RLGLPQY- 96
           ++ D+  P   RV DLVSRMTL+EKVQQL  +                 +P  LG   Y 
Sbjct: 28  IYKDAKAPIEERVNDLVSRMTLEEKVQQLNQYTLGRNNNENNRGEEVKKIPATLGSLIYF 87

Query: 97  --------EWWSEALHGVSNVGPGTHFD-DVIPG-ATSFPTVILTTASFNESLWKKIGQA 146
                   E   +A+   S +G    F  DVI G  T +P  +    S+N  L ++    
Sbjct: 88  DEDANLRNEAQRKAMEE-SRLGIPILFGYDVIHGFRTIYPISLGQACSWNPQLVEQACAV 146

Query: 147 VSTEARAMYNLGRAGLTY-WSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDV 205
            + EAR       +G+ + +SP I+VARD RWGR+ E  GEDP+       N V G+  +
Sbjct: 147 AAQEARM------SGVDWTFSPMIDVARDGRWGRVAEGYGEDPYT------NAVFGVASI 194

Query: 206 EGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCV 265
           +G++     +S+  +V++C KHY  Y         R +    ++ Q + +T++ P+E  V
Sbjct: 195 KGYQGEDMSDSK--RVAACLKHYIGYGASE---AGRDYVYTEISNQTLWDTYIPPYEAGV 249

Query: 266 KEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLAD 325
           K G A+++M S+N ++G P  A+   + + ++  W   G++V+D  ++  ++D     AD
Sbjct: 250 KAG-AATLMSSFNDISGTPGSANHYTMTEILKNRWKHDGFVVSDWSAVPQLID-QGHAAD 307

Query: 326 SKEDAVAQTLKAGLDLDC-GQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDG 384
            KE A      AGL++D  G  Y       V++GK+    +D ++K +  +  RLG FD 
Sbjct: 308 RKE-AARLAFNAGLEMDMMGHCYDKHMAKLVEEGKISMQLVDDAVKRVLRIKFRLGLFDN 366

Query: 385 SPQYVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATV 444
                S  K+     +++ +A + A E IVLLKN+   LPL +    T+AV+GP    + 
Sbjct: 367 PYTPTSTEKERFLLPQSLTIAEKLAEETIVLLKNENKVLPLANGNKPTIAVMGPLVQNSA 426

Query: 445 AMIGNYAG-------IPCRYMSPIAGFSGYANVTYKTGCDDVACKSNNSIFAASEA-AKT 496
            ++G++ G       +P +  +  A F+G A + Y  GCD     ++ S F+ + A A+ 
Sbjct: 427 ELLGSWYGHGHAEDVLPIK-KALDAEFAGKAELIYTEGCDFDG--NDTSKFSEALAVARK 483

Query: 497 ADATIILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFA 556
           AD  ++  G       E+  R  + LP  Q + I ++ +  K P++L +  A G  +  +
Sbjct: 484 ADIILLCMGEKKKWSGENASRSIIELPAIQEKFIAEMKKAGK-PIVLAL--ANGRPLGLS 540

Query: 557 ETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPL---TSMPL 613
           +      AI+    PG  GG+ +A V+ G+ NP G+L IT+        +P+        
Sbjct: 541 KVEPLCDAIVEMWQPGVPGGKPLAGVLSGRVNPSGKLSITFPRS--TGQIPIYYNQRKTA 598

Query: 614 RPVDSLGYPGRTYKFYNGPT--LYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLN 671
           RP        ++ K+ N P+  LY FGYGLSYT F Y          +NL K        
Sbjct: 599 RP--------QSGKYQNIPSTPLYEFGYGLSYTTFNYG--------NINLPK-------- 634

Query: 672 YTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQV 731
                            +R  +    ++   NVG  DG++VV  +   P        K++
Sbjct: 635 ---------------ETIRRGEKLVMEIPVTNVGKRDGAEVVHWFISDPFSTITRPCKEL 679

Query: 732 IGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVGNGGVSFPI 786
             F++  ++AG     +F  +  + L  V+      L  GE+ + V +  V F +
Sbjct: 680 KHFEKQLIKAGETHIFRFEIDPMRDLAFVNANGEHFLENGEYYVIVKDQKVKFTV 734


>gi|420148909|ref|ZP_14656095.1| glycosyl hydrolase family 3, N-terminal domain protein
           [Capnocytophaga sp. oral taxon 335 str. F0486]
 gi|394754508|gb|EJF37885.1| glycosyl hydrolase family 3, N-terminal domain protein
           [Capnocytophaga sp. oral taxon 335 str. F0486]
          Length = 770

 Score =  254 bits (648), Expect = 2e-64,   Method: Compositional matrix adjust.
 Identities = 210/733 (28%), Positives = 352/733 (48%), Gaps = 99/733 (13%)

Query: 64  RVKDLVSRMTLDEKVQQLGDFAHGVPRLG---LPQYEWWSEA------LHGVSNVG---- 110
           RV  ++  MTL+EK+ Q+  F+      G     +Y+ + E        +  S VG    
Sbjct: 46  RVDSVLRLMTLEEKIGQMTQFSADWSVTGPVMADKYQPYLEKGLVGSIFNATSVVGMRKL 105

Query: 111 ------------PGTHFDDVIPG-ATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNL 157
                       P     DVI G  T FP  +  + S++ +L +K  +  + EA A    
Sbjct: 106 QKIAVEQTRLGIPILFGQDVIHGYKTIFPIPLAESCSWDLALMRKTAELAAREATA---- 161

Query: 158 GRAGLTY-WSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNS 216
              G+ + ++P +++ RD RWGR  E  GEDP++    A   V+G Q   G +N   L+S
Sbjct: 162 --DGINWTFAPMVDITRDARWGRAMEGAGEDPYLGSLIAEARVKGFQ---GGDNWQTLSS 216

Query: 217 RPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCS 276
            P  + +C KH+A Y      G D  +  A ++   +   +L P+E  +  G   S+M S
Sbjct: 217 -PHTLLACGKHFAGYGAAE-SGKD--YNTAELSMHTLRNVYLPPYEATLNAG-VGSIMAS 271

Query: 277 YNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLK 336
            N +NG+P+ A   LL + +R EW  +G +V+D   I  +V  H    D K+ A   +  
Sbjct: 272 LNEINGVPATAYKWLLTEVLRKEWGFNGLLVSDYTGINELV-RHGVAKDDKQ-AANLSAN 329

Query: 337 AGLDLDC-GQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYV--SLGK 393
           AG+++D  G  +  +    V++GKV E  IDK+++++  +   LG FD   +Y+  +  K
Sbjct: 330 AGIEMDMNGATFIKYLSALVKEGKVTEAQIDKAVRHILEMKFLLGLFDDPYRYLDETRAK 389

Query: 394 QDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYA-- 451
           ++  ++E +++A +A    +VLLKN+   LP+     KT+AV+GP  N T  + G++   
Sbjct: 390 ENTFTEEYLKVARQAVASSVVLLKNEAEVLPIKKDSGKTIAVIGPMMNNTSDINGSWTCL 449

Query: 452 GIPCRYMSPIAGFSGY-----ANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGL 506
           G   + +S + G +         + Y  GC      S   +  A   A+ AD  ++  G 
Sbjct: 450 GDGKQSVSLLTGLTEKYKGTNVKLLYAEGCG-FTTISTEQLKEAVAIARKADRVLVAVGE 508

Query: 507 DLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAIL 566
             S   ES  R D+ LP  Q QL+  +  + K P+ ++  S   +D+++   N N++AIL
Sbjct: 509 QSSWSGESAVRTDIRLPQAQRQLLEALKAINK-PIAIITFSGRPLDLSWE--NENVQAIL 565

Query: 567 WAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPL------TSMPL----RPV 616
            A +PG +GG  IADV+ G  NP G+L +++     V  +P+      T  P+      V
Sbjct: 566 QAWFPGTQGGYGIADVIAGDVNPSGQLTMSFPRS--VGQIPIYYNYKSTGRPVYTNNEEV 623

Query: 617 DSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDA 676
           D   +    Y   +   LYPFGYGLSYT F  N         V+LNK    +++   +D+
Sbjct: 624 DHRPHYNAGYLDSSITPLYPFGYGLSYTTFAIN--------NVHLNK----KSIKRYNDS 671

Query: 677 SKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQR 736
                  ++VN              QN G+T+G  VV +Y++      +  +K++ GFQ+
Sbjct: 672 -------IIVN-----------ASVQNTGTTEGEIVVQLYTRQLVASVSRPVKELKGFQK 713

Query: 737 VFVRAGRNKRIKF 749
           + ++AG +K++ F
Sbjct: 714 IPLKAGESKQVHF 726


>gi|313145353|ref|ZP_07807546.1| periplasmic beta-glucosidase [Bacteroides fragilis 3_1_12]
 gi|313134120|gb|EFR51480.1| periplasmic beta-glucosidase [Bacteroides fragilis 3_1_12]
          Length = 802

 Score =  254 bits (648), Expect = 2e-64,   Method: Compositional matrix adjust.
 Identities = 234/814 (28%), Positives = 346/814 (42%), Gaps = 159/814 (19%)

Query: 54  FCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSE------------ 101
           + + S+P   RV+ L+S+MTL+EKV Q+      +  LG P YE   E            
Sbjct: 37  YENPSVPVEERVEHLLSQMTLEEKVGQM------LTSLGWPMYERVGEEIRLTARLEKEI 90

Query: 102 ------ALHGVSNVGPGT--------------------------HFDDVIP--------- 120
                 AL G     P T                          H    IP         
Sbjct: 91  SEYHIGALWGFMRADPWTQRTLHTGLNPSLAARASNRLQAFVMEHSRLGIPLFLAEECPH 150

Query: 121 -----GATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNINVARDP 175
                G T FPT I   +++N  L +++G+ ++TEA A     +     + P +++ARDP
Sbjct: 151 GHMAIGTTVFPTSIGQASTWNPELIRQMGRVIATEASA-----QGAHIGYGPVLDLARDP 205

Query: 176 RWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDN 235
           RW R+ ET GEDP++ G      VRG Q          L  R   V +  KH+A+Y    
Sbjct: 206 RWSRVEETYGEDPYLNGVMGAALVRGFQ-------GDTLRGRK-SVIATLKHFASY---G 254

Query: 236 WKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQT 295
           W         A + E+++EE    PF   V  G A SVM SYN ++G P      LL   
Sbjct: 255 WTEGGHNGGTAHLGERELEEAIFPPFREAVGAG-ALSVMSSYNEIDGNPCTGSRYLLTDI 313

Query: 296 VRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCG-QYYTNFTGNA 354
           ++  W   G++V+D  +I  + ++   +A S  +A  + + AG+D D G   Y      A
Sbjct: 314 LKDRWQFKGFVVSDLYAIGGLREHG--VAGSDYEAAVKAVNAGVDSDLGTNVYAEQLVAA 371

Query: 355 VQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQDICSDENIELAAEAAREGIV 414
           V++G V    +DK+++ +  +   +G FD          Q + S E+I LA E AR+ IV
Sbjct: 372 VRKGDVAMETVDKAVRRILFLKFHMGLFDAPFVDDKRPAQLVASPEHIGLAREVARQSIV 431

Query: 415 LLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFSGY------- 467
           LLKN+   LPL    ++T+AV+GP+A+    M+G+Y   P    S +    G        
Sbjct: 432 LLKNEDKLLPLKK-DIRTLAVIGPNADNGYNMLGDYTA-PQADGSVVTVLEGIRQKVSKD 489

Query: 468 ANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAG----LDLSVE------------ 511
             V Y  GC  V   S      A EAA++AD  +++ G     D S E            
Sbjct: 490 TRVLYAKGCA-VRDSSRTGFADAIEAARSADVVVMVVGGSSARDFSSEYEETGAAKVSAN 548

Query: 512 -------AESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKA 564
                   E  DR  L L G Q +L+ +V ++ K P++LV++   G  +          A
Sbjct: 549 RVSDMESGEGYDRATLHLMGRQLELLEEVRKLGK-PMVLVLIK--GRPLLMEGVIQEADA 605

Query: 565 ILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGR 624
           IL A YPG +GG A+ADV+FG +NP GRL ++      V  LP+     R  +   Y   
Sbjct: 606 ILDAWYPGMQGGNAVADVLFGDYNPAGRLTLSVPRS--VGQLPVYYNTKRKGNRSRYIEE 663

Query: 625 TYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGV 684
                 G   YPFGYGLSYT F Y  +    + + N     HCR                
Sbjct: 664 A-----GTPRYPFGYGLSYTTFSYTGMKVRVSEESN-----HCR---------------- 697

Query: 685 LVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRN 744
                      +  V  +N G+ DG +VV +Y +       T  +Q+  F RV ++AG  
Sbjct: 698 ----------VDVSVTVRNQGTVDGDEVVQLYLRDEVGSFTTPDRQLRAFSRVRLKAGET 747

Query: 745 KRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVG 778
           + I F  +  KSL +        +  G  T+  G
Sbjct: 748 REITFTLDK-KSLALYMRDGEWAVEPGRFTVMAG 780


>gi|423281958|ref|ZP_17260843.1| hypothetical protein HMPREF1204_00381 [Bacteroides fragilis HMW
           615]
 gi|404582445|gb|EKA87139.1| hypothetical protein HMPREF1204_00381 [Bacteroides fragilis HMW
           615]
          Length = 805

 Score =  253 bits (647), Expect = 2e-64,   Method: Compositional matrix adjust.
 Identities = 232/807 (28%), Positives = 348/807 (43%), Gaps = 145/807 (17%)

Query: 54  FCDSSLPYSIRVKDLVSRMTLDEKVQQL------------GDFAHGVPRLGLPQYEWWSE 101
           + + S P   RV+ L+S+MTL+EKV Q+            G+     P+L     E+   
Sbjct: 40  YENPSAPVEYRVEHLLSQMTLEEKVGQMLTSLGWPMYERVGEDIRLTPQLEKEIGEYHIG 99

Query: 102 ALHGVSNVGPGT--------------------------HFDDVIP--------------G 121
           +L G     P T                          H    IP              G
Sbjct: 100 SLWGFMRADPWTQRTLHTGLNPSLAARASNRLQSYVIEHSRLGIPLFLAEECPHGHMAIG 159

Query: 122 ATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNINVARDPRWGRIT 181
            T FPT I   +++N  L +++G+ ++ EA A     +     + P +++ARDPRW R+ 
Sbjct: 160 TTVFPTSIGQASTWNPELIRQMGRVIAIEASA-----QGAHIGYGPVLDLARDPRWSRVE 214

Query: 182 ETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDR 241
           ET GEDP++ G      VRG Q     E   D  S    V +  KH+A+Y    W     
Sbjct: 215 ETYGEDPYLNGVMGTALVRGFQG----ETLNDGKS----VIATLKHFASY---GWTEGGH 263

Query: 242 YHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWD 301
               A + E+++EE    PF   V  G A SVM SYN ++G P      LL   ++  W 
Sbjct: 264 NGGTAHIGERELEEAIFPPFREAVGAG-ALSVMSSYNEIDGNPCTGSRYLLTDILKDRWQ 322

Query: 302 LHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCG-QYYTNFTGNAVQQGKV 360
             G++V+D  ++  + ++   +A +  +A  + + AG+D D G   Y      AV++G V
Sbjct: 323 FKGFVVSDLYAVGGLREHG--VAGNDYEAAIKAVNAGVDSDLGTNVYAEQLVAAVKRGDV 380

Query: 361 KETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQDICSDENIELAAEAAREGIVLLKNDQ 420
               IDK+++ + ++  ++G FD          Q + S E+  LA E AR+ IVLLKN  
Sbjct: 381 AVATIDKAVRRILSLKFQMGLFDDPFVDEKQAAQLVASSEHTGLAREVARQSIVLLKNKD 440

Query: 421 NTLPLNSAKVKTVAVVGPHANATVAMIGNYA-----GIPCRYMSPI-AGFSGYANVTYKT 474
             LPL    ++T+AV+GP+A+    M+G+Y      G     +  I    S    V Y  
Sbjct: 441 KLLPLKK-DIRTLAVIGPNADNVYNMLGDYTAPQADGTVVTVLDGIRQKVSKETRVLYAK 499

Query: 475 GCDDVACKSNNSIFAASEAAKTADATIILAG----LDLSVE------------------- 511
           GC  V   S      A E A+ ADA +++ G     D S E                   
Sbjct: 500 GC-AVRDSSRTGFKDAIETARNADAVVMVMGGSSARDFSSEYEETGAAKVTINQISDMES 558

Query: 512 AESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYP 571
            E  DR  L L G Q +L+ +++ + K PV+LV++   G  +         +AI+ A YP
Sbjct: 559 GEGYDRATLHLMGRQLELLEEISRLGK-PVVLVLIK--GRPLLMEGAIQEAEAIVDAWYP 615

Query: 572 GEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNG 631
           G +GG A+ADV+FG +NP GRL ++      V  LP+     R     G   R Y    G
Sbjct: 616 GMQGGNAVADVLFGDYNPAGRLTLSVPRS--VGQLPVYYNTRRK----GNRSR-YIEEPG 668

Query: 632 PTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRC 691
              YPFGYGLSYT F Y  +     +QV                           +D R 
Sbjct: 669 TPRYPFGYGLSYTTFSYTDMK----VQVTEGS-----------------------DDCRV 701

Query: 692 DDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVF 751
           D      V  QN G+ DG +V  +Y +       T  KQ+  F R+ ++AG ++ + F  
Sbjct: 702 D----VTVTIQNQGTADGDEVAQLYFRDDVSSFTTPAKQLRAFSRIHLKAGESREVTFTL 757

Query: 752 NACKSLNIVDYAANTLLPAGEHTIFVG 778
           +  KSL +       ++  G  TI VG
Sbjct: 758 DK-KSLALYMQEGEWVVEPGLFTIMVG 783


>gi|404449838|ref|ZP_11014826.1| periplasmic beta-glucosidase [Indibacter alkaliphilus LW1]
 gi|403764685|gb|EJZ25578.1| periplasmic beta-glucosidase [Indibacter alkaliphilus LW1]
          Length = 763

 Score =  253 bits (647), Expect = 2e-64,   Method: Compositional matrix adjust.
 Identities = 205/699 (29%), Positives = 328/699 (46%), Gaps = 98/699 (14%)

Query: 117 DVIPG-ATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTY-WSPNINVARD 174
           DVI G  T F   +  +++++  L +K  +  + EA A       G+ + +SP ++V+RD
Sbjct: 112 DVIHGYETLFSIPLGLSSTWDMELIEKSARIAAIEASA------DGINWTFSPMVDVSRD 165

Query: 175 PRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVD 234
           PRWGR++E  GEDPF+  + A   +RG Q     ++ T  N+    + +C KH+A Y   
Sbjct: 166 PRWGRVSEGNGEDPFLGAKIAQAMIRGYQ----GDDLTAYNT----IMACVKHFALYGAP 217

Query: 235 NWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQ 294
              G D    D  ++ Q M   +  P++  V+ G   SVM ++N V+GIP+ A+  L+  
Sbjct: 218 E-AGRDYNTVD--MSRQRMYNEYFLPYQAAVEAG-VGSVMTAFNDVDGIPASANKWLMTD 273

Query: 295 TVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLD-CGQYYTNFTGN 353
            +R +W   G++V D  +I  M  +   L D  ++  A  L AG+D+D  G+ +      
Sbjct: 274 VLREQWGFDGFVVTDYTAINEMTSHG--LGD-LQNVSALALLAGVDMDMVGEGFLTTLEK 330

Query: 354 AVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGK--QDICSDENIELAAEAARE 411
           ++++GK+ E+ ID ++K +     +LG FD   +Y  LG+  ++I + E+ + A E A +
Sbjct: 331 SLEEGKISESHIDTAVKRILVAKYKLGLFDDPYRYSDLGRSEKEIFTQEHRKTAREIAAQ 390

Query: 412 GIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFSGYAN-- 469
             VLLKN+ + LPL   K   +A+VGP A+    M G ++ +  R+   I+   G  N  
Sbjct: 391 SFVLLKNEGSILPLK--KSGKIALVGPMADNRENMSGTWS-VAGRFTEAISLKDGLENAL 447

Query: 470 ---VTYKTG-----CDDVACKSNNSIFA----------------ASEAAKTADATIILAG 505
              VT  T       +D   +   SIF                 A E A+ +D  I   G
Sbjct: 448 GNEVTLLTARGANVVEDAEYEERVSIFGKPTYRDERPEETLISEALEIARESDVIIAAMG 507

Query: 506 LDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAI 565
               +  E+  R D+ LP  Q +L+  + +  K PV+LV+ +  G  +A      ++  I
Sbjct: 508 ESAEMSGEAASRSDIELPANQRRLLEALLDTGK-PVVLVLFT--GRPLAIKWEAEHVSGI 564

Query: 566 LWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPL------TSMPLRPVDSL 619
           L   + G E G AIADV+FG  NP G+L  T+     V  +P+      T  PL      
Sbjct: 565 LNVWFAGSEAGDAIADVLFGDVNPSGKLTATFPQN--VGQIPIFYNHKNTGRPLPEGQWF 622

Query: 620 GYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKT 679
                 Y   +   LYPFGYGLSYT+F Y+ L  +                         
Sbjct: 623 QKFRSNYLDVSNEPLYPFGYGLSYTEFDYSGLQLS------------------------- 657

Query: 680 RCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFV 739
                   +L  D+  +  VD +N GS DGS+VV +Y +         +K++ GF++VFV
Sbjct: 658 ------AEELSGDETLQITVDVRNAGSLDGSEVVQLYVRDLVASITRPVKELKGFEKVFV 711

Query: 740 RAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVG 778
           +AG  + + F     + L   +  A  +  AGE  I VG
Sbjct: 712 KAGETRSVTFELTK-RDLMFYNQDAEFVWEAGEFEIMVG 749


>gi|298482587|ref|ZP_07000772.1| xylosidase [Bacteroides sp. D22]
 gi|336405443|ref|ZP_08586122.1| hypothetical protein HMPREF0127_03435 [Bacteroides sp. 1_1_30]
 gi|295085727|emb|CBK67250.1| Beta-glucosidase-related glycosidases [Bacteroides xylanisolvens
           XB1A]
 gi|298271294|gb|EFI12870.1| xylosidase [Bacteroides sp. D22]
 gi|335938024|gb|EGM99918.1| hypothetical protein HMPREF0127_03435 [Bacteroides sp. 1_1_30]
          Length = 800

 Score =  253 bits (647), Expect = 2e-64,   Method: Compositional matrix adjust.
 Identities = 232/862 (26%), Positives = 370/862 (42%), Gaps = 165/862 (19%)

Query: 5   VSSLLCFSLSIALLVFSTNAVDANG---------SSSPVFVCDPGRFSKLGLQMSSFLFC 55
           +  LLC +L ++     + ++ AN          +S  ++      F+K G++    ++ 
Sbjct: 1   MKKLLCLALLVSAGSIYSESISANNKPTDNKSGNNSKDIYKKTWIDFNKNGIKD---VYE 57

Query: 56  DSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRL---GLPQYEWWSEALH-GVSNV-- 109
           D S P   R+ DL+S+MTL+EK  Q+    +G  R+     P   W +E    G+ N+  
Sbjct: 58  DPSAPIEARIADLLSQMTLEEKTCQMATL-YGSGRVLKDAWPTAGWSAEIWKDGIGNIDE 116

Query: 110 ---GPGTHFDDV-------------------------IP--------------GATSFPT 127
              G G    ++                         IP               AT FP 
Sbjct: 117 QANGLGKFGSEISYPYANSVKNRHTIQRWFVEQTRLGIPVDFTNEGIRGLCHDRATMFPA 176

Query: 128 VILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNINVARDPRWGRITETPGED 187
                A++N+ L ++I +  + EA+A   LG   +  +SP +++A+DPRWGR+ E+ GED
Sbjct: 177 QCGQGATWNKKLIREIAKVTANEAKA---LGYTNI--YSPILDIAQDPRWGRVVESYGED 231

Query: 188 PFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDAR 247
           P++ G      + GLQ  EG             + +  KH+A Y +           D  
Sbjct: 232 PYLAGELGKQMILGLQS-EG-------------IVATPKHFAVYSIPVGGRDGGTRTDPH 277

Query: 248 VTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIV 307
           V  ++M+  +L PF   ++E  A  VM SYN  +G P       L + +R +W   GY+V
Sbjct: 278 VAPREMKTLYLEPFRKGIQEAGALGVMSSYNDYDGEPVSGSYHFLTEILRQQWGFKGYVV 337

Query: 308 ADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFT---------GNAVQQG 358
           +D ++++ +   H+ +  ++E+  AQ + AGL++      TNFT           A+ +G
Sbjct: 338 SDSEAVEFLHTKHR-ITPTEEEMAAQVVNAGLNI-----RTNFTPPQDFILPLRRAIDEG 391

Query: 359 KVKETDIDKSLKYLYTVLMRLGFFDGS-PQYVSLGKQDICSDENIELAAEAAREGIVLLK 417
           KV    +D+ +  +  V   +G FD   P      +  + +D +  ++ +AA E +VLLK
Sbjct: 392 KVSLHTLDQRVSEILRVKFMMGLFDNPYPGDDRRPETVVHNDAHKAVSMKAALESVVLLK 451

Query: 418 NDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFSGY---ANVTYKT 474
           N    LPL S   K +AV+GP+A     +   Y        +   G   Y   + V Y  
Sbjct: 452 NKNQMLPL-SKNFKKIAVIGPNAEEVKELTCRYGPANASIKTVYQGIKEYLPNSEVRYAK 510

Query: 475 GCDDV--------------ACKSNNSIFAASEAAKTADATIILAGLDLSVEAESLDREDL 520
           GCD +                +    I  A E AK +D  I++ G +     E   R +L
Sbjct: 511 GCDIIDKYFPESELYNVPLDTQEQAMIHEAVELAKASDIAILVLGGNEKTVREEFSRTNL 570

Query: 521 WLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIA 580
            L G Q QL+  V    K PV+LV++      I +A  N  + AI+ A +PGE  G AIA
Sbjct: 571 DLCGRQQQLLEAVYATGK-PVVLVMVDGRAATINWA--NKYVPAIIHAWFPGEFMGDAIA 627

Query: 581 DVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRP-VDSLGYPGRTYKFYNGPTLYPFGY 639
            V+FG +NPGGRL +T+     V  +P  + P +P  DS G      K      LYPFGY
Sbjct: 628 KVLFGDYNPGGRLAVTFPKS--VGQIPF-AFPFKPGSDSKG------KVRVDGVLYPFGY 678

Query: 640 GLSYTQFKYNLLSFTKTI---QVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFE 696
           GLSYT F Y+ L  +K +   Q N+                           L C     
Sbjct: 679 GLSYTTFGYSDLKISKPVIGPQENIT--------------------------LSC----- 707

Query: 697 FKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKS 756
                +N G   G +VV +Y +       TY K + GF+R+ ++ G  + + F     + 
Sbjct: 708 ---TVKNTGKKAGDEVVQLYIRDDFSSVTTYDKVLRGFERIHLQPGEEQTVSFTLTP-QD 763

Query: 757 LNIVDYAANTLLPAGEHTIFVG 778
           L + D      +  G  ++ VG
Sbjct: 764 LGLWDKNNQFTVEPGSFSVMVG 785


>gi|189462809|ref|ZP_03011594.1| hypothetical protein BACCOP_03507 [Bacteroides coprocola DSM 17136]
 gi|189430425|gb|EDU99409.1| glycosyl hydrolase family 3 N-terminal domain protein [Bacteroides
           coprocola DSM 17136]
          Length = 754

 Score =  253 bits (647), Expect = 2e-64,   Method: Compositional matrix adjust.
 Identities = 195/661 (29%), Positives = 312/661 (47%), Gaps = 87/661 (13%)

Query: 117 DVIPG-ATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTY-WSPNINVARD 174
           DVI G  T FP  +   ASFN  L ++  +  + EA A       G+ + ++P I+V+RD
Sbjct: 110 DVIHGYKTIFPICLGQAASFNPDLVRESARVAAIEASA------DGIRWTFAPMIDVSRD 163

Query: 175 PRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVD 234
           PRWGRI E+ GEDP++        + G Q         D  + P  +++C KH+  Y   
Sbjct: 164 PRWGRIAESCGEDPYLTAVLGKAMIEGFQG--------DSLNDPTSIAACAKHFVGYGAA 215

Query: 235 NWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQ 294
                 R +    + E+ +   +L PFE   K   A++ M S+N  +G+PS  +  +L  
Sbjct: 216 E---SGRDYNSTFLPERLLRNVYLPPFEAAAKA-GAATFMTSFNDNDGVPSTGNKFILKN 271

Query: 295 TVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLD--CGQYYTNFTG 352
            +R EW   G +V D  S   M+  H F  D+  DA  ++L AG+D+D   G +  N   
Sbjct: 272 VLREEWKYDGMVVTDWASATEMI-THGFCKDAA-DAAKKSLDAGVDMDMVSGAFSGNLE- 328

Query: 353 NAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQDICSDENIELAAEAAREG 412
           N V++ K+ E  ID++++ +  +  RLG F+    YVS  +    S E++  A +A  + 
Sbjct: 329 NLVKENKISEKQIDEAVRNILRLKFRLGLFENP--YVSTPQSVKYSPEHLAKAKQAVEQS 386

Query: 413 IVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYA--GIPCRYMSPIAG----FSG 466
           ++LLKN   TLPLN+ +V TVAVVGP A+A    +G +   G      +P+A     +  
Sbjct: 387 VILLKNTNQTLPLNADEVHTVAVVGPLADAPHDQMGTWVFDGEKAHTQTPLAALRAVYGD 446

Query: 467 YANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAESLDREDLWLPGYQ 526
              + Y+        K    +  A  AAK AD  +   G +  +  E+    DL L G Q
Sbjct: 447 KVRIIYEPALAYSRDKQTTGLAKAVNAAKQADVVLAFVGEESILSGEAHSLADLNLQGLQ 506

Query: 527 TQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGK 586
           ++LI ++++  K P++ V+M+  G  +  A+      A+L+A +PG  GG A+AD++FGK
Sbjct: 507 SELIEKLSQTGK-PLVTVVMA--GRPLTIAKEVEESDAVLYAFHPGTMGGPALADILFGK 563

Query: 587 FNPGGRLPITWYNGDYVQMLPL------TSMPLRPVDSL-----------GYPGRTYKFY 629
            NP G+ P+T+     V  LP+      T  P    + L               R++   
Sbjct: 564 VNPSGKTPVTFPK--MVGQLPMYYAHNNTGRPALEKEMLLDEIPMEAGQTSVGCRSFFLD 621

Query: 630 NGPT-LYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVND 688
            G T L+PFGYGLSYT F Y  L                                ++   
Sbjct: 622 AGSTPLFPFGYGLSYTTFSYGNLK-------------------------------IVSGK 650

Query: 689 LRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIK 748
           L   D  +  V+ +N G  +G++VV +Y +         +K++  FQRV ++ G +K++ 
Sbjct: 651 LTVSDTLKVSVELKNTGRYEGTEVVQLYVQDKVGSVTRPVKELKRFQRVNLQPGESKQVM 710

Query: 749 F 749
           F
Sbjct: 711 F 711


>gi|333377431|ref|ZP_08469165.1| hypothetical protein HMPREF9456_00760 [Dysgonomonas mossii DSM
           22836]
 gi|332884165|gb|EGK04433.1| hypothetical protein HMPREF9456_00760 [Dysgonomonas mossii DSM
           22836]
          Length = 743

 Score =  253 bits (647), Expect = 2e-64,   Method: Compositional matrix adjust.
 Identities = 211/778 (27%), Positives = 359/778 (46%), Gaps = 119/778 (15%)

Query: 53  LFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDF--------------AHGVPRLGLPQYEW 98
           LF   ++ Y  R++ L+ +MTL+EK+ Q+                  H    + +     
Sbjct: 18  LFAQVNIEY--RIEALLKQMTLEEKIGQMNQLHCEDWNKLKEETEKGHVGSVMSITDPNL 75

Query: 99  WSEALHGV---SNVG-PGTHFDDVIPG-ATSFPTVILTTASFNESLWKKIGQAVSTEARA 153
           ++E        S +G P  +  DVI G  T FP  +   A+FN  + +K  Q  +TEA A
Sbjct: 76  FNEIQKIAVEESRLGIPLINARDVIHGFKTIFPIPLGQAATFNPEIVEKSSQIAATEASA 135

Query: 154 MYNLGRAGLTY-WSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENAT 212
                 AG+ + ++P I++  DPRWGRI E  GEDP++V       +RG Q    H    
Sbjct: 136 ------AGIRWTFAPMIDITHDPRWGRIAEGFGEDPYLVSEMGKASIRGFQGRSLH---- 185

Query: 213 DLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASS 272
                P  + +C KH+ AY         R +    V+E+ +   +LRPFE  V+ G A+ 
Sbjct: 186 ----NPRSILACAKHFVAYGAAEG---GRDYNSTFVSERRLRNLYLRPFEEAVQSGVAT- 237

Query: 273 VMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVA 332
           +M S+N  +GIP+     LL   +R EW+ +G++++D  S+  M   H +  + KE A+ 
Sbjct: 238 IMTSFNDNDGIPASGSKFLLTDILRNEWEFNGFVISDWASVIEMA-KHGYCKNGKEAAM- 295

Query: 333 QTLKAGLDLD-CGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSL 391
           + + AGLD++   + Y N     +++G+V  +DID +++ +  +   LG F+    Y+  
Sbjct: 296 KAVNAGLDMEMVSETYINHLPQLLKEGEVSLSDIDNAVRNILRIKFELGIFEQP--YIQD 353

Query: 392 GKQDIC-SDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNY 450
            +++I  ++ ++E A EA  +  +LLKN+ N LPLN   +K + V GP ANA    +G +
Sbjct: 354 EREEIYYAESHLEAAQEAVEQSTILLKNENNVLPLNMNNIKRILVTGPMANAPHDQLGTW 413

Query: 451 A--GIPCRYMSPIAGF---SGY-ANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILA 504
              G      +P+      SG+   + Y+         S  +     E AK  D  +   
Sbjct: 414 VFDGDKKYTRTPLISLQEQSGHIIEIIYEPALSISRDTSKYNFSKVVELAKKVDVILAFV 473

Query: 505 GLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAG----GVDIAFAETNT 560
           G +  +  E+     L L G Q+ LI ++A   K P++   M+      G ++A ++   
Sbjct: 474 GEEAILSGEAHSLTTLNLLGAQSALIEELANTGK-PLVTTFMAGRPLSIGKEVALSD--- 529

Query: 561 NIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPL------------ 608
              A+L++ +PG  GG A+  ++ GK  P G+LP+T+     V  +P+            
Sbjct: 530 ---AVLYSFHPGTMGGPALVSLLTGKVIPSGKLPVTFPKN--VGQIPIYYNHNNTGRPAD 584

Query: 609 ---TSMPLRPVD----SLGYPGRTYKFYNGPT-LYPFGYGLSYTQFKYNLLSFTKTIQVN 660
              T++   P++    SLG   ++Y    G   LYPFGYGLSYT F Y+ L         
Sbjct: 585 GNETTLYQIPIEAEQTSLG--NKSYYLDAGKDPLYPFGYGLSYTTFIYSNL--------- 633

Query: 661 LNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPP 720
             +L H                    N ++ DD  E   D  N G  D ++V+ +Y +  
Sbjct: 634 --QLSH--------------------NKIKKDDTLEVSFDLSNTGKYDATEVIQIYFRDI 671

Query: 721 AEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVG 778
                  +K+++ F R+ ++AG+  ++K      K L   +     ++ +G+  +FVG
Sbjct: 672 VANIIRPVKELVHFDRINLQAGKTMKVKVEIPVSK-LAFWNIDMQKVVESGQFELFVG 728


>gi|430736195|gb|AGA60127.1| glycoside hydrolase [Aminobacter sp. Gsoil204]
          Length = 772

 Score =  253 bits (647), Expect = 2e-64,   Method: Compositional matrix adjust.
 Identities = 201/662 (30%), Positives = 320/662 (48%), Gaps = 89/662 (13%)

Query: 117 DVIPG-ATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTY-WSPNINVARD 174
           DVI G  T FP  +   AS++    +K  +  +TEA A       G+ + ++P ++VARD
Sbjct: 134 DVIHGHRTIFPISLGEAASWDLKAIEKAARISATEASA------EGIHWTFAPMVDVARD 187

Query: 175 PRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVD 234
           PRWGRI+E  GED ++  R A   VRG Q         DL +    V +  KH+AAY   
Sbjct: 188 PRWGRISEGAGEDVYLGSRIAEARVRGFQ-------GNDLKAVD-TVLATAKHFAAYGAA 239

Query: 235 NWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQ 294
                 R +    ++E+ + + +L PF+       A++ M S+N V+GIP+  +  LL  
Sbjct: 240 Q---AGRDYGTVDISERTLRDVYLPPFKAAADA-GAATFMTSFNDVDGIPASGNHHLLTD 295

Query: 295 TVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDC-GQYYTNFTGN 353
            +R +W   G++V D  SI  MV  H +  D ++ A  Q + AG+D+D  G  +      
Sbjct: 296 VLRDKWGFKGFVVTDYTSINEMV-AHGYSKDLQQ-AGEQAINAGVDMDLQGAVFMEHLAK 353

Query: 354 AVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQD--ICSDENIELAAEAARE 411
           +V +GKV    ID ++K +  +  RLG F+   +Y    ++   +   + +E A + AR+
Sbjct: 354 SVAEGKVDVARIDAAVKAILEMKYRLGLFEDPYRYSDEAREKATVYRPDFLEAARDVARK 413

Query: 412 GIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFSGYA--- 468
            +VLLKN  N LPL +A  K++AV+GP  ++   MIG+++    R   P+    G     
Sbjct: 414 SMVLLKNANNALPL-AASAKSIAVIGPLGDSKADMIGSWSAAGDRKTRPVTLLEGMQARA 472

Query: 469 ----NVTYKTGCD---DVACKSNNSIFAASEA-AKTADATIILAGLDLSVEAESLDREDL 520
               +V Y  G     + A K++   FA + A A+ +D  +   G    +  E+  R  L
Sbjct: 473 PKGQSVAYVRGASYAFEDAGKTDG--FAEAIALAQKSDVIVAAMGERWDMTGEAASRTSL 530

Query: 521 WLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIA 580
            LPG Q  L+ ++ +  K P+ILV+MS     I +A  + N+ AIL A YPG  GG AIA
Sbjct: 531 DLPGNQQALLQELKKTGK-PIILVLMSGRPNSIEWA--DANVDAILEAWYPGTMGGHAIA 587

Query: 581 DVVFGKFNPGGRLPITWYNGDYVQMLPL------TSMPLRPVDSLGYPGRTY--KFYNGP 632
           DV++G +NP G+LP T+     V  +PL      T  P+ P      P   Y  ++ N P
Sbjct: 588 DVLYGDYNPSGKLPATFPRN--VGQVPLYYDMKNTGRPIDPAK----PDAKYVSRYLNTP 641

Query: 633 T--LYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLR 690
              LYPFGYGLSYT F Y+ ++ +K                                 ++
Sbjct: 642 NTPLYPFGYGLSYTSFTYSPVTLSKA-------------------------------RIK 670

Query: 691 CDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFV 750
             +     V   N G+ DG +VV +Y +         ++++ GF+++ ++ G +K + F 
Sbjct: 671 PGEPLTASVTVTNSGARDGEEVVQLYVRDLVGSVTRPVRELKGFRKIPLKKGESKTVSFT 730

Query: 751 FN 752
             
Sbjct: 731 LT 732


>gi|299144988|ref|ZP_07038056.1| xylosidase [Bacteroides sp. 3_1_23]
 gi|298515479|gb|EFI39360.1| xylosidase [Bacteroides sp. 3_1_23]
          Length = 800

 Score =  253 bits (646), Expect = 3e-64,   Method: Compositional matrix adjust.
 Identities = 232/862 (26%), Positives = 371/862 (43%), Gaps = 165/862 (19%)

Query: 5   VSSLLCFSLSIALLVFSTNAVDANG---------SSSPVFVCDPGRFSKLGLQMSSFLFC 55
           +  LLC +L ++     + ++ AN          +S  ++      F+K G++    ++ 
Sbjct: 1   MKKLLCLALLVSAGSIYSGSISANNKPTDNKSGNNSKDIYKKTWIDFNKKGIKD---VYE 57

Query: 56  DSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRL---GLPQYEWWSEALH-GVSNV-- 109
           D S P   R+ DL+S+MTL+EK  Q+    +G  R+     P   W +E    G+ N+  
Sbjct: 58  DPSAPIEARIADLLSQMTLEEKTCQMATL-YGSGRVLKDAWPTAGWSAEIWKDGIGNIDE 116

Query: 110 ---GPGTHFDDV-------------------------IP--------------GATSFPT 127
              G G    ++                         IP               AT FP 
Sbjct: 117 QANGLGKFGSEISYPYANSVKNRHTIQRWFMEQTRLGIPVDFTNEGIRGLCHDRATMFPA 176

Query: 128 VILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNINVARDPRWGRITETPGED 187
                A++N+ L ++I +  + EA+A   LG   +  +SP +++A+DPRWGR+ E+ GED
Sbjct: 177 QCGQGATWNKKLIREIAKVTADEAKA---LGYTNI--YSPILDIAQDPRWGRVVESYGED 231

Query: 188 PFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDAR 247
           P++VG      + GLQ+ EG             + +  KH+A Y +           D  
Sbjct: 232 PYLVGELGKQMILGLQN-EG-------------IVATPKHFAVYSIPVGGRDGGTRTDPH 277

Query: 248 VTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIV 307
           V  ++M+  +L PF   ++E  A  VM SYN  +G P       L + +R +W   GY+V
Sbjct: 278 VAPREMKTLYLEPFRKGIQEAGALGVMSSYNDYDGEPVSGSYHFLTEILRQQWGFKGYVV 337

Query: 308 ADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFT---------GNAVQQG 358
           +D ++++ +   H+ +  ++E+  AQ + AGL++      TNFT           A+ +G
Sbjct: 338 SDSEAVEFLHTKHR-ITPTEEEMAAQVVNAGLNI-----RTNFTPPQDFILPLRRAINEG 391

Query: 359 KVKETDIDKSLKYLYTVLMRLGFFDGS-PQYVSLGKQDICSDENIELAAEAAREGIVLLK 417
           KV    +D+ +  +  V   +G FD   P      +  + +D +  ++ +AA E IVLLK
Sbjct: 392 KVSLHTLDQRVGEILRVKFMMGLFDNPYPGDDRRPEAVVHNDAHKAVSMKAALESIVLLK 451

Query: 418 NDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFSGY---ANVTYKT 474
           N+   LPL S     +AV+GP+      +   Y        +   G   Y   + V Y  
Sbjct: 452 NENQMLPL-SKNFSKIAVIGPNGEEVKELTCRYGPANASIKTVYQGIKEYLPNSEVRYVK 510

Query: 475 GCDDV--------------ACKSNNSIFAASEAAKTADATIILAGLDLSVEAESLDREDL 520
           GCD +                +    I  A E AK +D  I++ G +     E   R +L
Sbjct: 511 GCDIIDKYFPESELYNVPLDTQEQAMIHEAVELAKASDVAILVLGGNEKTVREEFSRTNL 570

Query: 521 WLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIA 580
            L G Q QL+  V    K PV+LV++      I +A  N  + AI+ A +PGE  G AIA
Sbjct: 571 DLCGRQQQLLEAVYATGK-PVVLVMVDGRAATINWA--NKYVPAIIHAWFPGEFMGDAIA 627

Query: 581 DVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRP-VDSLGYPGRTYKFYNGPTLYPFGY 639
            V+FG +NPGGRL +T+     V  +P  + P +P  DS G      K      LYPFGY
Sbjct: 628 KVLFGDYNPGGRLAVTFPKS--VGQIPF-AFPFKPGSDSKG------KVRVDGVLYPFGY 678

Query: 640 GLSYTQFKYNLLSFTKTI---QVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFE 696
           GLSYT F Y+ L  +K +   Q N+                           L C     
Sbjct: 679 GLSYTTFGYSDLKISKPVIGPQENIT--------------------------LSC----- 707

Query: 697 FKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKS 756
                +N G   G +VV +Y +       TY K + GF+R+ ++ G  + + F     + 
Sbjct: 708 ---TVKNTGKKAGDEVVQLYIRDDFSSVTTYDKVLRGFERIHLQPGEEQTVNFTLTP-QD 763

Query: 757 LNIVDYAANTLLPAGEHTIFVG 778
           L + D      +  G  ++ VG
Sbjct: 764 LGLWDKNNRFTVEPGSFSVMVG 785


>gi|448360576|ref|ZP_21549207.1| beta-glucosidase [Natrialba asiatica DSM 12278]
 gi|445653189|gb|ELZ06061.1| beta-glucosidase [Natrialba asiatica DSM 12278]
          Length = 777

 Score =  253 bits (646), Expect = 3e-64,   Method: Compositional matrix adjust.
 Identities = 211/744 (28%), Positives = 334/744 (44%), Gaps = 124/744 (16%)

Query: 76  EKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGATSFPTVILTTASF 135
           E+  +L  +     RLG+P      E L G              P  T+FP +I   +++
Sbjct: 81  ERTNELQTYLREETRLGIPAIPH-EECLSGYMG-----------PEGTTFPQMIGMASTW 128

Query: 136 NESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYA 195
           + +L + + + +  +  A   +G A     SP ++VARD RWGR+ ET GEDP++V   A
Sbjct: 129 SPALLETVTETIRDQLEA---IGTA--HALSPVLDVARDLRWGRVEETFGEDPYLVAAMA 183

Query: 196 VNYVRGLQ-DVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDME 254
             YV GLQ D +G             +S+  KH+  +      G +R      +  +++ 
Sbjct: 184 CGYVDGLQGDGDG-------------ISATLKHFVGHAA-GAGGKNRSSVS--IGRRELR 227

Query: 255 ETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQ 314
           ET + PFE  V+  +A SVM +Y+ ++GIP  +D +LL   +RGEW   G +V+D  S++
Sbjct: 228 ETHMFPFEAAVRTANAESVMNAYHDIDGIPCASDERLLTDILRGEWSFDGTVVSDYYSVE 287

Query: 315 VMVDNHKFLADSKEDAVAQTLKAGLDL-----DCGQYYTNFTGNAVQQGKVKETDIDKSL 369
            +   H   AD +E  VA  ++AG+D+     DC   Y +   NAV+ G++ E  +D++ 
Sbjct: 288 YLRSEHGVAADEREAGVA-AVEAGIDVELPATDC---YGDHLVNAVEAGELAEETVDEAA 343

Query: 370 KYLYTVLMRLGFFDGSPQYVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAK 429
           + +     R G  D              +DE   L   AARE + LL+ND + LPL   +
Sbjct: 344 RRVLRAKARKGLLDDPTVDADAATAPFGTDEARALTERAARESMTLLQNDGDLLPLTGEE 403

Query: 430 VKTVAVVGPHANATVAMIGNYAGIPCRY---------MSPIAGFSGYA-----NVTYKTG 475
             +VAVVGP A+    ++G+YA  P  Y          +P+             V ++ G
Sbjct: 404 TNSVAVVGPKADDAQELLGDYA-YPAHYPEEEIEFDATTPLDAVRARGEEHGFEVRHERG 462

Query: 476 CDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAESLDRE----------------- 518
           C      +     AA+ AA  AD T+   G   +V+    DR+                 
Sbjct: 463 CTTTGPDTEG-FDAAANAAADADVTLAFVGARSAVDFSDSDRDRINKPSVATSGEGCDVV 521

Query: 519 DLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRA 578
           DL LPG Q +L+ +V E    PV +V++S  G   A       + A++ A  PGE GG  
Sbjct: 522 DLGLPGVQRELVERVHETGT-PVAVVVVS--GRPHAMERIAATVPAVVQAWLPGERGGEG 578

Query: 579 IADVVFGKFNPGGRLPITWYNGDYVQMLPLT--SMPLRPVDSLGYPGRTYKFYNGPTLYP 636
           IA V+FG+ NP G LP++         +P T   +P+            Y +     LYP
Sbjct: 579 IAAVLFGEHNPAGHLPVS---------VPRTVGQLPVHYNRKPNTATEEYVYTESDPLYP 629

Query: 637 FGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCP-GVLVNDLRCDDYF 695
           FG+GLSYT+F Y  LS +                      + +  P G +V         
Sbjct: 630 FGHGLSYTEFAYGDLSLS----------------------TDSLSPAGTIVA-------- 659

Query: 696 EFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACK 755
              V  +N G T G DV+ +Y+       A  +++++GF+RV +  G  KR+ F  +A +
Sbjct: 660 --TVTVENAGDTAGDDVLQLYASAENPDLARPVQELVGFERVSLDPGETKRVSFAVDASQ 717

Query: 756 SLNIVDYAANTLLPAGEHTIFVGN 779
            L   D   N ++  G +   +G+
Sbjct: 718 -LAYYDRDFNLVVEEGPYEFRIGH 740


>gi|409195436|ref|ZP_11224099.1| glycoside hydrolase family protein [Marinilabilia salmonicolor JCM
           21150]
          Length = 867

 Score =  253 bits (646), Expect = 3e-64,   Method: Compositional matrix adjust.
 Identities = 163/466 (34%), Positives = 236/466 (50%), Gaps = 41/466 (8%)

Query: 40  GRFSKLGLQMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWW 99
           G F+  G+  ++ ++ D+S     R  DL+  +TL+EKV  + D    + RLG+ +Y WW
Sbjct: 12  GIFTLAGVGCNTEIWKDNSYSPEERADDLLKELTLEEKVSLMVDRNTAIERLGIEEYNWW 71

Query: 100 SEALHGVSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNL-- 157
           +EALHGV+  G           AT FP  +   A+F+  +   +  A S EARA ++   
Sbjct: 72  NEALHGVARAGQ----------ATVFPQPVGMAAAFDRDMVLDVFSAASDEARAKHHFFK 121

Query: 158 -----GR-AGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENA 211
                GR  GLT W+PNINV RDPRWGR  E  GEDPF+ G      V+GLQ        
Sbjct: 122 ERGERGRYQGLTMWTPNINVFRDPRWGRGMEAYGEDPFMNGVLGTAVVKGLQ-------- 173

Query: 212 TDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDAR-VTEQDMEETFLRPFEMCVKEGDA 270
            D + +  K+ +C KHYA +    W   +R+ F+A  +  +D+ ET+L  F+  V +GD 
Sbjct: 174 GDRSGKYDKLHACAKHYAVHSGPEW---NRHSFNAENIRPRDLHETYLPAFKKLVIDGDV 230

Query: 271 SSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMV--DNHKFLADSKE 328
             VMC+YNR  G P C + +LL   +R EW   G +V+DC +I      D H    D+K 
Sbjct: 231 RMVMCAYNRFEGEPCCGNNQLLRDILRNEWGFDGVVVSDCWAINDFFNKDAHAMYPDAK- 289

Query: 329 DAVAQTLKAGLDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSP-- 386
            A    + AG DL+CG  Y +    AV+QG + E  +D SL+ L      LG  D     
Sbjct: 290 TASTDAVLAGTDLNCGDSYPSLV-EAVEQGLITEEQLDISLRRLLIARFELGEMDPDEEV 348

Query: 387 QYVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAM 446
           ++  +    + S  + E+A EAAR+ + LL N    LPL    + TVAV+GP+AN ++  
Sbjct: 349 EWSKIPHSVVSSPTHSEMALEAARKSMTLLMNKNGALPLKKEGL-TVAVMGPNANDSLMQ 407

Query: 447 IGNYAGIPCRYMSPIAGFSGYA----NVTYKTGCDDVACKSNNSIF 488
            GNY G P    + + G          V Y+ G   V  +   S+F
Sbjct: 408 WGNYNGTPATTTTILQGIRNALGNDDQVIYEQGTQWVDDRIFKSVF 453



 Score =  125 bits (314), Expect = 9e-26,   Method: Compositional matrix adjust.
 Identities = 90/301 (29%), Positives = 141/301 (46%), Gaps = 58/301 (19%)

Query: 491 SEAAKTADATIIL--AGLDLSVEAESL----------DREDLWLPGYQTQLINQVAEVAK 538
           S  AK ADA +++  +G+   +E E +          DR D+ LP  Q +++  + +  K
Sbjct: 595 SSVAKVADADVVVFASGISPFLEGEEMGVDLPGFKGGDRTDIALPAIQKEMLKALHKAGK 654

Query: 539 GPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWY 598
               +++++  G  I F E      AIL A YPG+ GG+A+A+V+FG +NP GRLP+T+Y
Sbjct: 655 E---IILVNCSGSAIGFEEATDYSSAILQAWYPGQAGGQAVAEVLFGDYNPAGRLPVTFY 711

Query: 599 NGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQ 658
               V  LP                RTY+++ G  LYPFGYGLSYT F Y+    ++T  
Sbjct: 712 KS--VDQLP-------DFQDYNMTNRTYRYFEGEPLYPFGYGLSYTTFSYDQPELSQT-- 760

Query: 659 VNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSK 718
                     +++   +AS                    KV   N G  DG +VV +Y +
Sbjct: 761 ----------SISTEEEAS-------------------LKVSVANTGDYDGEEVVQLYLQ 791

Query: 719 PPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLP-AGEHTIFV 777
            P +     +  + GFQRVF+  G    ++F     + L   +  A  + P AG++ + V
Sbjct: 792 KPDDTEGPSLT-LRGFQRVFIPKGETVEVEFQLTE-EVLEWWNADAQRMTPLAGDYRLLV 849

Query: 778 G 778
           G
Sbjct: 850 G 850


>gi|383117091|ref|ZP_09937838.1| hypothetical protein BSHG_0805 [Bacteroides sp. 3_2_5]
 gi|382973702|gb|EES87886.2| hypothetical protein BSHG_0805 [Bacteroides sp. 3_2_5]
          Length = 805

 Score =  253 bits (646), Expect = 3e-64,   Method: Compositional matrix adjust.
 Identities = 229/807 (28%), Positives = 346/807 (42%), Gaps = 145/807 (17%)

Query: 54  FCDSSLPYSIRVKDLVSRMTLDEKVQQL------------GDFAHGVPRLGLPQYEWWSE 101
           + + S P   RV+ L+S+MTL+EKV Q+            G+     P+L     E+   
Sbjct: 40  YENPSAPVEYRVEHLLSQMTLEEKVGQMLTSLGWPMYERVGEDIRLTPQLEKEIGEYHIG 99

Query: 102 ALHGVSNVGPGT--------------------------HFDDVIP--------------G 121
           +L G     P T                          H    IP              G
Sbjct: 100 SLWGFMRADPWTQRTLHTGLNPSLAARASNRLQSYVIEHSRLGIPLFLAEECPHGHMAIG 159

Query: 122 ATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNINVARDPRWGRIT 181
            T FPT I   +++N  L +++G+ ++ EA A     +     + P +++ARDPRW R+ 
Sbjct: 160 TTVFPTSIGQASTWNPELIRQMGRVIAIEASA-----QGAHIGYGPVLDLARDPRWSRVE 214

Query: 182 ETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDR 241
           ET GEDP++ G      VRG Q     E   D  S    V +  KH+A+Y    W     
Sbjct: 215 ETYGEDPYLNGVMGTALVRGFQG----ETLNDGKS----VIATLKHFASY---GWTEGGH 263

Query: 242 YHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWD 301
               A + E+++EE    PF   V  G A SVM SYN ++G P      LL   ++  W 
Sbjct: 264 NGGTAHIGERELEEAIFPPFREAVGAG-ALSVMSSYNEIDGNPCTGSRYLLTDILKDRWQ 322

Query: 302 LHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCG-QYYTNFTGNAVQQGKV 360
             G++V+D  ++  + ++   +A +  +A  + + AG+D D G   Y      AV++G V
Sbjct: 323 FKGFVVSDLYAVGGLREHG--VAGNDYEAAIKAVNAGVDSDLGTNVYAEQLVAAVKRGDV 380

Query: 361 KETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQDICSDENIELAAEAAREGIVLLKNDQ 420
               IDK+++ + ++  ++G FD          Q + S E+  LA E AR+ IVLLKN  
Sbjct: 381 AVATIDKAVRRILSLKFQMGLFDDPFVDEKQAVQLVASSEHTGLAREVARQSIVLLKNKD 440

Query: 421 NTLPLNSAKVKTVAVVGPHANATVAMIGNYA-----GIPCRYMSPI-AGFSGYANVTYKT 474
             LPL    ++T+AV+GP+A+    M+G+Y      G     +  I    S    V Y  
Sbjct: 441 KLLPLKK-DIRTLAVIGPNADNVYNMLGDYTAPQADGTVVTVLDGIRQKVSKETRVLYAK 499

Query: 475 GCDDVACKSNNSIFAASEAAKTADATIILAG----LDLSVE------------------- 511
           GC  V   S      A E A+ AD  +++ G     D S E                   
Sbjct: 500 GC-AVRDSSRTGFKDAIETARNADTVVMVMGGSSARDFSSEYEETGAAKVTINQISDMES 558

Query: 512 AESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYP 571
            E  DR  L L G Q +L+ +++ + K PV+LV++   G  +         +AI+ A YP
Sbjct: 559 GEGYDRATLHLMGRQLELLEEISRLGK-PVVLVLIK--GRPLLMEGAIQEAEAIVDAWYP 615

Query: 572 GEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNG 631
           G +GG A+ADV+FG +NP GRL ++      V  LP+     R     G   R Y    G
Sbjct: 616 GMQGGNAVADVLFGDYNPAGRLTLSVPRS--VGQLPVYYNTRRK----GNRSR-YVEEPG 668

Query: 632 PTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRC 691
              YPFGYGLSYT F Y  +                                V V +   
Sbjct: 669 TPRYPFGYGLSYTTFSYTDMK-------------------------------VQVTEGSD 697

Query: 692 DDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVF 751
           D + +  V  QN G+ DG +V  +Y +       T  KQ+  F R+ ++AG ++ + F  
Sbjct: 698 DCWVDVTVTIQNQGTADGDEVAQLYFRDDVSSFTTPAKQLRAFSRIHLKAGESREVTFTL 757

Query: 752 NACKSLNIVDYAANTLLPAGEHTIFVG 778
           +  KSL +       ++  G  TI VG
Sbjct: 758 DK-KSLALYMQEGEWVVEPGRFTIMVG 783


>gi|374311316|ref|YP_005057746.1| Beta-glucosidase [Granulicella mallensis MP5ACTX8]
 gi|358753326|gb|AEU36716.1| Beta-glucosidase [Granulicella mallensis MP5ACTX8]
          Length = 773

 Score =  253 bits (646), Expect = 3e-64,   Method: Compositional matrix adjust.
 Identities = 226/802 (28%), Positives = 371/802 (46%), Gaps = 121/802 (15%)

Query: 30  SSSPVFVCDPGRFSKLGLQMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHG-- 87
           SS  +F       +  G   +  ++  S+LP   R+ DL+ RMTL+EKV+QL DF  G  
Sbjct: 25  SSVTLFPAYSQSIASSGKTKTVLIYEQSNLPLETRLADLLGRMTLEEKVRQL-DFYSGTD 83

Query: 88  ----------VPRLGLPQYEWWSEALHGVSNVG--------------------------- 110
                     +P    P     ++AL G    G                           
Sbjct: 84  SLLDRGSKNSLPSKQSPFSTAKADALFGSLGAGAIHDLDPTPEQYNTIQRWVIEHNRLHI 143

Query: 111 PGTHFDDVIPG---ATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSP 167
           P    ++ + G    T FP  +   ++++ S+ +K G A++ EARA       G+   +P
Sbjct: 144 PALFIEEGLHGFDTGTVFPAPLNLASTWDPSVAEKTGSAIAAEARAT----GVGMIL-AP 198

Query: 168 NINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKH 227
            +++ARDPRWGRI E  GEDP++ G+  + YVRG Q   G    TD N     V +  KH
Sbjct: 199 VLDLARDPRWGRIEEDFGEDPYLTGQMGLAYVRGAQ---GESLNTDHN-----VVAEPKH 250

Query: 228 YAAY-DVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSC 286
           +AA+   +        H    + E+++    L+ FE   ++G A + M +Y+ ++GIP  
Sbjct: 251 FAAHGSPEGGTNTSPVH----IGERELRSVMLKSFEPAFRQGHAMATMAAYHEIDGIPVT 306

Query: 287 ADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQY 346
           ADP LL   +R EW   G +++D  +I+ +   H+ +A S + A    +K+G+D+    +
Sbjct: 307 ADPYLLKTILRQEWGFQGMVLSDLGAIRRLYQLHQ-VASSPKAASCLAIKSGVDMQFYDF 365

Query: 347 YTNFTGNA----VQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQDICSDENI 402
             +    A    V +G + + D+D++   +  +   LG FD      +L  +   S  ++
Sbjct: 366 DHDVFQKALIDCVHEGSLPQADVDRAASAVLRLKFTLGLFDRPYVDPTLNAKAYRSKPHL 425

Query: 403 ELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYA----GIPCRYM 458
           +++ ++ARE +VLLKN+   LP  S  ++ +AV+GP  NA VA  G+Y     G+    +
Sbjct: 426 DVSLQSARESLVLLKNENGLLPF-SKSIQRIAVIGP--NADVARYGDYEEEANGLHISIL 482

Query: 459 SPIAGFSGYANVTYKTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEAESLDRE 518
             +   + +A V + +G D         I AA   AK+AD  I+  G    +  E+ DR 
Sbjct: 483 QGVKAEAPHAQVEFDSGKD---------IAAAVAKAKSADVVILGLGEWRGISGEAFDRT 533

Query: 519 DLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRA 578
            L LPG Q +L+  +    K PV+LV+ +   + I +A+   ++ AI+ A YPGE GG+A
Sbjct: 534 SLDLPGEQEKLLEAITATNK-PVVLVLENGRPLTIGWAK--AHVGAIVEAWYPGEFGGQA 590

Query: 579 IADVVFGKFNPGGRLPITWYNGDYVQMLP--LTSMPLRPVDSLGYPGRTYKFYNGPTLYP 636
           IA+ +FG  NP GRL IT+     V  +P    + P R  DS     + Y   +   L+P
Sbjct: 591 IAETLFGDNNPAGRLTITFPK--TVGQIPDYYNTDPSRAYDSDLTRRKVYVDNDSQPLFP 648

Query: 637 FGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFE 696
           FGYGLSYT F Y                  C +L  T  A+K+            D    
Sbjct: 649 FGYGLSYTTFHY------------------C-DLQVTPPAAKS----------NEDVSVT 679

Query: 697 FKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKS 756
           F V   N G+  G +V  VY +       T ++ +  F R+ ++   ++ +       + 
Sbjct: 680 FTV--TNTGTKAGDEVSQVYLREQFSSVETPVRSLKAFTRMPLQPQESRTVTLKIPRSE- 736

Query: 757 LNIVDYAANTLLPAGEHTIFVG 778
           L + +      +  G++T++VG
Sbjct: 737 LAVWNADEKWAVEGGKYTVWVG 758


>gi|423214394|ref|ZP_17200922.1| hypothetical protein HMPREF1074_02454 [Bacteroides xylanisolvens
           CL03T12C04]
 gi|392692809|gb|EIY86045.1| hypothetical protein HMPREF1074_02454 [Bacteroides xylanisolvens
           CL03T12C04]
          Length = 800

 Score =  253 bits (646), Expect = 3e-64,   Method: Compositional matrix adjust.
 Identities = 232/862 (26%), Positives = 371/862 (43%), Gaps = 165/862 (19%)

Query: 5   VSSLLCFSLSIALLVFSTNAVDANG---------SSSPVFVCDPGRFSKLGLQMSSFLFC 55
           +  LLC +L ++     + ++ AN          +S  ++      F+K G++    ++ 
Sbjct: 1   MKKLLCLALLVSAGSIYSGSISANNKPTDNKSGNNSKDIYKKTWIDFNKNGIKD---VYE 57

Query: 56  DSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRL---GLPQYEWWSEALH-GVSNV-- 109
           D S P   R+ DL+S+MTL+EK  Q+    +G  R+     P   W +E    G+ N+  
Sbjct: 58  DPSAPIEARIADLLSQMTLEEKTCQMATL-YGSGRVLKDAWPTAGWSAEIWKDGIGNIDE 116

Query: 110 ---GPGTHFDDV-------------------------IP--------------GATSFPT 127
              G G    ++                         IP               AT FP 
Sbjct: 117 QANGLGKFGSEISYPYANSVKNRHTIQRWFVEQTRLGIPVDFTNEGIRGLCHDRATMFPA 176

Query: 128 VILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNINVARDPRWGRITETPGED 187
                A++N+ L ++I +  + EA+A   LG   +  +SP +++A+DPRWGR+ E+ GED
Sbjct: 177 QCGQGATWNKKLIREIAKVTANEAKA---LGYTNI--YSPILDIAQDPRWGRVVESYGED 231

Query: 188 PFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDAR 247
           P++ G      + GLQ  EG             + +  KH+A Y +           D  
Sbjct: 232 PYLAGELGKQMILGLQS-EG-------------IVATPKHFAVYSIPVGGRDGGTRTDPH 277

Query: 248 VTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIV 307
           V  ++M+  +L PF   ++E  A  VM SYN  +G P       L + +R +W   GY+V
Sbjct: 278 VAPREMKTLYLEPFRKGIQEAGALGVMSSYNDYDGEPVSGSYHFLTEILRQQWGFKGYVV 337

Query: 308 ADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFT---------GNAVQQG 358
           +D ++++ +   H+ +  ++E+  AQ + AGL++      TNFT          +A+ +G
Sbjct: 338 SDSEAVEFLHTKHR-ITPTEEEMAAQVVNAGLNI-----RTNFTPPQDFILPLRHAINEG 391

Query: 359 KVKETDIDKSLKYLYTVLMRLGFFDGS-PQYVSLGKQDICSDENIELAAEAAREGIVLLK 417
           KV    +D+ +  +  V   +G FD   P      +  + +D +  ++ +AA E +VLLK
Sbjct: 392 KVSLHTLDQRVSEILRVKFMMGLFDNPYPGDDRRPETVVHNDAHKAVSMKAALESVVLLK 451

Query: 418 NDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFSGY---ANVTYKT 474
           N    LPL S   K +AV+GP+A     +   Y        +   G   Y   + V Y  
Sbjct: 452 NKNQMLPL-SKNFKKIAVIGPNAEEVKELTCRYGPANASIKTVYQGIKEYLPNSEVRYAK 510

Query: 475 GCDDV--------------ACKSNNSIFAASEAAKTADATIILAGLDLSVEAESLDREDL 520
           GCD +                +    I  A E AK +D  I++ G +     E   R +L
Sbjct: 511 GCDIIDKYFPESELYNVPLDTQEQAMIQEAVELAKASDIAILVLGGNEKTVREEFSRTNL 570

Query: 521 WLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIA 580
            L G Q QL+  V    K PV+LV++      I +A  N  + AI+ A +PGE  G AIA
Sbjct: 571 DLCGRQQQLLEAVYATGK-PVVLVMVDGRAATINWA--NKYVPAIIHAWFPGEFMGDAIA 627

Query: 581 DVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRP-VDSLGYPGRTYKFYNGPTLYPFGY 639
            V+FG +NPGGRL +T+     V  +P  + P +P  DS G      K      LYPFGY
Sbjct: 628 KVLFGDYNPGGRLAVTFPKS--VGQIPF-AFPFKPGSDSKG------KVRVDGVLYPFGY 678

Query: 640 GLSYTQFKYNLLSFTKTI---QVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFE 696
           GLSYT F Y+ L  +K +   Q N+                           L C     
Sbjct: 679 GLSYTTFGYSDLKISKPVIGPQENIT--------------------------LSC----- 707

Query: 697 FKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKS 756
                +N G   G +VV +Y +       TY K + GF+R+ ++ G  + + F     + 
Sbjct: 708 ---TVKNTGKKAGDEVVQLYIRDDFSSVTTYDKVLRGFERIHLQPGEEQTVNFTLTP-QD 763

Query: 757 LNIVDYAANTLLPAGEHTIFVG 778
           L + D      +  G  ++ VG
Sbjct: 764 LGLWDKNNQFTVEPGSFSVMVG 785


>gi|375359159|ref|YP_005111931.1| putative exported hydrolase [Bacteroides fragilis 638R]
 gi|423283738|ref|ZP_17262622.1| hypothetical protein HMPREF1204_02160 [Bacteroides fragilis HMW
           615]
 gi|301163840|emb|CBW23395.1| putative exported hydrolase [Bacteroides fragilis 638R]
 gi|404580776|gb|EKA85484.1| hypothetical protein HMPREF1204_02160 [Bacteroides fragilis HMW
           615]
          Length = 859

 Score =  253 bits (646), Expect = 3e-64,   Method: Compositional matrix adjust.
 Identities = 214/769 (27%), Positives = 337/769 (43%), Gaps = 146/769 (18%)

Query: 50  SSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGD-------------------------- 83
           ++F + ++SLP  +RV+DL+SRMTL+EK+ Q+                            
Sbjct: 22  TNFKYKNASLPVEVRVQDLLSRMTLEEKIAQMRHIHAYSIMENGKLNEEKLEKMIGGQNY 81

Query: 84  -FAHGVP---------------------RLGLPQYEWWSEALHGVSNVGPGTHFDDVIPG 121
            F  G+                      RLG+P +   +E+LHG            V  G
Sbjct: 82  GFIEGITLPGKECLTLMNEVQKYMREKTRLGIPVFTL-TESLHG-----------SVHDG 129

Query: 122 ATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTY-WSPNINVARDPRWGRI 180
           +T FP  I   ++FN  L  ++  A++ E  A       G+T   +P I+V RD RWGR+
Sbjct: 130 STIFPQAIALGSTFNPILAYEMTSAIAKELTAQ------GITQSLTPVIDVCRDLRWGRV 183

Query: 181 TETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVD 240
            E  GEDP++V R  V+ VRG  D +              VS   KH+ A+      G++
Sbjct: 184 EECFGEDPYLVSRMGVSQVRGYLDNQ--------------VSPMIKHFGAHGAPQ-GGLN 228

Query: 241 RYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEW 300
                    ++++   +L+ FE  VKE    +VM SYN  N  P+ +   L+ + +R  W
Sbjct: 229 LASVSC--GQRELLSIYLKTFETVVKEAKPWAVMSSYNSWNNEPNSSSHYLMTELLRDRW 286

Query: 301 DLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFTGNAVQQGKV 360
           D  GY+ +D  +I ++   HK   +S E A+ Q L AGLD +            V+ G +
Sbjct: 287 DFQGYVYSDWGAIGMLNYFHKTAQNSAEAAI-QALTAGLDAEASDNSYAELQQLVENGML 345

Query: 361 KETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQDICSDENIELAAEAAREGIVLLKNDQ 420
               ID+++  + T    +G F+          + + +  ++ LA + A E IVLL+N+ 
Sbjct: 346 DVKYIDQAVARILTAKFNMGLFEYPLPMEKNYDKVVHAPAHVSLARKIAEESIVLLQNEN 405

Query: 421 NTLPLNSAKVKTVAVVGPHANATVAMIGNYA-------GIPCRYMSPIAGFSG-YANVTY 472
           N LPL   K+K++AV+GP  NA     G+Y        G+    +  +    G    + Y
Sbjct: 406 NILPLQMNKLKSIAVIGP--NADQVQFGDYTWSRDNKDGVTL--LEALKERVGNQLTLNY 461

Query: 473 KTGCDDVACKSNNSIFAASEAAKTADATIILAGLDLSVEA---------ESLDREDLWLP 523
             GC D+     +    A + AK +D  I++ G   +  A         E  D  DL L 
Sbjct: 462 AKGC-DLVTDDCSGFKEAVDVAKKSDVCIVVVGSASASLARDYSNATCGEGFDLSDLTLT 520

Query: 524 GYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVV 583
           G Q  L+  +    K PVI+V++S  G   A +    NI  I+   YPGE+GG A+AD++
Sbjct: 521 GVQEDLVEAIHATGK-PVIVVLLS--GKPFAMSWIKENIPGIVVQWYPGEQGGLALADML 577

Query: 584 FGKFNPGGRLPITWYNGD-----YVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFG 638
            GK NP G+L  ++         Y   LP      R   S   PG+ Y F +   L+ FG
Sbjct: 578 LGKVNPSGKLNYSFPQSVGHLPCYYNYLPTDKGFYRSPGSKNKPGKDYVFSSPKALWAFG 637

Query: 639 YGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFK 698
           +GLSYT F+Y  LS T + +                             D  C+D  E  
Sbjct: 638 HGLSYTDFEY--LSATTSKE-----------------------------DYACEDVIEVT 666

Query: 699 VDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRI 747
           +  +N G  DG +V  VY +         ++++ GF++V ++ G  K++
Sbjct: 667 IAIRNTGDYDGLEVPQVYVRDMVSSVVMPVQELKGFEKVLIKKGETKQV 715


>gi|402307522|ref|ZP_10826545.1| glycosyl hydrolase family 3, N-terminal domain protein [Prevotella
           sp. MSX73]
 gi|400378572|gb|EJP31427.1| glycosyl hydrolase family 3, N-terminal domain protein [Prevotella
           sp. MSX73]
          Length = 858

 Score =  253 bits (645), Expect = 3e-64,   Method: Compositional matrix adjust.
 Identities = 162/463 (34%), Positives = 247/463 (53%), Gaps = 43/463 (9%)

Query: 45  LGLQMSS----FLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWS 100
           LGL +S+      +C+  L    R +DL+SR+TL+EK + + D +  +PRLG+ ++ WWS
Sbjct: 11  LGLSLSATAQLLPYCNPDLSARERARDLLSRLTLEEKARLMLDESPAIPRLGIKKFFWWS 70

Query: 101 EALHGVSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGR- 159
           EALHG +N+G          G T FP  +   ASFN+ L +++  A S E RA YN    
Sbjct: 71  EALHGAANMG----------GVTVFPEPVGMAASFNDGLLRRVFDAASDEMRAQYNRRML 120

Query: 160 --------AGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENA 211
                     L+ W+PN+N+ RDPRWGR  ET GEDP++        VRGLQ  E     
Sbjct: 121 NGGEDEKFHSLSVWTPNVNIFRDPRWGRGQETYGEDPYLTSVMGTAVVRGLQGPE----- 175

Query: 212 TDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFD-ARVTEQDMEETFLRPFEMCVKEGDA 270
               ++  K+ +C KHYA +    +    R+  + A V+ +D+ ET+L  F+  V E   
Sbjct: 176 ---TAKYRKLWACAKHYAVHSGPEYT---RHTANVADVSPRDLWETYLPAFKTLVTEAKV 229

Query: 271 SSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDA 330
             VMC+Y R++  P C++ +LL Q +R EW  +  +V+DC ++  +  NHK  +D+   A
Sbjct: 230 REVMCAYQRLDDDPCCSNNRLLQQILRDEWGFNYLVVSDCGAVTDIYANHKTSSDAVH-A 288

Query: 331 VAQTLKAGLDLDCGQYYTNFT-GNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYV 389
            A+   AG D++CG  Y   T   AV++G + E ++DK +  L      LG  D  P+ V
Sbjct: 289 AAKAAVAGTDVECGFGYAYKTIPEAVRRGLITEAEVDKHVLRLLEGRFDLGEMD-DPKLV 347

Query: 390 SLGK---QDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAM 446
              K     + S  + +LA + AR+ +VLL+N    LPL +   + +AV+GP+A+    M
Sbjct: 348 EWSKIPASVMDSKAHRQLALDMARQSLVLLQNKGGVLPLKAGG-EPIAVIGPNADDGPMM 406

Query: 447 IGNYAGIPCRYMSPIAGFSG-YANVTYKTGCDDVACKSNNSIF 488
            GNY G P R ++ + G    +  VTY  GCD    K+ NS+ 
Sbjct: 407 WGNYNGTPNRTVTILDGIKARHKRVTYLKGCDLTDTKTVNSLL 449



 Score = 95.5 bits (236), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 78/291 (26%), Positives = 129/291 (44%), Gaps = 63/291 (21%)

Query: 501 IILAGLDLSVEAESL----------DREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGG 550
           + + G+  ++E E +          DR ++ LP  Q   +  + E  K    +V ++  G
Sbjct: 603 VFVGGISAALEGEEMPVDIDGFKGGDRTNIELPKVQRDFLRALHEAGK---TVVFVNCSG 659

Query: 551 VDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTS 610
             IA         AIL A Y G+EGG A++DV+FG  NP G+LP+T+Y       LP   
Sbjct: 660 SAIALEPEMETCDAILQAWYAGQEGGTAVSDVLFGTVNPSGKLPVTFYK--RTDQLP--- 714

Query: 611 MPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNL 670
                 +     GRTY++++ P L+ FGYGLSYT F++                      
Sbjct: 715 ----DYEDYSMRGRTYRYFSDP-LFAFGYGLSYTTFRFG--------------------- 748

Query: 671 NYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQ 730
              ++A+              +  +   V   N G+  G +VV VY +  A+     +K 
Sbjct: 749 RAHAEAA--------------EGGYRLSVPLTNTGTRPGEEVVQVYIRRVADTNGP-LKS 793

Query: 731 VIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTL--LPAGEHTIFVGN 779
           +  F+RV ++AG +  ++   +  KS    D + NT+  LP G++ +  GN
Sbjct: 794 LRAFRRVALKAGESTTVEIPLSR-KSFECFDESTNTMRTLP-GDYELMYGN 842


>gi|423293673|ref|ZP_17271800.1| hypothetical protein HMPREF1070_00465 [Bacteroides ovatus
           CL03T12C18]
 gi|392677631|gb|EIY71047.1| hypothetical protein HMPREF1070_00465 [Bacteroides ovatus
           CL03T12C18]
          Length = 800

 Score =  253 bits (645), Expect = 4e-64,   Method: Compositional matrix adjust.
 Identities = 237/862 (27%), Positives = 372/862 (43%), Gaps = 165/862 (19%)

Query: 5   VSSLLCFSLSIALLVFSTNAVDANG---------SSSPVFVCDPGRFSKLGLQMSSFLFC 55
           +  LLC +L ++     + ++ AN          +S  ++      F+K G++    ++ 
Sbjct: 1   MKKLLCLALLVSAGSIYSGSISANNKPTDNKSGNNSKDIYKKTWIDFNKNGIKD---VYE 57

Query: 56  DSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRL---GLPQYEWWSEALH-GVSNV-- 109
           D S P   R+ DL+S+MTL+EK  Q+    +G  R+     P   W +E    G+ N+  
Sbjct: 58  DPSAPIEARIADLLSQMTLEEKTCQMATL-YGSGRVLKDAWPTTGWSTEIWKDGIGNIDE 116

Query: 110 ---GPGTHFDDV-------------------------IP--------------GATSFPT 127
              G G    ++                         IP               AT FP 
Sbjct: 117 QANGLGKFGSEISYPYANSVKNRHTIQRWFVEQTRLGIPVDFTNEGIRGLCHDRATMFPA 176

Query: 128 VILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNINVARDPRWGRITETPGED 187
                A++N+ L  +I +  + EA+A   LG   +  +SP +++A+DPRWGR+ E+ GED
Sbjct: 177 QCGQGATWNKKLIGEIAKVTADEAKA---LGYTNI--YSPILDIAQDPRWGRVVESYGED 231

Query: 188 PFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDAR 247
           P++VG      + GLQ+ EG             + +  KH+A Y +           D  
Sbjct: 232 PYLVGELGKQMILGLQN-EG-------------IVATPKHFAVYSIPVGGRDGGTRTDPH 277

Query: 248 VTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIV 307
           V  ++M+  +L PF   ++E  A  VM SYN  +G P       L + +R +W   GYIV
Sbjct: 278 VAPREMKTLYLEPFRKGIQEAGALGVMSSYNDYDGEPVSGSYHFLTEILRQQWGFKGYIV 337

Query: 308 ADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFT---------GNAVQQG 358
           +D ++++ +   H+ +  ++E+  AQ + AGL++      TNFT           A+ +G
Sbjct: 338 SDSEAVEFLHTKHR-ITPTEEEMAAQVVNAGLNI-----RTNFTPPQDFILPLRRAINEG 391

Query: 359 KVKETDIDKSLKYLYTVLMRLGFFDGS-PQYVSLGKQDICSDENIELAAEAAREGIVLLK 417
           KV    +D+ +  +  V   +G FD   P      +  + +D +  ++ +AA E IVLLK
Sbjct: 392 KVSLHTLDQRVGEILRVKFMMGLFDNPYPGDDRRPETVVHNDAHKAVSMKAALESIVLLK 451

Query: 418 NDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFSGY---ANVTYKT 474
           N+   LPL S     +AV+GP+      +   Y        +   G   Y   + V Y  
Sbjct: 452 NENQMLPL-SKNFSKIAVIGPNGEEVKELTCRYGPANASIKTVYQGIKEYLPNSEVRYAK 510

Query: 475 GCDDV-----ACKSNN---------SIFAASEAAKTADATIILAGLDLSVEAESLDREDL 520
           GCD +       + NN          I  A E AK +D  I++ G +     E   R +L
Sbjct: 511 GCDIIDKYFPESELNNVPLDTQEQAMIQEAVELAKASDIAILVLGGNEKTVREEFSRTNL 570

Query: 521 WLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIA 580
            L G Q QL+  V    K PVILV++      I +A  N  I AI+ A +PGE  G AIA
Sbjct: 571 DLCGRQQQLLEAVYATGK-PVILVMVDGRAATINWA--NKYIPAIIHAWFPGEFMGDAIA 627

Query: 581 DVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRP-VDSLGYPGRTYKFYNGPTLYPFGY 639
            V+FG +NPGGRL +T+     V  +P  + P +P  DS G      K      LYPFGY
Sbjct: 628 KVLFGDYNPGGRLAVTFPKS--VGQIPF-AFPFKPGSDSKG------KVRVDGVLYPFGY 678

Query: 640 GLSYTQFKYNLLSFTKTI---QVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFE 696
           GLSYT F Y+ L  +K +   Q N+                           L C     
Sbjct: 679 GLSYTTFGYSDLKISKPVIGPQENIT--------------------------LSC----- 707

Query: 697 FKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKS 756
                +N G   G +VV +Y +       TY K + GF+R+ ++ G  + + F     + 
Sbjct: 708 ---TVKNTGKKAGDEVVQLYIRDDFSSVTTYDKVLRGFERIHLQPGEEQTVNFTLTP-QD 763

Query: 757 LNIVDYAANTLLPAGEHTIFVG 778
           L + D      +  G  ++ VG
Sbjct: 764 LGLWDKNNRFTVEPGSFSVMVG 785


>gi|53712134|ref|YP_098126.1| beta-glucosidase [Bacteroides fragilis YCH46]
 gi|52214999|dbj|BAD47592.1| periplasmic beta-glucosidase precursor [Bacteroides fragilis YCH46]
          Length = 812

 Score =  253 bits (645), Expect = 4e-64,   Method: Compositional matrix adjust.
 Identities = 238/853 (27%), Positives = 359/853 (42%), Gaps = 149/853 (17%)

Query: 8   LLCFSLSIALLVFSTNAVDANGSSSPVFVCDPGRFSKLGLQMSSFLFCDSSLPYSIRVKD 67
           L+CF +     +F   A +  G            F  L        + + S P   RV+ 
Sbjct: 5   LICFLMLSVFFIFPVRAKNTFGKKKDKVTRL--HFYDLNKNGRMDTYENPSAPVEYRVEH 62

Query: 68  LVSRMTLDEKVQQL------------GDFAHGVPRLGLPQYEWWSEALHGVSNVGPGT-- 113
           L+S+MTL+EKV Q+            G+     P+L     E+   +L G     P T  
Sbjct: 63  LLSQMTLEEKVGQMLTSLGWPMYERVGEDIRLTPQLEKEIGEYHIGSLWGFMRADPWTQR 122

Query: 114 ------------------------HFDDVIP--------------GATSFPTVILTTASF 135
                                   H    IP              G T FPT I   +++
Sbjct: 123 TLHTGLNPSLAARASNRLQSYVIEHSRLGIPLFLAEECPHGHMAIGTTVFPTSIGQASTW 182

Query: 136 NESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYA 195
           N  L +++G+ ++ EA A     +     + P +++ARDPRW R+ ET GEDP++ G   
Sbjct: 183 NPELIRQMGRVIAIEASA-----QGAHIGYGPVLDLARDPRWSRVEETYGEDPYLNGVMG 237

Query: 196 VNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEE 255
              VRG Q     E   D  S    V +  KH+A+Y    W         A + E+++EE
Sbjct: 238 TALVRGFQG----ETLNDGKS----VIATLKHFASY---GWTEGGHNGGTAHIGERELEE 286

Query: 256 TFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQV 315
               PF   V  G A SVM SYN ++G P      LL   ++  W   G++V+D  ++  
Sbjct: 287 AIFPPFREAVGAG-ALSVMSSYNEIDGNPCTGSRYLLTDILKDRWQFKGFVVSDLYAVGG 345

Query: 316 MVDNHKFLADSKEDAVAQTLKAGLDLDCG-QYYTNFTGNAVQQGKVKETDIDKSLKYLYT 374
           + ++   +A +  +A  + + AG+D D G   Y      AV++G V    IDK+++ + +
Sbjct: 346 LREHG--VAGNDYEAAIKAVNAGVDSDLGTNVYAEQLVAAVKRGDVAVATIDKAVRRILS 403

Query: 375 VLMRLGFFDGSPQYVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVA 434
           +  ++G FD          Q + S E+  LA E AR+ IVLLKN    LPL    ++T+A
Sbjct: 404 LKFQMGLFDDPFVDEKQAVQLVASSEHTGLAREVARQSIVLLKNKDKLLPLKK-DIRTLA 462

Query: 435 VVGPHANATVAMIGNYA-----GIPCRYMSPI-AGFSGYANVTYKTGCDDVACKSNNSIF 488
           V+GP+A+    M+G+Y      G     +  I    S    V Y  GC  V   S     
Sbjct: 463 VIGPNADNVYNMLGDYTAPQADGTVVTVLDGIRQKVSKETRVLYAKGC-AVRDSSRTGFK 521

Query: 489 AASEAAKTADATIILAG----LDLSVE-------------------AESLDREDLWLPGY 525
            A E A+ ADA +++ G     D S E                    E  DR  L L G 
Sbjct: 522 DAIETARNADAVVMVMGGSSARDFSSEYEETGAAKVTINQISDMESGEGYDRATLHLMGR 581

Query: 526 QTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFG 585
           Q +L+ +++ + K PV+L+      ++ A  E     +AI+ A YPG +GG A+ADV+FG
Sbjct: 582 QLELLEEISRLGK-PVVLIKGRPLLMEGAIQEA----EAIVDAWYPGMQGGNAVADVLFG 636

Query: 586 KFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQ 645
            +NP GRL ++      V  LP+     R     G   R Y    G   YPFGYGLSYT 
Sbjct: 637 DYNPAGRLTLSVPRS--VGQLPVYYNTRRK----GNRSR-YVEEPGTPRYPFGYGLSYTT 689

Query: 646 FKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVG 705
           F Y  +                                V V +   D + +  V  QN G
Sbjct: 690 FSYTDMK-------------------------------VQVTEGSDDCWVDVTVTIQNQG 718

Query: 706 STDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAAN 765
           + DG +V  +Y +       T  KQ+  F R+ ++AG ++ + F  +  KSL +      
Sbjct: 719 TADGDEVAQLYFRDDVSSFTTPAKQLRAFSRIHLKAGESREVTFTLDK-KSLALYMQEGE 777

Query: 766 TLLPAGEHTIFVG 778
            ++  G  TI VG
Sbjct: 778 WVVEPGRFTIMVG 790


>gi|293373755|ref|ZP_06620101.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
           ovatus SD CMC 3f]
 gi|292631245|gb|EFF49877.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
           ovatus SD CMC 3f]
          Length = 800

 Score =  253 bits (645), Expect = 4e-64,   Method: Compositional matrix adjust.
 Identities = 232/862 (26%), Positives = 371/862 (43%), Gaps = 165/862 (19%)

Query: 5   VSSLLCFSLSIALLVFSTNAVDANG---------SSSPVFVCDPGRFSKLGLQMSSFLFC 55
           +  LLC +L ++     + ++ AN          +S  ++      F+K G++    ++ 
Sbjct: 1   MKKLLCLALLVSAGSIYSGSISANNKPTDNKSGNNSKDIYKKTWIDFNKNGIKD---VYE 57

Query: 56  DSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRL---GLPQYEWWSEALH-GVSNV-- 109
           D S P   R+ DL+S+MTL+EK  Q+    +G  R+     P   W +E    G+ N+  
Sbjct: 58  DPSAPIEARIADLLSQMTLEEKTCQMATL-YGSGRVLKDAWPTAGWSAEIWKDGIGNIDE 116

Query: 110 ---GPGTHFDDV-------------------------IP--------------GATSFPT 127
              G G    ++                         IP               AT FP 
Sbjct: 117 QANGLGKFGSEISYPYANSVKNRHTIQRWFMEQTRLGIPVDFTNEGIRGLCHDRATMFPA 176

Query: 128 VILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNINVARDPRWGRITETPGED 187
                A++N+ L ++I +  + EA+A   LG   +  +SP +++A+DPRWGR+ E+ GED
Sbjct: 177 QCGQGATWNKKLIREIAKVTADEAKA---LGYTNI--YSPILDIAQDPRWGRVVESYGED 231

Query: 188 PFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDAR 247
           P++VG      + GLQ+ EG             + +  KH+A Y +           D  
Sbjct: 232 PYLVGELGKQMILGLQN-EG-------------IVATPKHFAVYSIPVGGRDGGTRTDPH 277

Query: 248 VTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIV 307
           V  ++M+  +L PF   ++E  A  VM SYN  +G P       L + +R +W   GY+V
Sbjct: 278 VAPREMKTLYLEPFRKGIQEAGALGVMSSYNDYDGEPVSGSYHFLTEILRQQWGFKGYVV 337

Query: 308 ADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFT---------GNAVQQG 358
           +D ++++ +   H+ +  ++E+  AQ + AGL++      TNFT           A+ +G
Sbjct: 338 SDSEAVEFLHTKHR-ITPTEEEMAAQVVNAGLNI-----RTNFTPPQDFILPLRRAINEG 391

Query: 359 KVKETDIDKSLKYLYTVLMRLGFFDGS-PQYVSLGKQDICSDENIELAAEAAREGIVLLK 417
           KV    +D+ +  +  V   +G FD   P      +  + +D +  ++ +AA E IVLLK
Sbjct: 392 KVSLHTLDQRVGEILRVKFMMGLFDNPYPGDDRRPETVVHNDAHKAVSMKAALESIVLLK 451

Query: 418 NDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFSGY---ANVTYKT 474
           N+   LPL S     +AV+GP+      +   Y        +   G   Y   + V Y  
Sbjct: 452 NENQMLPL-SKNFSKIAVIGPNGEEVKELTCRYGPANASIKTVYQGIKEYLPNSEVRYVK 510

Query: 475 GCDDV--------------ACKSNNSIFAASEAAKTADATIILAGLDLSVEAESLDREDL 520
           GCD +                +    I  A E AK +D  I++ G +     E   R +L
Sbjct: 511 GCDIIDKYFPESELYNVPLDTQEQAMIHEAVELAKASDVAILVLGGNEKTVREEFSRTNL 570

Query: 521 WLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIA 580
            L G Q QL+  V    K PV+LV++      I +A  N  + AI+ A +PGE  G AIA
Sbjct: 571 DLCGRQQQLLEAVYATGK-PVVLVMVDGRAATINWA--NKYVPAIIHAWFPGEFMGDAIA 627

Query: 581 DVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRP-VDSLGYPGRTYKFYNGPTLYPFGY 639
            V+FG +NPGGRL +T+     V  +P  + P +P  DS G      K      LYPFGY
Sbjct: 628 KVLFGDYNPGGRLAVTFPKS--VGQIPF-AFPFKPGSDSKG------KVRVDGVLYPFGY 678

Query: 640 GLSYTQFKYNLLSFTKTI---QVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFE 696
           GLSYT F Y+ L  +K +   Q N+                           L C     
Sbjct: 679 GLSYTTFGYSDLKISKPVIGPQENIT--------------------------LSC----- 707

Query: 697 FKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKS 756
                +N G   G +VV +Y +       TY K + GF+R+ ++ G  + + F     + 
Sbjct: 708 ---TVKNTGKKAGDEVVQLYIRDDFSSVTTYDKVLRGFERIHLQPGEEQTVNFTLTP-QD 763

Query: 757 LNIVDYAANTLLPAGEHTIFVG 778
           L + D      +  G  ++ VG
Sbjct: 764 LGLWDKNNRFTVEPGSFSVMVG 785


>gi|372223664|ref|ZP_09502085.1| glycoside hydrolase family 3 protein [Mesoflavibacter
           zeaxanthinifaciens S86]
          Length = 768

 Score =  253 bits (645), Expect = 4e-64,   Method: Compositional matrix adjust.
 Identities = 225/812 (27%), Positives = 364/812 (44%), Gaps = 136/812 (16%)

Query: 41  RFSKLGLQMSSFLFCDS--------SLPYSIRVKDLVSRMTLDEKVQQL-----GDFAHG 87
           +F  LGL +   L C+          LPY   V  +++ MTL+EK+ QL     GD   G
Sbjct: 4   KFIALGLLVLITLSCNEQKPAQPSQELPYQKEVDSILALMTLEEKIGQLNLPSSGDITTG 63

Query: 88  VPRLGLPQYEWWSEALHGVSNVGPGTHFD--------------------DVIPG-ATSFP 126
             +      +  +  + G+ N+                           DVI G  ++FP
Sbjct: 64  QAKSSDIASKIAAGKVGGLFNIKTAAKIKEVQRIAVEESRLKIPLLFGMDVIHGYQSTFP 123

Query: 127 TVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTY-WSPNINVARDPRWGRITETPG 185
             +   AS++  L ++  +  + EA A       G+ + +SP ++++RDPRWGRI+E  G
Sbjct: 124 IPLGLAASWDMDLIQQTARVAAQEASA------DGINWTFSPMVDISRDPRWGRISEGSG 177

Query: 186 EDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFD 245
           EDPF+ G+ A   VRG Q  +   N T L        +C KH+A Y      G D    D
Sbjct: 178 EDPFLGGKIAAAMVRGYQGDDLSANNTLL--------ACVKHFALYGASE-AGRDYNTVD 228

Query: 246 ARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGY 305
             ++   M   +L P++  +  G  +SVM S+N V+GIP+ A+  LL   +R +W  +G+
Sbjct: 229 --MSRVRMYNDYLPPYKAAIDAG-VASVMASFNEVDGIPATANKWLLTDVLREQWGFNGF 285

Query: 306 IVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLD-CGQYYTNFTGNAVQQGKVKETD 364
           +V+D   I  MV +      + +   A+ L AGLD+D  G+ +      ++++G V ET 
Sbjct: 286 VVSDYTGINEMVAHG---IGNLQQVSARALNAGLDMDMVGEGFLTTLKKSLEEGLVSETT 342

Query: 365 IDKSLKYLYTVLMRLGFFDGSPQY--VSLGKQDICSDENIELAAEAAREGIVLLKNDQNT 422
           ID ++K + T   +LG FD   +Y   +  K ++ + EN + A + + E +VLLKN +  
Sbjct: 343 IDTAVKRILTAKYQLGLFDDPYKYCDTTRTKNEVFTKENRDFARKVSAESMVLLKN-EGL 401

Query: 423 LPLNSAKVKTVAVVGPHANATVAMIGNY--AGIPCRYMSPIAGFSGYA----NVTYKTGC 476
           LPL   K  ++A++GP AN    M G +  A    + +S + G    A     + Y  G 
Sbjct: 402 LPLK--KSGSIALIGPLANTPHNMAGTWSVATQQEKSISVLEGLKEVAGEAVTINYAKGS 459

Query: 477 D---DVACKSNNSIFA----------------ASEAAKTADATIILAGLDLSVEAESLDR 517
           +   D A +   ++F                 A   AK +D  +   G       ES   
Sbjct: 460 NVAYDEAYEKRITMFGKEITRDGRTDAQLLAEALAVAKKSDVVVAAIGETAERSGESSSI 519

Query: 518 EDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGR 577
            +L +P  Q  L++ +    K PV++V+ +  G  +A  +      AI+ A +PG E G 
Sbjct: 520 TNLQIPKAQQDLLDALLATGK-PVVVVLFT--GRPLAITKIQEEAPAIINAWFPGSEAGL 576

Query: 578 AIADVVFGKFNPGGRLPITWYNGDYVQMLPL------TSMPLRPVDSL--GYPGRTYKFY 629
           AIADV+FG  NP G+L  T+     V  +PL      T  PL P  +   G+   T  + 
Sbjct: 577 AIADVLFGAVNPSGKLTATFPRN--VGQVPLFYAHKNTGRPLDPAKTADCGFQKFTSNYL 634

Query: 630 ---NGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLV 686
              N P LYPFG+GLSYT F Y+ ++  K                               
Sbjct: 635 DVCNTP-LYPFGFGLSYTTFSYSDITLDKA------------------------------ 663

Query: 687 NDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKR 746
            +L  +D     V  +N G+ DG +VV +Y +         ++++ GF+++F++    + 
Sbjct: 664 -ELGPNDSITVSVKVKNTGNFDGKEVVQLYVRDVVRSTTPPVRELKGFKKIFLKKDEEQI 722

Query: 747 IKFVFNACKSLNIVDYAANTLLPAGEHTIFVG 778
           ++F     + L   D   N +   GE  +FVG
Sbjct: 723 VQFKLQ-TEDLKFYDTDLNFIAEPGEFQVFVG 753


>gi|86141717|ref|ZP_01060241.1| beta-glucosidase [Leeuwenhoekiella blandensis MED217]
 gi|85831280|gb|EAQ49736.1| beta-glucosidase [Leeuwenhoekiella blandensis MED217]
          Length = 758

 Score =  253 bits (645), Expect = 4e-64,   Method: Compositional matrix adjust.
 Identities = 213/740 (28%), Positives = 345/740 (46%), Gaps = 117/740 (15%)

Query: 76  EKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGATSFPTVILTTASF 135
           +K++Q  + A    RLG+P     S+ +HG                 T+FP  +  ++S+
Sbjct: 85  DKIRQAQEIAVKNTRLGIPLL-IGSDIIHGYK---------------TTFPIPLGLSSSW 128

Query: 136 NESLWKKIGQAVSTEARAMYNLGRAGLTY-WSPNINVARDPRWGRITETPGEDPFVVGRY 194
           +  L +K  Q  + EA A       G+ + +SP +++ARDPRWGRI+E  GEDP++    
Sbjct: 129 DMELIEKTAQIAAKEATA------DGINWNFSPMVDIARDPRWGRISEGAGEDPYLGSAI 182

Query: 195 AVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDME 254
           A   V G Q     E+ T+ N+    + S  KH+A Y         R +    ++   M 
Sbjct: 183 AKAMVTGYQ----QEDLTEENT----MISTVKHFALYGAAEG---GRDYNTTDMSRVKMF 231

Query: 255 ETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQ 314
             +L P++  +  G A SVM S+N V+G+P+  +  LL   +R +W   G++V+D  S+ 
Sbjct: 232 NEYLPPYKAAIDAG-AESVMSSFNDVDGVPASGNKWLLTHLLREQWGFEGFVVSDYTSVN 290

Query: 315 VMVDNHKFLADSKEDAVAQTLKAGLDLD-CGQYYTNFTGNAVQQGKVKETDIDKSLKYLY 373
            M+ +   L D +    A ++ AGLD+D  G+ +      +V +GKV E  I  + + + 
Sbjct: 291 EMIAHG--LGDLQA-VSALSINAGLDMDMVGEGFLTTLKKSVDEGKVSEATITNACRRIL 347

Query: 374 TVLMRLGFFDGSPQYVSLGK--QDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVK 431
               +LG FD   +Y    +  +DI +  N E+A +AAR+  VLLKN+  TLPL+  K  
Sbjct: 348 EAKYKLGLFDDPYKYSDSKRPERDILTAANKEIARDAARKSFVLLKNENKTLPLD--KTA 405

Query: 432 TVAVVGPHANATVAMIGNYA--GIPCRYMSPIAGFSGYANV------TYKTGC---DDVA 480
            +A++GP AN    M+G +A  G P +  +PI  F G  NV      +Y  G    +D A
Sbjct: 406 KIALIGPLANNKNNMLGTWAPTGDP-QLSTPI--FEGLKNVAPNAEISYTKGANISNDTA 462

Query: 481 CKSNNSIFA----------------ASEAAKTADATIILAGLDLSVEAESLDREDLWLPG 524
                ++F                 A + A+TAD  + + G    +  ES  R D+ +P 
Sbjct: 463 YAKKINVFGPRIEISEATPETLLEEALQNAETADVVVAVVGEATEMSGESSSRTDITIPE 522

Query: 525 YQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVF 584
            Q  LI ++ ++ K PV+LV+MS   +DI   E      +IL   +PG + G A+ADV+F
Sbjct: 523 SQKTLIQELVKIGK-PVVLVLMSGRPLDI--TEELALPVSILQVWHPGIQAGNAVADVLF 579

Query: 585 GKFNPGGRLPITWYNGDYVQMLPL------TSMPLRPVDSLGYPGRTYKFYNGPTLYPFG 638
           G +NP G+L  +W     V  +P+      T  P    + L +  +     N P L  FG
Sbjct: 580 GDYNPSGKLTASWPQN--VGQIPVYHSMKTTGRPAPSAEFLKFKSQYLDTPNAPAL-AFG 636

Query: 639 YGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFK 698
           YGLSYT F+Y+ L  +             +++    D +                     
Sbjct: 637 YGLSYTTFEYSNLKLS------------SKSIGQNEDVT-------------------VM 665

Query: 699 VDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLN 758
           VD  N G+ DG++VV +Y           ++ + GFQ+V ++ G  K ++    A + L 
Sbjct: 666 VDVTNTGAYDGTEVVQLYIHDVVRSITPPMRTLKGFQKVSLKQGETKTVELTLKA-EDLK 724

Query: 759 IVDYAANTLLPAGEHTIFVG 778
             + +   +   GE  +FVG
Sbjct: 725 FYNGSLEFISEPGEFEVFVG 744


>gi|160884749|ref|ZP_02065752.1| hypothetical protein BACOVA_02738 [Bacteroides ovatus ATCC 8483]
 gi|156109784|gb|EDO11529.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
           ovatus ATCC 8483]
          Length = 800

 Score =  253 bits (645), Expect = 4e-64,   Method: Compositional matrix adjust.
 Identities = 235/862 (27%), Positives = 370/862 (42%), Gaps = 165/862 (19%)

Query: 5   VSSLLCFSLSIALLVFSTNAVDANG---------SSSPVFVCDPGRFSKLGLQMSSFLFC 55
           +  LLC +L ++     + ++ AN          +S  ++      F+K G++    ++ 
Sbjct: 1   MKKLLCLALLVSAGSIYSGSISANNKPTDNKSGNNSKDIYKKTWIDFNKNGIKD---VYE 57

Query: 56  DSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRL---GLPQYEWWSEALH-GVSNV-- 109
           D S P   R+ DL+S+MTL+EK  Q+    +G  R+     P   W +E    G+ N+  
Sbjct: 58  DPSAPIEARIADLLSQMTLEEKTCQMATL-YGSGRVLKDAWPTTGWSTEIWKDGIGNIDE 116

Query: 110 ---GPGTHFDDV-------------------------IP--------------GATSFPT 127
              G G    ++                         IP               AT FP 
Sbjct: 117 QANGLGKFGSEISYPYANSVKNRHTIQRWFVEQTRLGIPVDFTNEGIRGLCHDRATMFPA 176

Query: 128 VILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNINVARDPRWGRITETPGED 187
                A++N+ L  +I +  + EA+A   LG   +  +SP +++A+DPRWGR+ E+ GED
Sbjct: 177 QCGQGATWNKKLIGEIAKVTADEAKA---LGYTNI--YSPILDIAQDPRWGRVVESYGED 231

Query: 188 PFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDAR 247
           P++VG      + GLQ+ EG             + +  KH+A Y +           D  
Sbjct: 232 PYLVGELGKQMILGLQN-EG-------------IVATPKHFAVYSIPVGGRDGGTRTDPH 277

Query: 248 VTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIV 307
           V  ++M+  +L PF   ++E  A  VM SYN  +G P       L + +R +W   GYIV
Sbjct: 278 VAPREMKTLYLEPFRKGIQEAGALGVMSSYNDYDGEPVSGSYHFLTEILRQQWGFKGYIV 337

Query: 308 ADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFT---------GNAVQQG 358
           +D ++++ +   H+ +  ++E+  AQ + AGL++      TNFT           A+ +G
Sbjct: 338 SDSEAVEFLHTKHR-ITPTEEEMAAQVVNAGLNI-----RTNFTPPQDFILPLRRAINEG 391

Query: 359 KVKETDIDKSLKYLYTVLMRLGFFDGS-PQYVSLGKQDICSDENIELAAEAAREGIVLLK 417
           KV    +D+ +  +  V   +G FD   P      +  + +D +  ++ +AA E IVLLK
Sbjct: 392 KVSLHTLDQRVGEILRVKFMMGLFDNPYPGDDRRPETVVHNDAHKAVSMKAALESIVLLK 451

Query: 418 NDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFSGY---ANVTYKT 474
           N+   LPL S     +AV+GP+      +   Y        +   G   Y   + V Y  
Sbjct: 452 NENQMLPL-SKNFSKIAVIGPNGEEVKELTCRYGPANASIKTVYQGIKEYLPNSEVRYAK 510

Query: 475 GCDDV--------------ACKSNNSIFAASEAAKTADATIILAGLDLSVEAESLDREDL 520
           GCD +                +    I  A E AK +D  I++ G +     E   R +L
Sbjct: 511 GCDIIDKYFPESELYNVPLDTQEQAMIQEAVELAKASDIAILVLGGNEKTVREEFSRTNL 570

Query: 521 WLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIA 580
            L G Q QL+  V    K PVILV++      I +A  N  I AI+ A +PGE  G AIA
Sbjct: 571 DLCGRQQQLLEAVYATGK-PVILVMVDGRAATINWA--NKYIPAIIHAWFPGEFMGDAIA 627

Query: 581 DVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRP-VDSLGYPGRTYKFYNGPTLYPFGY 639
            V+FG +NPGGRL +T+     V  +P  + P +P  DS G      K      LYPFGY
Sbjct: 628 KVLFGDYNPGGRLAVTFPKS--VGQIPF-AFPFKPGSDSKG------KVRVDGVLYPFGY 678

Query: 640 GLSYTQFKYNLLSFTKTI---QVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFE 696
           GLSYT F Y+ L  +K +   Q N+                           L C     
Sbjct: 679 GLSYTTFGYSDLKISKPVIGPQENIT--------------------------LSC----- 707

Query: 697 FKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKS 756
                +N G   G +VV +Y +       TY K + GF+R+ ++ G  + + F     + 
Sbjct: 708 ---TVKNTGKKAGDEVVQLYIRDDFSSVTTYDKVLRGFERIHLQPGEEQTVNFTLTP-QD 763

Query: 757 LNIVDYAANTLLPAGEHTIFVG 778
           L + D      +  G  ++ VG
Sbjct: 764 LGLWDKNNQFTVEPGSFSVMVG 785


>gi|336412865|ref|ZP_08593218.1| hypothetical protein HMPREF1017_00326 [Bacteroides ovatus
           3_8_47FAA]
 gi|335942911|gb|EGN04753.1| hypothetical protein HMPREF1017_00326 [Bacteroides ovatus
           3_8_47FAA]
          Length = 800

 Score =  252 bits (644), Expect = 4e-64,   Method: Compositional matrix adjust.
 Identities = 232/862 (26%), Positives = 372/862 (43%), Gaps = 165/862 (19%)

Query: 5   VSSLLCFSLSIALLVFSTNAVDANG---------SSSPVFVCDPGRFSKLGLQMSSFLFC 55
           +  LLC +L ++     + ++ AN          +S  ++      F+K G++    ++ 
Sbjct: 1   MKKLLCLALLVSAGSIYSESISANNKPTDNKSGNNSKDIYKKTWIDFNKNGIKD---VYE 57

Query: 56  DSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRL---GLPQYEWWSEALH-GVSNV-- 109
           D S P   R+ DL+S+MTL+EK  Q+    +G  R+     P   W +E    G+ N+  
Sbjct: 58  DLSAPIEARIADLLSQMTLEEKTCQMATL-YGSGRVLKDAWPTAGWSAEIWKDGIGNIDE 116

Query: 110 ---GPGTHFDDV-------------------------IP--------------GATSFPT 127
              G G    ++                         IP               AT FP 
Sbjct: 117 QANGLGKFGSEISYSYANSVKNRHTIQRWFVEQTRLGIPVDFTNEGIRGLCHDRATMFPA 176

Query: 128 VILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNINVARDPRWGRITETPGED 187
                A++N+ L ++I +  + EA+A   LG   +  +SP +++A+DPRWGR+ E+ GED
Sbjct: 177 QCGQGATWNKKLIREIAKVTANEAKA---LGYTNI--YSPILDIAQDPRWGRVVESYGED 231

Query: 188 PFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDAR 247
           P++VG      + GLQ+ EG             + +  KH+A Y +           D  
Sbjct: 232 PYLVGELGKQMILGLQN-EG-------------IVATPKHFAVYSIPVGGRDGGTRTDPH 277

Query: 248 VTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIV 307
           V  ++M+  +L PF   ++E  A  VM SYN  +G P       L + +R +W   GY+V
Sbjct: 278 VAPREMKTLYLEPFRKGIQEAGALGVMSSYNDYDGEPVSGSYHFLTEILRQQWGFKGYVV 337

Query: 308 ADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFT---------GNAVQQG 358
           +D ++++ +   H+ +  ++E+  AQ + AGL++      TNFT           A+ +G
Sbjct: 338 SDSEAVEFLHTKHR-ITPTEEEMAAQVVNAGLNI-----RTNFTPPQDFILPLRRAIDEG 391

Query: 359 KVKETDIDKSLKYLYTVLMRLGFFDGS-PQYVSLGKQDICSDENIELAAEAAREGIVLLK 417
           KV    +++ +  +  V   +G FD   P      +  + +D +  ++ +AA E IVLLK
Sbjct: 392 KVSLHTLNQRVSEILRVKFMMGLFDNPYPGDDRRPETVVHNDAHKAVSMKAALESIVLLK 451

Query: 418 NDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFSGY---ANVTYKT 474
           N+   LPL S   K +AV+GP+      +   Y        +   G   Y   + V Y  
Sbjct: 452 NENQMLPL-SKNFKKIAVIGPNGEEVKELTCRYGPANASIKTVYQGIKEYLPNSEVRYAK 510

Query: 475 GCDDV--------------ACKSNNSIFAASEAAKTADATIILAGLDLSVEAESLDREDL 520
           GCD +                +    I  A E AK +D  I++ G +     E   R +L
Sbjct: 511 GCDIIDKYFPESELYNVPLDTQEQAMIHEAVELAKASDIAILVLGGNEKTVREEFSRTNL 570

Query: 521 WLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIA 580
            L G Q QL+  V    K PV+LV++      I +A  N  + AI+ A +PGE  G AIA
Sbjct: 571 DLCGRQQQLLEAVYATGK-PVVLVMVDGRAATINWA--NKYVPAIIHAWFPGEFMGDAIA 627

Query: 581 DVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRP-VDSLGYPGRTYKFYNGPTLYPFGY 639
            V+FG +NPGGRL +T+     V  +P  + P +P  DS G      K      LYPFGY
Sbjct: 628 KVLFGDYNPGGRLAVTFPKS--VGQIPF-AFPFKPDSDSKG------KVRVDGVLYPFGY 678

Query: 640 GLSYTQFKYNLLSFTKTI---QVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFE 696
           GLSYT F Y+ L  +K +   Q N+                           L C     
Sbjct: 679 GLSYTIFGYSDLKISKPVIGPQENIT--------------------------LSC----- 707

Query: 697 FKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKS 756
                +N G   G +VV +Y +       TY K + GF+R+ ++ G  + + F     + 
Sbjct: 708 ---TVKNTGKKAGDEVVQLYIRDDFSSVTTYDKVLRGFERIHLQPGEEQTVSFTLTP-QD 763

Query: 757 LNIVDYAANTLLPAGEHTIFVG 778
           L + D      +  G  ++ VG
Sbjct: 764 LGLWDKNNQFTVEPGSFSVMVG 785


>gi|393786770|ref|ZP_10374902.1| hypothetical protein HMPREF1068_01182 [Bacteroides nordii
           CL02T12C05]
 gi|392658005|gb|EIY51635.1| hypothetical protein HMPREF1068_01182 [Bacteroides nordii
           CL02T12C05]
          Length = 864

 Score =  252 bits (644), Expect = 5e-64,   Method: Compositional matrix adjust.
 Identities = 158/453 (34%), Positives = 238/453 (52%), Gaps = 40/453 (8%)

Query: 50  SSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNV 109
           S   + D +L    R  DL+ R+T++EKV  + + + G+ RLG+  YEWW+EALHGV+  
Sbjct: 26  SQLPYQDPNLTPEQRATDLLQRLTIEEKVSLMQNNSPGILRLGIKPYEWWNEALHGVARA 85

Query: 110 GPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARA----MYNLGR----AG 161
           G           AT FP  I   ASF+++L  ++  A+S EARA       LG+     G
Sbjct: 86  GL----------ATVFPQTIGMAASFDDTLIYEVFNAISDEARAKNRHFNTLGQYKRYQG 135

Query: 162 LTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKV 221
           LT W+PNIN+ RDPRWGR  ET GEDP++  R  V  V+GLQ  +        ++R  K+
Sbjct: 136 LTMWTPNINIFRDPRWGRGQETYGEDPYLTSRMGVAVVKGLQGPD--------SARYNKL 187

Query: 222 SSCCKHYAAYDVDNWKGVDRYHFDAR-VTEQDMEETFLRPFEMCVKEGDASSVMCSYNRV 280
            +C KH+A +    W   +R+ F+A  +  +D+ ET+L  F+  V+E D   VMC+YNR 
Sbjct: 188 HACAKHFAVHSGPEW---NRHSFNAENIIPRDLWETYLPAFKTLVQEADVKEVMCAYNRF 244

Query: 281 NGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVM--VDNHKFLADSKEDAVAQTLKAG 338
            G P C   +LL Q +R EW   G +V+DC +I        H    D+   A A+ +  G
Sbjct: 245 EGDPCCGSNRLLTQILRNEWGFKGIVVSDCGAISDFWGTKKHNTHPDAAH-ASAEAVLNG 303

Query: 339 LDLDCGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQDICS 398
            DL+CG  Y   T  A++ G + E  I+ S+K L      LG  +    + +L    + S
Sbjct: 304 TDLECGSNYRKLT-EAIKAGIISEKQINVSVKRLLKARFELGEMENIHPW-TLPYSIVDS 361

Query: 399 DENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYM 458
            ++  LA + A E + LL+N    LPL+  K   +A++GP+AN +V   GNY G P    
Sbjct: 362 PKHRCLALKMAHETMTLLQNKGKVLPLD--KQARIAIIGPNANDSVMQWGNYNGTPSHTS 419

Query: 459 SPIAGFSG---YANVTYKTGCDDVACKSNNSIF 488
           + ++ F      +++ Y+  C      + NS+F
Sbjct: 420 TLLSAFRKRLPISHLIYEPVCGLTDSITYNSLF 452



 Score =  121 bits (303), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 91/299 (30%), Positives = 134/299 (44%), Gaps = 56/299 (18%)

Query: 492 EAAKTADATIILAGLDLSVEAESL----------DREDLWLPGYQTQLINQVAEVAKGPV 541
           E  K  D  I   G+  S+E E +          DR D+  P  Q +++  + E  K  V
Sbjct: 596 EKLKDIDIIIFAGGISPSLEGEEMNVSATGFKGGDRTDIEFPAVQRKVLAALKEAGK-KV 654

Query: 542 ILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGD 601
           ILV  S  G  +A      +  AIL A YPGEEGG AI +V+FG +NP GRLPIT+Y   
Sbjct: 655 ILVNFS--GSAMALTPETKSCDAILQAWYPGEEGGMAIVNVLFGDYNPAGRLPITFYKS- 711

Query: 602 YVQMLPLTSMPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNL 661
            +  LP         ++    GRTY++     L+PFGYGLSYT F +         ++++
Sbjct: 712 -IDQLP-------DFENYSMKGRTYRYMQEEPLFPFGYGLSYTTFAFG--------KIHI 755

Query: 662 NKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPA 721
           NK                       N L   +     +  +N+G  DG +VV +Y +  A
Sbjct: 756 NK-----------------------NSLSAGEKVTLHIPIKNIGDRDGVEVVQIYIQRQA 792

Query: 722 EIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLP-AGEHTIFVGN 779
           +     +K +  F+RV +  G+ + +K       +    D   NT+ P  GE+ I  GN
Sbjct: 793 DKEGP-VKTLRAFKRVEIPKGKTQEVKIELPYV-AFEWFDPTTNTMRPIQGEYNILYGN 849


>gi|153807033|ref|ZP_01959701.1| hypothetical protein BACCAC_01310 [Bacteroides caccae ATCC 43185]
 gi|423219984|ref|ZP_17206480.1| hypothetical protein HMPREF1061_03253 [Bacteroides caccae
           CL03T12C61]
 gi|149130153|gb|EDM21363.1| glycosyl hydrolase family 3 C-terminal domain protein [Bacteroides
           caccae ATCC 43185]
 gi|392624247|gb|EIY18340.1| hypothetical protein HMPREF1061_03253 [Bacteroides caccae
           CL03T12C61]
          Length = 786

 Score =  252 bits (644), Expect = 5e-64,   Method: Compositional matrix adjust.
 Identities = 234/858 (27%), Positives = 375/858 (43%), Gaps = 167/858 (19%)

Query: 1   MAKVVSSL-LCFSLSIALLVFSTNAVDANGSSSPVFVCDPGRFSKLGLQMSSFLFCDSSL 59
           M K+V  L LC S+     +F+       GS+  ++  +   F+K G++    ++ D + 
Sbjct: 1   MKKLVCGLTLCLSVGN---IFA-------GSTKDIYKKNWIDFNKNGVKD---VYEDPAA 47

Query: 60  PYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRL---GLPQYEW----WSEAL-------HG 105
           P   RV DL+S+MTL+EK  Q+    +G  R+     P  EW    W + +       +G
Sbjct: 48  PIEARVADLLSQMTLEEKTCQMATL-YGSGRVLKDAWPTAEWSKEIWKDGIGNIDEQANG 106

Query: 106 VSNVG-----------------------------PGTHFDDVIPG-----ATSFPTVILT 131
           +   G                             P    ++ I G     AT FP     
Sbjct: 107 LGKFGSELSYPYANSVKNRHEIQRWFVEQTRLGIPVDFTNEGIRGLCHNRATMFPAQCGQ 166

Query: 132 TASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNINVARDPRWGRITETPGEDPFVV 191
            A++N+ L ++I +  + EA+A   LG   +  ++P +++A+DPRWGR+ E+ GEDP++ 
Sbjct: 167 GATWNKKLIREIAKVTADEAKA---LGYTNI--YAPILDIAQDPRWGRVVESYGEDPYLA 221

Query: 192 GRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQ 251
           G      + GLQ  EG             +++  KH+A Y +           D  V  +
Sbjct: 222 GELGKQMILGLQ-AEG-------------LAATPKHFAVYSIPVGGRDGGTRTDPHVAPR 267

Query: 252 DMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCD 311
           +M+  +L PF   ++E  A  VM SYN  +G P       L + +R +W   GY+V+D +
Sbjct: 268 EMKTLYLEPFRKGIQEAGALGVMSSYNDYDGEPVSGSYHFLTEILRQQWGFKGYVVSDSE 327

Query: 312 SIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCGQYYTNFT---------GNAVQQGKVKE 362
           +++ +   H+ +  ++E+  AQ + AGL++      TNFT           A+ +GK+  
Sbjct: 328 AVEFLHTKHR-ITPTEEEMAAQVVNAGLNIR-----TNFTPPQDFILPLRRAISEGKISL 381

Query: 363 TDIDKSLKYLYTVLMRLGFFDGS-PQYVSLGKQDICSDENIELAAEAAREGIVLLKNDQN 421
             +D+ +  +  V   LG FD   P      +  + +  + E++ +AA E IVLLKN+  
Sbjct: 382 HTLDQRVGEILRVKFMLGLFDNPYPGDDRHPETVVHNAAHQEVSMKAALESIVLLKNENQ 441

Query: 422 TLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFSGY---ANVTYKTGCDD 478
            LPL S  +  +AV+GP+A     +   Y        +   G   Y   A V+Y  GC+ 
Sbjct: 442 MLPL-SKSLNKIAVIGPNAEEVKELTCRYGPAHAPIKTVYQGIKEYLPNAEVSYAKGCNI 500

Query: 479 V--------------ACKSNNSIFAASEAAKTADATIILAGLDLSVEAESLDREDLWLPG 524
           +                +    I  A E AK +D  I++ G +     E   R  L L G
Sbjct: 501 IDKYFPESELYNVPLDTQEQAMINEAVELAKVSDIAILVLGGNEKTVREEFSRTSLDLCG 560

Query: 525 YQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVF 584
            Q QL+  V    K PV+LV++      I +A  N  + AI+ A +PGE  G AIA V+F
Sbjct: 561 RQQQLLEAVYATGK-PVVLVMVDGRAATINWA--NKYVPAIVHAWFPGEFMGNAIAKVLF 617

Query: 585 GKFNPGGRLPITWYNGDYVQMLPLTSMPLRP-VDSLGYPGRTYKFYNGPTLYPFGYGLSY 643
           G +NPGGRL +T+     V  +P  + P +P  DS G      +      LYPFGYGLSY
Sbjct: 618 GDYNPGGRLAVTFPKS--VGQVPF-AFPFKPGSDSKG------RVRVDGVLYPFGYGLSY 668

Query: 644 TQFKYNLLSFTKTI---QVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFEFKVD 700
           T F+Y+ L  +K +   Q N+                           L C         
Sbjct: 669 TTFEYSALKISKPVIGPQENMT--------------------------LSC--------I 694

Query: 701 FQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIV 760
            +N G   G +VV +Y +       TY K + GF+R+ ++ G  + I F     + L + 
Sbjct: 695 VKNTGKRAGDEVVQLYIRDDFSSVTTYDKMLRGFERIHLQPGEEQTISFTLTP-QDLGLW 753

Query: 761 DYAANTLLPAGEHTIFVG 778
           D      +  G  +I +G
Sbjct: 754 DKNNQFTVEPGSFSIMIG 771


>gi|329963878|ref|ZP_08301220.1| glycosyl hydrolase family 3 protein [Bacteroides fluxus YIT 12057]
 gi|328527131|gb|EGF54137.1| glycosyl hydrolase family 3 protein [Bacteroides fluxus YIT 12057]
          Length = 766

 Score =  252 bits (644), Expect = 5e-64,   Method: Compositional matrix adjust.
 Identities = 200/703 (28%), Positives = 335/703 (47%), Gaps = 109/703 (15%)

Query: 101 EALHGVSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRA 160
           +A+HG +N           PG T +PT I    SF+  +  +I +  + E RAM      
Sbjct: 132 DAIHGNANA----------PGNTVYPTNINLACSFDTLMAYRIARETAKEMRAMN----- 176

Query: 161 GLTYWS--PNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRP 218
              +W+  PN+ VARD RWGR+ ET GEDP++V R       G+Q V+G++ + D     
Sbjct: 177 --MHWTFNPNVEVARDARWGRVGETFGEDPYLVTRM------GVQSVKGYQGSLDSKE-- 226

Query: 219 LKVSSCCKHY--AAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCS 276
             V +C KH+   +  ++   G       A ++E+ + E F  PFE  VK G A S+M +
Sbjct: 227 -DVLACIKHFVGGSEPINGTNGSP-----ADLSERTLREVFFPPFEAGVKAG-AMSLMTA 279

Query: 277 YNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLK 336
           +N +NG+P  ++  L+   +RGEW+  G++V+D   I+   D H   A++ ++A  Q++ 
Sbjct: 280 HNELNGVPCHSNEWLMADVLRGEWNFPGFVVSDWMDIEHTHDLHA-TAENLKEAFYQSIM 338

Query: 337 AGLDLDC-GQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQD 395
           +G+D+   G ++       V++G++ E+ ID+S++ +  +  RLG F+     V    + 
Sbjct: 339 SGMDMHMHGIHWNEMVVELVKEGRIPESRIDESVRRILDIKFRLGLFEQPYADVEETMKI 398

Query: 396 ICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPC 455
               E+   A EAAR GIVLLKN +  LPL+ +K K + V G +A+    ++G+++  P 
Sbjct: 399 RLCGEHRATALEAARNGIVLLKN-EGVLPLDPSKYKKIMVTGINADDQ-NILGDWSA-PE 455

Query: 456 RYMSPIAGFSGYANVTYKTGCD------DVACKSNNSIFAASEAAKTADATIILAGLDL- 508
           +  +      G   +   T  D      D        +  A+  AK AD  I++AG  + 
Sbjct: 456 KEENVTTILEGLRMIAPDTQFDFVDQGWDPRNMDPKKVDEAAAHAKNADLNIVVAGEYMM 515

Query: 509 ------SVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNI 562
                   + E  DR DL L G Q +LI +VA   K P +LV+++   + + +A  N  +
Sbjct: 516 RFRWNDRTDGEDTDRSDLDLVGLQEELIEKVAASGK-PTVLVLVNGRPLSVRWAAEN--L 572

Query: 563 KAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYP 622
            AI+ A  PG +GG+A+A++++GK NP  +L IT         +P +   L+ +    Y 
Sbjct: 573 PAIVEAWAPGMQGGQAVAEILYGKVNPSAKLAIT---------IPHSVGQLQMI----YN 619

Query: 623 GRTYKFYN-------GPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSD 675
            +  ++++          LYPFGYGLSYT +KY  L+  +                    
Sbjct: 620 HKPSQYFHPYVAGKPSTPLYPFGYGLSYTTYKYEDLNLDR-------------------- 659

Query: 676 ASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQ 735
                       ++  D      V   N GS DG ++V +Y +         +K++  F 
Sbjct: 660 -----------KEIEKDGSVGVSVKVTNTGSRDGVEIVQLYIRDKFSCVTRPVKELKDFA 708

Query: 736 RVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVG 778
           RV ++AG ++ + F     K L   D     ++  GE  + VG
Sbjct: 709 RVPLKAGESRVVNFKITPDK-LAFYDIKMKKVVEPGEFIVMVG 750


>gi|288925400|ref|ZP_06419334.1| beta-glucosidase [Prevotella buccae D17]
 gi|288337871|gb|EFC76223.1| beta-glucosidase [Prevotella buccae D17]
          Length = 858

 Score =  252 bits (644), Expect = 5e-64,   Method: Compositional matrix adjust.
 Identities = 162/463 (34%), Positives = 247/463 (53%), Gaps = 43/463 (9%)

Query: 45  LGLQMSS----FLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGDFAHGVPRLGLPQYEWWS 100
           LGL +S+      +C+  L    R +DL+SR+TL+EK + + D +  +PRLG+ ++ WWS
Sbjct: 11  LGLSLSATAQLLPYCNPDLSARERARDLLSRLTLEEKARLMLDESPAIPRLGIKKFFWWS 70

Query: 101 EALHGVSNVGPGTHFDDVIPGATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGR- 159
           EALHG +N+G          G T FP  +   ASFN+ L +++  A S E RA YN    
Sbjct: 71  EALHGAANMG----------GVTVFPEPVGMAASFNDGLLRRVFDAASDEMRAQYNRRML 120

Query: 160 --------AGLTYWSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENA 211
                     L+ W+PN+N+ RDPRWGR  ET GEDP++        VRGLQ  E     
Sbjct: 121 NGGEDEKFHSLSVWTPNVNIFRDPRWGRGQETYGEDPYLTSVMGTAVVRGLQGPE----- 175

Query: 212 TDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFD-ARVTEQDMEETFLRPFEMCVKEGDA 270
               ++  K+ +C KHYA +    +    R+  + A V+ +D+ ET+L  F+  V E   
Sbjct: 176 ---TAKYRKLWACAKHYAVHSGPEYT---RHTANVADVSPRDLWETYLPAFKTLVTEAKV 229

Query: 271 SSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDA 330
             VMC+Y R++  P C++ +LL Q +R EW  +  +V+DC ++  +  NHK  +D+   A
Sbjct: 230 REVMCAYQRLDDDPCCSNNRLLQQILRDEWGFNYLVVSDCGAVTDIYANHKTSSDAVH-A 288

Query: 331 VAQTLKAGLDLDCGQYYTNFT-GNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYV 389
            A+   AG D++CG  Y   T   AV++G + E ++DK +  L      LG  D  P+ V
Sbjct: 289 AAKAAVAGTDVECGFGYAYKTIPEAVRRGLITEAEVDKHVLRLLEGRFDLGEMD-DPKLV 347

Query: 390 SLGK---QDICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAM 446
              K     + S  + +LA + AR+ +VLL+N    LPL +   + +AV+GP+A+    M
Sbjct: 348 EWSKIPASVMDSKAHRQLALDMARQSLVLLQNKGGVLPLKAGG-EPIAVIGPNADDGPMM 406

Query: 447 IGNYAGIPCRYMSPIAGFS-GYANVTYKTGCDDVACKSNNSIF 488
            GNY G P R ++ + G    +  VTY  GCD    K+ NS+ 
Sbjct: 407 WGNYNGTPNRTVTILNGIKVRHKRVTYLKGCDLTDTKTVNSLL 449



 Score = 96.7 bits (239), Expect = 5e-17,   Method: Compositional matrix adjust.
 Identities = 78/291 (26%), Positives = 127/291 (43%), Gaps = 63/291 (21%)

Query: 501 IILAGLDLSVEAESL----------DREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGG 550
           + + G+  ++E E +          DR ++ LP  Q   +  + E  K    +V ++  G
Sbjct: 603 VFVGGISAALEGEEMPVDIDGFKGGDRTNIELPKVQRDFLRALHEAGK---TVVFVNCSG 659

Query: 551 VDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTS 610
             IA         AIL A Y G+EGG A++DV+FG  NP G+LP+T+Y       LP   
Sbjct: 660 SAIALEPEMETCDAILQAWYAGQEGGTAVSDVLFGTVNPSGKLPVTFYK--RTDQLP--- 714

Query: 611 MPLRPVDSLGYPGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNL 670
                 +     GRTY++++ P L+ FGYGLSYT F++                      
Sbjct: 715 ----DYEDYSMRGRTYRYFSDP-LFAFGYGLSYTTFRF---------------------- 747

Query: 671 NYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQ 730
                  + R           +  +   V   N G+  G +VV VY +  A+     +K 
Sbjct: 748 ------GRARAEA-------AEGGYRLSVPLTNTGTRPGEEVVQVYIRRVADTNGP-LKS 793

Query: 731 VIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTL--LPAGEHTIFVGN 779
           +  F+RV ++AG +  ++   +  KS    D + NT+  LP G++ +  GN
Sbjct: 794 LRAFRRVALKAGESTTVEIPLSR-KSFECFDESTNTMRTLP-GDYELMYGN 842


>gi|224536364|ref|ZP_03676903.1| hypothetical protein BACCELL_01238, partial [Bacteroides
           cellulosilyticus DSM 14838]
 gi|224522024|gb|EEF91129.1| hypothetical protein BACCELL_01238 [Bacteroides cellulosilyticus
           DSM 14838]
          Length = 808

 Score =  252 bits (644), Expect = 5e-64,   Method: Compositional matrix adjust.
 Identities = 155/407 (38%), Positives = 233/407 (57%), Gaps = 41/407 (10%)

Query: 73  TLDEKVQQLGDFAHGVPRLGLPQYEWWSEALHGVSNVGPGTHFDDVIPGATSFPTVILTT 132
           T++EK+  L   + G+ RL +P+Y   +EALHGV  V PG          T FP  I   
Sbjct: 1   TVEEKISLLRATSPGISRLDIPKYYHGNEALHGV--VRPGRF--------TVFPQAIGLA 50

Query: 133 ASFNESLWKKIGQAVSTEARAMYNLGRAG----------LTYWSPNINVARDPRWGRITE 182
           A++N  L  ++   +S EARA +N    G          LT+WSP +N+ARDPRWGR  E
Sbjct: 51  ATWNPELQLQVATVISDEARARWNELDQGREQKSQFSDLLTFWSPTVNMARDPRWGRTPE 110

Query: 183 TPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRY 242
           T GEDP++ G     +V+GLQ   G ++      R LK+ S  KH+AA + ++    +R+
Sbjct: 111 TYGEDPYLSGIMGTAFVKGLQ---GDDD------RYLKIVSTPKHFAANNEEH----NRF 157

Query: 243 HFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDL 302
             + +++E+ + E +L  FE CVK+G ++S+M +YN +N +P   +  LL + +R +W  
Sbjct: 158 VCNPQISEKQLREYYLPAFEACVKDGKSASIMSAYNALNDVPCTLNAWLLTKVLRKDWGF 217

Query: 303 HGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCG-QYYTNFTGNAVQQGKVK 361
            GY+V+DC    ++V+ HK++  +KE A A ++KAGLDL+CG   Y     +A +Q  V 
Sbjct: 218 KGYVVSDCGGPSLLVNAHKYVK-TKEAAAALSIKAGLDLECGDDVYDQPLLSAYRQYMVT 276

Query: 362 ETDIDKSLKYLYTVLMRLGFFDGSPQ--YVSLGKQDICSDENIELAAEAAREGIVLLKND 419
           + DID +   +    M LG FD   Q  Y  +    I S E+ E+A  AARE IVLLKN 
Sbjct: 277 DADIDSAAYRVLRARMELGLFDSGEQNPYTKISPAVIGSAEHQEVALNAARECIVLLKNQ 336

Query: 420 QNTLPLNSAKVKTVAVVGPHANATVAMIGNYAGIPCRYMSPIAGFSG 466
           +  LPLN+ KVK++AVVG   NA  +  G+Y+G+P   ++PI+   G
Sbjct: 337 KKMLPLNARKVKSIAVVG--INAGSSEFGDYSGLPV--IAPISVLQG 379



 Score =  145 bits (367), Expect = 7e-32,   Method: Compositional matrix adjust.
 Identities = 100/293 (34%), Positives = 147/293 (50%), Gaps = 54/293 (18%)

Query: 490 ASEAAKTADATIILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAG 549
           A +A +  +  + + G++ S+E E  DR D+ LP  Q + + ++ +V   P I+V++ AG
Sbjct: 549 AGKAVRECETVVAVLGINKSIEREGQDRYDIQLPADQQEFLQEIYKV--NPNIVVVLVAG 606

Query: 550 GVDIAFAETNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLT 609
              +A    + +I AI+ A YPGE GG+A+A+V+FG +NPGGRLP+T+Y         L 
Sbjct: 607 S-SLAINWMDEHIPAIVNAWYPGESGGKAVAEVLFGDYNPGGRLPLTYYRS-------LD 658

Query: 610 SMPLRPVDSLGY-PGRTYKFYNGPTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCR 668
            +P  P D      GRTYK++ G  LYPFGYGLSYT FKY+       +QV         
Sbjct: 659 ELP--PFDDYDITKGRTYKYFKGDVLYPFGYGLSYTTFKYS------NLQV--------- 701

Query: 669 NLNYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQ--NVGSTDGSDVVIVYSKPPAEIAAT 726
                                  D   E  V FQ  N G   G +V  VY K P      
Sbjct: 702 ----------------------ADGEEEINVSFQLKNSGKYAGDEVAQVYVKLPERDEIM 739

Query: 727 YIKQVIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLL-PAGEHTIFVG 778
            IK++ GF+RV +++G NK++         L   D A +  + P+G++TI VG
Sbjct: 740 PIKELKGFERVTLKSGENKKVTLKLRK-DLLRYWDEAKDKFVCPSGDYTIMVG 791


>gi|336408356|ref|ZP_08588849.1| hypothetical protein HMPREF1018_00864 [Bacteroides sp. 2_1_56FAA]
 gi|335937834|gb|EGM99730.1| hypothetical protein HMPREF1018_00864 [Bacteroides sp. 2_1_56FAA]
          Length = 805

 Score =  252 bits (644), Expect = 6e-64,   Method: Compositional matrix adjust.
 Identities = 231/807 (28%), Positives = 347/807 (42%), Gaps = 145/807 (17%)

Query: 54  FCDSSLPYSIRVKDLVSRMTLDEKVQQL------------GDFAHGVPRLGLPQYEWWSE 101
           + + S P   RV+ L+S+MTL+EKV Q+            G+     P+L     E+   
Sbjct: 40  YENPSAPVEYRVEHLLSQMTLEEKVGQMLTSLGWPMYKRVGEDIRLTPQLEKEIGEYHIG 99

Query: 102 ALHGVSNVGPGT--------------------------HFDDVIP--------------G 121
           +L G     P T                          H    IP              G
Sbjct: 100 SLWGFMRADPWTQRTLHTGLNPSLAARASNRLQSYVIEHSRLGIPLFLAEECPHGHMAIG 159

Query: 122 ATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNINVARDPRWGRIT 181
            T FPT I   +++N  L +++G+ ++ EA A     +     + P +++ARDPRW R+ 
Sbjct: 160 TTVFPTSIGQASTWNPELIRQMGRVIAIEASA-----QGAHIGYGPVLDLARDPRWSRVE 214

Query: 182 ETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDR 241
           ET GEDP++ G      VRG Q     E   D  S    V +  KH+A+Y    W     
Sbjct: 215 ETYGEDPYLNGVMGTALVRGFQG----ETLNDGKS----VIATLKHFASY---GWTEGGH 263

Query: 242 YHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWD 301
               A + E+++EE    PF   V  G A SVM SYN ++G P      LL   ++  W 
Sbjct: 264 NGGTAHIGERELEEAIFPPFREAVGAG-ALSVMSSYNEIDGNPCTGSRYLLTDILKDRWQ 322

Query: 302 LHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDCG-QYYTNFTGNAVQQGKV 360
             G++V+D  ++  + ++   +A +  +A  + + AG+D D G   Y      AV++G V
Sbjct: 323 FKGFVVSDLYAVGGLREHG--VAGNDYEAAIKAVNAGVDSDLGTNVYAEQLVAAVKRGDV 380

Query: 361 KETDIDKSLKYLYTVLMRLGFFDGSPQYVSLGKQDICSDENIELAAEAAREGIVLLKNDQ 420
               IDK+++ + ++  ++G FD          Q + S E+  LA E AR+ IVLLKN  
Sbjct: 381 AVATIDKAVRRILSLKFQMGLFDDPFVDEKQAVQLVASSEHTGLAREVARQSIVLLKNKD 440

Query: 421 NTLPLNSAKVKTVAVVGPHANATVAMIGNYA-----GIPCRYMSPI-AGFSGYANVTYKT 474
             LPL    ++T+AV+GP+A+    M+G+Y      G     +  I    S    V Y  
Sbjct: 441 KLLPLKK-DIRTLAVIGPNADNVYNMLGDYTAPQADGTVVTVLDGIRQKVSKETRVLYAK 499

Query: 475 GCDDVACKSNNSIFAASEAAKTADATIILAG----LDLSVE------------------- 511
           GC  V   S      A E A+ ADA +++ G     D S E                   
Sbjct: 500 GC-AVRDSSRTGFKDAIETARNADAVVMVMGGSSARDFSSEYEETGAAKVTINQISDMES 558

Query: 512 AESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYP 571
            E  DR  L L G Q +L+ +++ + K PV+LV++   G  +         +AI+ A YP
Sbjct: 559 GEGYDRATLHLMGRQLELLEEISRLGK-PVVLVLIK--GRPLLMEGAIQEAEAIVDAWYP 615

Query: 572 GEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVDSLGYPGRTYKFYNG 631
           G +GG A+ADV+FG +NP GRL ++      V  LP+     R     G   R Y    G
Sbjct: 616 GMQGGNAVADVLFGDYNPAGRLTLSVPRS--VGQLPVYYNTRRK----GNRSR-YIEEPG 668

Query: 632 PTLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRC 691
              YPFGYGLSYT F Y  +     +QV                           +D R 
Sbjct: 669 TPRYPFGYGLSYTTFSYTDMK----VQVTEGS-----------------------DDCRV 701

Query: 692 DDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVF 751
           D      V  QN G+ DG +V  +Y +       T  KQ+  F R+ ++A  ++ + F  
Sbjct: 702 D----VTVTIQNQGTADGDEVAQLYFRDDVSSFTTPAKQLRAFSRIHLKAAESREVTFTL 757

Query: 752 NACKSLNIVDYAANTLLPAGEHTIFVG 778
           +  KSL +       ++  G  TI VG
Sbjct: 758 DK-KSLALYMQEGEWVVEPGRFTIMVG 783


>gi|336255157|ref|YP_004598264.1| beta-glucosidase [Halopiger xanaduensis SH-6]
 gi|335339146|gb|AEH38385.1| Beta-glucosidase [Halopiger xanaduensis SH-6]
          Length = 774

 Score =  252 bits (644), Expect = 6e-64,   Method: Compositional matrix adjust.
 Identities = 225/803 (28%), Positives = 369/803 (45%), Gaps = 138/803 (17%)

Query: 48  QMSSFLFCDSSLPYSIRVKDLVSRMTLDEKVQQLGD---------------------FAH 86
           ++S+  + D S     RV+DL+ RMT++EK  QLG                       AH
Sbjct: 4   ELSTAAYQDESESVENRVEDLLERMTVEEKAAQLGSVNADRLLDEDGEIDWDAVDEWLAH 63

Query: 87  GV---PRLGLPQYEWWSEA----------LHGVSNVG-PGTHFDDVI-----PGATSFPT 127
           G+    RLG       SEA          L   + +G P    ++ +     P AT+FP 
Sbjct: 64  GIGHFTRLGGEGSLAPSEAARVTNELQTYLREETRLGIPAIPHEECLSGYMGPEATTFPQ 123

Query: 128 VILTTASFNESLWKKIGQAVSTEARAMYNLGRAGLTYWSPNINVARDPRWGRITETPGED 187
           ++   +S+N  L + + + +  E       G   +   SP ++VARD RWGR+ ET GED
Sbjct: 124 MLGMASSWNPELLQTVTETIRGELE-----GIGTVHALSPVLDVARDLRWGRVEETFGED 178

Query: 188 PFVVGRYAVNYVRGLQDVEGHENATDLNSRPLKVSSCCKHYAAYDVDNWKGVDRYHFDAR 247
           P++V   A  YV GLQ           + R   +S+  KH+  +   +  G +R   +  
Sbjct: 179 PYMVAEMARAYVSGLQG----------DGRADGISATLKHFVGHGATD-GGKNRSSLN-- 225

Query: 248 VTEQDMEETFLRPFEMCVKEGDASSVMCSYNRVNGIPSCADPKLLNQTVRGEWDLHGYIV 307
           V  +++ ET L P+E  + E +A SVM +Y+ ++G+P      LL + +RGE+   G +V
Sbjct: 226 VGPRELRETHLFPYEAVISEANAESVMNAYHDLDGVPCANSEWLLTEVLRGEFGFDGTVV 285

Query: 308 ADCDSIQVMVDNHKFLADSKEDAVAQTLKAGLDLDC--GQYYTNFTGNAVQQGKVKETDI 365
           +D  S++ +V  H+  A +K +A  Q L+AG+D++    +YY      AV++G + E  +
Sbjct: 286 SDYYSVRHLVTEHE-TASTKPEAAVQALEAGIDVELPYTEYYGEHLVEAVEEGDLAEETL 344

Query: 366 DKSLKYLYTVLMRLGFFDGSPQYVSLGKQDICSDENIELAAEAAREGIVLLKNDQNTLPL 425
           ++S++ +     R G FD     V        +DE  E+  EAAR+ + LLKN+ +    
Sbjct: 345 NESVRRILREKFRKGVFDDPAVDVDAAADAFHTDEAREVTREAARQSMTLLKNEDDL--- 401

Query: 426 NSAKVKTVAVVGPHANATVAMIGNYA--------GIPCRYMSPIAGFSGY--ANVTYKTG 475
               V  VAVVGP A+    ++G+YA              ++P+         +VTY+ G
Sbjct: 402 LPLDVDDVAVVGPKADNPKELMGDYAYAAHYPEEEYEADAVTPLEALEARDGLDVTYEQG 461

Query: 476 CDDVACKSNNSIFAASEAAKTADATIILAG----LDLS-VEAESLDRED----------- 519
           C  ++  S +   AA++AA  AD  +   G    +D S VEAE  ++             
Sbjct: 462 C-TISGPSTDGFDAAADAAADADVALAFVGARSAVDFSDVEAEKEEKPSVPTSGEGCDVT 520

Query: 520 -LWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAETNTNIKAILWAGYPGEEGGRA 578
            L LPG Q +L+ ++ E    PV++V++S  G   A  +      AIL+A  PG+EGG A
Sbjct: 521 HLGLPGVQEELVAELLET-DTPVVVVLVS--GKPHAIEDIAAEAPAILYAWLPGDEGGTA 577

Query: 579 IADVVFGKFNPGGRLPITWYNGDYVQMLP--LTSMPLRPVDSLGYPGRTYKFYNGPTLYP 636
           IA+ +FG+ NP G+LP++         LP  +  +P+          + Y + +   +YP
Sbjct: 578 IAETLFGENNPAGKLPVS---------LPKSVGQLPVYYNRKENTANKDYVYTDSEPVYP 628

Query: 637 FGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNLNYTSDASKTRCPGVLVNDLRCDDYFE 696
           FG+G SYT+F+Y  +S +      L                                 F 
Sbjct: 629 FGHGESYTEFEYGDVSLSTDSVTPLGS-------------------------------FT 657

Query: 697 FKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQVIGFQRVFVRAGRNKRIKFVFNACKS 756
             V   NVG   G ++V  Y +      A  +++++GF+RV +  G +KR+ F  +A + 
Sbjct: 658 ASVTVANVGDRAGDEIVQCYGRATNASQARPVQELLGFERVSLEPGESKRVAFDLSATQ- 716

Query: 757 LNIVDYAANTLLPAGEHTIFVGN 779
           L   D + N  +  G + I +G 
Sbjct: 717 LAFHDLSMNLAVEEGPYEIRIGR 739


>gi|345302417|ref|YP_004824319.1| beta-glucosidase [Rhodothermus marinus SG0.5JP17-172]
 gi|345111650|gb|AEN72482.1| Beta-glucosidase [Rhodothermus marinus SG0.5JP17-172]
          Length = 754

 Score =  252 bits (643), Expect = 6e-64,   Method: Compositional matrix adjust.
 Identities = 229/768 (29%), Positives = 359/768 (46%), Gaps = 116/768 (15%)

Query: 65  VKDLVSRMTLDEKVQQL----GDFAHGVP-------------RLGLPQYEWWSEALHGV- 106
           ++ L++RMTL+EK+ QL    G  A   P             R+G     + +EA+  + 
Sbjct: 33  IEALLARMTLEEKLGQLTLYNGGMAETGPVVREGEPDAIRRGRVGAVMNFFGAEAVCAMQ 92

Query: 107 ------SNVG-PGTHFDDVIPG-ATSFPTVILTTASFNESLWKKIGQAVSTEARAMYNLG 158
                 S +G P     DVI G  T FP  +   A+F+ +L ++  +  + EA A+    
Sbjct: 93  RQAVEESRLGIPLLFALDVIHGFRTIFPVPLAEAATFDPALVEQAARVAAGEASAV---- 148

Query: 159 RAGLTY-WSPNINVARDPRWGRITETPGEDPFVVGRYAVNYVRGLQDVEGHENATDLNSR 217
             GL + ++P +++ARD RWGRI E  GEDP++    A   VRG Q         DL   
Sbjct: 149 --GLNWTFAPMVDIARDARWGRIVEGSGEDPYLGAVMAAARVRGFQ-------GRDLRD- 198

Query: 218 PLKVSSCCKHYAAYDVDNWKGVDRYHFDARVTEQDMEETFLRPFEMCVKEGDASSVMCSY 277
           P  + +  KH+AAY      G D    D  V+E+ + E +L PFE  V+ G A S+M ++
Sbjct: 199 PTTILATAKHFAAYGAAE-AGRDYNTVD--VSERTLREVYLPPFEAAVRAG-ALSIMSAF 254

Query: 278 NRVNGIPSCADPKLLNQTVRGEWDLHGYIVADCDSIQVMVDNHKFLADSKEDAVAQTLKA 337
           N + G+P+ AD  LL   +R EW   G +V+D  S+  ++  H   ADS E    + L+A
Sbjct: 255 NEIGGVPATADRWLLTDVLRHEWGFEGLVVSDYTSVWELL-FHGIAADSAEVG-RKALEA 312

Query: 338 GLDLD-CGQYYTNFTGNAVQQGKVKETDIDKSLKYLYTVLMRLGFFDGSPQYV--SLGKQ 394
           G+D+D     Y       V+ G++ E  +D++++ +  V  RLG F+   +Y   +  +Q
Sbjct: 313 GVDMDMVSGIYVRKLAEEVRAGRLSEAVVDEAVRRVLRVKYRLGLFEDPYRYCRDASREQ 372

Query: 395 DICSDENIELAAEAAREGIVLLKNDQNTLPLNSAKVKTVAVVGPHANATVAMIGNYA--G 452
            + S  +  LA E AR+ IVLLKN+   LPL    ++ VAV+G  AN + +++G +A  G
Sbjct: 373 VLLSPAHRRLAREVARKAIVLLKNEGELLPLAD-TLQRVAVIGALANDSASVLGPWAAAG 431

Query: 453 IPCRYMSPIAGFSGY---ANVTYKTGCDDV-----------ACKSNNSIFAASEA-AKTA 497
            P   ++ + G       A V Y  G  +V           A   + S FA +EA A+ A
Sbjct: 432 RPEDAVTILEGIRAALPGATVRYAPGYAEVPSGSFQEMVAAALSPDTSGFAEAEAVARWA 491

Query: 498 DATIILAGLDLSVEAESLDREDLWLPGYQTQLINQVAEVAKGPVILVIMSAGGVDIAFAE 557
           +  I++ G    +  E+  R  + LPG Q  L  ++  + + PV++V+M+  G  +A  E
Sbjct: 492 EVVILVLGEHRELSGEAASRASVELPGVQLALARRLLALGR-PVVVVLMN--GRPLAIPE 548

Query: 558 TNTNIKAILWAGYPGEEGGRAIADVVFGKFNPGGRLPITWYNGDYVQMLPLTSMPLRPVD 617
                 AI+ A + G E G A+ADV+ GK +PGGRLP+++      + L     P     
Sbjct: 549 LAALAPAIVEAWFLGTEMGHAVADVLLGKASPGGRLPVSFPRATGQEPLYYNHKP----- 603

Query: 618 SLGYPGR-----TYKFYNGP--TLYPFGYGLSYTQFKYNLLSFTKTIQVNLNKLQHCRNL 670
             G P R     T K+ + P   LYPFGYGL+YT F Y+ L  ++               
Sbjct: 604 -TGRPPRAEEKYTSKYVDVPWTPLYPFGYGLTYTTFAYDSLRLSRRRLG----------- 651

Query: 671 NYTSDASKTRCPGVLVNDLRCDDYFEFKVDFQNVGSTDGSDVVIVYSKPPAEIAATYIKQ 730
                                DD  E  V   N G   G +VV +Y +         +K+
Sbjct: 652 --------------------LDDTLEVVVSVTNTGRRRGEEVVQLYVRDEVASVTRPVKE 691

Query: 731 VIGFQRVFVRAGRNKRIKFVFNACKSLNIVDYAANTLLPAGEHTIFVG 778
           + GF RV +  G  K ++F     ++L         ++  G  T++VG
Sbjct: 692 LKGFARVELAPGETKAVQFRLP-VRALRFWGLEGGWVVEPGWFTLWVG 738


  Database: nr
    Posted date:  Mar 3, 2013 10:45 PM
  Number of letters in database: 999,999,864
  Number of sequences in database:  2,912,245
  
  Database: /local_scratch/syshi//blastdatabase/nr.01
    Posted date:  Mar 3, 2013 10:52 PM
  Number of letters in database: 999,999,666
  Number of sequences in database:  2,912,720
  
  Database: /local_scratch/syshi//blastdatabase/nr.02
    Posted date:  Mar 3, 2013 10:58 PM
  Number of letters in database: 999,999,938
  Number of sequences in database:  3,014,250
  
  Database: /local_scratch/syshi//blastdatabase/nr.03
    Posted date:  Mar 3, 2013 11:03 PM
  Number of letters in database: 999,999,780
  Number of sequences in database:  2,805,020
  
  Database: /local_scratch/syshi//blastdatabase/nr.04
    Posted date:  Mar 3, 2013 11:08 PM
  Number of letters in database: 999,999,551
  Number of sequences in database:  2,816,253
  
  Database: /local_scratch/syshi//blastdatabase/nr.05
    Posted date:  Mar 3, 2013 11:13 PM
  Number of letters in database: 999,999,897
  Number of sequences in database:  2,981,387
  
  Database: /local_scratch/syshi//blastdatabase/nr.06
    Posted date:  Mar 3, 2013 11:18 PM
  Number of letters in database: 999,999,649
  Number of sequences in database:  2,911,476
  
  Database: /local_scratch/syshi//blastdatabase/nr.07
    Posted date:  Mar 3, 2013 11:24 PM
  Number of letters in database: 999,999,452
  Number of sequences in database:  2,920,260
  
  Database: /local_scratch/syshi//blastdatabase/nr.08
    Posted date:  Mar 3, 2013 11:25 PM
  Number of letters in database: 64,230,274
  Number of sequences in database:  189,558
  
Lambda     K      H
   0.319    0.136    0.414 

Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 12,641,970,642
Number of Sequences: 23463169
Number of extensions: 547030438
Number of successful extensions: 1193614
Number of sequences better than 100.0: 1000
Number of HSP's better than 100.0 without gapping: 6086
Number of HSP's successfully gapped in prelim test: 1462
Number of HSP's that attempted gapping in prelim test: 1142885
Number of HSP's gapped (non-prelim): 17018
length of query: 792
length of database: 8,064,228,071
effective HSP length: 151
effective length of query: 641
effective length of database: 8,816,256,848
effective search space: 5651220639568
effective search space used: 5651220639568
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 81 (35.8 bits)